Contents
DAY 1 (Mon, 16-Jun 2014)
Session 1: Welcome & Introduction
Chair: Paola Grosso & Cees De Laat
- Welcome and “Why are we here?”
- Using the OSDC for Data-Intensive Research (Robert Grossman, UChicago)
Session 2: OSDC PIRE participant self-introduction session
Chair: Heidi Alvarez, FIU
Participants in the workshop, including organizers and speakers, will introduce themselves and provide a record about themselves for consultation during and after the workshop.
Session 3: Enabling data-intensive discovery with the Open Science Data Cloud
Chair: Maria Patterson, UChicago
- The Namibia Flood Dashboard (Jill Hardy and Race Clark, U of Oklahoma)
- Combining Latent Topics with Document Attributes in Text Analysis (Nelson Auner, UChicago)
- Tutorial and Hands-on OSDC exercises (Maria Patterson, UChicago)
- Reproducible Research and Collaborative Tools (Maria Patterson, UChicago)
- Wrap-up, Review, and PIRE Challenge Kick-off
DAY 2 (Tues, 17-Jun 2014)
Session 4: Applications & Infrastructure 10-20 Minute Talks
Chair: Robert Grossman, UChicago
- Project Matsu: OSDC’s Earth Satellite Imagery Project with NASA (Maria Patterson, UChicago)
- Yates and Tukey: OSDC’s Scientific Cloud Software StacK (Rafael Suarez, UChicago)
- Bionimbus: OSDC’s Protected Data Cloud for Human Health Data (Robert Grossman, UChicago)
- Big Data & Scientific Remote Collaboration Projects at LARC-USP(Fernando Redigolo, USP LARC)
- Scrying the next generation of data-intensive research infrastructure: Research at the Data-Intensive Research Group in Edinburgh (Paul Martin, University of Edinburgh)
Session 5a: Amsterdam Big Data Research
Chair: Paola Grosso & Cees de Laat, U of Amsterdam
- Global Biodiversity Information Facility (GBIF): Free and Open Access to Biodiversity Data (Cees Hof, University of Amsterdam)
- The Amsterdam Data Science Research Center (Marcel Worring, University of Amsterdam)
- Environmental Research Infrastrutures with the ENVRI Project and Other Big Data Infrastructure (Cees de Laat, University of Amsterdam)
- Data Visualization (Jeff Weekley, Naval Postgraduate School)
Session 5b: Using ENVRI
Chair: Massimo Argenti, ESA
DAY 3 (Wed, 18-Jun 2014)
Session 6: The research challenge/bazaar
Chair: Cees de Laat/Paola Grosso
- Group brainstorming on the OSDC PIRE challenge
DAY 4 (Thurs, 19-Jun 2014)
Session 7: Tools for Development and Support of Large-Scale Data-Intensive Applications
- RADICAL-Cybertools Also see the online tutorial. (Shantenu Jha, Rutgers University)
- IBM’s Watson (Marc Teerling, IBM)
Session 8: PIRE partners contributions
Chair: Paola Grosso
- Data-Intensive Research at AIST (Dr. Isao Kojima, ITRI-AIST)
- Research at the University of the West Indies (Dr. Brigitte Collins, UWI)
- AgrineTT at UWI (Dr. Margaret Bernard, UWI)
Session 9: Parallel Sessions on Project Development
Session 9A: Project Development in AIST
Chair: Isao Kojima
- Orientation to research at AIST.
- Extended lunch with informal discussions
Session 9B: Project Development in Sao Paolo
Chairs: Fernando Redigolo
- Extended lunch with informal discussions
Session 9C: Project Development in Edinburgh
Chairs: Paul Martin
- Extended lunch with informal discussions
Session 9D: Project Development in Amsterdam
Chairs: Paola Grosso
- Introduction to the projects in Amsterdam:
- Ana Oprescu on OSDC to OSDC
- Zhiming Zhao on OSDC to SDN
- Miroslav Zivkovic on OSDC and Big Data issues
DAY 5 (Fri, 20-Jun 2014)
Session 10: Challenge presentations and prizes
Chair: Cees de Laat/Paola Grosso
- A student from each group will present their work and the judges will decide on the winner.
Challenge results
Team | Title | Score | Rank |
---|---|---|---|
Race Clark, Chris Natoli, Jill Hardy, William Matthews | Using OSDC to Advance Public Understanding of Temperature Extremes (slides) (paper) (github) | 32 | 2 |
Nelson Auner, Cody Buntain | Mayfly - Rapid, Accessible, Reproducible Research (slides) (paper) (github) | 38 | 1 |
Alexander Moreno, Keval Shah, Yuan Zhao | Automatic Variable Detection and Formatting for Cross-disciplinary Data Set Compatibility (slides) (paper) (github) | 32 | 2 |
Nathiel Butler, Michael Lewis, Weiwei Zhang | Tourist Buddy (slides) (paper) | 34 | 2 |
Eric Griffis, Josh Eisenberg | Client-side plug-ins for Tukey (slides) (paper) | 31 | 3 |
Session 11: What have we learn & where next?
Chair: Heidi Alvarez, Bob Grossman, Paola Grosso & Cees De Laat
- What have we understood this week about data and how to make the best use of the big data in a scientific context?
- What do we need to do & understand to make data-use easy and effective?
- How should we do that?
- Wrap-up & valediction
More links from the workshop
- More is different (Anderson, 1972)
- The Data Center as a Computer
- A LaTeX template for PIRE fellows
- RADICAL – Cybertools – SAGA and Pilot tutorial
- Data-driven documents
- mbostock’s blocks gallery