Contents
See also the full agenda and Cees de Laat’s PIRE webpage.
DAY 1 (Mon, 08-Jun 2015)
Session 1: Welcome & Introduction
Chair: Paola Grosso & Cees de Laat, University of Amsterdam
- Welcome and “Why are we here?”
- Clouds and Commons for the Data Intensive Science Community (Robert Grossman, University of Chicago)
Session 2: OSDC PIRE participant self-introduction session
Chair: Maria Patterson, University of Chicago
Participants in the workshop, including organizers and speakers, will introduce themselves and provide a record about themselves for consultation during and after the workshop.
Session 3: Big Data Challenges, Solutions, and the Open Science Data Cloud
Chair: Maria Patterson, University of Chicago
- Open Science, Big Data, and Research Reproducibility (Tony Hey, eScience Institute, University of Washington)
- Data Commons for the Genomics Community (Allison Heath, University of Chicago)
- Cloud-based Analysis of NASA Satellite Data on OSDC (Maria Patterson, University of Chicago)
- Intro to OSDC (Maria Patterson, University of Chicago)
- Wrap-up, Review, and PIRE Challenge Kick-off
DAY 2 (Tues, 09-Jun 2015)
Session 4: Research at the PIRE hosts
Chair: Robert Grossman, University of Chicago
- Data Intensive Research Projects at ITRI/AIST (Jason Haga, AIST)
- Research at LARC-USP: E-Science, Cloud & Big Data Projects (Fernando Redigolo, USP LARC)
- Data intensive research in Edinburgh (Malcolm Atkinson, University of Edinburgh)
Session 5: Amsterdam Big Data Research
Chair: Paola Grosso, University of Amsterdam
- Sharing and using biodiversity data for e-Science (Cees Hof, Netherlands Biodiversity Information Facility)
- Big Data and Deep Learning: A Powerful Mix (Max Welling, University of Amsterdam)
- Smart Cyber Infrastructure for Big Data Processing (Cees de Laat & Paola Grosso, University of Amsterdam)
Session 6: Data Intensive Science: Workflow and Provenance
Chair: Zhiming Zhao, University of Amsterdam
- User Centered Provenance Management for Data-Intensive Platforms (Alessandro Spinuso, KNMI)
- dispel4py tutorial: basics (Rosa Filgueira, University of Edinburgh)
- dispel4py tutorial: advanced (Rosa Filgueira, University of Edinburgh)
- See also dispel4py scripts.
DAY 3 (Wed, 10-Jun 2015)
Session 7: The research challenge
Chair: Cees de Laat/Paola Grosso
- Towards an Open Science Commons in Europe (Tiziana Ferrari, European Grid Infrastructure)
- Project Matsu in Namibia (Race Clark, University of Oklahoma)
DAY 4 (Thurs, 11-Jun 2015)
Session 8: Handling data
Chair: Miroslav Zivkovic
- Anomaly Detection (Piotr Zuraniewski, TNO, The Hague)
- We are Big Data (Sander Klous, KPMG and the University of Amsterdam)
Session 9: Data Transfer and Networking
Chair: Paola Grosso
- Applications for High Speed Data Transfer (Joshua Miller, University of Chicago)
- AmLight SDN Testbeds: The Future of Collaboration Featuring AtlanticWave-SDX (Heidi Morgan, Florida International University)
Session 10: Parallel Sessions on Project Development
- Orientation to research at each host location.
- Extended lunch with informal discussions.
Session 10A: Project Development in AIST
Chair: Jason Haga
Session 10B: Project Development in Sao Paolo
Chair: Fernando Redigolo
Session 10C: Project Development in Edinburgh
Chairs: Malcolm Atkinson and Rosa Filgueira
Session 10D: Project Development in Amsterdam
Chair: Zhiming Zhao
DAY 5 (Fri, 12-Jun 2015)
Session 10: Challenge presentations and prizes
Chair: Cees de Laat/Paola Grosso
- A student from each group will present their work and the judges will decide on the winner.
Challenge results
Team | Title | Score | Rank |
---|---|---|---|
Steven Rapp, Melissa Bica, Theano Stavrinos, Race Clark | gui4dispel4py: A dispel4py graphical user interface on the OSDC (slides) (paper) (github) | 264 | 4 |
Nam Pho, Josh Miller | A Reproducible and Automated Deployment of an HPC Application on a Private Cloud (slides) (paper) (github) | 269 | 3 |
Ryan Mork, Genevieve Shattow | Lasso: A meta search process to find collaborators (slides) (paper) (github) (website) | 277 | 2 |
Grace Lu, Shelby Matlock, Jennifer Piscionere | Daisy: Data Made Easy (slides) (paper) (website) | 281 | 1 |
Session 11: What have we learn & where next?
Chairs: Heidi Morgan, Bob Grossman, Paola Grosso & Cees de Laat
- What have we understood this week about data and how to make the best enable the use of big data in a scientific context?
- What do we need to do & understand to make data-use easy and effective?
- How should we do that?
- Wrap-up & valediction
More links from the workshop
- More is different (Anderson, 1972)
- A LaTeX template for PIRE fellows
- Data-driven documents
- mbostock’s blocks gallery