Open Science Data Cloud Two-Day Tutorial

Two Day Tutorial Introduction to Data Intensive Science and Cloud Computing University of Chicago May 30 – 31, 2011

Please Note: This event is now full.

Scientific instruments are producing unprecedented amounts of data, yet our ability to manage, analyze, integrate and share this data has not been keeping up. As the amount of data grows, so does our ability to make new discoveries by integrating and analyze existing datasets.

We will give a tutorial introduction to utility clouds and to data clouds, both of which are being used to support data intensive computing. Utility clouds provide user-provisioned, on-demand infrastructure. Amazon Web Services (AWS) is a commercial provider of utility cloud services, and groups can set up their own utility clouds with OpenStack, as well as other applications. Data clouds provide large-scale storage over commodity hardware and simple to use parallel processing over the storage. Hadoop is the best known example of a data cloud. Both OpenStack and Hadoop are open source.

In this two day tutorial, we will give an introduction to these topics using several case studies. The tutorials include hands-on laboratory sessions each day. The laboratory sessions require that you bring your own laptop.

For more information: Please contact Cindy Rogowski at

Sponsors: The two day workshop is sponsored by the Open Science Data Cloud. The Open Science Data Cloud is a cloud-based platform for managing, analyzing, integrating and sharing scientific datasets operated by the Open Cloud Consortium. The workshop is also sponsored by the Computation Institute and the Institute for Genomics and Systems Biology at the University of Chicago. Additional funding is provided by the NSF through its PIRE Program (NSF Award #1129076).

This event will require registration. Receipt of a confirmation email is required for complete registration to the tutorial.

Share this on: TwitterGoogle+Facebook