Working Groups

Working groups support the interest and activities of OCC Members. The current working groups include:

The Open Science Data Cloud (OSDC) Working Group

The OSDC Working group tmanages and operates the Open Science Data Cloud (OSDC), which is is a petabyte scale science cloud for researchers to manage, analyze and share their data. Individual researchers may apply for accounts to analyze data hosted by the OSDC. Research projects with TB-scale datasets are encourage to join the OSDC and contribute towards its infrastructure.

By housing the entire scientific research pipeline, from the raw data and the entire computing environment to the analysis tools and results, the OSDC makes scientific research open, transparent, and reproducible. OSDC is also built on open source technology as a model for the scientific community to extend and grow.

Resources in the OSDC ecosystem include the Bionimbus PDC, a secure biomedical cloud operated at FISMA moderate as IaaS with an NIH Trusted Partner status for analyzing and sharing protected datasets. The Bionimbus PDC is a collaboration between the University of Chicago Center for Data Intensive Science (CDIS) and the Open Commons Consortium (OCC). The Bionimbus PDC allows users authorized by NIH to compute over human genomic data in a secure compliant fashion.

Learn more about available resources at opensciencedatacloud.org

BloodPAC Data Commons Working Group

The BloodPAC Data Commons Working Group supports and manages the BloodPAC data commons and the surrounding community interested accelerating the development and validation of liquid biopsy assays to improve the outcomes of patients with cancer through sharing data in a commons.

Working group activities include: i) developing and operating the BloodPAC data commons; ii) working with the BloodPAC members community to determine key datasets to be included; iii) working with the BloodPAC community to prioritize key middleware and data services; iv) providing a governance framework to align the needs of the various project stakeholders.

To learn more about the BloodPAC visit bloodpac.org

Environmental Data Commons Working Group

The Environmental Data Commons Working Group supports and manages the Environmental data commons and the surrounding community interested in the open redistribution of Environmental datasets from OCC stakeholders like NASA and NOAA.

Working group activities include: i) developing and operating a petabyte-scale environmental data commons; ii) working with the alliance community to determine key datasets to be included; iii) working with the alliance community to prioritize key middleware and data services; iv) implementing functional networking and connectivity between data owners and the Environmental Data Commons; v) working with other environmental data alliances to adopt data peering and standards for metadata and ID services; and vi) developing a self-sufficient and sustainable ecosystem through which the Environmental Data Commons can reliably release key datasets to the public.

To learn more about the Environmental Data Commons visit edc.occ-data.org

Project Matsu Working Group

Project Matsu is a collaboration between the NASA Goddard Space Flight Center and the Open Commons Consortium to develop open source technology for cloud-based processing of satellite imagery to support the earth science research community as well as human assisted disaster relief. This working group develops and operates the OCC Matsu Cloud. Matsu resources are also available to those who needed elastic computing capacity to assist with flood and fire disaster relief.

Biomedical Data Commons (BDC) Working Group

The Biomedical Data Commons (BDC) is cloud-based infrastructure that provides secure, compliant cloud services for managing and analyzing genomic data, electronic medical records (EMR), medical images, and other PHI data. It provides resources to researchers so that they can more easily make discoveries from large complex controlled access datasets. The BDC provides resources to those institutions in the BDC Working Group. It is an example of what is sometimes called condominium model of sharing research infrastructure in which the research infrastructure is operated by a consortium of educational and research organizations and provides resources to the consortium.

Working group activities include: i) developing and operating a secure, compliant cloud for human genomic data; ii) developing and operating a secure cloud-based enclave that can be used for storing and analyzing electronic medical and health records in a HIPAA-compliant environment; and iii) contributing towards a petabyte-scale repository of open and controlled access biomedical data.

The OCC Testbed Working Group

This working group manages and operates the OCC Testbed. The OCC Testbed (OCC-T) is a geographically distributed cloud testbed spanning four data centers and connected with 10G and 100G network connections. The OCC-T is used to develop new cloud computing software and infrastructure. Membership in this working group is limited to OCC members who contribute computing, networking, or other resources to the OCC Testbed. A current focus of the OCC-T is developing an OpenFlow enabled version of Hadoop, a project support by the NSF.