I'm a data scientist who has been working with big data and predictive Analytics since 1988. I'm a faculty member at the University of Chicago, the Director of the Open Commons Consortium, and the Founder and a Partner of Analytic strategy Partners.

Announcing the Data Commons Framework (DCF) and Gen3 at ISMB 2018

Announcing the Data Commons Framework (DCF) and Gen3 at ISMB 2018

narrow-middle-arch-18-v4.jpg

ISMB 2018 is meeting in Chicago this weekend and the Center for Data Intensive Science (CDIS) is making two announcements there.

First, we are announcing release 1.0 of the Gen3 platform, which you can use for setting up, developing and operating data commons.  Gen3 is open source and licensed under the Apache license and is being used for several data commons, including the BloodPAC Data Commons, the BRAIN Data Commons, the Melanoma Genetics Consortium (GenoMEL), and the Kids First Data Catalog.  

Second, we are making a joint announcement along with National Cancer Institute (NCI) Center for Biomedical Informatics and Information Technology (CBIIT) of the Data Commons Framework or DCF.  The DCF is a set of software services to make it easier to develop, operate, and interoperate data commons, data clouds, knowledge bases, and other resources for managing, analyzing, and sharing cancer research data that are part of NCI's Cancer Research Data Commons (CRDC).  The DCF is powered by the Gen3 platform.

Both Gen3 and the DCF use what you might call an end-to-end design or narrow middle architecture for a data commons.  I wrote a recent Medium post about the narrow middle architecture for data commons.

In addition to posters announcing Gen3 (A-318) and DCF (Poster A-381), CDIS also has posters at ISMB 2018 about our new workflow automation system (A-359), and a tumor-only variant calling pipeline (B-927).  This is a good time to mention that CDIS is hiring data scientists, software engineers, bioinformaticians, project managers, and product managers.  

The Genomic Data Commons (GDC) Two Year Anniversary

The Genomic Data Commons (GDC) Two Year Anniversary