Back to Table of contents

Primeur weekly 2013-11-04

The Cloud

Clustrix announces industry's first scale-out database for real-time analytics on live operational data ...

IDC MarketScape names DataDirect Networks a leader in object-based storage market ...

George Washington School of Engineering and Applied Science receives grant to improve Cloud computing reliability ...

Calxeda announces new fabric-­based platform for Clouds ...

Securing data for Cloud computing ...

Expensify selects Oracle Sales Cloud to empower growing sales team ...

Baylor, DNAnexus, Amazon Web Services collaboration enables largest-ever Cloud-based analysis of genomic data ...

Red Hat launches new initiative to drive enterprise adoption of OpenStack ...

SoftLayer, an IBM company, and Cloudera launch turnkey Big Data solutions on bare metal servers ...

CANCOM accelerates client innovation through the Cloud with IBM PureSystems ...

European managed services provider taps IBM Flash for faster Cloud solutions ...

Desktop Grids

New Virtual Team to promote Desktop Grids among the EGI scientific communities ...

New BoincTasks version release issued ...

Remotis Technologies achieves milestone with World Community Grid ...

EuroFlash

N8 supercomputer tracks first dinosaur steps in over 90 million years ...

Cray expands research and development in Europe ...

Points north: the future on display in Lithuania ...

Points north: the future on display in Lithuania - Part II ...

New California Energy Commission study validates energy savings of Asetek RackCDU for data centres ...

PRACE event "HPC@Horizon2020" rallies support for "Supercomputers for all" ...

PRACE Winter School 2014 to be held in Tel Aviv, Israel ...

New insight into the Earth's deep interior ...

The ALiX project provides Tenerife with Teide-HPC, the second most powerful supercomputer in Spain ...

New material for quantum computing discovered out of the blue ...

USFlash

National Energy Research Scientific Computing Center (NERSC) drives major scientific discoveries with powerful, converged, site-wide storage from DataDirect Networks ...

RainStor delivers Hadoop on EMC Isilon NAS for efficient Big Data management ...

HP helps Johnson Memorial Hospital deliver high-quality patient care ...

Baylor, DNAnexus, Amazon Web Services collaboration enables largest-ever Cloud-based analysis of genomic data

25 Oct 2013 Houston - With their participation in the completion of the largest Cloud-based analysis of genome sequence data, researchers from the Baylor College of Medicine (BCM) Human Genome Sequencing Center are helping to usher genomic scientists and clinicians around the world into a new era of high-level data analysis. A "Cloud" is a virtual network of remote internet servers used to store, manage and process information.

"The mission of the Baylor Human Genome Sequencing Center is to drive genomics and genomic analysis to be at the leading edge of everything in the field", stated Dr. Jeffrey Reid, assistant professor in the Human Genome Sequencing Center at BCM, who led the BCM portion of the project. "In terms of analysis, the future of genomic research and genomic medicine is in the Cloud. We are very much going towards more computing and not less."

Together with the Platform-as-a-Service company DNAnexus and Amazon Web Services, the largest provider of Cloud computing, BCM sequenced the DNA of more than 14,000 individuals - 3,751 whole genomes and 10,771 whole exomes using next generation sequencing. An exome contains all the genes in a genome and are the part of the genome that provides the blueprints for proteins. The individuals whose genetic material was sequenced are part of the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium or CHARGE project aimed at advancing understanding of human genetics and the contributions to heart disease and aging.

Dr. Reid gave a presentation on the project on 25 October at the American Society of Human Genetics annual meeting in Boston.

The BCM Human Genome Sequencing Center-developed Mercury pipeline, a semi-automated and modular set of tools for the analysis of next generation sequencing data in both research and clinical contexts, was an integral part of the project. The pipeline identifies mutations from genomic data, setting the stage for determining the significance of these mutations as a cause of serious disease.

Led by Dr. Eric Boerwinkle, professor and director of the Human Genetics Center at the University of Texas Health Science Center at Houston and associate director of the Human Genome Sequencing Center at BCM, the CHARGE project involves more than 300 researchers across five institutions around the world. The Cloud-based analysis makes it possible for the large group to have access to an expansive network of data over a server that is HIPAA certified to not compromise patient privacy.

"The collaboration between the CHARGE consortium and the Human Genome Sequencing Center is leading to discovery of those genes contributing to risk of the most important diseases plaguing the U.S. population across all age groups", stated Dr. Boerwinkle. "Ultimately, these discoveries forge a path toward novel therapeutics and diagnostics. The use of Cloud computing and collaboration with DNAnexus is allowing us to achieve our goals faster and in a more cost-effective manner." Dr. Boerwinkle will give an updated presentation November 15 at the Cold Spring Harbor Laboratory's Personal Genomes & Pharmacogenomics Meeting.

"Having access to this much data was unique", stated Dr. Reid. "Many institutions do not have the local compute resources and infrastructure to support large scale analysis projects like this one, so we were lucky to come together with DNAnexus and Amazon Web Services to make this project possible."

The project required approximately 2.4 million core-hours of computational time, generating 440 TB (terabytes) of results and nearly a petabyte of storage that took place over a four-week period.

By comparison, the 1,000 genomes project sequenced 2,535 exomes and required 25 TB of data.

"It is very important for us to create a centralized space where researchers from all over the world can come and collaborate with the data", stated Dr. Reid. "This project creates expansive access to this data over a protected network that will advance research."
Source: Baylor College of Medicine

Back to Table of contents

Primeur weekly 2013-11-04

The Cloud

Clustrix announces industry's first scale-out database for real-time analytics on live operational data ...

IDC MarketScape names DataDirect Networks a leader in object-based storage market ...

George Washington School of Engineering and Applied Science receives grant to improve Cloud computing reliability ...

Calxeda announces new fabric-­based platform for Clouds ...

Securing data for Cloud computing ...

Expensify selects Oracle Sales Cloud to empower growing sales team ...

Baylor, DNAnexus, Amazon Web Services collaboration enables largest-ever Cloud-based analysis of genomic data ...

Red Hat launches new initiative to drive enterprise adoption of OpenStack ...

SoftLayer, an IBM company, and Cloudera launch turnkey Big Data solutions on bare metal servers ...

CANCOM accelerates client innovation through the Cloud with IBM PureSystems ...

European managed services provider taps IBM Flash for faster Cloud solutions ...

Desktop Grids

New Virtual Team to promote Desktop Grids among the EGI scientific communities ...

New BoincTasks version release issued ...

Remotis Technologies achieves milestone with World Community Grid ...

EuroFlash

N8 supercomputer tracks first dinosaur steps in over 90 million years ...

Cray expands research and development in Europe ...

Points north: the future on display in Lithuania ...

Points north: the future on display in Lithuania - Part II ...

New California Energy Commission study validates energy savings of Asetek RackCDU for data centres ...

PRACE event "HPC@Horizon2020" rallies support for "Supercomputers for all" ...

PRACE Winter School 2014 to be held in Tel Aviv, Israel ...

New insight into the Earth's deep interior ...

The ALiX project provides Tenerife with Teide-HPC, the second most powerful supercomputer in Spain ...

New material for quantum computing discovered out of the blue ...

USFlash

National Energy Research Scientific Computing Center (NERSC) drives major scientific discoveries with powerful, converged, site-wide storage from DataDirect Networks ...

RainStor delivers Hadoop on EMC Isilon NAS for efficient Big Data management ...

HP helps Johnson Memorial Hospital deliver high-quality patient care ...