Back to Table of contents

Primeur weekly 2015-04-20

Special

Silvana Muscella to look back with satisfaction on a content-rich CloudScape VII event ...

Open standard Intercloud run by industry with global international standard on top of it would be ideal scenario ...

Rüdiger Dorn to present Microsoft's Trusted Cloud Initiative at CloudScape VII ...

The Cloud

ISC Cloud & Big Data keynote will recount bank's path into Cloud computing ...

Say it loud - it's fog and it's Cloud ...

IBM and partners to transform personal health with Watson and Open Cloud ...

Ministry of Health Brazil deploys Oracle Healthcare Master Person Index to improve health data interoperability ...

Desktop Grids

Alantu seeks beta testers ...

DENIS@Home to join BOINC ...

EuroFlash

RSC Group has deployed a massively parallel MVS-10P MP supercomputer based on RSC PetaStream architecture and the highest-performing Intel Xeon Phi 7120D at JSCC RAS ...

CoolIT Systems selected to host Birds-of-a-Feather (BoF) session at ISC 2015 in Frankfurt ...

PRACE Preparatory Access completes 20th cut-off with strong participation of industry ...

We-NMR cluster at Utrecht Bijvoet Center accommodated into Life Science Grid ...

CHAIN-REDS paves way for global e-Infrastructure ecosystem for researchers ...

USFlash

Intel Enterprise Edition for Lustre 2.2 launched ...

Storage boost at the National Computational Infrastructure in Australia ...

CASC announces new Director ...

OLCF selects application readiness projects to prepare for next-generation Summit supercomputer ...

Researchers to Tap NVLink to Power Next-Gen Research on Summit Supercomputer - ...

Computer-designed Rocker protein world's first to biomimic ion transport ...

For Big Data researchers, network and compute capabilities are lynchpin to success ...

Ultra-high-speed broadband bolsters UConn research ...

NIST tightens the bounds on the quantum information 'speed limit' ...

Electrical control of quantum bits in silicon paves the way to large quantum computers ...

Quantum cryptography at the speed of light: Researchers design first all-photonic repeaters ...

For Big Data researchers, network and compute capabilities are lynchpin to success

13 Apr 2015 Washington D.C. - For many researchers in the life sciences, Big Data is not just a buzz word - it's the daily reality for carrying out their work in areas like genomics, which is expected to equal if not surpass the data output of the particle physics community. For many scientists, in order to keep pace with the data deluge, the often less glamorous side of Big Data research - the network, computing and Cloud architecture required to support their work - must be at the forefront of their minds. At the Internet2 Global Summit meeting taking place April 26-30 in Washington, D.C., researchers like Genetics and Biochemistry Associate Professor Alex Feltus of Clemson University will come together with network engineers, chief information officers, and other technology leaders in the research and education community to discuss ways they can collaborate to advance research capabilities in IT infrastructure and applications.

At Clemson, Alex Feltus uses genomics research to develop new agricultural crop varieties that address population pressure, bioenergy, food security and climate change. He will be at the Global Summit meeting to present how his team and collaborators at The National Center for Biotechnology Information (NCBI) in Maryland at the National Library of Medicine (NLM) are leveraging the advanced Internet2 infrastructure, including its Advanced Layer 2 Service high-speed connections and perfSONAR network monitoring, to substantially accelerate genomic Big Data transfers and transform researcher collaboration.

As DNA data sets get bigger and bigger, Alex Feltus sees a need to change the way data is stored and transferred. "I hope that as DNA sequencing becomes cheaper it will make more sense to regenerate the data than to store it long term", he stated. "Of course we need bigger boxes, but we also need faster ways to put stuff into them. There is a serious data transfer bottleneck at the network-hard-drive interface. Thus, we need faster, reasonably-priced storage that can keep up with the advanced networks such as the Internet2 Network."

Alex Feltus' supercomputing capabilities at Clemson are best-in-class thanks to the university's high-performance computing (HPC) resource, Palmetto. But, Alex Feltus said, when it comes to collaborating with other research teams across the country, one's compute power is only as good as the connection it's hooked up to - which is where Internet2 comes in. "With the Internet2 Network, I can quickly download more data to Palmetto from public repositories like the National Center for Biotechnology Information and scale up my crop genomics experiments", he stated. "Besides scale up, low latency networks like Internet2 have opened up new possibilities for my research. In collaboration with Melissa Smith's group at Clemson for example, we were able to run GPU-enabled visualization algorithms on the Palmetto cluster at Clemson and beam results to our Supercomputing Conference booth in New Orleans for near real-time visualization of gene interaction networks. You can process data on the fastest nodes in the world, but it's pointless for real-time applications if the supercomputer is hooked up to a slow pipe."

These advanced technology capabilities allow Alex Feltus and his team to focus on their work, rather than attempting to build their own network and computing infrastructure to enable that work.

Across the country, Arizona State University (ASU) is also looking to do just that for its researchers in the College of Life Sciences' Adaptive Complex Systems Science programme, which studies highly interactive and dynamic systems that change over individual and evolutionary time scales, such as epidemics, obesity and cancer.

ASU's response to the Big Data challenge has been to develop what they call the Next Generation Cyber Capability (NGCC) - a "First Generation Data Science Research Instrument" that they liken to instruments such as the Hubble Space Telescope or the Large Hadron Collider. NGCC is already proving useful in the growing area of personalized medicine - specifically tailored disease treatment that takes into account an individual patient's own molecular constitution and that of the disease.

Also called precision medicine, this field of study comes with the challenge of managing and analyzing Big Data related to both genomic information and associated imaging data for each individual. ASU's Director of Operations, Research Computing and Senior HPC Architect Jay Etchings, who will also present at Internet2's Global Summit meeting, said the NGCC is poised to address precision medicine challenges and beyond.

"The potential for patient-focused, precision medicine care roadmaps crystalizes with personalized medicine if we can simply sort out the data", stated Jay Etchings. "Additionally, genomic data is only one of the 'varieties' of large volume Big Data. Diverse clinical observations and patient-reported outcomes also must be integrated and interpreted."

Jay Etchings said enabling this integration of multidimensional molecular and clinical data is where NGCC's new model of computing becomes essential.

"The traditional model of accessing data at the node still has its place in the greater data centre/Cloud infrastructure arena", stated Jay Etchings. "However, discounting many core, software-defined and virtual instances would be in error."

The NGCC's new model marries several essential capabilities for Big Data research. The first is physical capacity, which means being connected to the ultrahigh bandwidth Internet2 Network, having large-scale storage - on the order of 2 Petabytes or more - and integrating multiple types of computation, including utility computing, traditional HPC and new Big Data computing. The second element to NGCC is advanced logical capabilities such as software-defined storage and networking, metadata processing and semantics. The final element is the human factor - transdisciplinary teams of researchers, network engineers and computing professionals working together on the system as a whole.

It's this last capability that both Jay Etchings and Alex Feltus note is essential to the success of their work in Big Data.

"A key aspect is the side of cyberinfrastructure that can't be coded: personal relationships", stated Alex Feltus. "Recently, my collaboration with network and storage researchers and engineers has opened my eyes to innovative possibilities that will impact my research via the human network."

That "human network" will be made possible at the Global Summit meeting, which will provide yet another opportunity for researchers like Alex Feltus and architects like Jay Etchings to check in, collaborate and potentially develop even more meaningful interactions that just may lead to the next big breakthrough in Big Data science.

Presentations include:

  • "Innovations in High-Volume Life Sciences Research", Jay Etchings, Monday, April 27, 3 p.m., Room: Mount Vernon A, Renaissance Washington DC Downtown Hotel.
- "From CC-NIE/IIE/DNI to Building a Cohesive Platform for Collaboration Over Advanced Cyberinfrastructure", Alex Feltus et al., Tuesday, April 28, 8:45 a.m., Room: Mount Vernon A, Renaissance Washington DC Downtown Hotel.
Source: Internet2

Back to Table of contents

Primeur weekly 2015-04-20

Special

Silvana Muscella to look back with satisfaction on a content-rich CloudScape VII event ...

Open standard Intercloud run by industry with global international standard on top of it would be ideal scenario ...

Rüdiger Dorn to present Microsoft's Trusted Cloud Initiative at CloudScape VII ...

The Cloud

ISC Cloud & Big Data keynote will recount bank's path into Cloud computing ...

Say it loud - it's fog and it's Cloud ...

IBM and partners to transform personal health with Watson and Open Cloud ...

Ministry of Health Brazil deploys Oracle Healthcare Master Person Index to improve health data interoperability ...

Desktop Grids

Alantu seeks beta testers ...

DENIS@Home to join BOINC ...

EuroFlash

RSC Group has deployed a massively parallel MVS-10P MP supercomputer based on RSC PetaStream architecture and the highest-performing Intel Xeon Phi 7120D at JSCC RAS ...

CoolIT Systems selected to host Birds-of-a-Feather (BoF) session at ISC 2015 in Frankfurt ...

PRACE Preparatory Access completes 20th cut-off with strong participation of industry ...

We-NMR cluster at Utrecht Bijvoet Center accommodated into Life Science Grid ...

CHAIN-REDS paves way for global e-Infrastructure ecosystem for researchers ...

USFlash

Intel Enterprise Edition for Lustre 2.2 launched ...

Storage boost at the National Computational Infrastructure in Australia ...

CASC announces new Director ...

OLCF selects application readiness projects to prepare for next-generation Summit supercomputer ...

Researchers to Tap NVLink to Power Next-Gen Research on Summit Supercomputer - ...

Computer-designed Rocker protein world's first to biomimic ion transport ...

For Big Data researchers, network and compute capabilities are lynchpin to success ...

Ultra-high-speed broadband bolsters UConn research ...

NIST tightens the bounds on the quantum information 'speed limit' ...

Electrical control of quantum bits in silicon paves the way to large quantum computers ...

Quantum cryptography at the speed of light: Researchers design first all-photonic repeaters ...