Back to Table of contents

Primeur weekly 2011-10-24

Special

Interview with Tomi Ilijas from Arctur HPC centre in Slovenia ...

Exascale supercomputing

Fujitsu releases world's highest-performance file system ...

The Cloud

Bull announces bullion Cloud platform: the fast, cost-effective solution to build simple and secure private Clouds ...

Bull launches Le Cloud by Bull: a strategic approach, to achieve a smooth transition to the enterprise Cloud ...

IBM introduces new systems to accelerate smarter computing ...

HP speeds clients' Cloud evolution ...

New VMAXe enhancements deliver more powerful, trusted and efficient enterprise storage for VMware environments ...

VMware unveils management portfolio for the Cloud era ...

American Diversified Holdings Corporation Cloud networking subsidiary Rebel Networks to substantially expand operating capacity ...

Joint integration of EMC VPLEX and VMware vSphere delivers higher levels of availability and workload mobility ...

HP provides customers affordable route to Cloud with financing offer ...

Building a Cloud for High Performance Computing with OpenNebula ...

IEEE/ACM Utility and Cloud Computing Conference to issue Call for Participation ...

Desktop Grids

University of Westminster to hire reseach assistant in distributed, Grid and Cloud computing ...

EuroFlash

CURIE: the first large scale hybrid system available in PRACE ...

Dr. Maria Ramalho took appointment as PRACE Managing Director ...

Ryohin Keikaku expands HiQube deployment throughout Japanese retail store chain ...

Lightning strikes in the form of bits and bytes ...

UPC scientists perform first 3D simulations of nova explosions ...

USFlash

HP doubles customer base of 3PAR Utility Storage and scores world-record benchmark ...

Magnifying research: Scientists team together to upgrade supercomputer ...

SDSC and Calit2 awarded $1.4 million NSF grant for new bioinformatics tools ...

Indiana University awarded $1.5 million NSF grant for new national centre to support human genome research ...

Exa's PowerFLOW supports Cray XE6 supercomputers with excellent scalability to thousands of cores ...

Thomas Bogdan named president of University Corporation for Atmospheric Research ...

CCGrid 2012 to issue Call for Papers ...

Universities select HP to get to head of the class ...

Vertica announces Community Edition version of Vertica Analytic Database ...

SDSC and Calit2 awarded $1.4 million NSF grant for new bioinformatics tools

18 Oct 2011 San Diego - Researchers at the San Diego Supercomputer Center (SDSC) and the California Institute for Telecommunications and Information Technology (Calit2) at the University of California, San Diego, have been awarded a three-year, $1.4 million grant from the National Science Foundation (NSF) to create a Kepler Scientific Workflow System module. Researchers will develop new tools to help manage ever-growing data sets used in next-generation DNA sequencing.

"Next-generation DNA sequencing is now creating such a large amount of sequence data that it is overwhelming current computational tools and resources", stated Ilkay Altintas, director of the Scientific Workflow Automation Technologies (SWAT) Lab within SDSC's Cyberinfrastructure Research, Education And Development (CI-RED) group, and Principal Investigator for the project. "New computational techniques and efficient implementation mechanisms for this data-intensive workload are needed to enable rapid analysis of these next-generation sequence data."

The project receiving the NSF award is called "Advances in Biological Informatics Development: bioKepler: A Comprehensive Bioinformatics Scientific Workflow Module for Distributed Analysis of Large-Scale Biological Data". Bioinformatics refers to a field of science that combines biology, information technology, computers and statistical techniques to create research-driven solutions such as customized medications and treatments to help prevent disease, three-dimensional models of genomes and proteins, and advanced agricultural technologies.

"The enormous growth in data-intensive research means that as these data sets get larger, moving data over the network becomes more complicated, error-prone and costly to maintain", stated Ilkay Altintas, who also serves as SDSC's deputy coordinator for research.

The bioKepler project is motivated by the following three challenges that remain unsolved:

  • How can large-scale sequencing data be analyzed systematically in a way that incorporates and enables reuse of best practices by the scientific community?
  • How can such analysis be easily configured or programmed by end-users with various skill levels to formulate actual bioinformatics work flows?
  • How can such workflows be executed in computing resources available to scientists in an efficient and intuitive manner?

To create such an environment, the bioKepler project will create scientific work flow components to develop an array of bioinformatics tools using distributed execution techniques. Once customized, these components will be used on multiple distributed platforms, including various Cloud and Grid computing platforms. The tools will be selected to meet the diverse needs of researchers, and organized into eight groups covering most aspects of bioinformatics applications: sequence database searches; mapping; sequence assembly; gene prediction; clustering; multiple sequence alignment, phylogeny and taxonomy; protein annotation; and other miscellaneous utilities such as data format transformation and parsing.

"These tools will be applicable to a wide range of bioinformatics and computational biology problems", stated Ilkay Altintas, noting that "a key part of this project will also focus on education and outreach efforts, underscoring the importance of training next-generation scientists, as well as the need to narrow the gap between bioinformatics and technology."

All the resources, materials, and open-source software products produced by the bioKepler project will be integrated with Calit2's Community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis (CAMERA), a data repository and a bioinformatics resource for metagenomic analysis.

"The Kepler workflow system has already been used comprehensively in the CAMERA project", stated project co-investigator Weizhong Li, a research scientist at Calit2 and the Center for Research in Biological Systems (CRBS), and Bioinformatics group leader for CAMERA. "With the proposed developments in bioKepler, the CAMERA project and its large user communities will benefit from a larger set of next generation sequence analysis tools with much better scalability and flexibility. Other projects that heavily rely on next-generation sequencing, such as various microbiome projects, can also take advantage of the bioKepler software."

Moreover, bioKepler will be packaged to be installed on diverse, distributed execution environments (e.g., as a Web service and as virtual machines tuned for various Grid and Cloud systems), which in turn will enable deployment of bioKepler on public and private clusters and Clouds.

In addition to Ilkay Altintas and Weizhong Li, the bioKepler research team includes Eric E. Allen, assistant professor of marine biology at the Scripps Oceanography Institute (SIO); Jianwu Wang, project scientist with SWAT; Daniel Crawl, work flow specialist with SWAT; and Shulei Sun and Sitao Wu, bioinformaticians at CRBS.

The bioKepler project is funded by NSF DBI-1062565 under the CI Reuse and Advances in Bioinformatics programmes.
Source: San Diego Supercomputer Center

Back to Table of contents

Primeur weekly 2011-10-24

Special

Interview with Tomi Ilijas from Arctur HPC centre in Slovenia ...

Exascale supercomputing

Fujitsu releases world's highest-performance file system ...

The Cloud

Bull announces bullion Cloud platform: the fast, cost-effective solution to build simple and secure private Clouds ...

Bull launches Le Cloud by Bull: a strategic approach, to achieve a smooth transition to the enterprise Cloud ...

IBM introduces new systems to accelerate smarter computing ...

HP speeds clients' Cloud evolution ...

New VMAXe enhancements deliver more powerful, trusted and efficient enterprise storage for VMware environments ...

VMware unveils management portfolio for the Cloud era ...

American Diversified Holdings Corporation Cloud networking subsidiary Rebel Networks to substantially expand operating capacity ...

Joint integration of EMC VPLEX and VMware vSphere delivers higher levels of availability and workload mobility ...

HP provides customers affordable route to Cloud with financing offer ...

Building a Cloud for High Performance Computing with OpenNebula ...

IEEE/ACM Utility and Cloud Computing Conference to issue Call for Participation ...

Desktop Grids

University of Westminster to hire reseach assistant in distributed, Grid and Cloud computing ...

EuroFlash

CURIE: the first large scale hybrid system available in PRACE ...

Dr. Maria Ramalho took appointment as PRACE Managing Director ...

Ryohin Keikaku expands HiQube deployment throughout Japanese retail store chain ...

Lightning strikes in the form of bits and bytes ...

UPC scientists perform first 3D simulations of nova explosions ...

USFlash

HP doubles customer base of 3PAR Utility Storage and scores world-record benchmark ...

Magnifying research: Scientists team together to upgrade supercomputer ...

SDSC and Calit2 awarded $1.4 million NSF grant for new bioinformatics tools ...

Indiana University awarded $1.5 million NSF grant for new national centre to support human genome research ...

Exa's PowerFLOW supports Cray XE6 supercomputers with excellent scalability to thousands of cores ...

Thomas Bogdan named president of University Corporation for Atmospheric Research ...

CCGrid 2012 to issue Call for Papers ...

Universities select HP to get to head of the class ...

Vertica announces Community Edition version of Vertica Analytic Database ...