Back to Table of contents

Primeur weekly 2020-05-25

Focus

InsideHPC's Rich Brueckner passes away at 58 ...

Only half of the EuroHPC countries confirmed they will participate in co-funding EuroHPC projects ...

Quantum computing

Atos and CSC empower the Finnish quantum research community with Atos Quantum Learning Machine ...

Quantum Hall effect 'reincarnated' in 3D topological materials ...

University of California Los Angeles physicists develop world's best quantum bits ...

Quantum leap: Photon discovery is a major step toward at-scale quantum technologies ...

Focus on Europe

HPC-AI Advisory Council and ISC Group continue Annual Student Cluster Competition as an online event ...

ELASTIC software architecture advances urban mobility in Florence ...

Middleware

Waste energy of LUMI supercomputer produces 20 percent of the district heat of Kajaani ...

Italy-based Do IT Systems becomes Bright Services Partner ...

Hardware

Microsoft claims to have built new supercomputer for OpenAI ...

New York University and IBM research takes electrons for a spin in moving toward more efficient, higher density data ...

GRC wins Phase I of AFWERX Programme ...

Samsung Electronics expands its foundry capacity with a new production line in Pyeongtaek, Korea ...

HPE reports Q2 results ...

Intersect360 Research invites your input in an all new survey ...

Applications

Scientists seeking therapies for deadly bacterial disease ...

2020 OLCF User Meeting will happen remote together ...

Summit supercomputer is mining for COVID-19 connections ...

A new method for unraveling complex gene interactions ...

ACM honours computing innovators for advances in research, education and industry ...

Discovery about the edge of fusion plasma could help realize fusion power ...

Superconductors with 'zeitgeist' - When materials differentiate between past and future ...

NIST team builds hybrid quantum system by entangling molecule with atom ...

BIOS IT partners with Panasas to deliver ActiveStor Ultra turnkey HPC storage appliances ...

Supercomputer model simulations reveal cause of Neanderthal extinction ...

Pittsburgh Supercomputing Center is supporting COVID-19 research ...

The Cloud

New technique separates industrial noise from natural seismic signals ...

Lyell Immunopharma goes all-in on AWS as its Cloud provider ...

WekaIO furthers Weka AI by integrating with deep learning pipeline management solution from Valohai ...

A new method for unraveling complex gene interactions


Clustering SNP-to-phenotype associations (A & B) allows for the kinds of complex patterns in C.
18 May 2020 Oak Ridge - The first step for biologists who want to develop new, more efficient biofuels is to understand the genetic underpinnings of plants that can be digested by microbes into chemical compounds. One species in particular, the black cottonwood tree, or Populus trichocarpa (poplar tree), has long been a focus of scientists interested in bioenergy.
In A, the module (yellow) shows that both phenotypes (purple) interact with the same SNP. A researcher would be unable to modify the gene without affecting both phenotypes. In B, the presence of two separate modules indicates that modifying one SNP will only affect the associated phenotype.

To maximize poplar's potential as a biofuel, researchers map the interactions between genes and expressed traits, or phenotypes, in order to predict how desired traits can be expressed based on the subject's genetic make-up. With this knowledge, researchers can breed and genetically modify crops to maximize their potential as a biofuel.

However, this relationship between genes and their associated phenotypes often becomes complicated by phenomena like pleiotropy, which occurs when one gene is associated with two or more seemingly unrelated traits.

To unravel these complex interactions, Deborah Weighill and Daniel Jacobson enlisted the help of the Titan supercomputer at the Oak Ridge Leadership Computing Facility (OLCF), a US Department of Energy (DOE) Office of Science User Facility at DOE’s Oak Ridge National Laboratory (ORNL).

Working with the ORNL's Center for Bioenergy Innovation (CBI), Deborah Weighill and her team used the now-decommissioned Cray XK7 supercomputer as well as the OLCF's Eos cluster to analyze the results of a genome-wide association study (GWAS) which used genomic variants from the sequenced genomes of around 1000 poplar trees to look for pleiotropic signatures - signatures of genes involved in multiple functions or associated with multiple different phenotypes.

In the study, the team developed and used a new process called multi-phenotype association (MPA) decomposition, a network-based method for characterizing the complex relationship between genetic variants and their associated phenotypes.

The study looked at the way single nucleotide polymorphisms (SNPs, pronounced "snips"), which are substitutions of a single base pair in DNA, are correlated with phenotypes and then clustered them based on these associated phenotypes. Specifically, the study mapped the relationship between SNPs and hundreds of individual metabolite levels.

"So, SNPs represent variants across our population of poplar trees", Deborah Weighill stated. "We cluster the SNPs so that what we get are groups of mutations that are grouped together if they're associating with the same phenotypes in our poplar population."

This clustering process involved taking a massive matrix of SNP-to-phenotype associations from the GWAS results and calculating the similarities between all pairs of SNPs.

Associating a gene with its multiple phenotypes helps to identify pleiotropy, but it doesn't show the full genetic picture.

"In a gene-to-phenotype network", Deborah Weighill stated, "all we can see about genes is that they are connected to multiple phenotypes each. So, we can see they are pleiotropic in nature because they affect multiple phenotypes based on our GWAS results, but we can't tell anything about which variants within that gene are affecting the phenotypes or what type of signature it is."

To understand the topology of different phenotype associations within each gene, the MPA decomposition process forms what is called an association module, a discrete pairing of a single SNP and any phenotypes associated with that SNP.

"Modules allow us to a construct a new space, what we call powerset space", Deborah Weighill stated. "When we map genes into this space, it unravels the type of signature that the SNPs within that gene have. So that's what MPA association is trying to do. It computationally and mathematically characterizes the different types of SNP-to-phenotype patterns so we can cluster genes not only based on what phenotypes they're associating with but on the SNP-to-phenotype association pattern."

Visualizing MPA signatures in the powerset space is important for researchers attempting gene modification experiments because they can gain an understanding of which genes could be potentially altered without affecting other phenotypes. As an added benefit, through MPA decomposition, biologists have a method for mining massive datasets to create numerically interpretable information from which they can derive hypotheses.

"Our aim is to help generate specific hypotheses that will move the field of bioenergy forward", Deborah Weighill stated. "We want to take data and then analyze it and encourage it into a format so that we can work together with biologists to formulate specific hypotheses on how to improve the poplar tree to be a more favorable bioenergy resource or identify new genes that are involved in functions that are of interest to the bioenergy research at CBI."

The method developed by Weighill is not species specific, meaning that it can be applied to any organism as long as the mutations and phenotypes across its population have been measured. In the future, MPA decomposition can be used by researchers to extract meaningful information with enormous implications for biofuel development.

Also participating in the study were individuals from the University of Tennessee's (UTK) Bredesen Center, ORNL's Biosciences Division, UTK's Department of Plant Sciences, West Virginia University's Department of Biology, the DOE Joint Genome Institute, and HudsonAlpha Institute for Biotechnology.

Computing time was awarded through the OLCF Director's Discretion program as well as the DOE INCITE programme. Support for the Poplar GWAS dataset was provided by ORNL’s BioEnergy Science Center and the CBI, which are funded by the DOE Office of Science. The research also used resources from the ORNL Compute and Data Environment for Science (CADES) facility.

D. Weighill, P. Jones, C. Bleker, P. Ranjan, M. Shah, N. Zhao, M. Martin, S. DiFazio, D. Macaya-Sanz, J. Schmutz, A. Sreedasyam, T. Tschaplinski, G. Tuskan, and D. Jacobson are the authors of the paper titled "Multi-Phenotype Association Decomposition: Unraveling Complex Gene-Phenotype Relationships". The study has been published inFrontiers in Genetics10 (2019): 417, doi:10.3389/fgene.2019.00417.
Source: Oak Ridge Leadership Computing Facility - OLCF

Back to Table of contents

Primeur weekly 2020-05-25

Focus

InsideHPC's Rich Brueckner passes away at 58 ...

Only half of the EuroHPC countries confirmed they will participate in co-funding EuroHPC projects ...

Quantum computing

Atos and CSC empower the Finnish quantum research community with Atos Quantum Learning Machine ...

Quantum Hall effect 'reincarnated' in 3D topological materials ...

University of California Los Angeles physicists develop world's best quantum bits ...

Quantum leap: Photon discovery is a major step toward at-scale quantum technologies ...

Focus on Europe

HPC-AI Advisory Council and ISC Group continue Annual Student Cluster Competition as an online event ...

ELASTIC software architecture advances urban mobility in Florence ...

Middleware

Waste energy of LUMI supercomputer produces 20 percent of the district heat of Kajaani ...

Italy-based Do IT Systems becomes Bright Services Partner ...

Hardware

Microsoft claims to have built new supercomputer for OpenAI ...

New York University and IBM research takes electrons for a spin in moving toward more efficient, higher density data ...

GRC wins Phase I of AFWERX Programme ...

Samsung Electronics expands its foundry capacity with a new production line in Pyeongtaek, Korea ...

HPE reports Q2 results ...

Intersect360 Research invites your input in an all new survey ...

Applications

Scientists seeking therapies for deadly bacterial disease ...

2020 OLCF User Meeting will happen remote together ...

Summit supercomputer is mining for COVID-19 connections ...

A new method for unraveling complex gene interactions ...

ACM honours computing innovators for advances in research, education and industry ...

Discovery about the edge of fusion plasma could help realize fusion power ...

Superconductors with 'zeitgeist' - When materials differentiate between past and future ...

NIST team builds hybrid quantum system by entangling molecule with atom ...

BIOS IT partners with Panasas to deliver ActiveStor Ultra turnkey HPC storage appliances ...

Supercomputer model simulations reveal cause of Neanderthal extinction ...

Pittsburgh Supercomputing Center is supporting COVID-19 research ...

The Cloud

New technique separates industrial noise from natural seismic signals ...

Lyell Immunopharma goes all-in on AWS as its Cloud provider ...

WekaIO furthers Weka AI by integrating with deep learning pipeline management solution from Valohai ...