May 18, 2016

Researchers develop new way to decode large amounts of biological data

by University of Maryland School of Medicine

In recent years, the amount of genomic data available to scientists has exploded. With faster and cheaper techniques increasingly available, hundreds of plants, animals and microbes have been sequenced in recent years. However, this ever-expanding trove of genetic information has created a problem: how can scientists quickly analyze all of this data, which could hold the key to better understanding many diseases, and solving other health and environmental issues.

Now, two researchers have developed an innovative computing technique that, on very large amounts of data, is both faster and more accurate than current methods. To spur research, a program using this technique is being offered for free to the biomedical research community.

"This is a whole new approach, with multiple opportunities for further development," said Andrew F. Neuwald, PhD, Professor of Biochemistry & Molecular Biology at the Institute for Genome Sciences (IGS) at the University of Maryland School of Medicine.

A description of the new method was published today in PLOS Computational Biology. Dr. Neuwald collaborated on the work with Stephen F. Altschul, PhD, a senior investigator at the National Center for Biotechnology Information at the National Institutes of Health.

Genomic sequence data encodes information regarding the structure and function of proteins, which comprise the basic cellular machinery and thus determine the structure and function of all microbes, plants and animals.

The new program is called GISMO, an acronym for "Gibbs Sampler for Multi-Alignment Optimization". Gibbs sampling, a statistical technique for solving highly complex problems, is a central feature of the approach. In this case, sampling is used to find biological signals - relevant patterns that can help scientists better understand how organisms work. Neuwald says the approach improves upon conventional sequence alignment programs, which, unlike GISMO, can easily mistake random patterns in the data for biologically valid signals.

Current widely-used methods typically compare each sequence to every other sequence; this takes a prohibitively long time to compute for sets of a hundred thousand or more related protein sequences, which are now available for analysis. Neuwald describes these methods as "bottom up." He and Dr. Altschul developed a technique that is "top down"; instead of comparing sequences to each other, it compares each sequence to an evolving statistical model. This approach is not only faster, but is also better at finding biologically relevant signals, which can, for example, help researchers unravel the mechanisms underlying cancer and inherited diseases. This technique becomes progressively faster than other methods as the size of the data set becomes larger.

Dr. Neuwald has a varied background, in molecular biology, computer science and Bayesian statistics and has been working on this technique for years. Dr. Altschul, whose formal training is in mathematics, was the first author on two landmark publications describing the popular sequence database search programs BLAST and PSIBLAST. They confirmed GISMO's superior performance on large, diverse sequence sets by testing it against five widely used conventional methods. Dr. Neuwald is excited about GISMO's potential: "Because researchers have been finding ways to speed up and improve conventional methods for decades and because GISMO takes such a new and different approach, I am confident that we can make GISMO even faster and more accurate going forward."

Journal information: PLoS Computational Biology

Provided by University of Maryland School of Medicine

Citation: Researchers develop new way to decode large amounts of biological data (2016, May 18) retrieved 18 April 2024 from https://phys.org/news/2016-05-decode-large-amounts-biological.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Does order matter in protein sequence alignment?

18 shares

Feedback to editors

Researchers develop new way to decode large amounts of biological data

Skyrmions move at record speeds: A step towards the computing of the future

A third of China's urban population at risk of city sinking, new satellite data shows

Novel material supercharges innovation in electrostatic energy storage

Scientists discover forests that may resist climate change

Invasive species sound off about impending ecosystem changes

Materials follow the 'Rule of Four,' but scientists don't know why yet

Drawing a line back to the origin of life: Graphitization could provide simplicity scientists are looking for

Hubble goes hunting for small main belt asteroids

Dense network of seismometers reveals how the underground ruptures

Scientists grow human mini-lungs as animal alternative for nanomaterial safety testing

Relevant PhysicsForums posts

Can four legged animals drink from beneath their feet?

Mold in Plastic Water Bottles? What does it eat?

Dolphins don't breathe through their esophagus

Is this egg-laying or something else?

Color Recognition: What we see vs animals with a larger color range

How to Implement Beamforming in Ultrasound Diffraction Tomography

Does order matter in protein sequence alignment?

An algorithm is sped up to predict harmful effects from specific gene mutations

Scientists re-imagine how genomes are assembled

Search technique helps researchers find DNA sequences in minutes rather than days

A faster sequence homology search algorithm based on database subsequence clustering

Improved method for protein sequence comparisons is faster, more accurate, sensitive

Linking environmental influences, genetic research to address concerns of genetic determinism of human behavior

40 years of crop research shows inequities

AI-generated disproportioned rat genitalia makes its way into peer-reviewed journal

Unpacking social equity from biodiversity data: An interdisciplinary policy perspective

A whiff of tears reduces male aggression, says study

Solicitor in 19th-century Tasmania traded human Aboriginal remains for scientific accolades, study reveals

Medical Xpress

Tech Xplore

Science X

Researchers develop new way to decode large amounts of biological data

Skyrmions move at record speeds: A step towards the computing of the future

A third of China's urban population at risk of city sinking, new satellite data shows

Novel material supercharges innovation in electrostatic energy storage

Scientists discover forests that may resist climate change

Invasive species sound off about impending ecosystem changes

Materials follow the 'Rule of Four,' but scientists don't know why yet

Drawing a line back to the origin of life: Graphitization could provide simplicity scientists are looking for

Hubble goes hunting for small main belt asteroids

Dense network of seismometers reveals how the underground ruptures

Scientists grow human mini-lungs as animal alternative for nanomaterial safety testing

Relevant PhysicsForums posts

Related Stories

Does order matter in protein sequence alignment?

An algorithm is sped up to predict harmful effects from specific gene mutations

Scientists re-imagine how genomes are assembled

Search technique helps researchers find DNA sequences in minutes rather than days

A faster sequence homology search algorithm based on database subsequence clustering

Improved method for protein sequence comparisons is faster, more accurate, sensitive

Recommended for you

Linking environmental influences, genetic research to address concerns of genetic determinism of human behavior

40 years of crop research shows inequities

AI-generated disproportioned rat genitalia makes its way into peer-reviewed journal

Unpacking social equity from biodiversity data: An interdisciplinary policy perspective

A whiff of tears reduces male aggression, says study

Solicitor in 19th-century Tasmania traded human Aboriginal remains for scientific accolades, study reveals

Newsletter sign up

Donate and enjoy an ad-free experience