IBM researchers unveil genetic breakthrough

IBM researchers unveil genetic breakthrough


Discovery promises to shed light on workings of human genome

IBM researchers have identified numerous DNA patterns shared by areas of the human genome which they believe have been wrongly dismissed as functionless by scientists.

The boffins believe that these regions of the human genome that were assumed largely to contain evolutionary leftovers, or "junk DNA", may actually hold significant clues that can add to scientists' understanding of cellular processes.

The breakthrough occurred after the researchers discovered that these regions contain numerous short DNA "motifs", or repeating sequence fragments which are also present in the parts of the genome that give rise to proteins.

If verified experimentally, the discovery suggests a potential connection between these coding and non-coding parts of the human genome.

IBM said that this could have "a profound impact on genomic research and provide important insights on the workings of cells".

Ajay Royyuru, head of the Computational Biology Center at IBM Research, said: "Our goal is to apply advanced computational techniques to analyse the workings of processes and systems, in this case the function of the human genome.

"Using these tools, we have been able to shed new light on parts of the DNA that were traditionally thought of as not having a specific purpose.

"We believe that the innovative application of technology can provide further understanding in the life sciences at large."

The IBM team used a mathematical tool called pattern-discovery, often employed to mine useful information from very large repositories of data in business and scientific applications, to sift through the approximately six billion letters in the non-coding regions of the human genome and look for repeating sequence fragments, or motifs.

Among the millions of discovered motifs, the team identified approximately 128,000 that also occur in the coding region of the genome and are significantly over-represented in genes involved in specific biological processes such as cell communication, regulation of transcription, transport and others.