Story Summary: DNA sequences, however, are not random, they contain repeating sections, palindromes, and other features that could be represented by fewer bits than is required to spell out the complete sequence in binary. Many computer users are familiar with compression software that can remove redundant code from a music file – to produce an mp3 – or an image – to make a jpg. Less familiar to many users are lossless compression methods such as FLAC for sound files, TIFF for images, and the zip format for documents and other files. LCA achieves a better compression ratio than existing compression algorithms for DNA, such as GenCompress, DNACompress, and DNAPack, the team says. The same approach could also be used for protein sequences. They are now investigating ways in which the results of the compression might be used to differentiate between sections of a DNA sequence that code for proteins and those in the sequence that do not, so-called non-coding regions. Contact Our News EditorsFor any corrections of factual information, or to contact the editors please use our feedback form. Contact Our News EditorsFor any corrections of factual information, or to contact the editors please use our feedback form….Read the Full Story







