The storage and data transferring of large genome data are becoming important concerns for biomedical researchers. We present a novel multi-reference based genome compression method with a hierachical structure. Our approach works for the de facto standard alignment format (i.e., BAM) compression that is the pressing need at present. We align new sequences to a reference sequence using SOAP3, a GPU-based aligning software, and summarize mapping properties and information for exact mapped reads. To increase the exact aligning rate, we also realign the approximately mapped and unmapped reads by changing the reference sequence or shortening the read length. Meanwhile, we further the study using “lossy” quality values through k-means clustering scheme and find its minute effect on downstream applications. The proposed method has achieved compression ratios from 0.5 to 0.65, which corresponds to space savings of 35%-50%, on experimental datasets.

Features

  • Efficient
  • Fast
  • Promising

Project Samples

Project Activity

See All Activity >

Categories

Bio-Informatics

License

Academic Free License (AFL)

Follow Hierachical_DNAcoder

Hierachical_DNAcoder Web Site

Other Useful Business Software
Claims Processing solution for healthcare practitioners. Icon
Claims Processing solution for healthcare practitioners.

Very easy to use for medical, dental and therapy offices.

Speedy Claims became the top CMS-1500 Software by providing the best customer service imaginable to our thousands of clients all over America. Medical billing isn't the kind of thing most people get excited about - it is just a tedious task you have to do. But while it will never be a fun task, it doesn't have to be as difficult or time consumimg as it is now. With Speedy Claims CMS-1500 software you can get the job done quickly and easily, allowing you to focus on the things you love about your job, like helping patients. With a simple interface, powerful features to eliminate repetitive work, and unrivaled customer support, it's simply the best HCFA 1500 software available on the market. A powerful built-in error checking helps ensure your HCFA 1500 form is complete and correctly filled out, preventing CMS-1500 claims from being denied.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Hierachical_DNAcoder!

Additional Project Details

Operating Systems

Linux

Languages

English

Intended Audience

Information Technology, Science/Research, Engineering

Programming Language

C++

Related Categories

C++ Bio-Informatics Software

Registered

2013-06-18