Next-Generation Sequencing (NGS) | Biocomputing @ Synergy Lab.

Next-Generation Sequencing

Next-generation sequencing (NGS) delivers large-scale DNA sequencing, where millions of fragments of DNA from a single sample are sequenced concurrently. Analyzing the large-scale data generated by NGS (i.e., "big data") requires a series of tools, often organized as a pipeline, to process and analyze the data, including short-read mapping (e.g., BWA), identifying variants (e.g., Dindel), and variant filtration and refinement.

Locality-Aware Burrows-Wheeler Aligner (LA-BWA)

Optimized for , ,

A locality-aware approach to BWA that "regularizes" memory-access patterns in order to exploit the caching of modern multicore processors to improve performance.

Paper

pDindel

Optimized for , ,

A parallel version of Dindel that has been optimized using thread-level parallelism (OpenMP) and data-level parallelism (SIMD) on modern multicore processors to improve the performance.

Paper

SeqInCloud

Optimized for , ,

A highly scalable, cloud-based implementation of the Genome Analysis ToolKit (GATK) that has been further accelerated via big data software (e.g., Hadoop) and algorithmic refactoring (e.g., loci-based)

Download Paper Download GATK 1.6

FastR

Optimized for GPU-enabled , ,

A GPU-accelerated C++ version of IndelRealigner, a GATK tool from the Broad Institute that was originally written in Java.

Download