CppCon 2014 has ended
Back To Schedule
Thursday, September 11 • 4:45pm - 5:45pm
Gamgee: A C++14 library for genomics data processing and analysis

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Our group has defined the standards for DNA and RNA sequencing data processing and analysis for disease research and clinical applications. In the last 5 years we have published our tools in the GATK (genome analysis toolkit) which is completely written in java. With the scaling of next generation sequencing and the immense amount of that needs to be processed we hit a performance wall and found ourselves limited by the language to make optimizations and rewrite the algorithms in a way that would conform better to modern hardware.

Enter Gamgee. A free and open source C++14 library that offers much of the functionality of the GATK framework with the performance necessary to scale to the hundreds of petabytes of todays complex diseases projects. We will show how the tools developed using the Gamgee library replaced legacy java GATK tools in the production pipeline of the Broad Institute. We will also talk about how the algorithms have changed to take advantage of the native libraries and modern hardware features such as SSE/AVX and GPUs.


Mauricio Carneiro

Group Lead, Computational Technology Development, Broad Institute of MIT and Harvard
Dr. Carneiro leads the computational technology development team at the Broad Institute of MIT and Harvard. He has contributed to major advances in DNA sequencing analysis with compression algorithms, statistical methods, heterogeneous compute optimizations and a systematic approach... Read More →

Thursday September 11, 2014 4:45pm - 5:45pm PDT

Attendees (0)