Haplotype Assembly
Nadia Pisanti,
Universitá di Pisa –
Abstract:
The human genome is diploid, which requires to assign heterozygous single nucleotide polymorphisms (SNPs) to the two copies of the genome. The resulting haplotypes, lists of SNPs belonging to each copy, are crucial for downstream analyses in population genetics. Currently, statistical approaches, which are oblivious to direct read information, constitute the state-of-the-art. Haplotype assembly, which addresses phasing directly from sequencing reads, suffers from the fact that sequencing reads of the current generation are too short to serve the purposes of genome-wide phasing.
While future-technology sequencing reads will contain sufficient amounts of SNPs per read for phasing, they are also likely to suffer from higher sequencing error rates.
I will describe WhatsHap, the first approach that yields provably optimal solutions to the weighted minimum error correction problem in runtime linear in the number of SNPs. WhatsHap is a fixed parameter tractable (FPT) approach with coverage as the parameter. We demonstrate that WhatsHap can handle datasets of coverage up to 15x, and that 15x are generally enough for reliably phasing long reads, even at significantly elevated sequencing error rates.
I will then show some theoretical results on the optimization problem that lead to HapCol, a fixed parameter algorithm and tool with the number of errors as the parameter. HapCol can handle coverage higher than WhatsHap while being more sensible to the error rate.
Bio
http://pages.di.unipi.it/pisanti/
Date: 2016-Jun-29 Time: 14:00:00 Room: 408
For more information:
- luis.russo@tecnico.ulisboa.pt
- 21 31 00272
Upcoming Events
INESC-ID ESR Talks – February 2023

If you are a masters/PhD student or a postdoctoral fellow, come and present your work in an informal and friendly environment – and savour some tasty snacks!
Individual talks will be 10-15 minutes plus time for feedback. Enroll on your selected date by emailing pedro.ferreira[at]inesc-id.pt.
Happening on the second Wednesday of every month (4pm-5pm):
- 15 February (Alves Redol, Room 9)
- 15 March (Alves Redol, Room 9)
- 12 April (Alves Redol, Room 9)
- 10 May (Alves Redol, Room 9)
- 14 June (Alves Redol, Room 9)
- 12 July (Alves Redol, Room 9)
We hope to see you there!