Parallel efficient alignment of reads for re-sequencing applications
D – Departamento de Engenharia Informática, Instituto Superior Técnico, Universidade de Lisboa –
In bioinformatics, in the context of resequencing projects,
the e cient and accurate mapping of reads to a reference
genome is a critical problem. One instance of this problem
is the local alignment of pyrosequencing reads produced
by the 454 GS FLX system against a reference sequence,
an instance for which the software tool TAPyR (Tool for
the Alignment of Pyrosequencing Reads) was developed.
TAPyR implements a methodology to e ciently solve this
problem, which proved to yield results of a quality (both in
terms of content and execution speed) higher than those of
mainstream applications. With the goal of further improving
this platforms results, we produced a parallel implementation
of the query and reference sequence access procedures
of the original version. Through the use of multithreading,
this new version, P-TAPyR, produces considerable
reductions in the processing time of queries, scaling with
the amount of hardware-supported threads (not accounting
for hyper-threading) available. For larger data sets, we
were able to observe running times roughly 26 times faster
than serial execution with 30 executing threads, showing
an experimental (progressively-decreasing) execution serial
fraction of 0.8% (determined by the Karp-Rabin Metric described
in a posterior section). Herein we present the modi
cations made to this software tool to allow for parallel
querying of reads against an indexed reference which, scales
proportionally to the amount of available physical cores.
Date: 2013-Sep-26 Time: 11:00:00 Room: 336
For more information:
INESC-ID ESR Talks – February 2023
If you are a masters/PhD student or a postdoctoral fellow, come and present your work in an informal and friendly environment – and savour some tasty snacks!
Individual talks will be 10-15 minutes plus time for feedback. Enroll on your selected date by emailing pedro.ferreira[at]inesc-id.pt.
Happening on the second Wednesday of every month (4pm-5pm):
- 15 February (Alves Redol, Room 9)
- 15 March (Alves Redol, Room 9)
- 12 April (Alves Redol, Room 9)
- 10 May (Alves Redol, Room 9)
- 14 June (Alves Redol, Room 9)
- 12 July (Alves Redol, Room 9)
We hope to see you there!