Speaker and Content Identification
Telefonica Research –
In this talk I will cover two of the topics I have been recently working on. On the one hand, with regard to speaker identification, I will introduce the use of binary fingerprints to model the voice of a speaker. Based on the projection of standard acoustic vectors into a special GMM model (representing the speaker acoustic space), high-dimensional binary vectors, which have been proven successful in identifying speakers for speaker verification and diarization tasks, are obtained. On the other hand, I will talk about current developments in pattern matching approaches that allow for the development of content-centric applications when little or no training data is available for a particular language. In particular, I will describe a query-by-example system I presented to Mediaeval 2011 evaluation, which uses a novel feature extraction front-end I further described in a paper at ICASSP 2012.
Date: 2012-Apr-13 Time: 15:00:00 Room: 336
For more information:
INESC-ID ESR Talks – February 2023
If you are a masters/PhD student or a postdoctoral fellow, come and present your work in an informal and friendly environment – and savour some tasty snacks!
Individual talks will be 10-15 minutes plus time for feedback. Enroll on your selected date by emailing pedro.ferreira[at]inesc-id.pt.
Happening on the second Wednesday of every month (4pm-5pm):
- 15 February (Alves Redol, Room 9)
- 15 March (Alves Redol, Room 9)
- 12 April (Alves Redol, Room 9)
- 10 May (Alves Redol, Room 9)
- 14 June (Alves Redol, Room 9)
- 12 July (Alves Redol, Room 9)
We hope to see you there!