Phone Recognition and Language Modeling for Variety Identification
Oscar Koller,
Inesc-ID –
Abstract:
This talk will introduce the phonotactic approach “Phone Recognition and Language Modeling” (PRLM) for language/variety identification. After a detailed view on this token based method, I will present the use of a specialized Phone Recognizer to differentiate African Portuguese from European Portuguese in a highly accurate way. In contrast to other PRLM based methods, the tokenizer combines distinctive knowledge about the differences between the target varieties. This knowledge is introduced into a MLP phone recognizer by training two varieties’ mono-phonemes as contrasting phoneme-like classes within a single tokenizer. Significant improvements in terms of identification rate and computational cost were achieved compared to conventional single tokenizer PRLM based systems and to the combination of up to five parallel PRLM identifiers.
Date: 2010-Feb-19 Time: 14:00:00 Room: 336
For more information:
Upcoming Events
INESC-ID ESR Talks – February 2023

If you are a masters/PhD student or a postdoctoral fellow, come and present your work in an informal and friendly environment – and savour some tasty snacks!
Individual talks will be 10-15 minutes plus time for feedback. Enroll on your selected date by emailing pedro.ferreira[at]inesc-id.pt.
Happening on the second Wednesday of every month (4pm-5pm):
- 15 February (Alves Redol, Room 9)
- 15 March (Alves Redol, Room 9)
- 12 April (Alves Redol, Room 9)
- 10 May (Alves Redol, Room 9)
- 14 June (Alves Redol, Room 9)
- 12 July (Alves Redol, Room 9)
We hope to see you there!