Spoken Dialogue Systems: Progress and Challenges
Prof. Steve Young,
University of Cambridge, UK –
The potential advantages of statistical dialogue systems include lower development cost, increased robustness to noise and the ability to learn on-line so that performance can continue to improve over time. This talk will briefly review the basic principles of statistical dialogue systems including belief tracking and policy representations. Recent developments at Cambridge in the areas of rapid adaptation and on-line learning using Gaussian processes will then be described. The talk will conclude with a discussion of some of the major issues limiting progress.
Steve Young received a BA in Electrical Sciences from Cambridge University in 1973 and a PhD in Speech Processing in 1978. He held lectureships at both Manchester and Cambridge Universities before being elected to the Chair of Information Engineering at Cambridge University in 1994. He was a co-founder and Technical Director of Entropic Ltd from 1995 until 1999 when the company was taken over by Microsoft. After a short period as an Architect at Microsoft, he returned full-time to the University in January 2001 where he is now Senior Pro-Vice-Chancellor.
His research interests include speech recognition, language modelling, spoken dialogue and multi-media applications. He is the inventor and original author of the HTK Toolkit for building hidden Markov model-based recognition systems (see http://htk.eng.cam.ac.uk), and with Phil Woodland, he developed the HTK large vocabulary speech recognition system which has figured strongly in DARPA/NIST evaluations since it was first introduced in the early nineties. More recently he has developed statistical dialogue systems and pioneered the use of Partially Observable Markov Decision Processes for modelling them. He also has active research in voice transformation, emotion generation and HMM synthesis.
He has written and edited books on software engineering and speech processing, and he has published as author and co-author, more than 250 papers in these areas. He is a Fellow of the Royal Academy of Engineering, the IEEE, the IET and the Royal Society of Arts. He served as the senior editor of Computer Speech and Language from 1993 to 2004 and he was Chair of the IEEE Speech and Language Processing Technical Committee from 2009 to 2011. In 2004, he received an IEEE Signal Processing Society Technical Achievement Award. He was elected ISCA Fellow in 2008 and he was awarded the ISCA Medal for Scientific Achievement in 2010. He is the recipient of the 2013 Eurasip Individual Technical Achievement Award.
Isabel Maria Martins Trancoso
Anfiteatro do Complexo Interdisciplinar, IST Alameda