CSC 2518 -- Spoken Language Processing

  Fall 2011


Index of this document


Contact information

Instructor: Gerald Penn
Office: PT 396B (St. George campus)
Tel: 978-7390
Email: gpenn@cs.utoronto.ca
Back to the index

Meeting times

Lectures: R 1-3, GB 120
Back to the index

Presented Readings

 

Who
When
What
Where
Sajad Shirali-Shahreza 24 November
Perception of Synthetic Speech, D. B. Pisoni
Perception and Comprehension of Synthetic Speech, S. J. Winters and D. B. Pisoni
Chapter 43 of Progress in Speech Synthesis, J.P.H. van Santen, R. W. Sproat, J. P. Olive and J. Hirschberg eds., Springer 1997.
Progress Report 26 (2003-2004), Dept. of Psychology, Indiana University.
Anthony McCallum 17 November Intonation, J. van Santen, C. Shih and B. Mobius Chapter 6 of Multilingual Text-to-Speech Synthesis: The Bell Labs Approach, R. Sproat ed., Kluwer 1998.
Navdeep Jaitly 3 November
Prosody and the Selection of Source Units for Concatenative Synthesis, N. Campbell and A. Black
Optimal Coupling of Diphones, A. Conkie and S. Isard
Chapter 22 of Progress in Speech Synthesis, J.P.H. van Santen, R. W. Sproat, J. P. Olive and J. Hirschberg eds., Springer 1997.
Chapter 23 of Progress in Speech Synthesis, J.P.H. van Santen, R. W. Sproat, J. P. Olive and J. Hirschberg eds., Springer 1997.
Tong Wang 27 October Homograph Disambiguation in Text-to-Speech Synthesis, D. Yarowsky Chapter 12 of Progress in Speech Synthesis, J.P.H. van Santen, R. W. Sproat, J. P. Olive and J. Hirschberg eds., Springer 1997.
Jackie Cheung 20 October All-Prosodic Speech Synthesis, A. Dirksen and J. S. Coleman Chapter 8 of Progress in Speech Synthesis, J.P.H. van Santen, R. W. Sproat, J. P. Olive and J. Hirschberg eds., Springer 1997.
Siavash Kazemian 13 October Weighted Finite State Transducers in Speech Recognition, M. Mohri, F.C.N. Pereira and M. Riley Computer Speech and Language 16, pp. 69-88
Sammy Davis-Mendelow 6 October
Web-derived Pronunciations, Arnab Ghoshal, Martin Jansche, Sanjeev Khudanpur, Michael Riley and Morgan Ulinski
Re-Engineering Letter-to-Sound Rules, Martin Jansche
ICASSP 2009
NAACL 2001
Joanna Drummond 29 September Normalization of Non-standard Words, Richard Sproat, Alan Black, Stanley Chen, Shankar Kumar, Mari Ostendorf and Christopher Richards Computer Speech and Language 15, pp. 287-333
Gerald Penn 15 September The Quantitative Study of Writing Systems

Additional Readings for the Lectures


Title Author Publication Details
Spoken Language Processing X. Huang, A. Acero and H.-W. Hon Prentice Hall, 2001.
Discrete-Time Signal Processing, Chapters 5 and 6. J.R. Deller, Jr. , J.H.L. Hansen, and J.G. Proakis IEEE Press, 2000.
Open Finite-State Transducer Tutorial C. Allauzen, M. Jansche and M. Riley

Back to the index


Tentative course outline

Back to the index

Calendar of important course-related events


Date Event
Thurs, 15 September First lecture
Mon, 26 September Last day to add course
Mon, 31 October Last day to drop course
Thurs, 10 November No lecture - will be rescheduled before first lecture
Thurs, 1 December Nominal last lecture - will be rescheduled before first lecture
Tue, 20 December Final papers/projects due

Back to the index


Evaluation

Your final mark will be determined by a term paper/project, and a presentation of a paper in class.  The relative weights of these components towards the final mark are shown in the table below:
 

Class presentation 20%
Term paper/project 80%

Back to the index


Announcements

In this space, you will find announcements related to the course. Please check this space at least weekly. Back to the index

Lecture Slides



Gerald Penn, 20 November, 2011
This web-page was adapted from the web-page for another course, created by Vassos Hadzilacos.