Publications

An audio-visual corpus for speech perception and automatic speech recognition

Authors
Martin Cooke, Jon Barker, Stuart Cunningham, Xu Shao.
Year
2006
Journal
Journal of the Acoustical Society of America
DOI
ISSN

An audio-visual corpus has been collected to support the use of common material in speechperception and automatic speech recognition studies. The corpus consists of high-quality audio andvideo recordings of 1000 sentences spoken by each of 34 talkers. Sentences are simple, syntacticallyidentical phrases such as “place green at B 4 now.” Intelligibility tests using the audio signalssuggest that the material is easily identifiable in quiet and low levels of stationary noise. Theannotated corpus is available on the web for research use.