This paper presents a critique by a ''speech technologist'' of ''The potential role of speech production models in automatic speech recognition'' by R. The dynamics of active categorical perception in an evolved model agent. Words and voices: Implicit and explicit memory for spoken words. Recent developments in facial animation: An inside view. Vocal affect expression: A review and model for future research.
Preventing reading failure in young children with phonological processing disabilities: Group and individual responses to instruction. Influences of categorization on perceptual discrimination. Long-term retention of new phonetic categories. The goal of this paper is to illustrate the point that other potential candidates for explaining these speech errors, which have gone largely unnoticed, provide a better explanatory framework for speech errors than do segments. J Exp Psychol Gen 2002, 131:477—493.
Spanish-dominant individuals, but not Catalan-dominant individuals, exhibited repetition priming for minimal pairs differing in only one feature that is nondistinctive in Spanish e. Cognitive Dynamics: Conceptual Change in Humans and Machines. Perceptual categories in vision and audition. The perceiver's ability to discriminate sounds improves as the sounds become less similar—going from discriminations of sounds that differ by one step to two steps to three steps along the horizontal continuum. Categorical Perception: The Groundwork of Cognition.
However, the quantity of training data required to improve state-of-the-art systems seems to be growing exponentially and performance appears to be asymptotic to a level that may be inadequate for many real-world applications. Training included practice at the segment and the word level. On the comparison between identification and discrimination tests in speech perception. Acquired distinctiveness of cues: I. The nature of learned categorical perception effects: a psychophysical approach. Index Terms : speech modelling, case based reasoning, exemplar-based systems. We argue that a full understanding of the process of lexical access during speech comprehension will depend on resolving several key representational issues: what is the form of the representations used for lexical access; how is phonological information coded in the mental lexicon; and how is the morphological and semantic information about each word stored? Cognitive representations of semantic categories.
Motor theory of speech perception: a critical review. Perception and production improved for each of the 7 children. Perceiving talking faces: From speech perception to a behavioral principle. In: Honavar, V, Uhr, L, eds. Journal of Phonetics, 21, 3— 28. J Acoust Soc Am 1978, 63:456—468.
Peter Wallis for his suggestion to use a clapping robot as a challenging interaction scenario, and the reviewers for their helpful and insightful comments. New York: Plenum Press; 1978, 125—169. Categorical perception of facial expression. J Exp Psychol Learn Mem Cogn 1987, 13:640—645. This paper discusses these issues, and presents an argument in favor of the derivation of a 'unifying theory' that would be capable of explaining and predicting both human and machine spoken language processing behavior, and hence serve both communities as well as representing a long-term 'grand challenge' for the scientific community in the emerging field of 'cognitive informatics'. First and second language vowel perception in early bilinguals. Language development in profoundly deaf children with cochlear implants.
Seven students with hearing loss between the ages of 8 and 13 were trained for 6 hours across 21 weeks on 8 categories of segments 4 voiced vs. Journal of Speech and Hearing Research, 37, 687—699. New York: Bell Telephone Laboratories. This editorial provides a brief introduction and overview of the following five chapters dedicated to a subset of topics relating to the creation of real-time computational systems capable of engaging in affective interaction with humans. This paper compares the research objectives of the two fields, and attempts to draw informative lessons from one to the other. Language-particular sound patterns are in part patterns of temporal coordination among gestures.
An important goal of such systems is to be able to form a loop with the human by conducting appropriate behaviours at appropriate times, in order to establish and maintain different senses of connectedness between the human and agent. It can be claimed that spoken language is the most sophisticated behavior of the most complex organism in the known universe Gopnik et al, 2001;Dawkins, 1991 much more challenging than rocket science-and this places spoken language technology at the heart of artificial cognitive systems and therefore firmly within the newly emerging transdisciplinary field of 'Cognitive Informatics' Wang, 2003a Wang, , 2003b Moore, 2005b -see Figure 6. Categorical perception of familiar objects. The structure of the color space in naming and memory for two languages. Auditory and phonetic memory codes in the discrimination of consonants and vowels.
However, despite these impressive achievements, we still have a long way to go before these two areas of knowledge and discovery converge on a coherent 'theory' of spoken language processing; a theory that could serve to both explain the intricacies of human speech behaviour as well as support a truly ubiquitous technology for spoken language processing. Speech Lang Adv Basic Res Pract 1984, 10:243—335. Proc Natl Acad Sci U S A 2003, 100:9085—9089. Cued speech, manual complement to lipreading, for visual reception of spoken language: Principles, practice and prospects for automation. Learned categorical perception in neural nets: implications for symbol grounding. Acta Oto-Rhino-Laryngologica Belgica, 42, 375—384.