
Perceptuo-motor idiosyncrasies as a test case for the proximity of the speaking tongue and the hearing ear In other words, our overall goal is to assess how speech perception processes intervene in speech production and, conversely, how speech production knowledge constrains and informs speech perception. The key aim is to really understand the role of the “hearing ear” when the tongue speaks, and the role of the “speaking tongue” when the ear hears.

Globally, it appears that, in the human brain, “the hearing ear is always found close to the speaking tongue”, quoting part of the title of a recent paper by. A converging set of recent studies confirms that the speech production system is activated and has a significant role in the speech perception process, particularly in adverse conditions. On the flip side, neurophysiological and behavioral experimental data have also shown more and more clearly that both auditory and motor systems are activated during speech perception. The role of auditory representations in speech production appears crucial, not only for learning phonetic targets, but also for adapting and compensating for perturbations. Progress in neurocognitive techniques and paradigms has strongly strengthened the experimental bases of this debate in the last twenty years and made it clear that perceptuo-motor interactions shape both sides of the speech communication process. The debate propagated to the domain of speech production, once again contrasting the view that motor targets would be specified in terms of articulatory constrictions and vocal tract shapes and arguments in favor of auditory targets as reference frames for programming speech gestures. The debate was first focused on speech perception, opposing tenants of auditory theories and auditory invariants and proponents of the motor theory of speech perception claiming that phonetic invariants would be motor or rather articulatory in the direct realist perspective. The nature of phonetic invariance has been at the heart of lively exchanges among speech communication researchers. articulatory invariance in speech communication The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Ĭompeting interests: The authors have declared that no competing interests exist.Īuditory vs. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.ĭata Availability: All relevant data are available from figshare ( ).įunding: This research was supported by a grant from the European Research Council (FP7/2007-2013 Grant Agreement no. Received: SeptemAccepted: DecemPublished: January 11, 2019Ĭopyright: © 2019 Barnaud et al. We draw conclusions about the role and importance of motor processes in speech perception, and propose a perceptuo-motor model in which auditory processing would enable optimal processing of learned sounds and motor processing would be helpful in unlearned adverse conditions.Ĭitation: Barnaud M-L, Schwartz J-L, Bessière P, Diard J (2019) Computer simulations of coupled idiosyncrasies in speech perception and speech production with COSMO, a perceptuo-motor Bayesian model of speech communication.

On the contrary, motor and perceptuo-motor models provide perception idiosyncrasies correlated with idiosyncrasies in production. We show that no idiosyncrasy in perception can be obtained in the auditory model, since it is optimally tuned to the learning environment, which does not include the motor variability of the Learning Agent. Second, we implement within COSMO three models for speech perception that are, respectively, auditory, motor and perceptuo-motor. First, we show that if the learning process in COSMO includes a communicative mechanism between a Learning Agent and a Master Agent, vowel production does display idiosyncrasies.


In this paper, we attempt to simulate one study on coupled idiosyncrasies in the perception and production of French oral vowels, within COSMO, a Bayesian computational model of speech communication. Indeed, a number of studies explore inter-individual variability in auditory and motor prototypes within a given language, and provide evidence for a link between both sets. The existence of idiosyncrasies in production and in perception sheds interesting light on the nature of the link. The existence of a functional relationship between speech perception and production systems is now widely accepted, but the exact nature and role of this relationship remains quite unclear.
