Catching words in a stream of speech. Computational simulations of segmenting transcribed child-directed speech
PhD ceremony: Mr. C. Çöltekin, 14.30 uur, Aula Academiegebouw, Broerstraat 5, Groningen
Dissertation: Catching words in a stream of speech. Computational simulations of segmenting transcribed child-directed speech
Promotor(s): prof. J. Nerbonne
Faculty: Arts
Segmenting continuous speech into lexical units is one of the early tasks an infant needs to tackle during language acquisition. Çağrı Çöltekin’s thesis investigates this particular problem, segmentation, by means of computational modeling and simulations.
The segmentation problem is more difficult than it may be appreciated at first sight. Children need to find words in a continuous stream of speech, with no knowledge of words to start with. Fortunately, experimental studies reveal that children and adults use a number of cues in the input and simple strategies that exploit these cues in order to segment the speech. More interestingly, some of these cues are language independent, allowing a learner to segment the continuous input before knowing any words.
Two major aspects set the models presented in this thesis apart from other computational models in the literature. First, the models presented here use simple local strategies - as opposed to global optimization - that rely on cues known to be used by children, namely, predictability statistics, phonotactics and lexical stress. Second, these cues are combined using an explicit cue-combination model which can easily be extended to include more cues.
The models are tested using real-world transcribed child-directed speech. The simulation results show that the performance of individual strategies are comparable to the state-of-the-art computational models of segmentation. Furthermore, combinations of individual cues provide a consistent increase in performance. The combined model performs on a par with the reference state-of-the-art model, while while employing only mechanisms more similar to those available to humans performing the same task.
Last modified: | 13 March 2020 01.13 a.m. |
More news
-
24 March 2025
UG 28th in World's Most International Universities 2025 rankings
The University of Groningen has been ranked 28th in the World's Most International Universities 2025 by Times Higher Education. With this, the UG leaves behind institutions such as MIT and Harvard. The 28th place marks an increase of five places: in...
-
12 March 2025
Breaking news: local journalism is alive
Local journalism is alive, still plays an important role in our lives and definitely has a future. In fact, local journalism can play a more crucial role than ever in creating our sense of community. But for that to happen, journalists will have to...
-
11 March 2025
Student challenge: Starting Stories
The Challenge Starting Stories dares you to think about the beginning of recent novels for ten days.