Skip to ContentSkip to Navigation
University of Groningenfounded in 1614  -  top 100 university
About us Latest news News News articles

Catching words in a stream of speech. Computational simulations of segmenting transcribed child-directed speech

08 December 2011

PhD ceremony: Mr. C. Çöltekin, 14.30 uur, Aula Academiegebouw, Broerstraat 5, Groningen

Dissertation: Catching words in a stream of speech. Computational simulations of segmenting transcribed child-directed speech

Promotor(s): prof. J. Nerbonne

Faculty: Arts

Segmenting continuous speech into lexical units is one of the early tasks an infant needs to tackle during language acquisition. Çağrı Çöltekin’s thesis investigates this particular problem, segmentation, by means of computational modeling and simulations.

The segmentation problem is more difficult than it may be appreciated at first sight. Children need to find words in a continuous stream of speech, with no knowledge of words to start with. Fortunately, experimental studies reveal that children and adults use a number of cues in the input and simple strategies that exploit these cues in order to segment the speech. More interestingly, some of these cues are language independent, allowing a learner to segment the continuous input before knowing any words.

Two major aspects set the models presented in this thesis apart from other computational models in the literature. First, the models presented here use simple local strategies - as opposed to global optimization - that rely on cues known to be used by children, namely, predictability statistics, phonotactics and lexical stress. Second, these cues are combined using an explicit cue-combination model which can easily be extended to include more cues.

The models are tested using real-world transcribed child-directed speech. The simulation results show that the performance of individual strategies are comparable to the state-of-the-art computational models of segmentation. Furthermore, combinations of individual cues provide a consistent increase in performance. The combined model performs on a par with the reference state-of-the-art model, while while employing only mechanisms more similar to those available to humans performing the same task.

Last modified:13 March 2020 01.13 a.m.
Share this Facebook LinkedIn
View this page in: Nederlands

More news

  • 09 September 2025

    Art + science = 1-0 for humanity

    PhD candidate in Media Studies Marije Miedema and theater maker Mees van den Bergh joined forces. The result is the theatrical audio installation "Future of the Past," a project about how people want to be remembered digitally.

  • 26 August 2025

    Free rein for the crypto coin

    Canadian-Dutch political economist Malcolm Campbell-Verduyn is fascinated by cryptocurrencies. It is full steam ahead under American President Donald Trump, he says. At the same time, the economist himself was involved in the crypto coin Ada as an...

  • 17 July 2025

    Veni-grants for eleven UG researchers

    The Dutch Research Council (NWO) has awarded a Veni grant of up to €320,000 each to eleven researchers of the University of Groningen and the UMCG: Quentin Changeat, Wen Wu, Femke Cnossen, Stacey Copeland, Bart Danon, Gesa Kübek, Hannah Laurens, Adi...