Lexical acquisition for computational grammars. A unified model
PhD ceremony: Mr. K.D. Cholakov, 14.30 uur, Academiegebouw, Broerstraat 5, Groningen
Dissertation: Lexical acquisition for computational grammars. A unified model
Promotor(s): prof. G.J.M. van Noord, prof. J. Nerbonne
Faculty: Arts
Words are the building blocks in the implementation of many natural language processing systems. The lexical information in such systems is usually encoded in lexicons where the words are mapped to linguistic descriptions. However, lexicons will always be incomplete. Natural language is constantly evolving and new words emerge every day. It is impossible to list each word in a language in a lexicon. Kostadin Cholakov’s thesis describes a novel automated lexical acquisition model.
Cholakovs model learns the morphosyntactic properties of words which are not listed in lexicons employed by computational grammars of natural language. Two major aspects of the model set it apart from existing lexical acquisition techniques. First, it enables the acquisition of the full morphological paradigm of the unknown word. Second, different contexts of this word are considered during the acquisition process. This increases the amount and the diversity of the linguistic information available for the unknown word.
For each unknown word, a set of linguistic features is constructed automatically. Those features are used as an input to a statistical classifier which maps all forms in the paradigm of the unknown word to descriptions in the lexicon of the grammar. The lexical acquisition model is tested with computational grammars of Dutch and German. The results demonstrate its high-quality performance. Further, the model is applied to learn proper linguistic descriptions for words with wrong or incomplete entries in the lexicon of the grammar.
Finally, the work in this thesis goes beyond syntax. The lexical acquisition model is combined with vector-based semantic space techniques to acquire semantic properties of unknown words.
Last modified: | 13 March 2020 12.59 a.m. |
More news
-
13 May 2024
Trapping molecules
In his laboratory, physicist Steven Hoekstra is building an experimental set-up made of two parts: one that produces barium fluoride molecules, and a second part that traps the molecules and brings them to an almost complete standstill so they can...
-
06 May 2024
Impact: Utilization of geospatial data within international development cooperation
One of students nominated for the Ben Feringa Impact Award 2024 is Jonas Göbel. Göbel is nominated because of his internship research around the utilization of geospatial data in the field of international development cooperation.
-
03 May 2024
NWO Impact Explorer for Suzanne Manizza-Roszak's impactful postcolonial literary research
Suzanne Manizza-Roszak, Assistent Professor English at the Faculty of Arts has received an Impact Explorer grant from the Dutch Research Council (NWO) for her postcolonial literary research and the project to translate the results into social...