Mining for meaning. The extraction of lexico-semanticknowledge from text
PhD ceremony: Mr. T. van de Cruys, 14.45 uur, Academiegebouw, Broerstraat 5, Groningen
Thesis: Mining for meaning. The extraction of lexico-semanticknowledge from text
Promotor(s): prof. J. Nerbonne
Faculty: Arts
Words have a particular meaning. While language users have no problems inferring those meanings, this is a hard task for a computer system. In his dissertation, Tim van de Cruys investigates how a computer might be able to infer the meaning of words automatically from large text collections. The basic approach for doing so is by comparing the contexts of words (such as the surrounding words, or the syntactic relations in which the word takes part), in order to determine how similar those contexts are. This information enables a computer to automatically extract groups of words from text that are similar to each other.
An important part of the research focuses on dimensionality reduction, and its application to language. The use of large text collections brings about a large number of contexts in which a word occurs. Using a mathematical dimensionality reduction, the abundance of individual contexts can be reduced to a limited number of significant dimensions. Characteristic for these dimensions is that they contain `latent semantics': the value of a word on a particular dimension indicates the score of the word for a particular semantic field (such as economics, transport, food, ...). The research shows that, with a number of simple algorithms, the meaning of words can automatically be extracted from text, and this is an important step towards a system that is able to understand what is written in texts.
Last modified: | 13 March 2020 01.15 a.m. |
More news
-
16 September 2025
Space for art: How creativity and science can complement each other
The Dutch countryside is in a state of transition: land use conflicts are surfacing, infrastructural developments are changing the landscape, and quality of life is under pressure due to population decline and ageing. Cultural geographer and social...
-
15 September 2025
Successful visit to the UG by Rector of Institut Teknologi Bandung
The Rector of Institut Teknologi Bandung (ITB), Prof Tatacipta Dirgantara, paid a 3-day visit to the UG.
-
09 September 2025
Art + science = 1-0 for humanity
PhD candidate in Media Studies Marije Miedema and theater maker Mees van den Bergh joined forces. The result is the theatrical audio installation "Future of the Past," a project about how people want to be remembered digitally.