Publication

Empirical studies on word representations

Suster, S. 2016 [Groningen]: Rijksuniversiteit Groningen. 232 p.

Research output: ScientificDoctoral Thesis

Documents

  • Simon Suster
One of the most fundamental tasks in natural language processing is representing words with mathematical objects (such as vectors). The word representations, which are most often estimated from data, allow capturing the meaning of words. They enable comparing words according to their semantic similarity, and have been shown to work extremely well when included in complex real-world applications. A large part of our work deals with ways of estimating word representations directly from large quantities of text. Our methods exploit the idea that words which occur in similar contexts have a similar meaning. How we define the context is an important focus of our thesis. The context can consist of a number of words to the left and to the right of the word in question, but, as we show, obtaining context words via syntactic links (such as the link between the verb and its subject) often works better. We furthermore investigate word representations that accurately capture multiple meanings of a single word. We show that translation of a word in context contains information that can be used to disambiguate the meaning of that word.
Original languageEnglish
QualificationDoctor of Philosophy
Awarding Institution
Supervisors/Advisors
  • Noord, van, Gerardus, Supervisor
  • Hendriks, Petra, Assessment committee
  • Søgaard, Anders, Assessment committee, External person
  • Vossen, P.T.J.M. , Assessment committee, External person
Award date10-Nov-2016
Place of Publication[Groningen]
Publisher
Print ISBNs978-90-367-9261-5
Electronic ISBNs978-90-367-9260-8
StatePublished - 2016

View graph of relations

Download statistics

No data available

ID: 36592753