Skip to ContentSkip to Navigation
About us Latest news News News articles

Geographically constrained information retrieval

21 May 2010

Promotie: dhr. G. Andogah, 14.45 uur, Academiegebouw, Broerstraat 5, Groningen

Proefschrift: Geographically constrained information retrieval

Promotor(s): prof.dr.ir. J. Nerbonne

Faculteit: Wiskunde en Natuurwetenschappen

Contact: Geoffrey Andogah, tel. +256 471432921, e-mail: g.andogah@gu.ac.ug

Geographically constrained information retrieval

Eighteen percent of information seekers demand geographically intelligent information retrieval systems (Sanderson and Kohler, 2004). State-of-the-art information retrieval (IR) systems lack the geographical intelligence needed to effectively answer geography-dependent questions. Two specific research objectives are addressed in this thesis: (1) how to mine and analyze the geographical information (GI) implicit in texts, and (2) how to use the geographical knowledge obtained in this way to build models for answering geography-dependent questions.

We assume that every document and search query have a geographical scope (i.e., where the events described are situated). In order to exploit the notion geographical scope we first developed techniques to detect the geographical scope of documents, and resolve the scopes in case the indications are complex or inconsistent.

The thesis then turns to problems whose solution may be improved by incorporating the notion geographical scope, namely (i) toponym resolution, i.e. determining which place is referred to when ambiguous place names (toponyms) are used, (ii) query expansion, the enrichment of queries often used in IR, and relevance ranking strategies. The toponym resolution strategy prefers candidate places in top ranked scopes, and the query expansion strategy prefers place names in commonly shared scopes. The relevance ranking strategy incorporates scope information in score calculation. New evaluation metrics that measure small discrepancies among toponym and scope resolution systems are also proposed. The scope and toponym resolution strategies achieved scores of 70% ~ 90% against human annotators. The query expansion and relevance ranking strategies out-performed state-of-the-art IR systems by 9%.

Last modified:13 March 2020 01.14 a.m.
View this page in: Nederlands

More news

  • 13 October 2023

    Moniek Tromp appointed Captain of Science of the Top Sector Chemistry

    Prof. Moniek Tromp has been appointed Captain of Science of the Chemistry Top Sector by the Minister of Economic Affairs and Climate Policy. As from 1 July 2023, she succeeded Prof. Bert Weckhuysen from Utrecht University.

  • 12 September 2023

    Art in times of AI

    Leonardo Arriagada Beltran conducted his PhD research on the interface of computer-generated art and the constantly evolving field of Artificial Intelligence (AI). He will defend his Phd thesis on 21 September. His research offers valuable insights...

  • 28 August 2023

    Harish Vedantham and Casper van der Kooi nominated for 'Wetenschapstalent 2023'

    Harish Vedantham and Casper van der Kooi have been nominated by New Scientist for Wetenschapstalent 2023 (Science Talent 2023). This election is meant to give young scientists and their research a stage.