ICTIR 2015 paper online

“Entity Linking in Queries: Tasks and Evaluation,” an upcoming ICTIR 2015 paper by Faegheh Hasibi, Svein Erik Bratsberg, and myself is available online now. The resources developed within this study are also made publicly available.

Annotating queries with entities is one of the core problem areas in query understanding. While seeming similar, the task of entity linking in queries is different from entity linking in documents and requires a methodological departure due to the inherent ambiguity of queries. We differentiate between two specific tasks, semantic mapping and interpretation finding, discuss current evaluation methodology, and propose refinements. We examine publicly available datasets for these tasks and introduce a new manually curated dataset for interpretation finding. To further deepen the understanding of task differences, we present a set of approaches for effectively addressing these tasks and report on experimental results.

Two fully funded PhD positions available

I have two fully funded PhD positions available in the context of the FAETE project.
The positions are for three years and come with no teaching duties! (There is also possibility for an extension to four years with 25% compulsory duties.) Starting date can be as early as Sept 2015, but no later than Jan 2016.

Further details and application are on Jobbnorge.
Application deadline: Aug 3, 2015

Evaluating document filtering systems over time

Performance of three systems over time. Systems A and B degrade, while System C improves over time, but they all have the same average performance over the entire period. We express the change in system performance using the derivative of the fitted line (in orange) and compare performance at what we call the “estimated end-point” (the large orange dots).

Our IPM paper “Evaluating document filtering systems over time” with Tom Kenter and Maarten de Rijke as co-authors is available online. In this paper we propose a framework for measuring the performance of document filtering systems. Such systems, up to now, have been evaluated in terms of traditional metrics like precision, recall, MAP, nDCG, F1 and utility. We argue that these metrics lack support for the temporal dimension of the task. We propose a time-sensitive way of measuring performance by employing trend estimation. In short, the performance is calculated for batches, a trend line is fitted to the results, and the estimated performance of systems at the end of the evaluation period is used to compare systems. To demonstrate the results of our proposed evaluation methodology, we analyze the runs submitted to the Cumulative Citation Recommendation task of the 2012 and 2013 editions of the TREC Knowledge Base Acceleration track, and show that important new insights emerge.


The Exploiting Semantic Annotations in Information Retrieval (ESAIR) workshop series aims to advance the general research agenda on the problem of creating and exploiting semantic annotations. The eighth edition of ESAIR, with a renewed set of organizers, sets its focus on applications. We invite presentations of prototype systems in a dedicated “Annotation in Action” demo track, in addition to the regular research and position paper contributions. A Best Demonstration Award, sponsored by Google, will be presented to the authors of the most outstanding demo at the workshop.

Submissions: regular research papers (4+ pages), position papers (2+1 pages), demo papers (4+ pages)
Deadline: July 2nd

The workshop also offers a track for authors of papers that were not successful at the main conference for their work to be considered for presentation at the workshop; the deadline for these contributions is July 8. In this case, authors are required to attach the reviews for their paper along with the paper so as to facilitate the decision process.

See the workshop’s homepage for details.