The tutorial consists of three parts: (i) entity linking (Edgar), (ii) entity retrieval (me), and a hands-on lab session (Daan). The hands-on session is further subdivided into entity linking and entity retrieval parts. The slides are made available on github. We also created a Mendeley group with all the papers that were discussed. The tags, entity linking and entity retrieval, hint the part of the tutorial to which each paper belongs. We intend to maintain and expand this repository, so it might be useful for you to follow this group.
Given that this was a half-day tutorial, we had to be quite selective in what we presented. A full-day version of the same tutorial will be given by us at SIGIR’13 in July. If you have suggestions for improvements and pointers to papers, approaches, services, etc. that we could/should cover (yes, this includes your own work) then don’t hesitate to get in touch with us!
The basic idea of living labs for IR is that rather than individual research groups independently developing experimental search infrastructures and gathering their own groups of test searchers for IR evaluations, a central and shared experimental environment is developed to facilitate the sharing of resources.
Living labs would offer huge benefits to the community, such as: availability of, potentially larger, cohorts of real users and their behaviours, e.g. querying behaviours, for experiment purposes; cross-comparability across research centres; and greater knowledge transfer between industry and academia, when industry partners are involved. The need for this methodology is further amplified by the increased reliance of IR approaches on proprietary data; living labs are a way to bridge the data divide between academia and industry.
There are many challenges to be overcome before the benefits associated with living labs for IR can be realised, including challenges associated with living labs architecture and design, hosting, maintenance, security, privacy, participant recruiting, and scenarios and tasks for use development.
This workshop aims to bring together for the first time people interested in progressing the living labs for IR evaluation methodology. An interactive forum for researchers to share ideas and initiate collaborations will be provided, with the explicit goal of determining means for progressing towards living labs for IR and formulating practical next steps for progression.
See the Call-for-Papers for more details.
As part of the workshop, we are considering organising a challenge in the e-commerce domain with the involvement of a medium-sized online retailer. The goal of this challenge would be to (i) allow academics to work with real users and data (esp. those who otherwise would have no access to such data) and (ii) to provide a starting point for the discussions at the workshop.
We will set up and run this challenge if there is sufficient interest in the community. We have made a poll to collect some initial feedback — please let us know what you think!
It’s almost mid Feb, so I won’t even attempt to make it a Happy New Year entry. And I’ll keep it short.
As of Jan 1 this year, I’m working as an Associate Professor at the University of Stavanger. Don’t look for the IR group’s homepage, there is no such thing. Yet
Briefly about (some of) my recent work. Not surprisingly, it’s all related to entities. In a SPIRE’12 paper we study ad-hoc entity retrieval in Linked Data in a distributed setting, with focus on the problems of collection ranking and collection selection. In a short position paper, written for the ESAIR’12 workshop, we discuss how to make entity retrieval temporally-aware, using semantic knowledge bases that are enriched with temporal information (like YAGO2). In a CIKM’12 poster we introduce the task of target type identification for entity-oriented queries, where types are organized hierarchically. We also made all related resources publicly available.
Most recently, just earlier this week, I gave a lecture on Semistructured Data Search at the PROMISE Winter School. At some point in the not-too-distant future there might be a written version of this material. So if you have any feedback, comments, suggestions, etc. please don’t hesitate to contact me.
Finally, I decided to set up and maintain a separate page with a list of entity-oriented benchmarking campaigns, workshops, and journal special issues. I hope people will find it useful. If you have a relevant piece to be added here, let me know.
The aim of the Promise Winter School on “Bridging between Information Retrieval and Databases” is to give participants a grounding in the core topics that constitute the multidisciplinary area of information access and retrieval to unstructured, semistructured, and structured information.
The idea of the school stems from the observation that nowadays databases are more and more getting into techniques that have traditionally been typical of information retrieval and, viceversa, information retrieval is using more and more database-oriented techniques.
The school is a week-long event consisting of guest lectures from invited speakers who are recognized experts in the field of information retrieval and databases.
The school mainly targets PhD and Master students but all other researchers or participation from industry is welcome.
Registration deadline: 20 December 2012
Winter School: 4-8 February 2013
For details, see http://www.promise-noe.eu/events/winter-school-2013/.