Archive for the 'Course IR' Category

Guest lecture by Thijs Westerveld

Wednesday, October 7th, 2009, posted by Djoerd Hiemstra

Thijs Westerveld from Teezir will give a guest lecture for the course Information Retrieval

When: Wednesday, October 14, 2009
Where: HO-B1212
Title: Automatically Analyzing Word of Mouth And Focused Crawling

Teezir is a young and innovative technology company that develops and deploys comprehensive search solutions. Teezir lets companies take advantage of large and diverse amounts of documents or texts, using break through search technology. Teezir’s search platform provides functionality for the entire process of disclosing data: from gathering content, analyzing documents and building indexes for efficient access to effective querying and ranking of information. Teezir’s framework is based on full-text retrieval techniques.

Handouts for practical work

Monday, October 5th, 2009, posted by Paul van der Vet

The handout for the practical part of the course Information Retrieval has been added under Course Materials on Blackboard. Additionally, you will find two useful handouts there that help you to write your report and to insert citations in it.

Deadline to form groups: 30 September

Tuesday, September 29th, 2009, posted by Djoerd Hiemstra

Deadline to form pairs for the Information Retrieval Course Project is 30 September. Please send names and email addresses to the course staff. Groups will be numbered and listed (under Email) on Blackboard.

Information Retrieval Models Tutorial

Thursday, August 20th, 2009, posted by Djoerd Hiemstra

Many applications that handle information on the internet would be completely inadequate without the support of information retrieval technology. How would we find information on the world wide web if there were no web search engines? How would we manage our email without spam filtering? Much of the development of information retrieval technology, such as web search engines and spam filters, requires a combination of experimentation and theory. Experimentation and rigorous empirical testing are needed to keep up with increasing volumes of web pages and emails. Furthermore, experimentation and constant adaptation of technology is needed in practice to counteract the effects of people that deliberately try to manipulate the technology, such as email spammers. However, if experimentation is not guided by theory, engineering becomes trial and error. New problems and challenges for information retrieval come up constantly. They cannot possibly be solved by trial and error alone. So, what is the theory of information retrieval? There is not one convincing answer to this question. There are many theories, here called formal models, and each model is helpful for the development of some information retrieval tools, but not so helpful for the development others. In order to understand information retrieval, it is essential to learn about these retrieval models. In this chapter, some of the most important retrieval models are gathered and explained in a tutorial style.

The tutorial will be published in Ayse Goker and John Davies (eds.), Information Retrieval: Searching in the 21st Century, Wiley, 2009.

[download draft]

[download exercise solutions]

Blackboard site for Information Retrieval

Friday, July 17th, 2009, posted by Paul van der Vet

As of today, the Blackboard site of the Information Retrieval course will be gradually filled with information. You can now register for the course. The deadline for self-enrollment is not known at the time of writing this. So don’t wait until the last moment!

Photos of DIR 2009 on-line

Thursday, February 5th, 2009, posted by Djoerd Hiemstra

DIR 2009

Excellent photos by Sander Bockting

DIR industry talk by Rene van Erk

Tuesday, January 13th, 2009, posted by Djoerd Hiemstra

Rene van Erk is member of the European management team of Wolters Kluwer where he is responsible for all Product - and Business Development. In this role, his key responsibility is to optimize the WK portfolio for maximum growth, meaning: a) M&A focus: Responsible for identifying acquisition opportunities with a good strategic portfolio b) Leading Innovation: Overall responsible for Product Development & Product Management moving WK from content provider to information solutions provider c) Leading our Online and Software Businesses: Wolters Kluwer currently owns around 30 Software Development Companies and leading Online properties across Europe. Rene will talk about: Communities and Workflow: Driving Information Consumption

More info at: DIR 2009

Call for Participation: Dutch-Belgian IR workshop

Friday, January 9th, 2009, posted by Djoerd Hiemstra

The 9th Dutch-Belgian Information Retrieval Workshop (DIR) will take place at the University of Twente, Enschede, the Netherlands, on February 2-3, 2009. The workshop is organised under the auspices of the Working Community on Information Sciences (WGI) and the School for Information and Knowledge Systems (SIKS). The workshop is sponsored by NWO, CTIT and the University of Twente.

Keynote Speaker: prof. Gerhard Weikum (MPII, Saarbruecken, Germany) Gerhard Weikum is Research Director at the Max-Planck Institute for Informatics (MPII) in Saarbruecken, Germany, where he is leading the department on databases and information systems. Prof. Weikum is ACM fellow and a renowned expert in the field of Databases. He received the VLDB 10-Year Achievement Award in 2002. Since then, he focused on several information retrieval problems such as peer-to-peer search, search efficiency, and database and search integration, resulting in for instance 6 full papers at the last SIGIR conferences.

Due to the cooperation between SIKS, SIKS-members can participate without paying entrance fee. Especially PhD-students working on the SIKS-focus “Web based Information Systems” and “Data management, Storage and Retrieval” are strongly encouraged to participate. There is a limited number of registrations for SIKS members. Registrations will be honored in a first come-first serve manner.

Participants of DIR are invited to the Annual Meeting of the Werkgemeenschap Informatieschap which is directly following DIR on Tuesday.

Search for the Future

Friday, October 17th, 2008, posted by Djoerd Hiemstra

Information Retrieval is the discipline that studies computer-based search tools. Many applications that handle information on the internet would be completely inadequate without the support of information retrieval technology. How would we manage our email without spam filtering? How would we find information on the world wide web if there were no web search engines? The rise of web search engines has been one of the major success stories in computer science of the last decade: Internet and search companies like Google and Yahoo are now among the world’s most influential information technology companies.

Today, search technology is provided and developed by major search providers like Google and Yahoo, and by small specialized companies with specialized staff. But as search technology matures, it will have to be available to non-expert application developers as well. A major obstacle to achieve this, is the lack of theories and high-level abstractions of search systems and the lack of declarative query languages. Another obstacle is the lack of methods to handle non-textual data, such as images, audio and video. Several projects of the Database Group of the University of Twente try to solve these problems for application areas such as Entity Search, Expert Search, Video Search, and Distributed Search. The models and approaches that are developed in these projects are evaluated on large scale, realistic testbeds, and implemented in the group’s open source search system PF/Tijah, a search system that combines keyword queries with structured queries on XML databases. The research contributes to the several courses in the university’s graduate programs, for instance Information Retrieval, and XML & Databases 1 and XML & Databases 2.

Supervisors for groups

Wednesday, October 1st, 2008, posted by Paul van der Vet
All groups have now been assigned a supervisor, see under “Messages/groups” on TeleTOP. The deadline for the project plan is 30 October. Make sure you have talked with your supervisor at least twice before you submit your plan. Send him an email to make an appointment.