Welcome to Information Retrieval (IR) course! It is difficult to imagine living without search engines. Availability of big data has necessitated a systematic study of retrieval techniques. Principles and practices of information retrieval have been a focus of both researchers and practitioners alike. This course is not about just search engines. It is about dealing with big data and retrieving information which opens up several interesting applications. This course will introduce students to key parts of IR such as indexing techniques, challenges in query processing and well-known retrieval models.
Key Learning Objectives
At the end of this course, you should be able to:
- Understand and apply text indexing techniques to big data.
- Understand and apply text ranking techniques.
- Analyze and evaluate existing retrieval systems.
The lecture schedule of 2019 offering (which was a 2-credit course) is available here
|Part 1: Building a Search System - An Overview|
|1||Introduction to Information Retrieval||Chapter 1 from CPS||[Video][Slides]|
|2||Building a Simple Retrieval System||Chapter 1 from CPS||[Video][Slides]|
|3||Query Processing with Inverted Index||Chapter 2 from CPS||[Video][Slides]|
|Assignment 1 released. Please visit Moodle for details.|
|4||Evaluating Retrieval Systems||Chapter 8 from CPS||[Video][Slides]|
|Tutorial: Lucene Demo||Lucene Tutorial||[Video][LuceneDemo.zip]|
|Bonus Task 1 released. Please visit Moodle for details. A preview of a bonus task from an earlier course offering is here.|
|Part 2: Components of a Retrieval System|
|5||Indexing: Query Processing Order||Section 1.3 from CPS||[Video][Slides]|
|6||Indexing: Challenges||Chapter 2 from CPS||[Video][Slides]|
The Power of Indexing
|Bonus Task 2 released. Please visit Moodle for details.|
|2.2 Query Understanding|
|7||Query Understanding: Segmentation and Spelling Correction||Chapter 3 from CPS||[Video][Slides]|
|8||Query Understanding: Phonetic Correction||Chapter 3 from CPS||[Video][Slides]|
Handling Wildcard Queries
|Assignment 2 released. Please visit Moodle for details.|
|2.3 Index Compression|
|10||Index Compression||Chapter 5 from CPS||[Video][Slides]|
|11||Crawlers||Chapter 20 from CPS, Chapter 3 from BDT||[Video][Slides]|
|12||Evaluation||Chapter 8 from CPS|
|Part 3: Advanced Topics in Information Retrieval|
|Assignments (4 * 10% each)||40%|
Familiarity with Java will help in coding with Lucene. You may use your favourite programming language (for assignments) as long as the objectives of the assignment are met. Basic understanding of linear algebra, set theory and probability will be useful in understanding the IR models. However, there are no specific pre-requisites for this course. We will revise the fundamentals wherever necessary.
- [BDT] Search Engines: Information Retrieval in Practice. Bruce Croft, Donald Metzler, Trevor Strohman