198845 VU Data Analysis II: Information Retrieval and Full-Text Search

winter semester 2020/2021 | Last update: 02.12.2020 Place course on memo list
198845
VU Data Analysis II: Information Retrieval and Full-Text Search
VU 3
5
weekly
annually
English

The students should get an overview over Information Retrieval while deepening their knowledge on text processing methods used for full text search engines. They will acquire necessary abilities to create and adjust indexing and searching algorithms.

After a general introduction to Information Retrieval (definition, tasks etc.) students will be confronted with challenges in creating a search engine. They will learn different ways of processing texts (e.g. tokenization, stemming, lemmatization etc.) in order to overcome those challenges and create a functioning program that is able to perform full text search.

Lecture with exercises.

The following factors will influence the final grade: Participation in exercises (60%), and the result of the exams (40%).

Ch. D. Manning, P. Raghavan, H. Schütze: Introduction to Information Retrieval; Online Edition, Cambridge UP 2009 https://nlp.stanford.edu/IR-book/information-retrieval-book.html

Other literature will be announced at the beginning of the course.

In order to finish the course, students will need their own computer on which different tools will be installed. It is imperative that the students know at least one programming language. During the course we will be programming in Python.

The acceptance is based on prioritised randomisation. Active students of Complementary Subject Area Digital Science get precedence.

see dates
Group 0
Date Time Location
Mon 2020-10-05
09.00 - 11.45 eLecture - online eLecture - online
Mon 2020-10-12
09.00 - 11.45 eLecture - online eLecture - online
Mon 2020-10-19
09.00 - 11.45 eLecture - online eLecture - online
Mon 2020-11-09
09.00 - 11.45 eLecture - online eLecture - online
Mon 2020-11-16
09.00 - 11.45 eLecture - online eLecture - online
Mon 2020-11-23
09.00 - 11.45 eLecture - online eLecture - online
Mon 2020-11-30
09.00 - 11.45 eLecture - online eLecture - online
Mon 2020-12-07
09.00 - 11.45 eLecture - online eLecture - online
Mon 2020-12-14
09.00 - 11.45 eLecture - online eLecture - online
Mon 2021-01-11
09.00 - 11.45 eLecture - online eLecture - online
Mon 2021-01-18
09.00 - 11.45 eLecture - online eLecture - online
Mon 2021-01-25
09.00 - 11.45 eLecture - online eLecture - online