👷 Introduction to Information Retrieval
doc. RNDr. Petr Sojka, Ph.D.
👷 Introduction to Information Retrieval

Dear students,

welcome to the PV211 Introduction to Information Retrieval course.

The course is based on the Introduction to Information Retrieval textbook by Manning, Raghavan and Schutze (hard copies available in MU libraries) taught at Stanford, Munich, and other places. In the course you will, among other things, learn how it is possible to fulfill seekers' information needs at the pace of 10,000+ questions per second on the global web scale within milliseconds.

Students will be motivated to try active/flipped learning approaches wherever possible.

The course has moved from its own web presentation to IS MU. Please, have a look if you would like to take a sneak peek at the all-in-one preliminary teaching materials and the topics that we will discuss in the course. However, this interactive syllabus is the primary source of information in this course.

Course trailer (in Czech)
A trailer for the PV211 Introduction to Information Retrieval course by Tomáš Effenberger
Second term project assignments
Slides introducing the second term project by Vít Novotný
Second term project assignments (GitHub)
Google Colaboratory code for the second term project
Second term project assignments (JupyterHub)
A JupyterHub cluster kindly provided to the course by ICT MU. You can use JupyterHub to work on your first term project assignment. Compared to Google Colaboratory, JupyterHub offers up to 32 CPUs, 2 NVIDIA A40 GPUs and 64G RAM. Notebooks will be closed after 3 days of inactivity; make sure you download your work!
Second term project leaderboard (TREC collection)
Google Spreadsheet leaderboard for the second term project
Alternative second term project leaderboard (ARQMath collection)
Google Spreadsheet leaderboard for the alternative second term project

Kapitola obsahuje:
2
Diskusní fórum
4
PDF
1
Složka
1
Video
1
Studijní text
5
Web
Učitel doporučuje studovat od 9. 2. 2022 do 20. 2. 2022.
Kapitola obsahuje:
1
Odpovědník
5
PDF
1
Složka
1
Video
1
Studijní text
5
Web
Učitel doporučuje studovat od 19. 2. 2022 do 27. 2. 2022.
Kapitola obsahuje:
3
PDF
1
Složka
1
Video
1
Studijní text
1
Web
Učitel doporučuje studovat od 26. 2. 2022 do 6. 3. 2022.
Kapitola obsahuje:
3
Odevzdávárna
1
Odpovědník
7
PDF
1
Složka
2
Video
1
Studijní text
7
Web
Učitel doporučuje studovat od 5. 3. 2022 do 13. 3. 2022.

2022-03-14: Submissions due for the first term project

Kapitola obsahuje:
3
Odevzdávárna
1
Odpovědník
3
Vzájemné hodnocení
5
PDF
1
Složka
1
Video
1
Studijní text
1
Web
Učitel doporučuje studovat od 12. 3. 2022 do 20. 3. 2022.

2022-03-21: Peer reviews due for the first term project

Kapitola obsahuje:
1
Odpovědník
4
PDF
1
Video
1
Studijní text
11
Web
Učitel doporučuje studovat od 19. 3. 2022 do 27. 3. 2022.
Kapitola obsahuje:
9
PDF
1
Složka
2
Video
1
Studijní text
6
Web
Učitel doporučuje studovat od 26. 3. 2022 do 3. 4. 2022.
Kapitola obsahuje:
1
Odpovědník
5
PDF
1
Složka
1
Video
1
Studijní text
4
Web
Učitel doporučuje studovat od 2. 4. 2022 do 10. 4. 2022.
Kapitola obsahuje:
3
Odevzdávárna
3
PDF
1
Složka
1
Video
1
Studijní text
1
Web
Učitel doporučuje studovat od 9. 4. 2022 do 17. 4. 2022.

2022-05-02: Submissions due for the second term project

Kapitola obsahuje:
3
Odevzdávárna
1
Odpovědník
8
PDF
1
Složka
3
Video
1
Studijní text
5
Web
Učitel doporučuje studovat od 16. 4. 2022 do 24. 4. 2022.
Kapitola obsahuje:
3
Odevzdávárna
6
PDF
1
Složka
1
Video
1
Studijní text
2
Web
Učitel doporučuje studovat od 23. 4. 2022 do 1. 5. 2022.

2022-05-09: Peer reviews due for the second term project

Kapitola obsahuje:
1
Odpovědník
4
Vzájemné hodnocení
4
PDF
1
Složka
1
Video
1
Studijní text
1
Web
Učitel doporučuje studovat od 30. 4. 2022 do 8. 5. 2022.
Kapitola obsahuje:
1
Odpovědník
3
PDF
1
Video
1
Studijní text
2
Web
Učitel doporučuje studovat od 7. 5. 2022 do 15. 5. 2022.

The following topics will not be covered in the 2022 course run:

Here are materials from the previous runs of the course:

I will be glad if you get encouraged into course topics and you decide to get insight into it by solving [mini]projects. Activities in this direction will be rewarded by the nontrivial number of premium points towards successful grading. The number of stars below is an estimate of project difficulty, from miniproject [(*), 10 points] to big project size [(*****), 30+ points]. I am also open to assigning/extending a project as a Bachelor/ Masters/ Dissertation thesis. 

  • (*)+ Pointing to any (factual, typographical) errors in the course materials.
  • (**)+ Preparation of Deepnote instructions, documentation, and support for the solution of course projects
  • (**)+ Preparation of hot topic slides, production or preparation of motivating Khan-Academy style video, or other course materials in LaTeX.
  • (**)+ Presentation or teaching video on topics relevant to the course. Possible topics: Sketch Engine, search with linguistic attributes, random walks in texts, topic search and corpora, time-constrained search, topic modeling with gensim, LDA, Wolfram Alpha, specifics of search of structured data (chemical and mathematical formulae, linguistic trees - syntactic or dependency), etc.
  • (***) Participation in IR competition at Kaggle.com.
  • (***)+ Participation in IR research in our group Math Information Retrieval on research agendas and ARQMath task or EuDML project or DML project.
  • (***)+ Evaluation of Math Information Retrieval in system MIaS - possible as a Dean project under the supervision of Vít Novotný or Martin Geletka or  or as a Bachelor/Masters/Dissertation thesis.

To a pupil who was in danger, Master said, “Those who do not make mistakes, they are most mistaken for all – they do not try anything new.” Anthony de Mello

Předchozí