IA161 Advanced Techniques of Natural Language Processing

Faculty of Informatics
Autumn 2016
Extent and Intensity
1/1/0. 2 credit(s) (plus extra credits for completion). Type of Completion: k (colloquium).
Teacher(s)
doc. RNDr. Aleš Horák, Ph.D. (lecturer)
Mgr. et Mgr. Vít Baisa, Ph.D. (lecturer)
RNDr. Miloš Jakubíček, Ph.D. (lecturer)
RNDr. Vojtěch Kovář, Ph.D. (lecturer)
RNDr. Jiří Materna, Ph.D. (lecturer)
RNDr. Marek Medveď, Ph.D. (lecturer)
RNDr. Zuzana Nevěřilová, Ph.D. (lecturer)
RNDr. Adam Rambousek, Ph.D. (lecturer)
RNDr. Jan Rygl (lecturer)
RNDr. Vít Suchomel, Ph.D. (lecturer)
Mgr. Ján Švec (lecturer)
Guaranteed by
doc. RNDr. Aleš Horák, Ph.D.
Department of Machine Learning and Data Processing – Faculty of Informatics
Supplier department: Department of Machine Learning and Data Processing – Faculty of Informatics
Timetable
Wed 10:00–11:50 A219
Prerequisites
PA153 NL Processing || IB030 Introduction to CL || NOW( PA153 NL Processing ) || NOW( IB030 Introduction to CL )
All students should have basic practical knowledge of programming in Python.
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
Course objectives
The course participants will have the opportunity to learn about, test and experiment with advanced techniques of natural language processing (NLP) and to develop an understanding of the limits of those techniques. The course aims to introduce current research issues, and to meet in practice with particular programming techniques used in language technology applications.
Syllabus
  • The presented NLP problems will concentrate on practical problems connected with processing human-produced textual data. Particular topics include:
  • Opinion mining, sentiment analysis
  • Machine translation
  • Parsing of Czech: Between Rules and Statistics
  • Named Entity Recognition
  • Building Language Resources from the Web (effective crawling, boilerplate removal, tokenisation, near duplicates identification)
  • Language modelling
  • Topic identification, topic modelling
  • Extracting structured information from text
  • Automatic relation extraction (hypernyms, synonyms, ...)
  • Adaptive electronic dictionaries
  • Terminology identification (keywords, key phrases)
  • Anaphora resolution
  • Stylometry
  • Automatic language corrections
Literature
  • JURAFSKY, Dan and James H. MARTIN. Speech and language processing : an introduction to natural language processing, computational linguistics and speech recognition. 2nd ed. New Jersey: Pearson, 2009, 1024 s. ISBN 9780135041963. info
  • MANNING, Christopher D. and Hinrich SCHÜTZE. Foundations of statistical natural language processing. Cambridge, Mass.: MIT Press, 1999, xxxvii, 68. ISBN 9780262133609. info
  • MERLO, Paola, Harry BUNT and Joakim NIVRE. Trends in Parsing Technology: Dependency Parsing, Domain Adaptation, and Deep Parsing. Springer Netherlands, 2011, 297 pp. Text, Speech and Language Technology, Vol. 43. ISBN 978-90-481-9351-6. URL info
  • https://www.aclweb.org/anthology/
Teaching methods
Each lecture consists of a one-hour lesson about the theoretical issues connected with a particular NLP problem, and a one-hour practical work in a computer laboratory devoted to the implementation, adaptation and evaluation of the presented techniques on real-world data.
Assessment methods
Solving tasks in the practical part of the lecture or in homeworks.
Language of instruction
Czech
Further Comments
Study Materials
The course is taught annually.
Teacher's information
http://nlp.fi.muni.cz/AdvancedNlpCourse
The course is also listed under the following terms Autumn 2011, Autumn 2013, Autumn 2014, Autumn 2015, Autumn 2017, Autumn 2018, Autumn 2019, Autumn 2020, Autumn 2021, Autumn 2022, Autumn 2023, Autumn 2024.
  • Enrolment Statistics (Autumn 2016, recent)
  • Permalink: https://is.muni.cz/course/fi/autumn2016/IA161