FI:PA153 NL Processing - Course Information
PA153 Natural Language Processing
Faculty of InformaticsAutumn 2019
- Extent and Intensity
- 2/0/0. 2 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
- Teacher(s)
- prof. PhDr. Karel Pala, CSc. (lecturer), doc. RNDr. Aleš Horák, Ph.D. (deputy)
RNDr. Vojtěch Kovář, Ph.D. (assistant)
RNDr. Zuzana Nevěřilová, Ph.D. (alternate examiner) - Guaranteed by
- prof. PhDr. Karel Pala, CSc.
Department of Machine Learning and Data Processing – Faculty of Informatics
Contact Person: prof. PhDr. Karel Pala, CSc.
Supplier department: Department of Machine Learning and Data Processing – Faculty of Informatics - Timetable
- Wed 12:00–13:50 C416
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- Image Processing and Analysis (programme FI, N-VIZ)
- Applied Informatics (programme FI, N-AP)
- Information Technology Security (eng.) (programme FI, N-IN)
- Information Technology Security (programme FI, N-IN)
- Bioinformatics and systems biology (programme FI, N-UIZD)
- Bioinformatics (programme FI, N-AP)
- Computer Games Development (programme FI, N-VIZ_A)
- Computer Graphics and Visualisation (programme FI, N-VIZ_A)
- Computer Networks and Communications (programme FI, N-PSKB_A)
- Cybersecurity Management (programme FI, N-RSSS_A)
- Formal analysis of computer systems (programme FI, N-TEI)
- Graphic design (programme FI, N-VIZ)
- Graphic Design (programme FI, N-VIZ_A)
- Hardware Systems (programme FI, N-PSKB_A)
- Hardware systems (programme FI, N-PSKB)
- Image Processing and Analysis (programme FI, N-VIZ_A)
- Information security (programme FI, N-PSKB)
- Information Systems (programme FI, N-IN)
- Informatics (eng.) (programme FI, D-IN4)
- Informatics (programme FI, D-IN4)
- Information Security (programme FI, N-PSKB_A)
- Quantum and Other Nonclassical Computational Models (programme FI, N-TEI)
- Parallel and Distributed Systems (programme FI, N-IN)
- Computer graphics and visualisation (programme FI, N-VIZ)
- Computer Graphics (programme FI, N-IN)
- Computational Linguistics (programme FF, N-PLIN_) (3)
- Computer Networks and Communication (programme FI, N-IN)
- Computer Networks and Communications (programme FI, N-PSKB)
- Computer Systems and Technologies (eng.) (programme FI, D-IN4)
- Computer Systems and Technologies (programme FI, D-IN4)
- Computer Systems (programme FI, N-IN)
- Principles of programming languages (programme FI, N-TEI)
- Embedded Systems (eng.) (programme FI, N-IN)
- Embedded Systems (programme FI, N-IN)
- Cybersecurity management (programme FI, N-RSSS)
- Services development management (programme FI, N-RSSS)
- Software Systems Development Management (programme FI, N-RSSS)
- Services Development Management (programme FI, N-RSSS_A)
- Service Science, Management and Engineering (eng.) (programme FI, N-AP)
- Service Science, Management and Engineering (programme FI, N-AP)
- Social Informatics (programme FI, B-AP)
- Software Systems Development Management (programme FI, N-RSSS_A)
- Software Systems (programme FI, N-PSKB_A)
- Software systems (programme FI, N-PSKB)
- Machine learning and artificial intelligence (programme FI, N-UIZD)
- Theoretical Informatics (programme FI, N-IN)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-SS) (2)
- Artificial Intelligence and Natural Language Processing (programme FI, N-IN)
- Computer Games Development (programme FI, N-VIZ)
- Processing and analysis of large-scale data (programme FI, N-UIZD)
- Image Processing (programme FI, N-AP)
- Natural language processing (programme FI, N-UIZD)
- Course objectives
- The course offers a deeper knowledge about the natural language processing and computational linguistics.
- Learning outcomes
- The students will learn about the particular levels of linguistic analysis - morphology, syntax, semantics and pragmatics.
They will be able to use language data - corpora, types of corpora, corpus tools, perform tagging corpus texts, disambiguation with rule based and statistical systems.
They will be acquainted with representation of the morphological stuctures, notation and algorithms for morphological analysis.
The students will be able to work with the representations of syntactic structures - formal grammars and their types. They will learn about context-free, functional and definite-clause grammars and related parsing algorithms.
The data structures such as valency frames and their types will be explained as well.
They will learn about lexical semantics - meanings of words and collocations, machine readable dictionaries, lexical databases (WordNet, EuroWordNet, thesauri).
Semantic analysis of sentence, principles of logical semantic and Normal Translation Algorithm will be presented.
Pragmatics and discourse analysis and its segmentation, anaphora and (co-)reference will be explained.
The students obtain basic knowledge about dialogue systems, inference systems and knowledge representation for NLP systems.
They will be able to understand the principles of the communication agents and main evaluation techniques. - Syllabus
- Natural language processing and computational linguistics.
- Natural language and understanding.
- Levels of linguistic analysis - morphology, syntax, semantics.
- Language data - corpora. Types of corpora. Corpus tools. Tagging corpus texts. Disambiguation, rule based and statistical systems.
- Representation of the morphological stuctures, notation, morphological algorithms.
- Representation of syntactic structures - formal grammars and their types. Context-free and definite-clause grammars. Parsing algorithms. Valency frames and their types.
- Semantic representation. Lexical meanings (words and collocations), machine readable dictionaries, lexical databases (WordNet, EuroWordNet, thesauri).
- Semantic analysis of sentence meaning, Normal Translation Algorithm.
- Pragmatics.
- Discourse analysis and its segmentation. Anaphora and (co-)reference.
- Inference and knowledge representation for NL systems.
- Dialogue systems.
- Communication agents.
- Evaluation techniques
- Literature
- Teaching methods
- Teaching is performed in the form of oral lectures and seminars, in which the slides and demos of the relevant software tools are combined. Students work out homeworks, prepare presentations based on the literature they had read and develop smaller projects. At the appropriate points of the teaching the open dialog between a teacher and students is used.
- Assessment methods
- written test
- Language of instruction
- Czech
- Further Comments
- Study Materials
The course is taught annually.
- Enrolment Statistics (Autumn 2019, recent)
- Permalink: https://is.muni.cz/course/fi/autumn2019/PA153