FI:IB030 Introduction to CL - Course Information
IB030 Introduction to Computational Linguistics
Faculty of InformaticsAutumn 2004
- Extent and Intensity
- 2/0. 2 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
- Teacher(s)
- doc. RNDr. Pavel Smrž, Ph.D. (lecturer)
- Guaranteed by
- prof. PhDr. Karel Pala, CSc.
Department of Machine Learning and Data Processing – Faculty of Informatics
Contact Person: prof. PhDr. Karel Pala, CSc. - Timetable
- Wed 8:00–9:50 B411
- Prerequisites (in Czech)
- ! I030 Introduction to CL
Před IB030 doporučuji zapsat PV122 Formální struktura přirozeného jazyka. Vhodná je znalost Prologu. - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- Applied Informatics (programme FI, B-AP)
- Czech Language and Literature (programme FF, M-FI) (2)
- Czech Language and Literature (programme FF, M-HS)
- Informatics with another discipline (programme FI, B-BI)
- Informatics with another discipline (programme FI, B-FY)
- Informatics with another discipline (programme FI, B-GE)
- Informatics with another discipline (programme FI, B-GK)
- Informatics with another discipline (programme FI, B-CH)
- Informatics with another discipline (programme FI, B-IO)
- Informatics with another discipline (programme FI, B-MA)
- Informatics with another discipline (programme FI, B-SO)
- Informatics with another discipline (programme FI, B-TV)
- Informatics (programme FI, B-IN)
- Course objectives
- In this course the main principles of natural language processing are offered. The algorithmic description of the main language levels will be discussed, particularly, morphology, syntax, semantics and pragmatics. Also the resources of natural language data - corpora will be mentioned. The role of knowledge representation, inference and relations to AI will be touched as well.
- Syllabus
- Introduction to Computational Linguistics.
- Natural language as a main tool of human communication. Language data in corpora, information about corpus linguistics.
- Levels of description: phonetics and phonology, morphology, syntax, semantics and pragmatics. Traditional vs. formal grammars: representation of morphological and syntactic structures -- DAGs, meaning representation. Grammars: context-free, context-sensitive, logical - DCG, transformational. Generating and recognition: morphological, syntactic, sémantic. Parsing: morphological parser -- AJKA, syntactic -- KLARA, Techniques of analysis: top-down, bottom-up, mixed, heuristics. Problem of ambiguity and searching.
- Electronic or machine readable dictionaries: representation of lexical knowledge. Types of the machine readable dictionaries: monolingual, thesauri, idiomatic, morphological dictionaries (stems), translation dictionaries, - bi- or multilingual, the ways of their formalization.
- Semantic representation of sentece meanings: logical vs. lexical sémantics. The Compositionality Principle.
- Semantic classification of verbs, valency frames, predicates, transparent intensional logic (TIL) and its application to semantic analysis of Czech sentences.
- Pragmatics: sémantic and pragmatic nature of noun groups, discourse structure, deictic expressions, verbal and non-verbal contexts. Natural Language Understanding: semantic representation, inference and knowledge representations - are they the same? Structure of dialog systems.
- Literature
- CHOMSKY, Noam. Syntaktické struktury., Logický základ teorie jazyka., O pojmu gramatické pravidlo (Syntactic Structures). 1st ed. Praha: Academia, 1966, 209 s. info
- PALA, Karel. Počítačové zpracování přirozeného jazyka (Natural Language Processing). 1st ed. Brno: FI MU, 2000, 190 pp. info
- SGALL, Petr, Eva HAJIČOVÁ and Jarmila PANEVOVÁ. The meaning of the sentence in its semantic and pragmatic aspects. 1. vyd. Prague: Academia, 1986, ix, 353 s. info
- Assessment methods (in Czech)
- Závěrečné hodnocení se děje na základě písemné zkoušky. Účast na přednáškách není povinná.
- Language of instruction
- Czech
- Further Comments
- The course is taught annually.
- Enrolment Statistics (Autumn 2004, recent)
- Permalink: https://is.muni.cz/course/fi/autumn2004/IB030