IB030 Introduction to Computational Linguistics

Faculty of Informatics
Autumn 2004
Extent and Intensity
2/0. 2 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
Teacher(s)
doc. RNDr. Pavel Smrž, Ph.D. (lecturer)
Guaranteed by
prof. PhDr. Karel Pala, CSc.
Department of Machine Learning and Data Processing – Faculty of Informatics
Contact Person: prof. PhDr. Karel Pala, CSc.
Timetable
Wed 8:00–9:50 B411
Prerequisites (in Czech)
! I030 Introduction to CL
Před IB030 doporučuji zapsat PV122 Formální struktura přirozeného jazyka. Vhodná je znalost Prologu.
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
Course objectives
In this course the main principles of natural language processing are offered. The algorithmic description of the main language levels will be discussed, particularly, morphology, syntax, semantics and pragmatics. Also the resources of natural language data - corpora will be mentioned. The role of knowledge representation, inference and relations to AI will be touched as well.
Syllabus
  • Introduction to Computational Linguistics.
  • Natural language as a main tool of human communication. Language data in corpora, information about corpus linguistics.
  • Levels of description: phonetics and phonology, morphology, syntax, semantics and pragmatics. Traditional vs. formal grammars: representation of morphological and syntactic structures -- DAGs, meaning representation. Grammars: context-free, context-sensitive, logical - DCG, transformational. Generating and recognition: morphological, syntactic, sémantic. Parsing: morphological parser -- AJKA, syntactic -- KLARA, Techniques of analysis: top-down, bottom-up, mixed, heuristics. Problem of ambiguity and searching.
  • Electronic or machine readable dictionaries: representation of lexical knowledge. Types of the machine readable dictionaries: monolingual, thesauri, idiomatic, morphological dictionaries (stems), translation dictionaries, - bi- or multilingual, the ways of their formalization.
  • Semantic representation of sentece meanings: logical vs. lexical sémantics. The Compositionality Principle.
  • Semantic classification of verbs, valency frames, predicates, transparent intensional logic (TIL) and its application to semantic analysis of Czech sentences.
  • Pragmatics: sémantic and pragmatic nature of noun groups, discourse structure, deictic expressions, verbal and non-verbal contexts. Natural Language Understanding: semantic representation, inference and knowledge representations - are they the same? Structure of dialog systems.
Literature
  • CHOMSKY, Noam. Syntaktické struktury., Logický základ teorie jazyka., O pojmu gramatické pravidlo (Syntactic Structures). 1st ed. Praha: Academia, 1966, 209 s. info
  • PALA, Karel. Počítačové zpracování přirozeného jazyka (Natural Language Processing). 1st ed. Brno: FI MU, 2000, 190 pp. info
  • SGALL, Petr, Eva HAJIČOVÁ and Jarmila PANEVOVÁ. The meaning of the sentence in its semantic and pragmatic aspects. 1. vyd. Prague: Academia, 1986, ix, 353 s. info
Assessment methods (in Czech)
Závěrečné hodnocení se děje na základě písemné zkoušky. Účast na přednáškách není povinná.
Language of instruction
Czech
Further Comments
The course is taught annually.
The course is also listed under the following terms Autumn 2002, Autumn 2003, Autumn 2005, Spring 2007, Spring 2008, Spring 2009, Spring 2010, Spring 2011, Spring 2012, Spring 2013, Spring 2014, Spring 2015, Spring 2016, Spring 2017, Spring 2018, Spring 2019, Spring 2020, Spring 2021, Spring 2022, Spring 2023, Spring 2024, Spring 2025.
  • Enrolment Statistics (Autumn 2004, recent)
  • Permalink: https://is.muni.cz/course/fi/autumn2004/IB030