CJBB105 Introduction in Corpus Linguistics - lecture

Faculty of Arts
Spring 2009
Extent and Intensity
2/0/0. 4 credit(s). Type of Completion: k (colloquium).
Teacher(s)
doc. PhDr. Klára Osolsobě, Dr. (lecturer)
Guaranteed by
doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Timetable
Fri 8:20–9:55 zruseno D31
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
there are 10 fields of study the course is directly associated with, display
Course objectives
The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
Syllabus
  • 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
Literature
  • Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
  • Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
  • Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
  • Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
  • http://ucnk.ff.cuni.cz/
  • Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
  • Burnard L. (1993): A Gentle Introduction to XML.
  • McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
  • Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
  • Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
Assessment methods
Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium
Language of instruction
Czech
Follow-Up Courses
Further comments (probably available only in Czech)
Study Materials
The course is taught each semester.
Listed among pre-requisites of other courses
The course is also listed under the following terms Spring 2006, Autumn 2006, Spring 2007, Autumn 2007, Spring 2008, Autumn 2008, Autumn 2009, Spring 2010, Autumn 2010, Spring 2011, Autumn 2011, Spring 2012, Autumn 2012, Spring 2013, Autumn 2013, Spring 2014, Autumn 2014, Spring 2015, Autumn 2015, Autumn 2016, Autumn 2017, Autumn 2018, Spring 2020, Spring 2021, Spring 2022, Spring 2023, Spring 2024, Spring 2025.
  • Enrolment Statistics (Spring 2009, recent)
  • Permalink: https://is.muni.cz/course/phil/spring2009/CJBB105