FF:PLIN011 Transcription of SpokenCZ - Course Information
PLIN011 Transcription of Spoken Czech
Faculty of ArtsSpring 2025
- Extent and Intensity
- 0/2/0. 4 credit(s). Type of Completion: z (credit).
In-person direct teaching - Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Bc. Silvie Hulewicz, DiS.
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Tue 10:00–11:40 C42, except Mon 21. 4. to Sun 27. 4.
- Prerequisites
- Students will need their own laptops.
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 15 student(s).
Current registration and enrolment status: enrolled: 10/15, only registered: 0/15, only registered with preference (fields directly associated with the programme): 0/15 - fields of study / plans the course is directly associated with
- there are 6 fields of study the course is directly associated with, display
- Course objectives
- Students will learn how to process spoken language. The prerequisite will be to make a recording of spontaneous spoken Czech and to transcribe it. The aim is to teach students how to make recordings and how to transcribe them. Students will make an orthographic transcription of their recording in Transcriber. At the same time, they will test AI transcription tools (probably with subsequent editing). The recordings and their transcriptions will be used to build a new spoken corpus in Sketch Engine Manager. For these purposes, the data will still need to be annotated.
- Learning outcomes
- Upon completion of the course, the student will be able to:
- understand the issues related to the acquisition and transcription of spoken data;
- work with automatic tools for transcription of Czech language;
- to make a recording and subsequent transcription according to established transcription rules;
- work with Transcriber;
- prepare linguistic data for the creation of a spoken corpus. - Syllabus
- 1. Introduction to spoken data processing.
- 2. Instructions for making a recording of spoken Czech.
- 3. Introduction to the Transcriber tool and instructions for orthographic transcription of recordings.
- 4. Editing audio recordings and transcribing them.
- 5. Testing AI tools for automatic transcription of Czech.
- 6. Work on transcription and editing of automatically generated data.
- 7. Annotating data to create a prevertical in Sketch Engine.
- 8. Final evaluation of the annotation process. Translated with DeepL.com (free version)
- Teaching methods
- Frontal teaching (delivery of information and instruction), workshop teaching (transcription, editing, annotation), independent work (making recordings), discussion and consultation.
- Assessment methods
- Active participation in seminars. Acquiring a good quality recording and its orthographic transcription. Testing of at least one tool for automatic transcription of spoken Czech and subsequent editing of the text. Annotation of data for the purpose of building a spoken corpus.
- Language of instruction
- Czech
- Further Comments
- Study Materials
The course is taught annually.
- Enrolment Statistics (recent)
- Permalink: https://is.muni.cz/course/phil/spring2025/PLIN011