Laboratory of Electronic and Multimedia Applications (Research Section)

[Martin Geletka]: Visual Document Understanding 14. 4. 2022

Abstract

We will present the individual task in the area of Visual Document Understanding. We will show this theoretical task on the practical need for Intelligent Back Office. We will describe interesting approaches that combine the information from Images and Text.

Presentation

Visual Document Understanding

Slides presented at the seminar on April 14, 2022

2022-04-14-geletka.mp4

Záznam přednášky Martina Geletky 14. 4. 2022

Readings

Xang, Y. (2020). LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding (https://arxiv.org/abs/2012.14740)
Kim, G. (2022). Donut: Document Understanding Transformer without OCR
(https://arxiv.org/abs/2111.15664)
Li, M (2021). TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
(https://arxiv.org/abs/2109.10282)

Předchozí

Následující

Laboratory of Electronic and Multimedia Applications (Research Section)
- Nyní studovat
  
  Introduction 17. 2. 2022
- Nyní studovat
  
  Topics, Lecture Allocation, Literature Review Methodology 24. 2. 2022
- Nyní studovat
  
  [Marek Petrovič]: One Bit at a Time: Impact of Quantisation on NMT Robustness 17. 3. 2022
- Nyní studovat
  
  [Lukáš Mikula]: Think Twice Before You Answer: Mitigating Biases of Question Answering Models (24.3.2022)
- Nyní studovat
  
  [Katarína Grešová]: Modeling Small RNA Binding Rules 31. 3. 2022
- Nyní studovat
  
  [Michal Štefánik, Martin Geletka, Petr Sojka] Math Information Retrieval: The past, the present, and the bright ARQMath 3 future 7. 4. 2022
- Nyní studovat
  
  [Martin Geletka]: Visual Document Understanding 14. 4. 2022
- Nyní studovat
  
  [Jakub Ryšavý]: Decentralized Finance Backtesting 28. 4. 2022
- Nyní studovat
  
  [Dávid Čechák + Vlasta Martinek]: Deep Learning for Drug Discovery 5. 5. 2022
- Nyní studovat
  
  [Michal Štefánik]: Robustness of Neural Language Models 12. 5. 2022

Operace

Prohlédnout vše

Interaktivní osnova

[Martin Geletka]: Visual Document Understanding 14. 4. 2022

Abstract

Presentation Visual Document Understanding Slides presented at the seminar on April 14, 2022 PDF ke stažení

Readings

Operace

Presentation

Visual Document Understanding

Slides presented at the seminar on April 14, 2022