Seminární skupina 02 předmětu Laboratoř elektronických a multimediálních aplikací

[Michal Štefánik]: Unsupervised Estimation of Out-of-Distribution Performance 1. 4. 2021

Abstract

Neural language models are consistently moving the SOTA on a wide range of NLP tasks, but they do not perform consistently well under a domain shift, i.e. when applied to samples from different language domains. This disallows their deployment to some critical applications. Questionable comparability of models based on in-domain evaluation also slows down further research progress in the relevant direction.

We propose a set of simple evaluation methods that can estimate the expected performance of the system on out-of-distribution (OOD) samples. We show how each of these methods corresponds to a true evaluated performance on OOD and demonstrate the practical implications of our work in zero-shot evaluation.

Eventually, we present a set of interesting observations adjusting our understanding of neural language models, based on a novel insight that the evaluation methods bring.

Seminář 1. 4. 2021 10:00

Michal Štefánik: Unsupervised Estimation of Out-of-Distribution Performance

Readings

Unsupervised Estimation of Out-of-Domain Performance of Language Models

Talk by Michal Štefánik at the PV173 NLP seminar

Increasing Data Efficiency: Hugging Face Quantifies the Benefits of Prompts for Pretrained Language Models

A research team from Hugging Face shows that prompting is indeed beneficial for fine-tuning pre-trained language models and that this benefit can be quantified as worth hundreds of data points on average across classification tasks.

To Annotate or Not? Predicting Performance Drop under Domain Shift

Measuring Compositionality in Representation Learning

Fantastic Generalization Measures and Where to Find Them

Estimation of OOD performance

Předchozí

Následující

Seminární skupina 02 předmětu Laboratoř elektronických a multimediálních aplikací
- Nyní studovat
  
  Introduction and Publication Plan 4. 3. 2021
- Nyní studovat
  
  [Dávid Lupták & Vítek Novotný] ARQMath 2021 and SIGIR 2021 Doctoral Consortium 11. 3. 2021
- Nyní studovat
  
  [Jakub Ryšavý]: Sequential Representations 18. 3. 2021
- Nyní studovat
  
  [All] Research and presentation PV174/02 plan 25. 3. 2021
- Nyní studovat
  
  [Michal Štefánik]: Unsupervised Estimation of Out-of-Distribution Performance 1. 4. 2021
- Nyní studovat
  
  [Martin Geletka] Machine Translation + Intro to Quantisation, Distillation, Lottery Ticket theory 8. 4. 2021
- Nyní studovat
  
  [Mikuláš Bankovič & Dalibor Bačovský] Application of SR for OCR and Hyphenation in Subword LMs 15. 4. 2021
- Nyní studovat
  
  [Mikuláš Bankovič & Vítek Novotný] Geographical Information Retrieval in Mapy.cz & a Roadmap towards Machine Intelligence using Complex Systems 22. 4. 2021
- Nyní studovat
  
  [Vlastimil Martinek]: Deep learning for genomic data 29. 4. 2021
- Nyní studovat
  
  [David Čechák]: Transformers in Genomic Sequences II 6. 5. 2021
- Nyní studovat
  
  [Dominik Rehák & Tereza Vrabcová] Different input formats for the Markdown package and Updating the fithesis LaTeX templates to JVS MU 13. 5. 2021
- Nyní studovat
  
  [All] MIR: ARQMath 2021 MIRMU and MSM submissions, CICM 2021 20. 5. 2021
- Nyní studovat
  
  [All] ARQMath 2021 and final seminar meetup 27. 5. 2021

Operace

Prohlédnout vše

Interaktivní osnova

[Michal Štefánik]: Unsupervised Estimation of Out-of-Distribution Performance 1. 4. 2021

Abstract

Readings

Operace