Seminární skupina 02 předmětu Laboratoř elektronických a multimediálních aplikací

[Michal Štefánik]: Unsupervised Estimation of Out-of-Distribution Performance 14. 10. 2021

Abstract

Neural language models are consistently moving the SOTA on a wide range of NLP tasks, but they do not perform consistently well under a domain shift, i.e. when applied to samples from different language domains. This disallows their deployment to some critical applications. Questionable comparability of models based on in-domain evaluation also slows down further research progress in the relevant direction.

We propose a set of simple evaluation methods that can estimate the expected performance of the system on out-of-distribution (OOD) samples. We show how each of these methods corresponds to a true evaluated performance on OOD and demonstrate the practical implications of our work in zero-shot evaluation.

Eventually, we present a set of interesting observations adjusting our understanding of neural language models, based on a novel insight that the evaluation methods bring.

Seminář 1. 4. 2021 10:00

Michal Štefánik: Unsupervised Estimation of Out-of-Distribution Performance

Readings

Unsupervised Estimation of Out-of-Domain Performance of Language Models

Talk by Michal Štefánik at the PV173 NLP seminar

Increasing Data Efficiency: Hugging Face Quantifies the Benefits of Prompts for Pretrained Language Models

A research team from Hugging Face shows that prompting is indeed beneficial for fine-tuning pre-trained language models and that this benefit can be quantified as worth hundreds of data points on average across classification tasks.

To Annotate or Not? Predicting Performance Drop under Domain Shift

Measuring Compositionality in Representation Learning

Fantastic Generalization Measures and Where to Find Them

Estimation of OOD performance

Předchozí

Následující

Seminární skupina 02 předmětu Laboratoř elektronických a multimediálních aplikací
- Nyní studovat
  
  Introduction and Publication Plan 16. 9. 2021
- Nyní studovat
  
  [Dávid Lupták & Vítek Novotný] ARQMath 2021 and SIGIR 2021 Doctoral Consortium 23. 9. 2021
- Nyní studovat
  
  [Jakub Ryšavý]: Sequential Representations 30. 9. 2021
- Nyní studovat
  
  [All] Research and presentation PV174/02 plan 7. 10. 2021
- Nyní studovat
  
  [Michal Štefánik]: Unsupervised Estimation of Out-of-Distribution Performance 14. 10. 2021
- Nyní studovat
  
  [Martin Geletka] Machine Translation + Intro to Quantisation, Distillation, Lottery Ticket theory 21. 10. 2021
- Nyní studovat
  
  [Mikuláš Bankovič & Dalibor Bačovský] Application of SR for OCR and Hyphenation in Subword LMs 28. 10. 2021
- Nyní studovat
  
  [Mikuláš Bankovič & Vítek Novotný] Geographical Information Retrieval in Mapy.cz & a Roadmap towards Machine Intelligence using Complex Systems 4. 11. 2021
- Nyní studovat
  
  [Vlastimil Martinek]: Deep learning for genomic data 11. 11. 2021
- Nyní studovat
  
  [David Čechák]: Transformers in Genomic Sequences II 18. 11. 2021
- Nyní studovat
  
  [Dominik Rehák & Tereza Vrabcová] Different input formats for the Markdown package and Updating the fithesis LaTeX templates to JVS MU 25. 11. 2021
- Nyní studovat
  
  [All] MIR: ARQMath 2021 MIRMU and MSM submissions, CICM 2021 2. 12. 2021
- Nyní studovat
  
  [All] ARQMath 2021 and final seminar meetup 9. 12. 2021

Operace

Prohlédnout vše

Interaktivní osnova

[Michal Štefánik]: Unsupervised Estimation of Out-of-Distribution Performance 14. 10. 2021

Abstract

Readings

Operace