Přeskočit na horní lištu Přeskočit na hlavičku Přeskočit na obsah Přeskočit na patičku

👷 Seminar on Machine Learning, Information Retrieval, and Scientific Visualization

Interaktivní osnova

👷 Seminar on Machine Learning, Information Retrieval, and Scientific Visualization

[Marek Kadlčík]: Can language models use external tools? 20. 4. 2023

Abstract

Large language models (LLMs) are commonly used for solving
natural language tasks like question answering or generating text. However,
their outputs can be factually incorrect, untruthful, outdated, or
otherwise limited by the knowledge encoded in the trained parameters. One
promising way to mitigate this problem is to allow LLMs to interact with
external tools, such as an IR system, a calculator, a knowledge graph, a
private database, or a code interpreter.

Visual Abstract

Llm tools

Slides

Lecture slides of Marek Kadlčík: Can large language models use external tools?

Presentation recordings

Lecture recording

Readings

Parisi, Aaron & Zhao, Yao & Fiedel, Noah. (2022). TALM: Tool Augmented Language Models. doi.org/10.48550/arXiv.2205.12255.
Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom (2022). Toolformer: Language Models Can Teach Themselves to Use Tools. doi.org/10.48550/arXiv.2302.04761
https://platform.openai.com/docs/plugins/introduction
https://python.langchain.com/en/latest/

Předchozí

👷 Seminar on Machine Learning, Information Retrieval, and Scientific Visualization
- Nyní studovat
  
  [Richard Šoltis, Šárka Ščavnická, Dávid Meluš]: Topics and research of diploma thesis 23. 2. 2023
- Nyní studovat
  
  [Jakub Ryšavý]: Confidence Intervals 9. 3. 2023
- Nyní studovat
  
  [Katarína Grešová]: Using Attribution Sequence Alignment to Interpret Deep Learning Models for MiRNA Binding Site Prediction 16. 3. 2023
- Nyní studovat
  
  [Dávid Meluš]: Utilization of contextual information for post-OCR error correction using language models 23. 3. 2023
- Nyní studovat
  
  [Michal Štefánik et al.]: Intelligent Back Office: the past, present, and future 30. 3. 2023
- Nyní studovat
  
  [Šárka Ščavnická]: Multimodal Question Answering 13. 4. 2023
- Nyní studovat
  
  [Marek Kadlčík]: Can language models use external tools? 20. 4. 2023
- Nyní studovat
  
  [David Čechák]: Deep learning in DNA decay prediction 4. 5. 2023
- Nyní studovat
  
  [Michal Štefánik, Marek Kadlčík]: EACL breaking news 18. 5. 2023

Operace

Prohlédnout vše