👷 Seminar on Machine Learning, Information Retrieval, and Scientific Visualization

[Marek Kadlčík]: Teaching Models to Use a Calculator for Solving Math Word Problems 23. 11. 2023

Abstract

Large language models (LLMs) are commonly used for solving natural language tasks like question answering or generating text. However, their outputs can be outdated, factually incorrect, or untruthful. In particular, LLMs are notoriously bad at arithmetic computation. A promising way to mitigate this problem is to allow LLMs to interact with external tools, such as a calculator, a computer algebra system, or a code interpreter.

In this talk, we will cover the training of calculator-using models, compare their capability of solving math word problems to vanilla LLM baselines, and discuss possible improvements in the training workflow.

Visual Abstract

Visual abstract

Slides

Teaching Models to Use a Calculator for Solving Math Word Problems

Presentation recordings

Teaching Models to Use a Calculator for Solving Math Word Problems

Readings

Parisi, Aaron & Zhao, Yao & Fiedel, Noah. (2022). TALM: Tool Augmented Language Models. doi.org/10.48550/arXiv.2205.12255.
Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom (2022). Toolformer: Language Models Can Teach Themselves to Use Tools. doi.org/10.48550/arXiv.2302.04761
Luyu Gao et al. PAL (2022): Program-aided Language Models
Kadlčík et al. (2023): Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems
LangChain: https://www.langchain.com/

Catering

The talk itself was very foody.

Předchozí

Následující

👷 Seminar on Machine Learning, Information Retrieval, and Scientific Visualization
- Nyní studovat
  
  [Michal Štefánik]: Can In-context Learners Learn a new Reasoning Concept from Demonstrations? 5. 10. 2023
- Nyní studovat
  
  [Denisa Šrámková]: Interpretability of Binary Protein Knot Classification 19. 10. 2023
- Nyní studovat
  
  [Adam Hájek]: De-Novo Identification of Small Molecules from their GC-EI-MS Spectra 19. 10. 2023
- Nyní studovat
  
  [Vlastimil Martinek]: Predicting RNA Halflife 2. 11. 2023
- Nyní studovat
  
  [Dávid Meluš, Šárka Ščavnická]: Intelligent Back Office Work in Progress Thesis Reports 9. 11. 2023
- Nyní studovat
  
  [David Valecký]: Transformers in Computer Vision 16. 11. 2023
- Nyní studovat
  
  [Marek Kadlčík]: Teaching Models to Use a Calculator for Solving Math Word Problems 23. 11. 2023
- Nyní studovat
  
  [Jan Rodák]: Uses Machine Learning for Security Compliance 30. 11. 2023
- Nyní studovat
  
  [Andrej Kubanda]: Forecasting of glycemia 7. 12. 2023
- Nyní studovat
  
  [David Čechák]: Understanding miRNA Binding Behavior Through Deep Learning Models 14. 12. 2023
- Nyní studovat
  
  [Michal Štefánik]: EMNLP presentation breaking news and report 21. 12. 2023

Operace

Prohlédnout vše

Interaktivní osnova