Machine Learning in Image Processing

Week 10 - Attention

One of the most important recent developments in machine learning is the invention of attention mechanisms. Before, neural networks could only process information from receptive fields of varying sizes, but attention lets models focus on important details regardless of their positions. This powerful technique forms the basis of famous transformer architecture, which is fundamental building-block of models such as ChatGPT. At this seminar, we are going to demonstrate the basic operations inside an attention module and how they can be used to create models for image captioning.

Goals:

Examine a single attention module in detail.
Gain experience with the various techniques used alongside attention, such as positional encoding.
Implement a vision transformer and use it to perform image classification.
Demonstrate image captioning using a sample implementation of the paper "Show, Attend, Tell".

Tutorial 10

Předchozí

Následující

Machine Learning in Image Processing
- Nyní studovat
  
  Semestral rules
- Nyní studovat
  
  Week 1 - Image processing
- Nyní studovat
  
  Week 2 - Neural network from scratch
- Nyní studovat
  
  Week 3 - Datasets
- Nyní studovat
  
  Week 4 - Designing a neural network
- Nyní studovat
  
  Week 5 - Training a neural network
- Nyní studovat
  
  Student projects
- Nyní studovat
  
  Week 6 - Image segmentation
- Nyní studovat
  
  Week 7 - Object detection and SSDs
- Nyní studovat
  
  Week 8 - Project checkpoint
- Nyní studovat
  
  Week 9 - Generative models
- Nyní studovat
  
  Week 10 - Attention

Operace

Prohlédnout vše

Interaktivní osnova

Week 10 - Attention

Operace