For any presentation in class:
to improve understanding
Find words from the Little Prince book with more than one meaning.
Choose one or more methods for creating word embeddings (word2vec, FastText, GloVe, …), run the traning on same data with different parameters (and/or epochs), evaluation stability.
Stability can be computed in several ways:
How many pair similarities are same. It can be computed on the whole vocabulary on a sample (for example: 10 words with frequences from [100, 400, 1600, 6400, 25600, …]).
Percentage of changes in analogy tasks. Same percentage in the taks doesn’t mean the same succesful analogy items. Calculate how many items changed successful/unsuccessful estimation.
Percentage of changes in the Outlier Detection task