- What are similarity metrics? Why does cosine similarity work best?
- Why do matching networks use the LSTM architecture to obtain embeddings?
- What are the disadvantages associated with the contrastive loss function, and how does the triplet loss function assist in solving it?
- What is the curse of dimensionality? How can we deal with it?




















































