Let's evaluate our newly acquired knowledge by answering the following questions:
- What is the difference between the skip-gram and CBOW models?
- What is the loss function of the CBOW model?
- What is the need for negative sampling?
- Define PV-DM.
- What is the role of the encoder and decoder in the skip-thoughts vector?
- What are quick thoughts vector?