Word embeddings is a method which allows mapping words to numeric vectors for capturing the meaning of the words. This is used for visualization or model training.
With ML.NET 0.4, the word embedding transform is added to ML.NET allowing you to use pre-trained or existing word embedding models in pipelines. There are several different pretrained models such as GloVe, fastText, and SSWE which are available. Adding this transform method along with the existing transforms help improve the model’s metrics.
SymSGD is now available in ML.NET 0.4 for binary classification. SymSGD is a parallel SGD algorithm which retains the sequential semantics of SGD. It offers a much better performance based on multithreading. It is fast, scales well on multiple cores, and achieves the same accuracy as sequential SGD.
SymSGD can be applied to any linear learner where update rule is linear like a binary classification or a linear regression.
This is how you can add a SymSGD Binary Classifier learner to the pipeline:
pipeline.Add(new SymSgdBinaryClassifier() { NumberOfThreads = 1});
Even though there is no multithreading enabled in SymSGD at the moment, it can be helpful in cases where you want to try many different learners and limit each of them to a single thread.
ML.NET did not provide support for F# records. With ML.NET 0.4 release, you can use property-based row classes in F#.
To get more coverage, check out the official Microsoft blog.
Microsoft Open Sources ML.NET, a cross-platform machine learning framework
Create machine learning pipelines using unsupervised AutoML [Tutorial
Top AutoML libraries for building your ML pipelines