Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon

Introducing Spleeter, a Tensorflow based python library that extracts voice and sound from any music track

Save for later
  • 2 min read
  • 05 Nov 2019

article-image

On Monday, Deezer, a French online music streaming service, released Spleeter which is a music separation engine.  It comes in the form of a Python Library based on Tensorflow. Stating the reason behind Spleeter, the researchers state, “We release Spleeter to help the Music Information Retrieval (MIR) community leverage the power of source separation in various MIR tasks, such as vocal lyrics analysis from audio, music transcription, any type of multilabel classification or vocal melody extraction.”

Spleeter comes with pre-trained models for 2, 4 and 5 track separation. These include:

  • Vocals (singing voice) / accompaniment separation (2 stems)
  • Vocals / drums / bass / other separation (4 stems)
  • Unlock access to the largest independent learning library in Tech for FREE!
    Get unlimited access to 7500+ expert-authored eBooks and video courses covering every tech area you can think of.
    Renews at $19.99/month. Cancel anytime
  • Vocals / drums / bass / piano / other separation (5 stems)


It can also train source separation models or fine-tune pre-trained ones with Tensorflow if you have a dataset of isolated sources.

Deezer benchmarked Spleeter against Open-Unmix another open-source model recently released and reported slightly better performances with increased speed. It can perform separation of audio files to 4 stems 100x faster than real-time when running on a GPU.

You can use Spleeter straight from the command line as well as directly in your own development pipeline as a Python library. It can be installed with Conda, with pip or be used with Docker. Spleeter creators mention a number of potential applications of source separation engine including remixes, upmixing, active listening, educational purposes, and pre-processing for other tasks such as transcription.

Spleeter received mostly positive feedback on Twitter, as people experimented to separate vocals from music.

https://twitter.com/lokijota/status/1191580903518228480

https://twitter.com/bertboerland/status/1191110395370586113

https://twitter.com/CholericCleric/status/1190822694469734401

Wavy.org also ran several songs through the two-stem filter and evaluated them in a blog post. They tried a variety of soundtracks across multiple genres. The performance of audio was much better than expected, however, vocals sometimes felt robotically autotuned. The amount of bleed was shockingly low relative to other solutions and surpassed any available free tool and rival commercial plugins and services.

https://twitter.com/waxpancake/status/1191435104788238336

Spleeter will be presented and live-demoed at the 2019 ISMIR conference in Delft.

For more details refer to the official announcement.

DeepMind AI’s AlphaStar achieves Grandmaster level in StarCraft II with 99.8% efficiency.

Google AI introduces Snap, a microkernel approach to ‘Host Networking’

Firefox 70 released with better security, CSS, and JavaScript improvements