Upload an audio file. We separate the vocals from the instrumental using self-hosted Demucs v4, the open-source model Facebook AI Research released in 2023. The whole track gets processed, not a clip. Free. No watermark. No signup.
Waiting for the Demucs worker to pick up your job.
Output stems auto-delete after 1 hour. Save them now if you want to keep them.
Until 2020, removing vocals from a finished mix was nearly impossible. Studios used phase-cancellation tricks that worked only on stereo recordings where the vocal sat dead-centre, and even then the result was muddy. The breakthrough came when researchers stopped trying to subtract the vocal mathematically and started training neural networks on thousands of pairs: a full mix and its isolated stems.
Demucs v4, the model running on this page, takes that further. It works in both the time domain and the frequency domain simultaneously. The time-domain branch listens to the raw waveform like a person would. The frequency-domain branch looks at the spectrogram, the visual representation of which frequencies are active when. The two views feed into a transformer that decides, for each tiny slice of audio, what proportion belongs to vocals versus everything else.
The model has been trained on the MusDB18 dataset (150 full songs with isolated stems) and millions of additional samples. It does not know your song specifically. It knows what vocals tend to look like in a spectrogram, what drums look like, what a bass line looks like, and it applies that pattern to whatever you give it.
The most popular use. Strip the vocal off any song, sing over the instrumental yourself. Quality is genuinely good enough that the result feels professional.
Producers extract vocals from a track to remix or mash up over their own beat. Note: releasing a remix commercially still requires sample clearance from the rights holder.
Hip-hop and electronic producers isolate instrumental hooks from older recordings to flip into new productions. Demucs catches details the original mix bus may have buried.
Vocal coaches isolate a singer's performance to study technique, phrasing, breath control. Producers isolate drum stems to study programming and groove.
Content creators strip vocals from licensed music to use as soundtrack without copyright issues with the original lyric. (Still need to license the underlying composition.)
Labels and curators isolate vocal performances when judging a demo, to evaluate the singer separately from the production polish.
Most vocal-removal services charge $10–$30 per month for a few separations. We thought about it and ran the numbers: Demucs is open-source, the only real cost is the CPU time on our server, and the page is small enough to support itself with a couple of ads. So we run it on our own machine and keep it free, unlimited, no watermark.
The trade-off is processing time. We don't have a GPU, so a 3-minute song takes about 2 to 3 minutes of real-world CPU time. The page tells you exactly when your job is queued, processing, and done. You can leave it open and come back, or close it and come back via the same URL. One running job at a time site-wide to keep the server happy. If there's a queue ahead of you, you'll see it.
You get 15 separations per IP per day. The 150 MB file size cap covers full albums and extended DJ sets. If you have a real production workflow that needs more than that, get in touch.
If your track is afro house or deep house, send it to Ben. €3, listened to in full, written feedback within 72 hours.
Pitch your track — €3 →