Online Audio Stem Separator
Easily isolate vocals, drums, bass, and other instruments from your audio tracks using the power of Demucs, completely free.
No file selected
Tip: Uses CPU SERVER for processing and may take up to 10 minutes to complete, Do not close or navigate away from this page during processing... FASTER GPU SERVER COMING SOON!
Unlock Your Audio: What is Stem Separation?
Stem separation, also known as music source separation, is the process of isolating individual instrumental and vocal tracks (called "stems") from a mixed audio recording. Imagine having a finished song and being able to extract just the vocals, or only the drum track, or the bassline – that's the magic of stem separation!
This technology has revolutionized how musicians, producers, DJs, and audio enthusiasts interact with music, opening up a world of creative possibilities.

Powered by Demucs: State-of-the-Art AI
This online tool utilizes Demucs, a cutting-edge deep learning model for music source separation developed by Meta AI (formerly Facebook Research). Demucs is renowned for its high-quality separation capabilities, leveraging advanced neural network architectures trained on vast amounts of music data.
Key features of Demucs include:
- High Fidelity: It excels at producing clean stems with minimal artifacts compared to many older methods.
- Multiple Models: Demucs offers various pre-trained models, each with different characteristics. This service typically uses a hybrid transformer model (like
htdemucs_ft
) known for its excellent balance of quality and computational efficiency. - Open Source: The Demucs project is open-source (learn more about open source software), and the Demucs code itself can be found on GitHub, fostering a community of researchers and developers who contribute to its advancement.
By using Demucs, this tool aims to provide you with professional-grade stem separation results, accessible directly from your browser.
How to Use This Stem Separator
Getting your audio stems is simple:
- Choose Your Audio File: Click the "Choose Audio File" button and select an audio file from your device. We support common formats like MP3, WAV, FLAC, M4A, AAC, and OGG. For best results, use a high-quality source file.
- Upload & Process: Once a file is selected, the "Separate Stems" button will activate. Click it to upload your file to our server for processing. You'll see a progress bar and log messages updating you on the status.
- Be Patient: Can take up to 10 minutes to complete one 5 minute song. Stem separation is a computationally intensive task. Depending on the length of your audio file and current server load, processing can take several minutes. Please keep the browser tab open.
- Download Your Stems: When processing is complete, a download will automatically begin for a ZIP file containing your separated stems (typically vocals, drums, bass, and "other" instruments).
Supported Stems: This service typically separates audio into four main stems:
- Vocals
- Drums
- Bass
- Other (includes guitars, keyboards, synths, strings, etc.)

Creative Possibilities: What Can You Do With Stems?
Audio stems open up a vast array of creative and practical applications:
- Remixing & Mashups: DJs and producers can easily grab acapellas (vocals) or instrumental parts to create unique remixes and mashups.
- Karaoke Tracks: Remove the vocals from a song to create your own karaoke versions.
- Music Learning & Practice: Musicians can isolate specific instrument parts to learn them more easily or play along with a backing track minus their instrument.
- Sampling: Extract clean instrumental loops or vocal phrases for use in new compositions (always respect copyright!).
- Audio Post-Production: Enhance or rebalance elements in a mix if the original multitracks are unavailable.
- Educational Purposes: Analyze song structures and arrangements by listening to individual components.
- Content Creation: Obtain instrumental backing tracks for videos or podcasts.
Tips for Optimal Stem Separation
To get the best possible quality from our Demucs-powered separator, consider these important tips:
- Start with High-Quality Source Audio: This is the most crucial factor.
- Use lossless formats like WAV or FLAC if available.
- If using MP3 or AAC, ensure they are high bitrate (e.g., 256kbps or 320kbps).
- Avoid heavily compressed or low-quality audio, as this will significantly increase artifacts (like "warbling" or "swishy" sounds) in the separated stems.
- Choose Songs with Clear Mixes: Audio where instruments are well-defined and not overly "glued" together by heavy mastering compression or excessive reverb will generally separate more cleanly. Densely packed or "muddy" mixes are more challenging for any AI.
- Be Mindful of the File Size Limit: This service currently accepts files up to 100MB. This helps ensure fair usage and manageable processing times for all users. If your lossless file is too large, consider converting it to a high-quality MP3 (320kbps) or processing a shorter segment.
- Patience During Processing: Stem separation is complex. Depending on your audio's length and current server demand, it can take several minutes. Please keep the browser tab open and monitor the progress indicators.
While these tips can improve results, remember that all AI source separation tools, including Demucs, may still produce minor artifacts or slight "bleeding" between stems, especially on very complex material. This is a characteristic of the current technology.
Understanding Demucs Models (A Bit More Technical)
The Demucs project has evolved, offering several model architectures. Some notable ones include:
- Original Demucs: Based on a U-Net convolutional architecture.
- Hybrid Demucs (
htdemucs
): Combines convolutional layers with Transformers, often providing improved quality. This is what our service primarily uses, specifically a fine-tuned version (often denoted by_ft
). For instance,htdemucs_ft
is a popular choice, fine-tuned on the MusDB HQ dataset. - MDX Models (e.g.,
mdx
,mdx_extra
,mdx_q
): These were earlier models, sometimes trained or fine-tuned by the community, and were part of the broader Demucs ecosystem. While good, newer hybrid transformer models generally offer superior performance for the standard four stems.
The choice of model involves trade-offs between separation quality, speed, and the specific stems it's trained to isolate. We aim to use a model that provides a great balance for general-purpose use.

Important: Legal & Ethical Considerations
While stem separation technology is powerful, it's crucial to use it responsibly:
- Copyright: You should only process audio files for which you own the copyright or have explicit permission from the copyright holders. Uploading and processing copyrighted material without authorization may infringe on intellectual property rights.
- Fair Use/Dealing: Some jurisdictions have concepts of "fair use" or "fair dealing" for purposes like education, parody, or criticism, but these are complex and vary by region. Relying on fair use for distributing separated stems from copyrighted works is legally risky.
- Our Stance: This service is provided as a tool. The responsibility for ensuring copyright compliance rests entirely with you, the user. By using this service, you affirm that you have the necessary rights to the audio you upload.
Please respect artists and copyright law. Use this tool to enhance your own creativity with material you're authorized to use.