Invite-only beta

Clean up your audio. Automatically.

SHOOSH removes explicit words from any song in 20 to 35 seconds — keeping the music intact using AI stem separation and word-level transcription.

Join the waitlist
⚡ Sub-minute end-to-end 🎚️ Studio-grade stems 🎯 Word-level precision

A radio edit in three steps

Drop a song. AI does the dirty work. You pick the cleanup style. Out comes a polished, broadcast-ready track in under a minute.

Stem separation

AI splits your track into vocals + instrumental in seconds — the music keeps playing clean under every shoosh.

How it works: a Demucs neural network isolates the vocal stem from the rest of the mix. We only edit the vocal so the instrumental stays full-quality, untouched.

Bonus: opt in to extract all 6 stems (bass, drums, vocals, guitar, piano, other) as a background job. Costs +1 credit; doesn't slow your shoosh down.

Whisper-fast transcription

200×-realtime speech-to-text gives you word-level timestamps — every lyric pinpointed to the millisecond.

How it works: Groq Whisper Turbo runs the transcription in 1–3 seconds for a four-minute song. No waiting around for batch jobs.

Every word arrives with its start / end in seconds, accurate enough to surgically replace a single syllable.

Smart auto-bleep

Every word rated for offensiveness. Pick a censor method. Threshold slider tunes everything else.

How it works: a fast LLM rates each word's intensity 0–1. The threshold slider — or the MPAA-style rating cards (G / PG / PG-13 / R / NC-17) — controls how aggressive the cleanup is.

Click any flagged word to skip it. Click any normal word to add it manually. You're always in control.

Seven ways to shoosh

Each method runs in seconds. Hover any card to see what it sounds like.

Beep

Classic

The universal TV-censor tone. Instantly recognizable. Safe for any context, never surprises a listener.

Empty

Silent

Dead silence. The instrumental plays right through — the word is just gone. Cleanest possible result.

Scratch

Hip-hop

A quick vinyl scratch over the word. Hip-hop heritage. Adds character without disrupting flow.

Reverse

Subtle

The vocal plays backwards just for that word. Weird in the best way. Often goes unnoticed.

Pitch Down

Demon voice

Drops the vocal two octaves for the duration of the word. Unintelligible. Slightly menacing. Iconic.

Turntable

Wind-down

A turntable wind-down effect on the vocal. Like the record stopped for a beat. Cinematic.

Custom

Your sound

Drop your own MP3 — airhorn, meme clip, custom sting, anything. We splice it in at every flagged word.

Made for

Anywhere clean audio matters. Click the dots to flip through.

🎙️

Radio edits

Submit to terrestrial radio or playlist gatekeepers with confidence. No "clean version" takedowns, no last-minute edit panic. Cleans match what stations actually allow.

🎧

Podcasts

Get past Apple's explicit-content gates, YouTube's ad-friendly thresholds, and Spotify's family filters — without re-recording your interview.

📺

Live streamers

DMCA-conscious music for your Twitch, Kick, YouTube Live, or TikTok Live. Your music — clean enough to play unrestricted.

📱

TikTok & Reels

Algorithm-friendly cleans that keep the song recognizable. Higher organic reach, fewer flagged uploads, more shares.

🎤

Clean covers

Cover an explicit track for a family audience without rewriting lyrics. Pick the method, ship the cover, keep your channel safe.

👶

Kids content

School plays, kids' parties, family TikToks. Drop the song, set the rating to G, get a kid-safe version in under a minute.

Questions, answered

If we missed something, the waitlist form below is the fastest way to ask.

How accurate is the bad-word detection?

An LLM rates every transcribed word from 0 (clean) to 1 (extreme). The MPAA-style rating cards (G / PG / PG-13 / R / NC-17) map directly to threshold bands. You can also fine-tune with the slider, and manually add or skip any word with a click. Default settings catch the obvious stuff; you decide the rest.

What about non-English songs?

Transcription is multilingual — Whisper auto-detects the language. Bad-word rating works on the transcribed text, so any language with profanity in the LLM's training set is covered. English, Spanish, French, German, Portuguese are all well-supported. Niche languages may need manual review.

How long does a full shoosh take?

End-to-end (upload through final clean track) is typically 20–35 seconds for a 3–4 minute song. The bottleneck is upload speed, not processing.

Can I re-shoosh with a different method?

Yes. The original upload stays in storage — every "Finalize" call processes from the original, never from a previous shoosh. Switch from Beep to Scratch to Pitch Down freely, each gives a clean result against the original.

What audio formats do you accept?

MP3, WAV, FLAC, M4A on upload. Output is MP3 at the same sample rate as your input.

Is my audio private?

Yes. Audio is stored in your private S3 namespace, served only via signed URLs, and never used for training. Sessions are scoped to your email — only you (and SHOOSH admins for support) can access them.

What about copyright?

You're responsible for having the rights to upload and edit the audio you submit. SHOOSH doesn't grant you any new rights to the underlying composition or recording; it only edits the file you provide.

Why credits?

Each shoosh hits premium AI APIs (Whisper, Demucs, Claude). Credits cover the per-track cost. New accounts start with 20 free credits; a full shoosh costs 5, optional stems +1. Buy-credits is coming soon.

Can I use SHOOSH commercially?

Yes — once we're out of invite-only beta. Right now access is gated. Join the waitlist below and we'll get back to you.

The team

Built by people who actually make and ship music.

Cédric F. Jacob

Founder & Engineering

Builds the AI pipeline, the infrastructure, and most of what you see on this page. Long-time audio nerd, longer-time software engineer.

DJ Skizzbeats

Music & Audio Direction

Producer, DJ, and the ear behind every shoosh method. Makes sure the cleaned version still feels right.

Tarik Thornton

Product & Brand

Shapes what SHOOSH feels like — voice, visual identity, and which features actually matter to the people using it.

More info, photos, and a couple of new faces coming soon.

Join the waitlist

SHOOSH is invite-only while we ramp up. Drop your email and tell us what you'd use it for — we'll be in touch as we open access.