Speaker Diarization Cleanup
Correct auto-diarized transcripts by merging short speaker swaps and relabeling anonymous speakers from known-voice samples.
🧠 Why it works
Auto-diarization has two predictable failure modes that a post-processing pass can fix without retraining: short spurious swaps (easily filtered by duration) and generic speaker labels (easily fixed by matching against known voices). Both are cheap to detect and apply.
⚙️ How it works
- Parse the diarized transcript into turns. 2. Merge turns shorter than a configurable threshold (default 2s) into the neighboring dominant speaker. 3. For each anonymous speaker label, compute an average voice embedding from their turns. 4. Compute embeddings for each reference clip. 5. Match anonymous labels to reference labels by cosine similarity above a threshold; unmatched speakers keep their numeric label. 6. Emit the cleaned transcript + a diff showing what changed.
Description
Takes a diarized transcript (with Speaker 0, Speaker 1 labels) and a folder of reference audio clips for each known speaker. Cleans up two classes of error — spurious short swaps (Speaker 0 for 1.2 seconds in the middle of Speaker 1's turn) and anonymous speakers (renames Speaker 0 to 'Dana' based on voice embedding similarity to a reference).
Install this skill
A Claude skill is a skill.md file with YAML frontmatter and a markdown body.
Drop the file into your tool of choice — or pick a different format if you use Cursor, Windsurf, Copilot, or something else.
mkdir -p ~/.claude/skills/speaker-diarization-cleanup \
&& curl -L https://aidiveforge.com/skill/speaker-diarization-cleanup.skill-md \
-o ~/.claude/skills/speaker-diarization-cleanup/skill.md
Save to ~/.claude/skills/speaker-diarization-cleanup/skill.md
Recommended Use
Sign in to suggestTools and workflow packs this skill pairs well with. Forge picks are auto-generated from category + capability signals; Community picks are added by people who've used the pairing.
No matches yet. Be the first to suggest a pairing, or the Forge will populate suggestions as signals align.
Report compatibility
Tell the community which tool you used this skill with, and whether it worked.
Suggest a pairing
Recommend a tool or workflow pack that this skill works well with. Up to 5 recommendations per day.