Vector Index Health Check
Probe a vector index for near-duplicate embeddings, orphaned chunks, and stale source docs — the three silent killers of RAG quality.
🧠 Why it works
RAG quality degrades silently over time as duplicates accumulate (the same FAQ reindexed from three sources), as source docs get deleted but chunks don't, and as content becomes stale without being refreshed. Users see 'worse answers' and don't know why. A periodic audit surfaces the cause instead of waiting for complaints.
⚙️ How it works
- Pull metadata for every vector — source URL, last_modified, chunk_id. 2. Near-dup scan: for each chunk, get its top-5 nearest neighbors; flag any pair with similarity > threshold where the text diff is only whitespace + boilerplate. 3. Orphan scan: resolve each source URL (HEAD request) and flag 404s or redirects to different content. 4. Stale scan: flag chunks whose source last_modified exceeds the threshold. 5. Emit a markdown report with counts, example entries, and suggested remediations (dedup, delete, refresh).
Description
Runs against a Pinecone, Qdrant, Weaviate, or FAISS index and reports three classes of problem: near-duplicates (chunks with cosine similarity > 0.98 that differ only by boilerplate), orphans (chunks whose source doc no longer exists), and stale chunks (last_modified older than a configurable threshold). Each problem class includes a suggested remediation.
Install this skill
A Claude skill is a skill.md file with YAML frontmatter and a markdown body.
Drop the file into your tool of choice — or pick a different format if you use Cursor, Windsurf, Copilot, or something else.
mkdir -p ~/.claude/skills/vector-index-health-check \
&& curl -L https://aidiveforge.com/skill/vector-index-health-check.skill-md \
-o ~/.claude/skills/vector-index-health-check/skill.md
Save to ~/.claude/skills/vector-index-health-check/skill.md
Recommended Use
Sign in to suggestTools and workflow packs this skill pairs well with. Forge picks are auto-generated from category + capability signals; Community picks are added by people who've used the pairing.
No matches yet. Be the first to suggest a pairing, or the Forge will populate suggestions as signals align.
Report compatibility
Tell the community which tool you used this skill with, and whether it worked.
Suggest a pairing
Recommend a tool or workflow pack that this skill works well with. Up to 5 recommendations per day.