Skip to main content
AIDiveForge AIDiveForge
🩺

Vector Index Health Check

Search & Retrieval · by AIDiveForge · Apr 20, 2026 · Advanced

Probe a vector index for near-duplicate embeddings, orphaned chunks, and stale source docs — the three silent killers of RAG quality.

🧠 Why it works

RAG quality degrades silently over time as duplicates accumulate (the same FAQ reindexed from three sources), as source docs get deleted but chunks don't, and as content becomes stale without being refreshed. Users see 'worse answers' and don't know why. A periodic audit surfaces the cause instead of waiting for complaints.

⚙️ How it works

  1. Pull metadata for every vector — source URL, last_modified, chunk_id. 2. Near-dup scan: for each chunk, get its top-5 nearest neighbors; flag any pair with similarity > threshold where the text diff is only whitespace + boilerplate. 3. Orphan scan: resolve each source URL (HEAD request) and flag 404s or redirects to different content. 4. Stale scan: flag chunks whose source last_modified exceeds the threshold. 5. Emit a markdown report with counts, example entries, and suggested remediations (dedup, delete, refresh).

Description

Runs against a Pinecone, Qdrant, Weaviate, or FAISS index and reports three classes of problem: near-duplicates (chunks with cosine similarity > 0.98 that differ only by boilerplate), orphans (chunks whose source doc no longer exists), and stale chunks (last_modified older than a configurable threshold). Each problem class includes a suggested remediation.

Install this skill

A Claude skill is a skill.md file with YAML frontmatter and a markdown body. Drop the file into your tool of choice — or pick a different format if you use Cursor, Windsurf, Copilot, or something else.

Download skill.md
mkdir -p ~/.claude/skills/vector-index-health-check \
  && curl -L https://aidiveforge.com/skill/vector-index-health-check.skill-md \
       -o ~/.claude/skills/vector-index-health-check/skill.md

Save to ~/.claude/skills/vector-index-health-check/skill.md

Recommended Use

Tools and workflow packs this skill pairs well with. Forge picks are auto-generated from category + capability signals; Community picks are added by people who've used the pairing.

No matches yet. Be the first to suggest a pairing, or the Forge will populate suggestions as signals align.