Embedding Models With an API
As of June 2026, AIDiveForge tracks 4 embedding models with an api. Curated embedding models with an api tracked by AIDiveForge. Listings are verified against each tool's live website and re-checked regularly.
Last updated June 9, 2026 · 4 tools

1. BGE-M3
BGE is a family of open-source embedding and reranking models from BAAI, released under MIT license with weights available on Hugging Face and PyPI, designed to run entirely on your own infrastructure. The core workflow is straightforward: generate dense embeddings, index them in a vector database, and optionally layer in sparse or multi-vector retrieval for hybrid search. Multi-lingual retrieval is a documented strength, with cross-lingual matching working across language pairs without requiring parallel training data. The ceiling appears when your domain is highly specialized — out-of-the-box embeddings on narrow technical corpora produce ranking quality that requires fine-tuning to fix, and that fine-tuning work lands entirely on your team.
FreeOpen Source
2. Cohere Embed v4
Cohere Embed v4 transforms text, images, and mixed content into unified vector representations for semantic search, RAG, document clustering, and similarity matching. The model supports 1,536-dimensional embeddings with flexible compression via Matryoshka embeddings (256, 512, 1024, 1536 dimensions). Priced at $0.12/1M text tokens and $0.47/1M image tokens, it delivers multimodal capabilities competitive with text-only alternatives. The API supports batch processing up to 128,000 tokens per request with asymmetric search optimization. Limitation: incompatible with v3 embeddings; corpus re-embedding required for upgrades.
PaidFree Trial · 0 days
3. embed-english-v3.0
embed-english-v3.0 generates semantic embeddings from English text, producing 1,024-dimensional vectors suitable for retrieval-augmented generation, classification, clustering, and semantic search tasks. It achieves state-of-the-art performance on MTEB and BEIR benchmarks and was trained on approximately 1 billion English training pairs. The model supports batches of up to 96 inputs with 512 tokens maximum per input, and supports both text and image embedding. Pricing is $0.10 per million tokens. A notable limitation is that it requires explicit input_type specification to differentiate between search documents, queries, classification, and clustering tasks.
Paid
4. jina-embeddings-v3
Fast multilingual embeddings that outperform OpenAI on MTEB, but LoRA adapters complicate efficient serving and newer models have widened the gap.
Paid
Listings on this page are sourced and verified by the AIDiveForge data pipeline. AIDiveForge is editorially independent — no money changes hands for inclusion.