Screenshots 5
embed-english-v3.0
Summary
Cohere's English-optimized text embedding model with 1,024-dimensional outputs for semantic search, RAG, and classification.
embed-english-v3.0 generates semantic embeddings from English text, producing 1,024-dimensional vectors suitable for retrieval-augmented generation, classification, clustering, and semantic search tasks. It achieves state-of-the-art performance on MTEB and BEIR benchmarks and was trained on approximately 1 billion English training pairs. The model supports batches of up to 96 inputs with 512 tokens maximum per input, and supports both text and image embedding. Pricing is $0.10 per million tokens. A notable limitation is that it requires explicit input_type specification to differentiate between search documents, queries, classification, and clustering tasks.
Bottom line: *Use for high-quality English semantic search and RAG applications with cost-efficient API pricing; avoid if you need multilingual support or extensive context windows beyond 512 tokens.*
Pricing Plans
Per-token- Price
- $0.10 per million tokens
- Cost per 1M Input
- $0.10
- Cost per 1M Output
- N/A (embedding model)
- Free Tier
- Rate-limited access for learning and prototyping
Free Tier
Rate-limited access for learning and prototyping
- Rate-limited API access
- Suitable for experimentation
Pay-As-You-Go
Usage-based billing per token
- $0.10 per million input tokens
- Scaling for production use
- No minimum commitments
View full pricing on cohere.com →
Pricing may have changed since last verified. Check the official site for current plans.
Community Performance Report Card
No community ratings yet. Be the first to rate this tool!
Community Benchmarks Community
Sign in to submit a benchmarkNo community benchmarks yet. Be the first to share a real-world data point.
Pros
Sign in to edit- State-of-the-art performance on MTEB and BEIR benchmarks
- Highly cost-efficient at $0.10 per million tokens
- Supports multimodal input (text and images) with unified embeddings
- Batch processing up to 96 inputs per request
- Multiple embedding output formats (float, int8, uint8, binary, base64)
Cons
Sign in to edit- English-optimized only; use embed-multilingual-v3.0 for multilingual needs
- 512-token limit per input may truncate long documents
- Requires explicit input_type specification for optimal results
Community Reviews
Sign in to write a reviewNo reviews yet. Be the first to share your experience.
About
- Platforms
- Cohere API, AWS SageMaker, Azure AI Foundry, OCI Generative AI
- Languages
- English (primary); text-image multimodal support
- API Available
- Yes
- Self-Hosted
- No
- Last Updated
- 2026-04-11T19:38:58.529Z
Best For
Who it's for
- English-only semantic search applications
- RAG systems requiring cost efficiency
- Production-scale document retrieval
- Text classification with semantic embeddings
What it does well
- Semantic search and document retrieval
- Retrieval-augmented generation (RAG)
- Text classification and clustering
- Similarity comparison between sentences
Integrations
Discussion Community
Sign in to commentNo discussion yet. Sign in to start the conversation.
Similar Tools
Compare embed-english-v3.0
Spotted incorrect or missing data? Join our community of contributors.
Sign Up to ContributeCommunity Notes & Tips Community
Sign in to contributeBe the first to contribute. General notes, observations, gotchas, and tips from people who use this tool day-to-day.
Frequently Asked Questions
- Is embed-english-v3.0 free?
- embed-english-v3.0 is a paid tool ($0.10 per million tokens). No permanent free tier is offered.
- Is embed-english-v3.0 open source?
- No — embed-english-v3.0 is a closed-source tool. Source code is not publicly available.
- Does embed-english-v3.0 have an API?
- Yes. embed-english-v3.0 exposes a developer API. See the official documentation at https://cohere.com for details.
- When was embed-english-v3.0 released?
- embed-english-v3.0 was first released in 2024.
- What platforms does embed-english-v3.0 support?
- embed-english-v3.0 is available on: Cohere API, AWS SageMaker, Azure AI Foundry, OCI Generative AI.
Hours Saved & ROI Stories Community
Sign in to contributeBe the first to contribute. Concrete time/cost savings, with context. e.g. "Cut my code review backlog from 4h to 45m per week."
Curated lists that include this category
embed-english-v3.0 is a specialized text embedding model from Cohere designed specifically for English language inputs. It transforms text phrases, sentences, and paragraphs into 1,024-dimensional dense vectors that capture semantic meaning, enabling applications to perform similarity searches, retrieve relevant documents, classify text, and cluster similar items.
The model was trained on approximately 1 billion English language pairs and achieves state-of-the-art performance on widely-used benchmarks including the Massive Text Embedding Benchmark (MTEB) across 56 datasets for retrieval, classification, and clustering, as well as the BEIR benchmark for zero-shot dense retrieval tasks. These benchmark results demonstrate strong generalization across diverse domains.
API access supports batch processing of up to 96 texts per request, with each input limited to 512 tokens maximum. The model requires users to specify an input_type parameter that distinguishes between search_document, search_query, classification, clustering, and image inputs to optimize embeddings for specific use cases. For image inputs, both text and image can be embedded together. Output embeddings are available in multiple formats including float, int8, uint8, binary, ubinary, and base64 representations to optimize for different storage and retrieval requirements.
The model is available through Cohere’s API with usage-based pricing of $0.10 per million tokens, with a free tier available for learning and prototyping subject to rate limits. Deployment options include Cohere’s managed API platform, AWS SageMaker, Azure AI Foundry, and OCI Generative AI service.
