Is embed-english-v3.0 free?

embed-english-v3.0 is a paid tool ($0.10 per million tokens). No permanent free tier is offered.

Is embed-english-v3.0 open source?

No — embed-english-v3.0 is a closed-source tool. Source code is not publicly available.

Does embed-english-v3.0 have an API?

Yes. embed-english-v3.0 exposes a developer API. See the official documentation at https://cohere.com for details.

When was embed-english-v3.0 released?

embed-english-v3.0 was first released in 2024.

What platforms does embed-english-v3.0 support?

embed-english-v3.0 is available on: Cohere API, AWS SageMaker, Azure AI Foundry, OCI Generative AI.

Visit embed-english-v3.0

Screenshots 5

embed-english-v3.0

FreemiumAPI

Summary

Cohere's English-optimized text embedding model with 1,024-dimensional outputs for semantic search, RAG, and classification.

embed-english-v3.0 generates semantic embeddings from English text, producing 1,024-dimensional vectors suitable for retrieval-augmented generation, classification, clustering, and semantic search tasks. It achieves state-of-the-art performance on MTEB and BEIR benchmarks and was trained on approximately 1 billion English training pairs. The model supports batches of up to 96 inputs with 512 tokens maximum per input, and supports both text and image embedding. Pricing is $0.10 per million tokens. A notable limitation is that it requires explicit input_type specification to differentiate between search documents, queries, classification, and clustering tasks.

Bottom line: *Use for high-quality English semantic search and RAG applications with cost-efficient API pricing; avoid if you need multilingual support or extensive context windows beyond 512 tokens.*

Pricing Plans

Per-token

Price: $0.10 per million tokens
Cost per 1M Input: $0.10
Cost per 1M Output: N/A (embedding model)
Free Tier: Rate-limited access for learning and prototyping

Free Tier

Free

Rate-limited access for learning and prototyping

Rate-limited API access
Suitable for experimentation

Pay-As-You-Go

per month

Usage-based billing per token

$0.10 per million input tokens
Scaling for production use
No minimum commitments

View full pricing on cohere.com →

Pricing may have changed since last verified. Check the official site for current plans.

Community Performance Report Card

No community ratings yet. Be the first to rate this tool!

Best For: English-only semantic search applications, RAG systems requiring cost efficiency, Production-scale document retrieval, Text classification with semantic embeddings

Community Benchmarks Community

No community benchmarks yet. Be the first to share a real-world data point.

Embedding Models Large Language Models

Released 2024

Pros

State-of-the-art performance on MTEB and BEIR benchmarks
Highly cost-efficient at $0.10 per million tokens
Supports multimodal input (text and images) with unified embeddings
Batch processing up to 96 inputs per request
Multiple embedding output formats (float, int8, uint8, binary, base64)

Cons

English-optimized only; use embed-multilingual-v3.0 for multilingual needs
512-token limit per input may truncate long documents
Requires explicit input_type specification for optimal results

Community Reviews

No reviews yet. Be the first to share your experience.

About

Platforms: Cohere API, AWS SageMaker, Azure AI Foundry, OCI Generative AI
Languages: English (primary); text-image multimodal support
API Available: Yes
Self-Hosted: No
Last Updated: 2026-04-11T19:38:58.529Z

Best For

Who it's for

English-only semantic search applications
RAG systems requiring cost efficiency
Production-scale document retrieval
Text classification with semantic embeddings

What it does well

Semantic search and document retrieval
Retrieval-augmented generation (RAG)
Text classification and clustering
Similarity comparison between sentences

Integrations

Python SDK (cohere)AWS BedrockSageMakerAzure Machine LearningMilvus/ZillizLangChain

Discussion Community

No discussion yet. Sign in to start the conversation.

Similar Tools

Compare embed-english-v3.0

Spotted incorrect or missing data? Join our community of contributors.

Community Notes & Tips Community

Be the first to contribute. General notes, observations, gotchas, and tips from people who use this tool day-to-day.

Frequently Asked Questions

Is embed-english-v3.0 free?: embed-english-v3.0 is a paid tool ($0.10 per million tokens). No permanent free tier is offered.
Is embed-english-v3.0 open source?: No — embed-english-v3.0 is a closed-source tool. Source code is not publicly available.
Does embed-english-v3.0 have an API?: Yes. embed-english-v3.0 exposes a developer API. See the official documentation at https://cohere.com for details.
When was embed-english-v3.0 released?: embed-english-v3.0 was first released in 2024.
What platforms does embed-english-v3.0 support?: embed-english-v3.0 is available on: Cohere API, AWS SageMaker, Azure AI Foundry, OCI Generative AI.

paid

Hours Saved & ROI Stories Community

Be the first to contribute. Concrete time/cost savings, with context. e.g. "Cut my code review backlog from 4h to 45m per week."

Curated lists that include this category

embed-english-v3.0 is a specialized text embedding model from Cohere designed specifically for English language inputs. It transforms text phrases, sentences, and paragraphs into 1,024-dimensional dense vectors that capture semantic meaning, enabling applications to perform similarity searches, retrieve relevant documents, classify text, and cluster similar items.

The model was trained on approximately 1 billion English language pairs and achieves state-of-the-art performance on widely-used benchmarks including the Massive Text Embedding Benchmark (MTEB) across 56 datasets for retrieval, classification, and clustering, as well as the BEIR benchmark for zero-shot dense retrieval tasks. These benchmark results demonstrate strong generalization across diverse domains.

API access supports batch processing of up to 96 texts per request, with each input limited to 512 tokens maximum. The model requires users to specify an input_type parameter that distinguishes between search_document, search_query, classification, clustering, and image inputs to optimize embeddings for specific use cases. For image inputs, both text and image can be embedded together. Output embeddings are available in multiple formats including float, int8, uint8, binary, ubinary, and base64 representations to optimize for different storage and retrieval requirements.

The model is available through Cohere’s API with usage-based pricing of $0.10 per million tokens, with a free tier available for learning and prototyping subject to rate limits. Deployment options include Cohere’s managed API platform, AWS SageMaker, Azure AI Foundry, and OCI Generative AI service.