<cite index="1-1">Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding.</cite>
<cite index="2-1">Released in late August 2025, the xAI Grok Code Fast 1 model is a coding-focused AI model that excels at common, high-volume coding task and is designed especially for agentic coding workflows.</cite> <cite index="1-6,1-7,1-8">Built from scratch with a brand-new model architecture, it was trained on a pre-training corpus rich with programming-related content, and curated high-quality datasets that reflect real-world pull requests and coding tasks.</cite> <cite index="1-23">The model is particularly adept at TypeScript, Python, Java, Rust, C++, and Go.</cite> <cite index="1-13">The model is generally available via the xAI API, priced at $0.20 / 1M input tokens, $1.50 / 1M output tokens, and $0.02 / 1M cached input tokens.</cite>
Bottom line:*A lightweight, cost-efficient reasoning model optimized for speed in agentic coding workflows, trading some accuracy for rapid iteration.*
Pricing Plans
Per-token
Price
<cite index="1-13">$0.20 per 1M input tokens, $1.50 per 1M output tokens, and $0.02 per 1M cached input tokens</cite>
Free Tier
<cite index="1-20">Free access available for a limited time through launch partners; complimentary access through GitHub Copilot ended on September 10, 2025</cite>
Most Popular
API Pricing
Usage-basedper month
<cite index="1-13">Standard API pricing: $0.20 per 1M input tokens, $1.50 per 1M output tokens, $0.02 per 1M cached input tokens</cite>
Per-token billing
Cached token discount
API access for developers
Free Trial (Limited Duration)
Free
<cite index="1-20">Free access for a limited time through select launch partners including GitHub Copilot, Cursor, Cline, Roo Code, Kilo Code, opencode, and Windsurf</cite>
Pricing may have changed since last verified. Check the official site for current plans.
Community Performance Report Card
No community ratings yet. Be the first to rate this tool!
Best For: <cite index="1-4">Agentic coding workflows requiring fast loops of reasoning and tool calls</cite>, <cite index="6-8,6-12">Cost efficiency and rapid iteration due to fast token throughput of approximately 100 tokens per second</cite>, <cite index="1-23">Development in TypeScript, Python, Java, Rust, C++, and Go</cite>, Rapid prototyping and iterative development, <cite index="8-4">Processing larger codebases with its 256k token context window</cite>
<cite index="2-25,2-26">Massive throughput of approximately 90-100 tokens per second, delivering dozens of tool calls and edits before you finish reading its initial plan in IDE integrations</cite>
<cite index="1-13">Economical pricing at $0.20/1M input tokens and $1.50/1M output tokens</cite>
<cite index="2-27,2-28,2-29">Visible reasoning traces that provide real-time, summarized view of its reasoning process, helping developers catch logic errors early</cite>
<cite index="1-22">Prompt caching optimizations regularly achieving cache hit rates above 90% when used with launch partners</cite>
<cite index="6-31,6-34">Potential gaps in training on specific frameworks; poor performance on Tailwind CSS v3 tasks, suggesting possible smaller model size limitations</cite>
<cite index="6-36">Its reasoning model nature makes it unsuitable for interactive workflows requiring fast responses despite fast token throughput</cite>
No reviews yet. Be the first to share your experience.
About
Platforms
<cite index="30-1">Available through xAI API and integrated with launch partners including GitHub Copilot, Cursor, Cline, Roo Code, Kilo Code, opencode, and Windsurf</cite>
<cite index="2-23,2-24">Agentic coding workflows where the model is optimized for tool-use and can autonomously use the terminal, run commands, and perform multi-step edits across a repository</cite>
<cite index="1-24">Providing insightful answers to codebase questions and performing surgical bug fixes</cite>
Grok Code Fast 1 has been reliable for quick code refactors in my workflow. It nails small Python edits in under 2 seconds, which makes iterating on tests feel instant. The one caveat is that it sometimes over-trims comments when compressing code, so review diffs before committing.
Grok Code Fast 1 is a paid tool (<cite index="1-13">$0.20 per 1M input tokens, $1.50 per 1M output tokens, and $0.02 per 1M cached input tokens</cite>). A 0-day free trial is available.
Is Grok Code Fast 1 open source?
No — Grok Code Fast 1 is a closed-source tool. Source code is not publicly available.
Does Grok Code Fast 1 have an API?
Yes. Grok Code Fast 1 exposes a developer API. See the official documentation at https://x.ai for details.
What are the alternatives to Grok Code Fast 1?
Common alternatives include Claude Opus 4.1, Claude Sonnet 4.5, GPT-5, Gemini 2.5 Pro. Compare them on AIDiveForge for pricing, features, and platform support.
When was Grok Code Fast 1 released?
Grok Code Fast 1 was first released in 2025.
What platforms does Grok Code Fast 1 support?
Grok Code Fast 1 is available on: <cite index="30-1">Available through xAI API and integrated with launch partners including GitHub Copilot, Cursor, Cline, Roo Code, Kilo Code, opencode, and Windsurf</cite>.
Spotted incorrect or missing data? Join our community of contributors.
Be the first to contribute. Concrete time/cost savings, with context. e.g. "Cut my code review backlog from 4h to 45m per week."
Released in late August 2025, the xAI Grok Code Fast 1 model is a coding-focused AI model that excels at common, high-volume coding task and is designed especially for agentic coding workflows. With its speed, efficiency, and low cost, this model is built to handle the loop of modern software development (planning, writing, testing, and debugging), offers real-time, summarized trace of its reasoning, and is proficient in TypeScript, Python, Java, Rust, C++, and Go. Internally, it uses a mixture-of-experts architecture with an estimated 314 billion parameters, designed to balance speed with coding capability, and delivers approximately 92 tokens per second in practical usage. The model addresses the challenge that powerful AI models often don’t feel purpose-built for agentic coding workflows where loops of reasoning and tool calls can feel frustratingly slow, as engineers saw room for a more nimble, responsive solution optimized for day-to-day tasks.
Edit section
0 / 4000
Submit a benchmark
Share a real-world data point. Plausibility-checked by our AI moderator before publishing.
We use first-party cookies for our own analytics and Google Analytics (via Google Site Kit) to understand how you use our site.
See our Privacy Policy.