Supported LLMs
Chat and Prompts
Cody supports a variety of cutting-edge large language models for use in chat and prompts, allowing you to select the best model for your use case.
Newer versions of Sourcegraph Enterprise, starting from v5.6, it will be even easier to add support for new models and providers, see Model Configuration for more information.
| Provider | Model | Status | Vision Support |
|---|---|---|---|
| OpenAI | GPT-5 | ✅ | ✅ |
| OpenAI | GPT-5-mini | ✅ | ✅ |
| OpenAI | GPT-5-nano | ✅ | ✅ |
| OpenAI | GPT-4-Turbo | ✅ | ❌ |
| OpenAI | GPT-4o | ✅ | ✅ |
| OpenAI | GPT-4o-mini | ✅ | ✅ |
| OpenAI | o3-mini-medium | ✅ (experimental) | ❌ |
| OpenAI | o3-mini-high | ✅ (experimental) | ❌ |
| OpenAI | o3 | ✅ | ❌ |
| OpenAI | o4-mini | ✅ | ❌ |
| OpenAI | GPT-4.1 | ✅ | ✅ |
| OpenAI | GPT-4.1-mini | ✅ | ✅ |
| OpenAI | GPT-4.1-nano | ✅ | ✅ |
| Anthropic | Claude 3.5 Haiku | ✅ | ❌ |
| Anthropic | Claude Haiku 4.5 | ✅ | ✅ |
| Anthropic | Claude Haiku 4.5 w/ Thinking | ✅ | ✅ |
| Anthropic | Claude 3.7 Sonnet | ✅ | ✅ |
| Anthropic | Claude Sonnet 4 | ✅ | ✅ |
| Anthropic | Claude Sonnet 4 w/ Thinking | ✅ | ✅ |
| Anthropic | Claude Opus 4.1 | ✅ | ✅ |
| Anthropic | Claude Opus 4 | ✅ | ✅ |
| Anthropic | Claude Opus 4 w/ Thinking | ✅ | ✅ |
| Gemini 1.5 Pro | ✅ (beta) | ✅* | |
| Gemini 2.0 Flash | ✅ | ✅* | |
| Gemini 2.0 Flash | ✅ | ✅* | |
| Gemini 2.5 Pro | ✅ | ✅* | |
| Gemini 2.5 Flash | ✅ | ✅* |
- While Gemini models support vision capabilities, Cody clients do not currently support image uploads to Gemini models.
To use Claude 3 Sonnet models with Cody Enterprise, make sure you've upgraded your Sourcegraph instance to the latest version.
Site admins can configure vision support using the chatVision
setting in site configuration and by adding the
vision capability to model configurations. See Model
Configuration for more details.
Claude 3.7 and 4 Models
Claude 3.7 and 4 Sonnet have two variants; the base version, and the extended thinking version which supports deep reasoning and fast, responsive edit workflows. Claude Haiku 4.5 also supports both variants. Cody enables using both, and lets the user select which to use in the model dropdown selector, so the user can choose whether to use extended thinkig depending on their work task.
Claude 4 models support is available with Sourcegraph versions v6.4+ and v6.3.4167. Claude Haiku 4.5 requires v6.9.2509+. Model availability also depends on the deployment type and whether it's thinking or not.
Claude 3.7 and 4 via Google Vertex, via AWS Bedrock
Starting in Sourcegraph v6.4+ and v6.3.416, Claude 3.7 Extended Thinking - as well as Claude 4 base and extended thinking variants - are available in Sourcegraph when using Claude through either Google Vertex or AWS Bedrock. Claude Haiku 4.5 is available starting in v6.9.2509.
See Model Configuration: Reasoning models for more information.
Autocomplete
Cody uses a set of models for autocomplete which are suited for the low latency use case.
| Provider | Model | Status |
|---|---|---|
| Fireworks.ai | DeepSeek-Coder-V2 | ✅ |
| Anthropic | claude Instant | ✅ |
The default autocomplete model for Enterprise users is DeepSeek-Coder-V2.
The DeepSeek model used by Sourcegraph is hosted by Fireworks.ai, and is hosted as a single-tenant service in a US-based data center. For more information see our Cody FAQ.
Smart Apply
| Provider | Model | Status |
|---|---|---|
| Fireworks.ai | Qwen 2.5 Coder | ✅ |
Fireworks.ai is the default model for cody-gateway, but if you wish to switch to Claude models, Site admins can do it following these steps:
- Go to Site admin
- Click on the Feature flags
- Search for
cody-smart-apply-instant-mode-enabledfeature flag - Turn off/delete the cody-smart-apply-instant-mode-enabled feature flag