Technologies Overview
- BigCodeBenchBenchmarks & EvaluationTrial
- Claude 3.5 SonnetFrontier ModelsHold
- Claude Opus 4.6Frontier ModelsAdopt
- Claude Sonnet 4.6Frontier ModelsAdopt
- Cohere Command R/AFrontier ModelsAssess
- DeepSeek Coder V2Code-Specialized ModelsHold
- DeepSeek R1Open Weights ModelsTrial
- DeepSeek V3.1 TerminusOpen Weights ModelsTrial
- DeepSeek V3.2Open Weights ModelsTrial
- Gemini 1.5 ProFrontier ModelsHold
- Gemini 3.1 ProFrontier ModelsTrial
- GLM-4.7-FlashOpen Weights ModelsTrial
- GLM-5Open Weights ModelsTrial
- Google GemmaOpen Weights ModelsTrial
- GPT-4oFrontier ModelsHold
- GPT-5.4Frontier ModelsAdopt
- Grok 4.2Frontier ModelsTrial
- HumanEvalBenchmarks & EvaluationHold
- Kimi K2 / K2.5Open Weights ModelsTrial
- LiveCodeBenchBenchmarks & EvaluationTrial
- Llama 3 (Meta)Open Weights ModelsHold
- Llama 4 (Meta)Open Weights ModelsTrial
- Microsoft PhiOpen Weights ModelsAssess
- MiniMax M2 / M2.5Open Weights ModelsAssess
- MistralOpen Weights ModelsTrial
- NVIDIA NemotronOpen Weights ModelsAssess
- OpenAI GPT-OSSOpen Weights ModelsTrial
- QwenOpen Weights ModelsTrial
- SWE-bench VerifiedBenchmarks & EvaluationAdopt
- Voyage Code 3Code-Specialized ModelsTrial
- Xiaomi MiMoOpen Weights ModelsAssess