Tencent Hy3 Preview

llm open-source agentic reasoning coding

Jun 2026

Assess

Tencent Hy3 Preview, open-sourced April 23, 2026, is a 295B-parameter Mixture-of-Experts model with 21B active parameters, fusing fast-and-slow thinking in a single model. It scores 74.4% on SWE-bench Verified and 54.4% on Terminal-Bench 2.0 (self-reported), with 256K context and demonstrated production deployments of 495-step agentic workflows in Tencent's own CodeBuddy and WorkBuddy products.

Why It's in Assess

Hy3 Preview represents Tencent's first major open-weight release after rebuilding its pre-training and reinforcement-learning infrastructure from scratch in early 2026 — reaching open-source release in under three months from cold start. That pace is impressive, but the benchmarks are self-reported and the model family is untested beyond Tencent's own internal products:

74.4% SWE-bench Verified (self-reported), closing ground on Kimi K2.6 (80.2%) and GPT-5.5 (82.6%) but without independent reproduction
Strong agentic production evidence: In CodeBuddy and WorkBuddy deployments, Hy3 Preview achieved 99.99% success rate over 495-step agent workflows, a 54% reduction in time-to-first-token, and 47% lower end-to-end response time — but these figures are internal Tencent measurements
Cost-efficient architecture: 21B active parameters from a 295B total pool makes inference significantly cheaper than dense models of similar reported capability
Rebuilt infrastructure: Tencent reset its AI stack entirely in early 2026, meaning Hy3 Preview reflects the first output of the new pipeline — early days, but with serious engineering investment behind it

Move to Trial when: independent evaluators (Princeton SWE-bench.com, Artificial Analysis, or equivalent) confirm the SWE-bench and Terminal-Bench scores.

Capabilities

Context: 256K tokens
Reasoning modes: Fast-and-slow thinking fused into a single model (no separate chain-of-thought variant required)
Agentic tool use: MCP toolchain orchestration; document processing, data analysis, knowledge retrieval
Long-horizon task completion: 495-step autonomous workflows validated in production
Benchmark performance (all self-reported):

Benchmark	Hy3 Preview	Notes
SWE-bench Verified	74.4%	Self-reported
Terminal-Bench 2.0	54.4%	Self-reported
BrowseComp	67.1%	Self-reported
WideSearch	70.2%	Self-reported

Cautions

All public benchmarks are self-reported by Tencent; no independent reproductions as of May 2026
Same data sovereignty considerations as other Chinese-origin models (Tencent is subject to Chinese data law)
"Preview" status signals the family is not yet at production release quality for general use
Limited Western ecosystem presence and community adoption compared to DeepSeek or Kimi

Key Characteristics

Property	Value
Provider	Tencent
License	Hy3 Preview License
Pricing	Open weights (free to download and run)
Context window	256,000 tokens
Parameters	295B total / 21B active (MoE)
Architecture	Hybrid fast-and-slow MoE; 3.8B MTP layer parameters
Status	Preview (April 23, 2026)
GitHub	Tencent-Hunyuan/Hy3-preview
Website	hy3ai.com

Tencent Hy3 Preview

Why It's in Assess

Capabilities

Cautions

Key Characteristics

Further Reading