moonshotai/Kimi-K2.5
Kimi K2.5 is Moonshot AI's most powerful open-source model to date — a native multimodal agentic model built through continual pretraining on 15 trillion mixed visual and text tokens atop Kimi-K2-Base. With 1T total parameters (32B active), it seamlessly integrates vision, language, and advanced agentic capabilities including an Agent Swarm paradigm that coordinates up to 100 parallel sub-agents, reducing execution time by 4.5x on parallelizable tasks.
api_example.sh
Technical Specifications
Model Architecture & Performance
Pricing
Pay-per-use, no commitments
API Reference
Complete parameter documentation
| Parameter | Type | Default | Description |
|---|---|---|---|
| stream | boolean | true | Enable streaming responses for real-time output. |
| temperature | number | 1 | Recommended 1.0 for Thinking mode, 0.6 for Instant mode. |
| max_tokens | number | 16384 | Maximum number of tokens to generate. |
| top_p | number | 0.95 | Controls nucleus sampling. |
| thinking_mode | select | thinking | Thinking mode enables deep reasoning traces. Instant mode provides fast direct responses. |
Explore the full request and response schema in our external API documentation
Performance
Strengths & considerations
| Strengths | Considerations |
|---|---|
| Native multimodal — trained jointly on 15T vision+text tokens Agent Swarm coordinates up to 100 parallel sub-agents 4.5x execution time reduction on parallelizable tasks 1T MoE with 32B active per token 76.8% SWE-bench Verified 50.2% HLE (Humanity's Last Exam) at 76% lower cost than Claude Opus 4.5 256K context window Supports Thinking and Instant modes | 1T parameter model requires significant infrastructure Video input is experimental 630GB full model size Agent Swarm requires official API for full functionality |
Use cases
Recommended applications for this model
Enterprise
Platform Integration
Docker Support
Official Docker images for containerized deployments
Kubernetes Ready
Production-grade KBS manifests and Helm charts
SDK Libraries
Official SDKs for Python, Javascript, Go, and Java
Don't let your AI control you. Control your AI the Qubrid way!
Have questions? Want to Partner with us? Looking for larger deployments or custom fine-tuning? Let's collaborate on the right setup for your workloads.
"Qubrid's medical OCR and research parsing cut our document extraction time in half. We now have traceable pipelines and reproducible outputs that meet our compliance requirements."
Clinical AI Team
Research & Clinical Intelligence
