Model Catalog

One API for all models. Search our library, deploy and run inference on NVIDIA GPUs in seconds

Get API Key Book a demo

Unlock $1 free API credit on first recharge - generate up to ~4M tokens

Showing 73 of 73 models

Chat

No models match your search. Try a different keyword or category.

Sign up to get $1.00 free API credit. Test out the latest models now.

Access enterprise-grade open-source AI models including Llama 3, DeepSeek, Qwen, and more via our high-performance serverless API. Experience low-latency inference on the latest NVIDIA GPUs optimized for production workloads.

Get API Key

"Qubrid's medical OCR and research parsing cut our document extraction time in half. We now have traceable pipelines and reproducible outputs that meet our compliance requirements."

Clinical AI Team

Research & Clinical Intelligence

Qubrid AI - The Full AI Stack is designed to give developers, researchers, and enterprises the GPU performance, AI-ready software, and cost-efficiency needed to unlock the full potential of AI.

Navigations

AI Appliances
GPU Virtual Machine
AI/ML Templates
Playground
Pricing
Model Catalog
Returns & Refunds
Contact Us

Developers

Documentation
Platform Updates
Model Updates
GitHub
Cookbook

Solutions

Enterprise OCR & RAG
AI Automation & Workflows
Custom Built AI Agents for Production
Clinical & Research Analysis
AI-Powered Marketing & Prospect Outreach

Company

About Us
Partners
Blog & News
Brand Kit
Privacy Policy
Terms & Conditions
Acceptable Use
Safety & Responsible Use
Returns & Refunds

Model Catalog

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B

deepseek-ai/DeepSeek-V3.2

zai-org/GLM-5

MiniMaxAI/MiniMax-M2.5

moonshotai/Kimi-K2.5

Qwen/Qwen3-Coder-Plus

anthropic/claude-opus-4-6

openai/gpt-5.4

google/gemini-3.1-pro-preview

Qwen/WAN 2.7 Image

Qwen/Qwen3.6-Plus

MiniMaxAI/MiniMax-M2.1

moonshotai/Kimi-K2-Thinking

Qwen/Qwen3.5-Flash

Qwen/Qwen3.5-27B

Qwen/Qwen3.5-35B-A3B

Qwen/Qwen3.5-122B-A10B

Qwen/Qwen3.5-397B-A17B

deepseek-ai/DeepSeek-R1-0528

Qwen/Qwen3-Max

Qwen/Qwen3-VL-235B-A22B-Thinking

Qwen/Qwen3-Coder-480B-A35B-Instruct

Qwen/Qwen3-Next-80B-A3B-Thinking

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

meta-llama/Llama-3.3-70B-Instruct

tencent/HunyuanOCR

deepseek-ai/deepseek-r1-distill-llama-70b

microsoft/Fara-7B

Qwen/Qwen3-Coder-30B-A3B-Instruct

openai/gpt-oss-120b

openai/whisper-large-v3

Qwen/Qwen3-TTS-Flash

Qwen/Qwen3-Coder-Next

openai/gpt-oss-20b

mistralai/Mistral-7B-Instruct-v0.3

p-image

p-image-edit

stabilityai/stable-diffusion-3.5-large

Tongyi-MAI/Z-Image-Turbo

Z-Image-Turbo [LoRA]

FLUX.1 [dev]

FLUX.2 [klein] 4B

Qwen/Qwen-Image

p-video

p-image-lora

p-image-edit-lora

Qwen/Qwen3-VL-8B-Instruct

Qwen/Qwen3-Coder-Flash

Qwen/Qwen3-Plus

Qwen/Qwen3-VL-235B-A22B-Instruct

Qwen/Qwen3-VL-Flash

Qwen/Qwen3-VL-Plus

Qwen/Qwen3-VL-30B-A3B-Instruct

Qwen/Qwen-Image-2.0

Qwen/Qwen-Image-2.0-Pro

Qwen/Qwen-Image-2.0-Edit

Qwen/Qwen-Image-2.0-Pro-Edit

deepseek-ai/DeepSeek-V3

Qwen/Qwen3.5-Plus

zai-org/GLM-4.7

moonshotai/Kimi-K2-Instruct

google/gemini-3-flash-preview

google/gemini-2.5-pro

google/gemini-2.5-flash

anthropic/claude-opus-4-5

anthropic/claude-sonnet-4-6

anthropic/claude-sonnet-4-5

anthropic/claude-haiku-4-5-20251001

openai/gpt-4o

openai/gpt-4o-mini

openai/gpt-4.1

openai/gpt-5.4-mini

openai/gpt-5.4-nano

Sign up to get $1.00 free API credit. Test out the latest models now.