Chat
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
Input/1M $0.10
Output/1M $0.50
Parameters 120B (12B active)
One API for all models. Search our library, deploy and run inference on NVIDIA GPUs in seconds
Unlock $1 free API credit on first recharge - generate up to ~4M tokens
Access enterprise-grade open-source AI models including Llama 3, DeepSeek, Qwen, and more via our high-performance serverless API. Experience low-latency inference on the latest NVIDIA GPUs optimized for production workloads.
"Qubrid's medical OCR and research parsing cut our document extraction time in half. We now have traceable pipelines and reproducible outputs that meet our compliance requirements."
Clinical AI Team
Research & Clinical Intelligence