GLM 4 32B 0414 128K API

from openai import OpenAI  # Initialize the OpenAI client with Qubrid base URL client = OpenAI(  base_url="https://platform.qubrid.com/v1",  api_key="QUBRID_API_KEY", )  stream = client.chat.completions.create(  model="zai-org/GLM-4-32B-0414-128K",  messages=[  {  "role": "user",  "content": "Explain quantum computing in simple terms"  }  ],  max_tokens=4096,  temperature=1,  top_p=1,  stream=True,  extra_body={  "enable_thinking": True,  } )  for chunk in stream:  if chunk.choices and chunk.choices[0].delta.content:  print(chunk.choices[0].delta.content, end="", flush=True)  print("\n")

GLM 4 32B 0414 128K

EnterprisePlatform Integration

Docker Support

Kubernetes Ready

SDK Libraries

Don't let your AI control you. Control your AI the Qubrid way!

Enterprise
Platform Integration