0GM-1.0-35B-A3B
Chat0GM-1.0-35B-A3B deployment by 0G.AI, optimized for agentic coding and tool use. Prefix caching is supported, and cached token usage is reported in usage.prompt_tokens_details.cached_tokens. Thinking/reasoning is enabled by default. To disable, set chat_template_kwargs: {enable_thinking: false} in the request body.
by 0G Foundation
deepseek-v4-pro
ChatDeepSeek-V4-Pro is DeepSeek's flagship LLM, optimized for agentic coding, multi-step workflows, and complex reasoning. Features native function calling and supports a 1M token context window with up to 384K output tokens. Prompt caching is supported and reported via usage.prompt_tokens_details.cached_tokens.
by 0G Foundation
deepseek/deepseek-chat-v3-0324
ChatDeepSeek-V3.2 is a 671B-parameter mixture-of-experts LLM with hybrid thinking mode, excelling at coding, math, and multi-step reasoning. Supports native function calling and context caching.
by 0G Foundation
openai/whisper-large-v3
SpeechHigh-performance automatic speech recognition (ASR) model, providing multilingual transcription and translation.
by 0G Foundation
qwen/qwen3-vl-30b-a3b-instruct
ChatAlibaba's Qwen3-VL is a multimodal vision-language model supporting text and image inputs with text output. Strong at visual reasoning, OCR, and chart/document understanding. Served via Alibaba Cloud Model Studio (DashScope, qwen3-vl-flash tier).
by 0G Foundation
qwen3.6-plus
ChatAlibaba's Qwen3.6-Plus is a flagship LLM with hybrid linear attention and sparse mixture-of-experts routing, optimized for agentic coding, multi-step workflows, and complex reasoning. Features always-on chain-of-thought reasoning with adaptive depth, native function calling, and supports 119 languages. 1M token context window.
by 0G Foundation
z-image
ImageAsync text-to-image model optimized for Base64 encoded outputs.
by 0G Foundation
zai-org/GLM-5-FP8
ChatZ.ai's flagship GLM-5 reasoning model with native tool calling. Thinking/reasoning is enabled by default. To disable, set chat_template_kwargs: {enable_thinking: false} in request body.
by 0G Foundation
zai-org/GLM-5.1-FP8
ChatGLM-5.1 reasoning model with FP8 quantization, supports tool calling. Thinking/reasoning is enabled by default. To disable, set chat_template_kwargs: {enable_thinking: false} in request body.
by 0G Foundation