by 0G Foundation
0GM-1.0-35B-A3B deployment by 0G.AI, optimized for agentic coding and tool use. Prefix caching is supported, and cached token usage is reported in usage.prompt_tokens_details.cached_tokens. Thinking/reasoning is enabled by default. To disable, set chat_template_kwargs: {enable_thinking: false} in the request body.
262K
262,144 tokens
33K
32,768 tokens
0.2900 0G
per 1M tokens
1.7700 0G
per 1M tokens
1
chatbot