by 0G Foundation
DeepSeek-V4-Flash is an efficient lightweight MoE model (284B total / 13B active parameters) with native 1M-token context support. Optimized for fast, low-latency, low-cost inference; well-balanced general capability tuned for high-throughput everyday chat, content creation, basic RAG, and batch text processing. Features native function calling and prompt caching reported via usage.prompt_tokens_details.cached_tokens.
1M
1,000,000 tokens
393K
393,216 tokens
0.3100 0G
per 1M tokens
0.6200 0G
per 1M tokens
1
chatbot
0G Router normalizes requests and responses across providers for you. All endpoints follow the OpenAI API format.
Endpoint
Code Examples
curl https://router-api.0g.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <YOUR_API_KEY>" \
-d '{
"model": "deepseek-v4-flash",
"messages": [
{"role": "user", "content": "Hello!"}
],
"stream": true
}'TEE Verification
If the provider runs inside a TEE (Trusted Execution Environment), you can request on-chain signature verification by setting verify_tee. The response will include a tee_verified field in the trace.
# Add verify_tee in the JSON body
{
"model": "deepseek-v4-flash",
"messages": [{"role": "user", "content": "Hello!"}],
"verify_tee": true
}Provider Routing
Control which provider handles your request using the provider parameter.
{
"model": "deepseek-v4-flash",
"messages": [{"role": "user", "content": "Hello!"}],
"provider": {
"sort": "latency",
"allow_fallbacks": true
}
}