Agent Hostingself hosted
Llama 3.1 hosting
Asia Inference Grid · Singapore
Input price / 1M tokens
$0.08
Output price / 1M tokens
$0.16
Latency
115 ms
Uptime
99.2%
Listing Details
- Region
- Singapore
- Currency
- USD
- Rate limits
- Volume tiers available by inquiry
- Source
- View source
- Last checked
- May 8, 2026
- Last updated
- May 8, 2026
- Notes
- Dedicated inference endpoint with volume discounts.
Comparison notice
AI API prices change frequently. Always verify with the provider before purchasing.
Asia Inference Grid
Self-hosted inference and GPU capacity for APAC teams needing regional routing.
View provider profile