Back
The Infe API Ecosystem
Precision-engineered model offerings split across two distinct layers: Infe Edge AI for instantaneous global responses, and Infe Core for heavy-duty infrastructural reasoning.
Priority Edge Network
Infe Edge AI
Deployed across 300+ global PoPs. Our edge models are tuned for sub-50ms TTFT (Time To First Token). Zero-cold-start architecture leveraging our proprietary global CDN.
300+ Global Locations
Sub-50ms Latency
Serverless Architecture
High-Margin Economy
Infrastructural Core
Infe Core
The heavy-duty tier. Optimized for raw throughput and massive reasoning. Powered by specialized inference technology clusters for the biological limit of speed.
Specialized LPU Clusters
High Reasoning Density
Ultra-High Throughput
Enterprise Grade SLA
Edge Catalogue
Pricing per 1k TokensInfe Pulse 8B
Language
Optimized for global edge deployment via CDN nodes.
< 50ms
50 IU
Infe Echo Edge
Speech-to-Text
High-fidelity transcription at the browser edge.
< 100ms
80 IU
Infe Vision Nano
Vision
Real-time object detection and scene analysis.
< 80ms
60 IU
Core Infrastructure
Pricing per 1k TokensInfe Titan 70B
Reasoning
High-parameter model running on specialized LPUs.
Sub-100ms
250 IU
Infe Titan 120B
Deep Inference
Massive reasoning capabilities for enterprise logic.
Fast-Batch
450 IU
Infe Forge Pro
Coding
SOTA code generation with 128k context window.
Sub-100ms
200 IU
Infe Vision Ultra
Multimodal
High-resolution visual reasoning and OCR.
< 150ms
300 IU
The Unit Economy
Infe.io operates on a fixed-value currency called Infe Units (IU). 1,000,000 IU is precisely calibrated to $20.00 USD. This allows for high-granularity billing across multimodal requests without complex decimal logic.
Exchange Rate
50k IU = $1.00