Simple pricing
You only pay when you save
Sleev earns its keep by cutting your LLM bill. Start free, upgrade when the savings speak for themselves.
Free
$0 /mo
For trying the proxy with light usage.
Get started- 500 requests per month
- Context compression for supported coding tools
- Anthropic, OpenAI, Codex, Kimi, Moonshot, and opencode-go routing
- Savings dashboard
- Community support
Recommended
Pro
$20 /mo
Higher limits with the full optimization pipeline.
Start free trial- Higher request volume
- Advanced context management
- All current supported providers
- Detailed usage analytics
- Priority support
- Team features coming later
Frequently asked questions
- How does the savings model work?
- Sleev reduces tokens sent upstream by compressing stale context, replaying summaries, and stripping redundant content. You pay your provider directly for fewer tokens, and Sleev charges separately for the proxy service.
- Do you store my code or conversations?
- Sleev processes requests as a proxy and stores only account, token, session, and usage metadata needed to operate the service and dashboard. Provider credentials stay in your local harness or provider config and are supplied on each request.
- What happens if I cancel?
- Remove the Sleev base URL and Sleeve headers from your coding tool and you are back to direct provider calls. You can delete your account and sleeve tokens from the dashboard.
- Which providers and tools are supported?
- The proxy currently registers Anthropic, OpenAI, Codex, Kimi Coding, Moonshot AI, and opencode-go upstreams. Claude Code, opencode, and Codex CLI have setup snippets in the dashboard.
- Do you collect provider credentials?
- No. Your provider credential remains with your coding tool or provider configuration. The tool sends it to Sleev with each proxied request, and Sleev forwards it upstream unchanged without storing it.
- Is there an SLA?
- During early access, we do not offer formal SLA guarantees. Uptime commitments may come with a future enterprise tier.