Live
OpenAI-compatible proxy
Meter chat and completions. Tag sessions and agents. Export usage for billing.
Infrastructure · India · INR settlement
MetricAI sits in front of your model calls. It meters every request, binds spend to agents and sessions, and produces numbers you can bill against. Built for production ledgers, not demos.
// Route the SDK through MetricAI. One line.
client = metricai.wrap(openai, { project_key: "pk_live__" })
→ POST /v1/chat/completions intercepted session=ses_8k2m₹ figures illustrative — wiring is real.
500+
Teams
Production agent workloads tracked
₹10Cr+
AI Spend Tracked
Metered across providers and sessions
99.9%
Uptime
Proxy and settlement infrastructure
A static API key and a monthly OpenAI invoice tell you nothing about who burned what. Loops and tool chains turn small mistakes into large bills.
Before
Agent economy
session, agent_id, and end_user.Your application keeps the same SDK surface. MetricAI terminates TLS, normalizes provider quirks, and emits metering events you can bill on.
Your app
Agents, backends, workers. Anything that calls chat or completions today.
MetricAI proxy
Auth, metering, attribution, settlement. One choke point for policy.
LLM providers
OpenAI-compatible and first-party routes. Provider keys stay server-side.
Platform
Observability, routing, and billing infrastructure built for India's agent economy.
Real-time spend by agent, session, and model. INR-native ledgers finance can reconcile.
OpenAI, Anthropic, Bedrock, and more through one proxy. Keys stay server-side.
Auto-select the best model per request with fallbacks, rules, and A/B testing.
Cost, error, and latency thresholds with email, Slack, and dashboard notifications.
Invite members, assign roles, and share dashboards across your organization.
Margin analysis, latency trends, tool overhead, and session-level debugging.
Billing
Tokens are inputs. Customers pay for results. MetricAI keeps both: provider cost from the wire, and commercial outcomes you define. Finance gets a line that maps to a contract clause.
// Bill on completion, not raw tokens
await metricai.record_outcome({
session_id: "ses_8k2m",
outcome: "ticket_resolved",
value_inr: 49.00,
});Surface vs. depth
Economics
Same product surface, different burn profiles. MetricAI rolls provider cost to the agent that triggered it. You see who subsidizes whom.
agent_support_tier1
₹+18,420
Recommendation
Ship more. Margin covers retries.
agent_onboarding
₹0
Recommendation
Breaking even. Cap prompts or shorten the system message.
agent_research
₹−6,900
Recommendation
Disable browse tool or require approval.
Protocol
Tool vendors need pricing that survives partial failures and double posts. MetricAI treats MCP calls like payment intents: correlated, auditable, and priced in INR before they hit your general ledger.
// Agent A pays Agent B for an MCP tool call (sketch)
POST /v1/mcp/transfer
{
"payer_agent": "agent_sales",
"payee_server": "mcp_crm_acme",
"correlation_id": "corr_9x1",
"amount_inr": "12.50",
"memo": "lookup_customer_record"
}We are not optimizing for a hero demo. The work is plumbing: identifiers on every call, deterministic metering, and contracts that survive an audit.
India adds GST discipline and rupee settlement expectations. MetricAI is written with that operating context from day one.
Live
Meter chat and completions. Tag sessions and agents. Export usage for billing.
Building
INR line items, tax metadata, and hooks into invoicing. Fewer spreadsheet bridges.
Roadmap
Net positions between internal agents and external MCP vendors on one statement.
Roadmap
Hard budgets per agent, circuit breakers on runaway loops, approval gates on spend.
Integration is deliberately boring. If you already call OpenAI from the server, you are one wrapper away from metered traffic.
You get a project key and proxy host. Keys are rotatable without redeploying provider secrets to clients.
One wrapper call. Headers carry session and agent identifiers. Streaming responses are still supported.
Point staging first. Compare provider invoices to MetricAI totals. Discrepancies should be explainable in one query.
Pull usage exports or webhooks into your billing system. Outcome lines are explicit events, not guesses.
pip install metricai-sdk
from metricai import wrap
from openai import OpenAI
base = OpenAI()
client = wrap(base, project_key="pk_live__")
# All completions are metered and attributed
r = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "..."}],
extra_headers={
"X-MetricAI-Session": "ses_8k2m",
"X-MetricAI-Agent": "agent_support_tier1",
},
)Full observability access. No credit card required. Five-minute setup with our SDK.
No credit card required · 5-minute setup
We are onboarding teams that already run agent workloads in production and need metering they can take to finance. Short form below. No automated drip.