Skip to content

AI provider integration

Beta

Every Claude token, attributed to the work it did.

CloudMonitor reads your Anthropic usage into Fabric and turns a single API invoice into cost per team, per feature, and per model, with anomalies caught the day they start.

Anthropic bills per token, and that bill grows fast once Claude is in production. CloudMonitor breaks the invoice apart by model, by token type, and by the team or feature that drove it, so spend has an owner and a unit cost rather than one opaque number on a monthly statement.

  • Breaks Claude spend down by model (Haiku, Sonnet, Opus) and by input versus output tokens.
  • Attributes usage to teams, features, and environments, not just one account total.
  • Catches token-spend spikes from runaway loops or prompt changes within the day.
  • Builds a showback ledger so AI cost lands on the budget that owns it.

Provider breakdown

See where the Claude bill actually goes.

CloudMonitor splits Anthropic spend by model and token type so you can tell heavy generation from large-context work, and steer routine calls toward cheaper models with the evidence to back it.

  • Model mix across Haiku, Sonnet, and Opus
  • Input versus output token split per workload
  • Prompt caching and batch usage made visible
CloudMonitor AI spend view in Microsoft Fabric breaking Anthropic Claude usage down by model and token type

Anomaly detection

A runaway agent should not wait for the invoice.

Because token cost is incurred the moment a request runs, CloudMonitor forecasts daily Claude spend and fires an alert when usage jumps, routed to the team behind it with the dollar impact attached.

  • Daily token-spend forecast per team and feature
  • Spike alerts for prompt changes and retry loops
  • Routed to Teams with owner and impact attached
CloudMonitor anomaly inbox in Microsoft Fabric showing severity-ranked AI token-spend anomalies with dollar impact

Showback

Put AI cost on the budget that owns it.

A per-team ledger reconciles Claude spend against the feature and cost center it belongs to, so finance can charge it back and product can see cost per unit of value next to the volume it shipped.

  • Per-team and per-feature cost ledger
  • Cost per unit of value alongside volume
  • Chargeback-ready for finance
CloudMonitor cost ledger in Microsoft Fabric attributing AI spend to teams and features for chargeback

Token economics

Why Anthropic spend needs more than a SaaS line item.

AI cost is set the moment a request runs. A longer prompt, an extra retry, or a switch to a larger model can move spend in seconds, not billing cycles. CloudMonitor treats every token as a unit of cost you can attribute, forecast, and tie back to the work it produced.

The cost unit changed

Cloud spend is a vCPU-hour or a GB-month. Anthropic spend is a token, an inference call, and an agent session, measured per request and billed as it happens.

Allocate before you spend

Infrastructure FinOps reports after the bill lands. AI spend has to be attributed at the point of the call, so a runaway loop is caught in minutes, not on next month's invoice.

Unit economics, not totals

A single invoice number tells finance nothing. CloudMonitor reduces Anthropic usage to cost per team, per feature, and per unit of value: the conversation, the task, the pull request.

AI cost management is now near-universal: 98% of practitioners report governing AI spend, up from 31% two years earlier, and it ranks as the top skill FinOps teams are building. Source: State of FinOps 2026 Report

Billing model

API-direct token billing (you pay Anthropic per token), plus Claude subscription seats for Claude Code.

What drives the bill

  • Model tier: Opus costs many times more per token than Haiku for the same prompt.
  • Token type: output tokens are priced well above input tokens.
  • Prompt caching: cache reads are cheap, but cache writes carry a premium.
  • Long context: crossing the large-context threshold reprices the whole request.

What CloudMonitor does show

  • Cost per team, feature, and environment, not just one account total.
  • Model mix and input-versus-output split for every workload.
  • Daily forecast with spike alerts before the invoice arrives.
  • A chargeback-ready ledger tying spend to the work it produced.

CloudMonitor's approach aligns with the FinOps Foundation's Token Economics & SaaS working group, the emerging discipline for governing pay-per-token cost.

A Claude API key is easy to share and hard to govern. Usage spreads across teams, the invoice climbs, and finance is left with a single number nobody can explain. CloudMonitor reads Anthropic usage into your Fabric workspace and rebuilds that number into cost per team, per feature, and per model.

The same engine that allocates and forecasts your Azure spend applies to tokens, so AI cost is governed with the same discipline as the rest of the estate rather than living in a spreadsheet on the side.

Govern Anthropic spend on real data.

The beta runs against your own tenant: your spend, your allocation, your alerts.