How to read CloudMonitor's AI cost reports
This guide walks through the AI cost dashboard CloudMonitor exposes once at least one AI provider connector is configured (GitHub Copilot, OpenAI, Anthropic Claude, or Cursor). The dashboard appears in two places: the AI cost page in your Power BI workspace, and the AI providers tab of the Admin App.
Five KPIs at the top
Section titled “Five KPIs at the top”The header strip shows the same five numbers in both surfaces.
1. Total AI spend
Section titled “1. Total AI spend”Sum of dollar cost across every connected provider for the selected period (default: month-to-date). Click the figure to drill into provider → account → workspace → model.
2. Cost per developer
Section titled “2. Cost per developer”Total spend divided by distinct active developers during the period. CloudMonitor counts a developer once even if they appear in multiple providers — say Copilot for IDE completions, Claude Code for terminal work, and Cursor for editor chat. The figure normalizes for the team size you have today, not the team size on the first of the month.
3. Model mix
Section titled “3. Model mix”Share of dollar spend by model tier: frontier, mid, entry. Frontier means Opus, GPT-5, and Gemini Ultra-tier models. Entry means Haiku, GPT-5 Nano, and Gemini Flash. Rule of thumb: every percentage point of frontier spend you shift to entry without quality loss is roughly 10× cheaper.
4. Cache hit rate
Section titled “4. Cache hit rate”Share of input tokens served from prompt cache (Anthropic and OpenAI both expose this). Anthropic charges cache reads at 0.1× the standard input rate; OpenAI’s cached input is similarly heavily discounted. A cache hit rate above 30% is healthy on workflows with long system prompts.
5. Batch percentage
Section titled “5. Batch percentage”Share of spend run through batch APIs. Both OpenAI and Anthropic offer 50% off on batch. If you run asynchronous work — back-fills, evals, document summarization — and batch percentage is under 10%, you’re leaving money on the table.
Drill-downs
Section titled “Drill-downs”Below the header strip, the dashboard has four tabs.
- Spend — provider, account, workspace, model, day. Date-range slicer at the top.
- Developers — per-developer cost, model mix, acceptance rate (Copilot and Cursor). Top spenders and unused seats sit on the same page.
- Efficiency — cache hit rate, batch percentage, and context-cliff exposure (requests over 200 K tokens that trigger 2× input pricing on most providers).
- Forecast — month-end projection vs budget per cost center.
Anomaly alerts
Section titled “Anomaly alerts”CloudMonitor opens an alert when daily spend exceeds the 30-day mean by 3 sigma for any (provider, account, model) combination. Alerts land in the Microsoft Teams channel you wired up at installation. Acknowledge from the Admin App Anomalies tab — acknowledged anomalies stop notifying but stay in the audit log.
Allocation tagging
Section titled “Allocation tagging”To attribute AI spend to a cost center, tag the connector account at setup or open Settings → AI providers → choose the connector → Allocation. Tags propagate to the cost center column on the Spend tab and feed the Allocation Coverage KPI on the Quantify domain dashboard.
For finer-grained per-PR or per-feature attribution, see the v2 LiteLLM proxy integration (currently in private preview).