AI workloads on Azure OpenAI and Foundry deployed across product teams. Total spend visible at tenant level but not attributed by workload, model, or team. Tags inconsistent. No token-level metering by workload. Every workload chose the largest model "to be safe". No optimisation cadence. CFO sees the bill, cannot challenge specific decisions.
Typical concerns
- ·AI spend growing faster than feature delivery
- ·Cannot attribute cost to specific workloads or teams
- ·Model-fit analysis absent — overspecified models everywhere
- ·No anomaly detection on AI consumption
- ·No optimisation cadence
Capability gaps
- ·Per-workload AI cost attribution
- ·Token-level metering via Azure Monitor
- ·Model-fit analysis per workload
- ·AI cost dashboards consumable by product owners
- ·Monthly AI optimisation cadence