Featured instruments
start here · 5MCP Server Playground
What actually goes over the wire when an agent calls a tool?
Decision: Expose N internal systems via MCP vs bespoke integrations — the crossover.
MCP Server Playground
What actually goes over the wire when an agent calls a tool?
Decision: Expose N internal systems via MCP vs bespoke integrations — the crossover.
Open the labMulti-Agent Orchestration Board
Is multi-agent worth the extra cost, or a party trick?
Decision: When multi-agent's quality gain justifies its cost multiplier.
Multi-Agent Orchestration Board
Is multi-agent worth the extra cost, or a party trick?
Decision: When multi-agent's quality gain justifies its cost multiplier.
Open the labProtocol Selection Lab
Function calling, MCP, A2A, or hybrid for this integration?
Decision: The number of producers and consumers picks the protocol.
Protocol Selection Lab
Function calling, MCP, A2A, or hybrid for this integration?
Decision: The number of producers and consumers picks the protocol.
Open the labAI Initiative Portfolio Dashboard
Which initiatives to kill, scale, or hold this quarter?
Decision: Portfolio kill/scale/hold via risk-adjusted ROI thresholds.
AI Initiative Portfolio Dashboard
Which initiatives to kill, scale, or hold this quarter?
Decision: Portfolio kill/scale/hold via risk-adjusted ROI thresholds.
Open the labAdoption & Change Readiness Instrument
Are the people ready, or just the model?
Decision: Scale / scale-with-conditions / hold the rollout.
Adoption & Change Readiness Instrument
Are the people ready, or just the model?
Decision: Scale / scale-with-conditions / hold the rollout.
Open the labEnterprise AI Lifecycle (spine)
Can this person run an AI program end to end, with gates?
Decision: FRAME → DATA → BUILD(RAG) → DEPLOY → GOVERN & REALIZE, gated.
Enterprise AI Lifecycle (spine)
Can this person run an AI program end to end, with gates?
Decision: FRAME → DATA → BUILD(RAG) → DEPLOY → GOVERN & REALIZE, gated.
Open the labBacklog Generator
Turn business requirements into user stories with backlog hygiene.
Decision: Requirements → sequenced, estimable backlog.
Backlog Generator
Turn business requirements into user stories with backlog hygiene.
Decision: Requirements → sequenced, estimable backlog.
Open the labRAG Quality Evaluator
Does retrieval + generation actually stay faithful and cite sources?
Decision: Score faithfulness/citations/hallucination before trusting a RAG build.
RAG Quality Evaluator
Does retrieval + generation actually stay faithful and cite sources?
Decision: Score faithfulness/citations/hallucination before trusting a RAG build.
Open the labGovern — guardrails & risk tiering
Which use cases need which controls before they ship?
Decision: Risk-tier a use case; map required guardrails.
Govern — guardrails & risk tiering
Which use cases need which controls before they ship?
Decision: Risk-tier a use case; map required guardrails.
Open the labAgent & Protocol Labs
the toolkit · 8MCP Server Playground
What actually goes over the wire when an agent calls a tool?
Decision: Expose N internal systems via MCP vs bespoke integrations — the crossover.
MCP Server Playground
What actually goes over the wire when an agent calls a tool?
Decision: Expose N internal systems via MCP vs bespoke integrations — the crossover.
Open the labAgent Loop & Failure Inspector
How do agents fail, and what catches it?
Decision: How much to budget for the observability harness around agents.
Agent Loop & Failure Inspector
How do agents fail, and what catches it?
Decision: How much to budget for the observability harness around agents.
Open the labMulti-Agent Orchestration Board
Is multi-agent worth the extra cost, or a party trick?
Decision: When multi-agent's quality gain justifies its cost multiplier.
Multi-Agent Orchestration Board
Is multi-agent worth the extra cost, or a party trick?
Decision: When multi-agent's quality gain justifies its cost multiplier.
Open the labTool-Use & Structured Output
How do messy inputs become schema-valid outputs, reliably?
Decision: Where to place the validation gate before outputs hit systems of record.
Tool-Use & Structured Output
How do messy inputs become schema-valid outputs, reliably?
Decision: Where to place the validation gate before outputs hit systems of record.
Open the labContext & Memory Engineering
Which context strategy — dump, summarize, compress, hand off?
Decision: Set the cost-fidelity dial per use case, not per platform.
Context & Memory Engineering
Which context strategy — dump, summarize, compress, hand off?
Decision: Set the cost-fidelity dial per use case, not per platform.
Open the labPrompt Cost & Token Simulator
What will this actually cost per month at volume?
Decision: Build-vs-buy on unit economics before architecture.
Prompt Cost & Token Simulator
What will this actually cost per month at volume?
Decision: Build-vs-buy on unit economics before architecture.
Open the labProtocol Selection Lab
Function calling, MCP, A2A, or hybrid for this integration?
Decision: The number of producers and consumers picks the protocol.
Protocol Selection Lab
Function calling, MCP, A2A, or hybrid for this integration?
Decision: The number of producers and consumers picks the protocol.
Open the labHuman-in-the-Loop Approval Simulator
How much autonomy before an edge case slips through?
Decision: Set autonomy level per risk tier, not per enthusiasm.
Human-in-the-Loop Approval Simulator
How much autonomy before an edge case slips through?
Decision: Set autonomy level per risk tier, not per enthusiasm.
Open the labBusiness of AI Delivery
the gallery · 5AI Initiative Portfolio Dashboard
Which initiatives to kill, scale, or hold this quarter?
Decision: Portfolio kill/scale/hold via risk-adjusted ROI thresholds.
AI Initiative Portfolio Dashboard
Which initiatives to kill, scale, or hold this quarter?
Decision: Portfolio kill/scale/hold via risk-adjusted ROI thresholds.
Open the labBuild-vs-Buy-vs-Fine-Tune Evaluator
Build it, buy it, or fine-tune it?
Decision: 3-year TCO across all three, with the flip condition.
Build-vs-Buy-vs-Fine-Tune Evaluator
Build it, buy it, or fine-tune it?
Decision: 3-year TCO across all three, with the flip condition.
Open the labInference Cost Forecaster
When does self-hosting undercut API spend?
Decision: The API-vs-self-host crossover, driven by utilization assumptions.
Inference Cost Forecaster
When does self-hosting undercut API spend?
Decision: The API-vs-self-host crossover, driven by utilization assumptions.
Open the labVendor Evaluation & Risk Monitor
Which vendor — and what does concentration cost if we're wrong?
Decision: Weighted vendor pick + concentration/exit-cost exposure.
Vendor Evaluation & Risk Monitor
Which vendor — and what does concentration cost if we're wrong?
Decision: Weighted vendor pick + concentration/exit-cost exposure.
Open the labBusiness Case / ROI Builder
What's the payback — and how fragile is it?
Decision: Fund/defer on an NPV range, not a single point.
Business Case / ROI Builder
What's the payback — and how fragile is it?
Decision: Fund/defer on an NPV range, not a single point.
Open the labEngagement Leadership
the control room · 10Adoption & Change Readiness Instrument
Are the people ready, or just the model?
Decision: Scale / scale-with-conditions / hold the rollout.
Adoption & Change Readiness Instrument
Are the people ready, or just the model?
Decision: Scale / scale-with-conditions / hold the rollout.
Open the labStakeholder & Sponsor Alignment Cockpit
Which sponsor is quietly drifting before the next steering?
Decision: Who needs to hear what, from whom, before the meeting.
Stakeholder & Sponsor Alignment Cockpit
Which sponsor is quietly drifting before the next steering?
Decision: Who needs to hear what, from whom, before the meeting.
Open the labCapacity & Resourcing Planner
Do 30 people actually cover this portfolio's skills?
Decision: Hire / contract / upskill per gap, with date + cost impact.
Capacity & Resourcing Planner
Do 30 people actually cover this portfolio's skills?
Decision: Hire / contract / upskill per gap, with date + cost impact.
Open the labDelivery Health & RAID Radar
Which 'green' workstream is actually trending into trouble?
Decision: Report trajectory, not snapshot — surface the reported-vs-actual gap.
Delivery Health & RAID Radar
Which 'green' workstream is actually trending into trouble?
Decision: Report trajectory, not snapshot — surface the reported-vs-actual gap.
Open the labAI Compliance Readiness Navigator
What tier is this AI, and what controls does it owe?
Decision: Risk-tier + required-controls map (EU AI Act + finserv overlays).
AI Compliance Readiness Navigator
What tier is this AI, and what controls does it owe?
Decision: Risk-tier + required-controls map (EU AI Act + finserv overlays).
Open the labTalent & Upskilling Pathway Planner
How do we get the team to agentic-era skills in time?
Decision: Build/hire/partner pathway per role, with time-to-productive.
Talent & Upskilling Pathway Planner
How do we get the team to agentic-era skills in time?
Decision: Build/hire/partner pathway per role, with time-to-productive.
Open the labRFP/RFI Response War Room
Should we even bid this — and where's the response weak?
Decision: Bid / no-bid as a portfolio decision (fit × win-prob × capacity × margin).
RFP/RFI Response War Room
Should we even bid this — and where's the response weak?
Decision: Bid / no-bid as a portfolio decision (fit × win-prob × capacity × margin).
Open the labEstimation & Scoping Studio
What's the real estimate — and what happens when scope moves?
Decision: Deliverable estimate range + staffing + change-control impact.
Estimation & Scoping Studio
What's the real estimate — and what happens when scope moves?
Decision: Deliverable estimate range + staffing + change-control impact.
Open the labResource Onboarding & KT Tracker
Why does it take 40 days to make a new hire productive?
Decision: Onboarding critical path + KT capture before senior roll-off.
Resource Onboarding & KT Tracker
Why does it take 40 days to make a new hire productive?
Decision: Onboarding critical path + KT capture before senior roll-off.
Open the labExecutive Communication Studio
Does this exec update force a decision, or just report status?
Decision: What decision to force this week, framed per audience.
Executive Communication Studio
Does this exec update force a decision, or just report status?
Decision: What decision to force this week, framed per audience.
Open the lab