Agent Toolkit

Monte Carlo's Agent Toolkit enables an agentic experience, allowing your AI tools to interact with your Monte Carlo Data account and bring data observability into your development workflow.

Why the Agent Toolkit exists

AI agents are powerful, but without the right context they'll confidently work with stale tables, ship changes that break downstream dashboards, or miss the alert that explains the bug you're chasing. Monte Carlo already has that context β€” freshness, lineage, active alerts, monitor coverage, query history, cost signals.

The Agent Toolkit is the bridge. It's a bundle of skills, hooks, and an MCP server that teaches your coding agent when and how to reach for Monte Carlo, so the observability data you already have shows up at the moment it's useful. The result: fewer broken merges, faster incident response, and monitoring coverage that keeps up with your data estate β€” all without you having to remember which Monte Carlo feature to call.

How the toolkit works

The toolkit is delivered as a single mc-agent-toolkit plugin that bundles everything β€” install once, get everything, and new capabilities arrive on the next update. Under the hood there are three building blocks that work together:

  • An MCP server that gives the agent live, authenticated access to your Monte Carlo account (alerts, lineage, monitors, query history, and more).
  • Skills β€” focused instructions that teach the agent how to do specific Monte Carlo jobs well.
  • Hooks β€” event-driven triggers that activate skills at the right moment (for example, before you edit a dbt model, or before a commit).

Quick start

Prerequisites

  • A Monte Carlo account with Editor role or above
  • A supported AI coding agent (see Supported agents below)

Install the mc-agent-toolkit plugin

Claude Code (recommended β€” best support):

  1. Add the marketplace:
/plugin marketplace add monte-carlo-data/mc-agent-toolkit
  1. Install the plugin:
/plugin install mc-agent-toolkit@mc-marketplace
  1. Authenticate β€” run /mcp, select the Monte Carlo server, and complete the OAuth flow in your browser.

  2. Verify β€” ask Claude: "Test my Monte Carlo connection."

  3. Enable auto-updates β€” run /plugin, select the monte-carlo-data/mc-agent-toolkit marketplace, and choose Enable auto-update. This keeps the toolkit current as new skills ship.

Cursor:

bash <(curl -fsSL https://raw.githubusercontent.com/monte-carlo-data/mc-agent-toolkit/main/plugins/cursor/scripts/install.sh)

Then restart Cursor. The Monte Carlo MCP server will prompt for OAuth on first use.

GitHub Copilot CLI:

git clone https://github.com/monte-carlo-data/mc-agent-toolkit.git

# Install hooks into your dbt project
./mc-agent-toolkit/plugins/copilot/scripts/install.sh /path/to/your/dbt-project

# Install the plugin (skills + MCP)
copilot plugin install ./mc-agent-toolkit/plugins/copilot

For other agents (OpenCode, Codex), see the plugins page on GitHub.

πŸ“˜

Keeping the toolkit up to date

Third-party plugin marketplaces in Claude Code don't auto-update by default. After installing, we recommend enabling auto-updates: In Claude Code: /plugin β†’ select the monte-carlo-data/mc-agent-toolkit marketplace β†’ Enable auto-update

For Cursor and other agents, re-run the install script periodically or watch the GitHub repo for releases.

Orchestrated workflows

For common, multi-step situations β€” responding to an incident, deciding what to monitor, or sorting through an ambiguous "something feels off with my data" β€” the agent proactively recommends and chains the right skills in the right order. You don't have to know which individual skill to call; the workflow handles the sequencing.

You can also launch a workflow explicitly with a slash command:

CommandWhat it orchestrates
/mcRoutes an ambiguous data question to the right skill or workflow. Asks clarifying questions first when the ask is vague.
/mc-incident-responseFull incident lifecycle β€” triage β†’ root cause β†’ remediation β†’ prevent recurrence.
/mc-proactive-monitoringCoverage discovery β†’ gap analysis β†’ prioritization β†’ monitor creation.

Workflows are smart proactive ways to sequence the individual skills below.

Individual skills

Every capability is also available on its own. Ask your agent a direct question ("check the health of the orders table", "tune this monitor to reduce noise", "what's costing us the most in storage?") and the matching skill activates. You can mix and match β€” the workflows are a convenience, not a cage.

What the skills do

Skills are grouped by the job they help you do. All skills ship with the plugin and are available in every supported editor.

Trust β€” is this data safe to use?

Foundational pre-query and pre-build checks so the agent doesn't reach for data that isn't ready.

SkillWhat it does
asset-healthSingle-table health report: freshness, active alerts, monitor coverage, importance, and upstream issues. Great as a pre-query check before you build on a table.

Incident Response β€” something is broken, fix it

Reactive skills for when alerts are firing, a table looks stale, or a pipeline failed.

SkillWhat it does
automated-triageFetches, scores, and prioritizes active alerts. Runs interactively or as a scheduled workflow.
analyze-root-causeInvestigates why an incident happened β€” lineage tracing, ETL checks, query analysis, and data profiling.
remediationProposes and executes fixes for data-quality alerts after assessing blast radius, or escalates with full context when uncertain.

Monitoring β€” plan coverage, create monitors, reduce noise

Skills for reasoning about monitor coverage and keeping monitors healthy.

SkillWhat it does
monitoring-advisorUses Monte Carlo's use-case and coverage data to pinpoint gaps in your data estate, then creates monitors for data tables or AI agents β€” validating tables and fields against your live workspace and emitting monitors-as-code YAML. See How Monitor Creation works below.
tune-monitorRecommends sensitivity, segment, and schedule changes to reduce alert noise on an existing metric monitor.

Prevent β€” catch issues before they ship

Silent, hook-driven skills that shape code changes before and after they happen. Auto-activates on dbt/SQL edits.

SkillWhat it does
preventSurfaces blast radius, lineage, alerts, and monitor coverage before an edit β€” and generates monitors-as-code for new transformation logic.
generate-validation-notebookGenerates targeted SQL validation queries for a dbt PR or local repo to verify a change behaved as intended.

See How MC Prevent works below for the full flow.

Optimize β€” cost and performance

Skills for cleaning up waste and diagnosing slowness in existing data assets.

SkillWhat it does
storage-cost-analysisIdentifies storage waste (unread, zombie, and dead-end tables) and estimates cleanup savings β€” using lineage to verify safety first.
performance-diagnosisDiagnoses slow pipelines and expensive queries across Airflow, dbt, Databricks, and other platforms.

Setup β€” onboarding and integrations

Admin-oriented skills for getting data into Monte Carlo and configuring connections.

SkillWhat it does
push-ingestionGenerates collection scripts to push metadata, lineage, or query logs to Monte Carlo from any data source.
connection-auth-rulesBuilds Connection Auth Rules JSON for a Monte Carlo connection type using live connector schemas.

Full details and source are in the agent toolkit repo.


Using skills standalone (without the plugin)

If your editor isn't listed above or you prefer to install skills individually:

  1. Configure the Monte Carlo MCP server:
claude mcp add --transport http monte-carlo-mcp https://mcp.getmontecarlo.com/mcp
  1. Install a skill:
npx skills add monte-carlo-data/mc-agent-toolkit --skill prevent
  1. Authenticate via /mcp and complete the OAuth flow.

See the MCP server docs for advanced configuration options.

Supported agents

The mc-agent-toolkit plugin is available for multiple AI coding agents:

AgentStatusInstallation
Claude CodeFull support (hooks, skills, MCP, slash commands)Setup guide
CursorFull support (hooks, skills, MCP)Setup guide
OpenCodeFull support (hooks, skills, MCP)Setup guide
Copilot CLIPreliminary (skills, MCP; hooks installed separately)Setup guide
CodexPreliminary (skills only)Setup guide

Editor compatibility: VS Code users can use Copilot CLI or Claude Code. JetBrains users can use Copilot CLI. Cursor is a standalone editor. Claude Code, Copilot CLI, and OpenCode also run in any terminal.

How MC Prevent works

MC Prevent is the flagship feature of the toolkit. When installed via the plugin, it uses hooks to enforce a safety-first workflow:

  1. You edit a dbt model β€” the pre-edit hook blocks until a change impact assessment runs. The agent surfaces downstream blast radius, active alerts, monitor coverage, and a risk-tiered recommendation.
  2. You confirm and edit β€” the post-edit hook records the change. The skill offers to generate monitors-as-code for new logic.
  3. You commit β€” the pre-commit hook checks for unresolved monitor coverage gaps.
  4. You validate β€” run /mc-validate or ask the agent to generate validation queries. Targeted SQL checks are saved to validation/<table>_<timestamp>.sql.

Without the plugin, the skill still works β€” it just relies on the agent following instructions rather than hooks enforcing the workflow.

How Monitor Creation works

Asking an AI agent to "add a monitor" without guardrails usually produces monitors that reference nonexistent tables, invalid fields, or the wrong monitor category for the problem. The monitoring-advisor skill prevents this by grounding every step in your live Monte Carlo workspace:

  1. Category selection β€” the agent matches the user's intent to the right Monte Carlo monitor category (table, metric, validation, comparison, custom SQL, or one of the agent monitor types) rather than defaulting to whatever it saw most in training.
  2. Table validation β€” the agent confirms the target table is ingested into Monte Carlo before proposing anything, using the MCP server to check in real time.
  3. Field verification β€” for metric and validation monitors, the agent checks that referenced fields exist and that their types are compatible with the chosen category.
  4. Monitors-as-code output β€” the agent emits monitors-as-code YAML, so the monitor is version-controlled and deployable through your existing CI/CD instead of clicked into the UI.

If you want to start from coverage instead of a specific table, run /mc-proactive-monitoring β€” the workflow analyzes gaps across your warehouses and use cases first, then hands off to monitoring-advisor to create the recommended monitors.

Resources