Automated Triage, Root Cause Analysis, and Remediation Skills
Three new skills in the MC Agent Toolkit bring the full alert lifecycle into your coding agent.
Automated Triage fetches recent alerts, scores each one by confidence and impact, runs deep troubleshooting on high-signal alerts, and recommends actions. Start from a built-in example workflow or customize to match how your team responds.
Root Cause Analysis systematically investigates data incidents across freshness, volume, schema, field metrics, and ETL failures. It walks the lineage chain upstream, checks ETL jobs across Airflow, dbt, and Databricks, detects query changes, and profiles actual data when a database connector is available.
Remediation picks up where investigation ends. It discovers available tools, proposes a fix with risk assessment and rollback plan, executes with safety rails, and documents everything on the alert. All three skills compose naturally: triage surfaces alerts, root cause analysis investigates them, and remediation fixes them.
