Generic Agent: GCP GKE Deployment

Deploy the Generic Agent on GCP using GKE

📝
Prerequisites

You have a GCP project with administrative access.

Terraform (>= 1.9) is installed with GCP authentication configured.

kubectl is installed for cluster access.

You are an Account Owner in Monte Carlo.

This guide walks through deploying the Generic Agent on GCP using GKE (Google Kubernetes Engine) with the monte-carlo-data/mcd-k8s-agent/google Terraform module. The module provisions all required infrastructure — including VPC, GKE cluster, GCS storage, Secret Manager, and IAM — and deploys the agent via Helm.

Architecture

The diagram below shows the GCP-specific deployment architecture. For a general overview of how the Generic Agent works, see the Architecture section on the main page.

The agent runs as a pod in a GKE cluster and connects to:

GCS Bucket — object storage for sampling and temporary data during operations.
Secret Manager — stores the agent token and integration credentials, synced to the cluster via External Secrets Operator.
Customer Integrations — the data sources (e.g. BigQuery, Snowflake) being monitored by Monte Carlo.

Provider Configuration

This module does not configure the google provider — your root module must do so. At minimum, set the target project and region:

provider "google" {
  project = "my-gcp-project"
  region  = "us-central1"
}

The module applies Monte Carlo agent labels to resources directly. To add your own labels alongside these, use the custom_default_tags variable.

Steps

1. Register the Agent

In Monte Carlo, navigate to Settings > Deployments and click Add.
Select Generic, enter a name for the agent, and choose an Authentication method:
- Key/Token (default) — the agent authenticates with a Monte Carlo-generated key pair (mcd_id / mcd_token).
- OAuth — the agent authenticates using OAuth 2.0 Client Credentials (client_id / client_secret). The agent automatically obtains and refreshes short-lived JWT tokens. See Authentication for more details.
The authentication method is permanent and cannot be changed after provisioning.
Click Provision.
Copy the credentials — they are displayed only once and cannot be retrieved later.
- Key/Token: copy the Key ID and Key Secret — these are used as mcd_id and mcd_token in the deployment steps below.
- OAuth: copy the Client ID and Client Secret — these are used as client_id and client_secret in the deployment steps below.

2. Deploy the Agent

2.1 Configure the Terraform Module

📘
Set backend_service_url to the Public endpoint shown in the Agent Service section of the Account Information page in Monte Carlo.

📘
Check Docker Hub for the latest available chart version and update the helm.chart_version value accordingly.

Agent authentication secret

The authentication secret depends on the method you chose during registration.

You must configure the agent token secret using one of two options:

Option 1 — Provide credentials at deploy time (recommended): The module creates and populates the secret in GCP Secret Manager.

token_credentials = {
  mcd_id    = var.mcd_id
  mcd_token = var.mcd_token
}

Define the variables in a terraform.tfvars file:

mcd_id    = "your-mcd-id"
mcd_token = "your-mcd-token"

Option 2 — Use a pre-existing secret: Point the module to an existing secret in GCP Secret Manager by name. The secret value must be a JSON object with the following format:

{
  "mcd_id": "<your-mcd-id>",
  "mcd_token": "<your-mcd-token>"
}

token_secret = {
  create = false
  name   = "my-existing-secret-name"
}

Full deployment (new VPC and cluster)

This is the simplest option — the module creates all infrastructure from scratch:

provider "google" {
  project = "my-gcp-project"
  region  = "us-central1"
}

module "mcd_agent" {
  source = "monte-carlo-data/mcd-k8s-agent/google"

  project_id          = "my-gcp-project"
  location            = "us-central1"
  backend_service_url = "<backend_service_url>"

  token_credentials = {
    mcd_id    = var.mcd_id
    mcd_token = var.mcd_token
  }

  helm = {
    chart_version = "0.0.2"
  }
}

Existing VPC

If you already have a VPC, provide the network and subnetwork IDs:

provider "google" {
  project = "my-gcp-project"
  region  = "us-central1"
}

module "mcd_agent" {
  source = "monte-carlo-data/mcd-k8s-agent/google"

  project_id          = "my-gcp-project"
  location            = "us-central1"
  backend_service_url = "<backend_service_url>"

  helm = {
    chart_version = "0.0.2"
  }

  networking = {
    create_network         = false
    existing_network_id    = "projects/my-gcp-project/global/networks/my-vpc"
    existing_subnetwork_id = "projects/my-gcp-project/regions/us-central1/subnetworks/my-subnet"
  }
}

Existing GKE cluster

If you already have a GKE cluster, the module can deploy the agent into it:

provider "google" {
  project = "my-gcp-project"
  region  = "us-central1"
}

module "mcd_agent" {
  source = "monte-carlo-data/mcd-k8s-agent/google"

  project_id          = "my-gcp-project"
  location            = "us-central1"
  backend_service_url = "<backend_service_url>"

  helm = {
    chart_version = "0.0.2"
  }

  cluster = {
    create                = false
    existing_cluster_name = "my-cluster"
  }

  networking = {
    create_network = false
  }
}

Infrastructure only (manual Helm deployment)

To provision GCP infrastructure without deploying the agent via Helm — useful if you want to manage the Helm release separately:

provider "google" {
  project = "my-gcp-project"
  region  = "us-central1"
}

module "mcd_agent" {
  source = "monte-carlo-data/mcd-k8s-agent/google"

  project_id          = "my-gcp-project"
  location            = "us-central1"
  backend_service_url = "<backend_service_url>"

  helm = {
    chart_version = "0.0.2"
    deploy_agent  = false
  }
}

output "helm_values" {
  value     = module.mcd_agent.helm_values
  sensitive = true
}

After applying, use the helm_values output as your values.yaml and follow the Helm deployment steps in the Kubernetes deployment guide.

2.2 Deploy

terraform init && terraform apply

Additional module inputs, options, and defaults can be found in the module documentation.

2.3 Verify

Configure kubectl access to the cluster:

gcloud container clusters get-credentials <cluster_name> --region <location> --project <project_id>

Check the agent pod is running:

kubectl get pods -n mcd-agent
kubectl logs -n mcd-agent -l app=mcd-agent --tail=30

Run the reachability test to confirm the agent can communicate with the Monte Carlo platform:

kubectl exec -n mcd-agent deploy/mcd-agent-deployment -- \
  curl -s -X POST localhost:8080/api/v1/test/reachability

A successful response contains "ok": true.

3. Enable the Agent

After verifying the agent is running, click Enable on the agent registration screen.

📘
If you've navigated away from the registration screen, go to Settings > Deployments, select your agent, and click Enable.

Once validations pass, click Enable in the validations dialog to activate the agent.

Outputs

The module exposes the following outputs:

Name	Description
`cluster_name`	GKE cluster name
`cluster_endpoint`	Endpoint for GKE control plane
`project_id`	GCP project ID
`location`	GCP region
`storage_bucket_name`	GCS bucket name for agent storage
`service_account_email`	Service account email for the agent
`namespace`	Kubernetes namespace for the agent
`helm_values`	Helm values for manual deployment (sensitive)

FAQs

Which Kubernetes versions are supported?

The agent is regularly tested against current Kubernetes releases on managed cloud platforms (EKS, AKS, GKE). The chart uses standard, non-deprecated Kubernetes APIs (v1, apps/v1, autoscaling/v2, rbac.authorization.k8s.io/v1), which require Kubernetes 1.23 or later — in practice, all currently supported versions on the major managed platforms.

When your cluster is upgraded to a newer Kubernetes version, no agent-side changes are typically required. If you encounter an issue specific to a particular version, please open a support case.

Does the agent run as root?

No. The agent and the metrics-collector daemonset run with runAsNonRoot: true:

The main agent deployment (mcd-agent-deployment) — runs as UID 1000
The metrics-collector daemonset (OpenTelemetry) — runs as UID 10001
All init containers used during pod startup (CA bundle build, credential extraction)

The optional Fluentd logs-collector daemonset is the exception — see How do I monitor the agent?. No container uses privileged: true, requests extra Linux capabilities, or attaches host namespaces.

Is SELinux supported?

Yes. The agent has been validated against Kubernetes nodes running SELinux in enforcing mode and triggers no AVC denials during normal operation. This includes the agent deployment, the logs-collector and metrics-collector daemonsets, and all init containers used by the chart.

Does the agent require inbound network access?

No. The Generic Agent is egress-only. All connections are initiated from the agent to the Monte Carlo platform. No inbound connectivity to the agent is required.

What outbound destinations does the agent connect to?

See the Outbound Destinations section in the architecture overview for the full list of egress targets and ports.

Can I customize the Helm values?

Yes. Use the custom_values variable to merge additional values with the module-generated Helm configuration. This accepts any map matching the chart's values.yaml schema.

How do I add credentials for data source integrations?

Create a secret in GCP Secret Manager with the integration credentials. See the Self-Hosted Credentials documentation for the JSON format for each integration type.
```
echo -n '{"connect_args": { ... }}' | \
  gcloud secrets create mcd-integrations-<integration> --data-file=-
```

Grant the agent's service account access to the secret. You can find the service account email in the Terraform output service_account_email.

gcloud secrets add-iam-policy-binding mcd-integrations-<integration> \
  --member="serviceAccount:<service_account_email>" \
  --role="roles/secretmanager.secretAccessor"

montecarlo integrations add-self-hosted-credentials-v2 \
  --connection-type <integration> \
  --self-hosted-credentials-type GCP_SECRET_MANAGER \
  --gcp-secret <secret_name>

How do I connect the agent to my data sources privately?

When the agent is deployed in a separate VPC from your data sources, you need to establish network connectivity between them. Common options include:

VPC Network Peering — connect two VPC networks directly. See GCP VPC Network Peering documentation.
Private Service Connect — access Google Cloud services (e.g. Cloud SQL, BigQuery) privately. See Private Service Connect documentation.
Shared VPC — share a VPC network across multiple projects. See Shared VPC documentation.

If your data sources are on-premises or in another cloud, you can use Cloud Interconnect or Cloud VPN to establish connectivity.

The agent itself does not require any special configuration for networking — connectivity to data sources must be configured at the infrastructure level.

How do I scale the agent?

There are several ways to scale the agent:

Manual replicas: Set agent.replica_count in your module configuration:

agent = {
  replica_count = 3
}

Thread count: Increase the number of concurrent operations a single replica can process via custom_values:

custom_values = {
  container = {
    opsRunnerThreadCount = "36"
  }
}

The default is 18. Increasing this value allows each replica to handle more operations in parallel, which can be useful before adding more replicas.

Pod resources: Set CPU and memory requests/limits to ensure replicas have adequate resources, especially when increasing thread count:

custom_values = {
  container = {
    resources = {
      requests = {
        cpu    = "500m"
        memory = "512Mi"
      }
      limits = {
        cpu    = "2"
        memory = "2Gi"
      }
    }
  }
}

Autoscaling (HPA): Enable the Horizontal Pod Autoscaler to scale replicas automatically based on CPU (and optionally memory) utilization:

custom_values = {
  autoscaling = {
    enabled                          = true
    minReplicas                      = 1
    maxReplicas                      = 5
    targetCPUUtilizationPercentage   = 70
  }
}

When autoscaling is enabled, replica_count is ignored — the HPA manages replicas. container.resources.requests must be set for the HPA to calculate utilization, and metrics-server must be installed in the cluster (standard in EKS, AKS, and GKE).

How do I upgrade the agent?

To upgrade the agent, update both the Helm chart version and the agent image tag in your module configuration, then run terraform apply:

agent = {
  image = "montecarlodata/agent:1.0.0-generic"
}

helm = {
  chart_version = "0.1.0"
}

Available agent images are published to montecarlodata/agent on Docker Hub (use tags ending with -generic, e.g. latest-generic or 1.0.0-generic). Helm chart versions are available at montecarlodata/generic-agent-helm.

You will not receive notifications when an update is available from Monte Carlo, but you can subscribe to the repositories for new updates. Keeping the agent up to date is part of your shared responsibility.

How do I monitor the agent?

You can monitor the agent using Google Cloud Monitoring for GKE. The agent exposes a health endpoint at /api/v1/test/healthcheck that can be used for liveness and readiness probes.

Using your platform's logs and metrics pipeline

By default the chart ships agent logs in-process — the agent buffers and POSTs them itself, no DaemonSet required — and deploys a metrics-collector daemonset (OpenTelemetry) for runtime metrics. If your platform already collects pod logs and container metrics centrally and you don't want a parallel pipeline, you can opt out of both:

logShipping: none
metricsCollector:
  enabled: false

The agent itself continues to run normally; only the Monte Carlo log-shipping path and the metrics-collector daemonset are removed. The agent's stdout JSON logs still flow to your cluster's standard log driver and any pipeline subscribed to it.

logShipping accepts three values:

in-process (default) — agent ships its own logs to Monte Carlo
fluentd — opt in to a logs-collector daemonset (Fluentd) that tails container logs from the host. Requires root pods. Mostly useful for troubleshooting or specialized log-routing setups.
none — disable Monte Carlo log shipping entirely; rely on your own pipeline

Trade-off: with logShipping: none and metricsCollector.enabled: false, Monte Carlo support won't have direct visibility into agent logs and metrics during troubleshooting — you may be asked to share logs manually if you open a support case.

How do I rotate agent credentials?

Update the secret in GCP Secret Manager:

echo -n '{"mcd_id":"<new-mcd-id>","mcd_token":"<new-mcd-token>"}' | \
  gcloud secrets versions add mcd-agent-token --data-file=-

Force sync the Kubernetes secret from External Secrets Operator:

kubectl annotate externalsecret -n mcd-agent --all \
  force-sync=$(date +%s) --overwrite

Restart the agent services:

kubectl rollout restart deployment mcd-agent-deployment -n mcd-agent
kubectl rollout restart daemonset metrics-collector -n mcd-agent

Updated 18 days ago

Did this page help you?