Presto
PrerequisitesTo complete this guide, you will need permissions to create a read-only user on Presto.
Before configuring your Presto query engine, make sure your metastore (e.g., Glue or Hive Metastore) is connected, as Monte Carlo requires it to map query engine activity to data lake assets.
To connect Monte Carlo to a Presto cluster to run data health SQL queries, follow these steps:
- Create a Presto cluster for Monte Carlo's data health queries using AWS EMR. Alternatively, you may use an existing cluster that you already have in your environment.
- Ensure that Monte Carlo's data collector has network connectivity to the cluster (VPC peering is required in most cases).
- Create a read-only service account on Presto if necessary.
- Provide service account credentials to Monte Carlo.
Creating a service account on Presto
Monte Carlo supports basic authentication or custom certificates for Presto connections. Please create a read-only user for Monte Carlo, or obtain a certificate file that enables authentication to Presto.
Providing account credentials to Monte Carlo
You will provide connection details for Presto using Monte Carlo's CLI:
- Please follow this guide to install and configure the CLI.
- Please use the command
[montecarlo integrations add-presto](https://clidocs.getmontecarlo.com/#montecarlo-integrations-add-presto)to set up Presto connectivity. For reference, see help for this command below:
$ montecarlo integrations add-presto --help
Usage: montecarlo integrations add-presto [OPTIONS]
Setup a Presto SQL integration. For health queries.
Options:
--host TEXT Hostname. [required]
--port INTEGER HTTP port. [default: 8889]
--user TEXT Username with access to catalog/schema.
--password TEXT User\'s password. If you prefer a prompt (with
hidden input) enter -1
--catalog TEXT Mount point to access data source.
--schema TEXT Schema to access.
--http-scheme [http|https] Scheme for authentication. [required]
--cert-file FILE Local SSL certificate file to upload to
collector. This option cannot be used with
'cert-s3'.
--cert-s3 TEXT Object path (key) to a certificate already
uploaded to the collector. This option cannot be
used with 'cert-file'.
--skip-cert-verification Skip SSL certificate verification.
--name TEXT Friendly name of the warehouse which the
connection will belong to.
--collector-id UUID ID for the data collector. To disambiguate
accounts with multiple collectors.
--skip-validation Skip all connection tests. This option cannot be
used with 'validate-only'.
--validate-only Run connection tests without adding. This option
cannot be used with 'skip-validation'.
--auto-yes Skip any interactive approval.
--option-file FILE Read configuration from FILE.
--help Show this message and exit.Updated 8 days ago
