A datasource is a configuration for accessing your database from Cluvio, the connection method, the database user credentials and any additional options.
Cluvio supports following 3 ways to connect to your database:
- connecting via Cluvio agent, which you can install and run on your machine or server to provide easy and secure database connectivity
- connecting via SSH tunnel, a hop-server with a publicly reachable IP address, from which your database can be reached (typically over private network)
- direct connection, which can be used when you database is a cloud-native database (such as BigQuery) or has a publicly reachable IP address.
How Cluvio treats your data and how it is kept secure is described in detail in the Security Whitepaper.
At the moment, Cluvio supports the following databases as data sources:
- Amazon Redshift / Panoply
- PostgreSQL / Heroku Postgres
- MySQL / MariaDB / Amazon Aurora
- Google BigQuery
- Google CloudSpanner
- Presto
- Snowflake
- Exasol
- Microsoft SQL Server
- Oracle Database
- MongoDB BI Connector
- Vertica
When you configure a datasource with connectivity to your database, Cluvio will only fetch the database metadata (version, timezone, max # of simultaneous connections), schema information (tables and columns of the selected database) and table row counts. This information is refreshed nightly, so we display an up to date information even when you make changes to the database tables or columns.
Cluvio does not extract any data from your tables, all reports, dashboards, SQL alerts, etc. run the queries directly against your database and cache the results for limited time.
For further information on configuring and securing database access with Cluvio, see also:
- Securing your database connection
- Connecting through an SSH tunnel
- Creating a read-only user in the database
- Connecting to a local or test database