Add a Databricks connection

Configuring OAuth

If you need to use OAuth with Databricks, you must configure OAuth in Microsoft Azure. For details, see OAuth for a Databricks connection.

Adding a Databricks connection

To connect to Databricks:

  1. Select Data in the top navigation bar.

  2. Click the Connections tab in the left navigation bar, and select the Others tile. Alternatively, select + Create new in the left navigation bar, select Connection, and select the Databricks tile.

  3. Create a name for your connection, a description (optional), then select the Databricks connection type, and click Continue.

    Add a Databricks connection
  4. Enter the connection details for your Databricks data source using Service Account, or OAuth.

    Connection details

    For Service Account authentication, do the following:

    • Enter Host, HTTP Path, User, and Password

    For OAuth authentication, do the following:

    • Enter Host, HTTP Path, OAuth Client ID, OAuth Client Secret, Scope, Auth Url, and Access token Url.

    Refer to the Databricks connection reference for more information on each of the specific attributes you must enter for your connection.

  5. (Optional) Provide additional key-value pairs that you need to set up your connection to Databricks or customize additional JDBC parameters supported by Databricks, by doing the following:

    • Select the Advanced Config menu to reveal the Key and Value fields.

    • Enter your key and value information. Refer to the installation and configuration guide that comes with the Databricks JDBC driver for information on Databricks’s supported keys and values.

    • To add more keys and values, select the plus sign (+), and enter them.

      Any key-value pairs that you enter must be defined in your Databricks data source. Key-value pairs are case-sensitive.
  6. Select Continue.

  7. Select tables (on the left) and the columns from each table (on the right), and then click Create connection.

    Select tables and columns for your connection

    A message appears indicating the number of tables and columns that will be added to your connection.

  8. Select Confirm.

After you add the connection, you can search your Databricks database.

The "Connection created" screen

Your new connection appears on the Data > Connections page. You can select the name of your connection to view the tables and columns in your connection.

The connection you just created is a link to the external data source. If there are any joins in the selected tables of the external data source, those are imported into ThoughtSpot.

You can now perform a live query on the selected tables and columns of your connection. Because the selected tables and columns in your connection are linked, it may take a while to initially render the search results. This is because ThoughtSpot does not cache linked data. With linked data, ThoughtSpot queries the external database directly, which is slower than querying data that is stored in ThoughtSpot’s database.

You can modify a Databricks connection in the following ways:

See the Connection reference for details of connection parameters.