How DataFlow works

With DataFlow, you can move data into ThoughtSpot from most databases.

Follow the steps in this checklist to connect to your data source and establish data load.

Follow the steps in this checklist to connect to your data source and establish data load:
  • Step 1: Enable DataFlow on the cluster by running tscli dataflow enable. Refer to tscli dataflow commands.

  • Step 2: Add a connection to the data source.

  • Step 3: Select the source table or file.

  • Step 4: Specify the sync schedule: hourly, daily, weekly, monthly, or one-time only (does not repeat).

  • Step 5: Map tables or files from the data source to tables in the internal ThoughtSpot database.

  • Step 6: Map columns from the data source to columns in the internal ThoughtSpot database.

  • Step 7: [Optional] Set sync properties: conditions, sync mode (append or overwrite), additional scripts to run before or after the sync operation, specify additional sync properties.

  • Step 8: [Optional] Create table joins.