How DataFlow works

With DataFlow, you can move data into ThoughtSpot from most databases.

Follow the steps in this checklist to connect to your data source and establish data load:
  • Step 1: Enable DataFlow on the cluster by running tscli dataflow enable. Refer to tscli dataflow commands.

  • Step 2: Add a connection to the data source.

  • Step 3: Select the source table or file.

  • Step 4: Specify the sync schedule: hourly, daily, weekly, monthly, or one-time only (does not repeat).

  • Step 5: Map tables or files from the data source to tables in the internal ThoughtSpot database.

  • Step 6: Map columns from the data source to columns in the internal ThoughtSpot database.

  • Step 7: [Optional] Set sync properties: conditions, sync mode (append or overwrite), additional scripts to run before or after the sync operation, specify additional sync properties.

  • Step 8: [Optional] Create table joins.