How DataFlow works
With DataFlow, you can move data into ThoughtSpot from most databases.
Follow the steps in this checklist to connect to your data source and establish data load.
-
Step 1: Enable DataFlow on the cluster by running
tscli dataflow enable
. Refer to tscli dataflow commands. -
Step 2: Add a connection to the data source.
-
Step 3: Select the source table or file.
-
Step 4: Specify the sync schedule: hourly, daily, weekly, monthly, or one-time only (does not repeat).
-
Step 5: Map tables or files from the data source to tables in the internal ThoughtSpot database.
-
Step 6: Map columns from the data source to columns in the internal ThoughtSpot database.
-
Step 7: [Optional] Set sync properties: conditions, sync mode (append or overwrite), additional scripts to run before or after the sync operation, specify additional sync properties.
-
Step 8: [Optional] Create table joins.