DataFlow requirements and guidelines

Here are some requirements and guidelines for working with DataFlow.

Requirements

  • You must open port 9000. See Network ports.

  • Allocate minimum disk space of approximately 5 GB in the /etc/thoughtspot/ directory.

Guidelines

  • Some data sources, such as Snowflake, Amazon S3, Google Cloud Storage, and Azure Blob Storage, temporarily store data as local files before loading into the internal ThoughtSpot database. These sources require additional disk space, depending on how much data is in the source.

  • The DataFlow TQL editor supports the following commands:

    • ALTER TABLE: rename, add, and remove columns. Modify column datatypes. Add and remove primary and foreign keys.

    • CREATE DATABASE

    • CREATE SCHEMA

    • DELETE FROM <table> [WHERE...]

    • UPDATE <table> ... SET ... [WHERE ...]

    For details on these TQL commands, see the TQL reference.