ThoughtSpot configuration and licensing varies by memory availability. Other considerations also impact the performance of your solution. Because some solutions perform better than others, think about the following issues before implementation.
Each node in a ThoughtSpot cluster performs ideally when it has less than 250GB of data and fewer than 0.25 billion total rows of data. For more complex schemas, we recommend even fewer rows of data per node for optimal performance. To reduce the total amount of data and rows of data, you can limit the data range to the relevant years or months, or combine long and narrow tables into wider tables when possible.
Total rows in a result of a join can have an impact on performance. In general, we recommend that you have fewer than 10 billion rows in a many-to-many join. Also, consider these boundaries:
|Maximum number of rows that can be downloaded||10M (default: 1M)|
|Size in CSV format||250GB per node|
|Total number of rows across all tables||250GB per node|
|Many-to-Many (Generic) join cardinality||2.5B per node|
|Load frequency||Once every hour|
Worksheets must have less than 1000 columns. For aggregated worksheets, follow these guidelines:
- Number of columns not to exceed 50
- Number of rows not to exceed 10 million
You can use an ETL process to circumvent these limitations. Speak with ThoughtSpot Customer Support to learn more.
Aggregated Worksheets and Joins
To join an aggregated worksheet with a base table, you must configure your installation to allow this behavior.
- The aggregated worksheet cannot use more than 5 component tables.
- The number of rows in the final aggregated worksheet cannot be greater than 1000.
Chasm Trap Worksheets
For chasm trap scenarios where two or more fact tables join through a shared dimension, we recommend the following boundaries:
|Maximum number of fact tables in a worksheet||5|
|Maximum number of shared dimensions||2|
|Maximum number of rows in a non co-sharded shared dimension table of chasm trap||1B|
|Maximum number of rows in a co-sharded shared dimension table of chasm trap||1B|
Row-level Security Boundaries
Maximum number of unique RLS rules with search data suggestions should not exceed 15K.
For ideal performance of scheduled pinboards, do not exceed 50 scheduled pinboard jobs.