Datasets

Creating a Dataset

Datasets are encrypted tables stored in ThoughtSpot and refreshed at a user-defined interval. Datasets can be created by either querying a database or uploading a Google Sheet. To create a new Dataset, select the Create button in the top right corner of your Analyst Studio home screen and choose Make a reusable Dataset.

create dataset

Alternatively, go to My Work on the left nav and click the shortcut tile for Make a reusable Dataset.

create dataset

This will open up the Datasets editor where you can write a new SQL query or insert SQL snippets using Definitions.

create dataset

Adding calculated fields to the Dataset

You can add new calculated fields to the Dataset from the Fields tab. Doing so will add the calculated field to the list of available fields, and also make it available in any reports created from the Dataset.

To add a new calculated field, first select the New field button.

Adding calculated fields to the dataset

Then enter the formula for your calculated field, along with adding a name. To save the calculated field, select the Apply and Close button.

Adding calculated fields to the dataset

Viewing the source syntax

The source syntax of the query run can be accessed from the Source tab.

Viewing the source syntax

Adding a name and description to the Dataset

You can add a name and description to the Dataset. To do so, select the caret next to the placeholder name, “Untitled Dataset.” From the dropdown, select Rename.

Adding name and description

Enter the desired Dataset name and description. Then select Save.

Adding name and description

We recommend using consistent naming conventions and adding detailed descriptions to your Datasets. Doing so will help other team members find and understand how to use the Dataset.

Scheduling a Dataset

You can set a schedule for your Dataset to refresh. When a Dataset refreshes, all associated Reports built using that Dataset will receive a prompt to pull in the fresh data.

To create a new schedule, select the caret next to the Dataset name and choose Schedule.

Schedule a dataset

Then, select Create New Schedule to open the scheduling options. From here, you can set the refresh frequency, as well as the specific time and timezone.

Schedule a dataset

Publishing a Dataset to ThoughtSpot

Once you’ve created a Dataset and set a schedule, follow these steps to publish to ThoughtSpot:

  1. Click the blue Publish button in the upper right side of the top menu.

    Publish a Dataset
    You can only publish Datasets you created.
  2. In the Publish to the Data workspace window, enter the Dataset name and click Publish.

    Publish to Data workspace
  3. Your Dataset is published to ThoughtSpot as a table. You can access it from the Datasets section in the Data workspace.

    Only users with can manage data privileges can access the Data workspace.
  4. Unlike items from ThoughtSpot Connections, which are live, a Dataset is an extract that runs on a schedule. Your Dataset in ThoughtSpot will refresh according to the schedule(s) you set in Analyst Studio.

Note that if you want to edit an existing, published Dataset, it may break the content built on it. We recommend creating a new Dataset with the desired edits and publishing it. If you delete a Dataset after publishing, you must also delete the Dataset in the ThoughtSpot Data workspace.


Was this page helpful?