Steps to train Spotter

Plan for training Spotter

In order to effectively train Spotter, you must first start by building a good understanding of who the end users are and what their data requirements are. Let’s take an example of the Salesforce dataset of a Software as a Service company to understand how you should plan for training Spotter.

Step 1: Identify the target users

As discussed in the Data modeling article, you have to start the training process by identifying who will be the target users for Spotter, then listing the types of queries those users will ask.

The Salesforce dataset can be useful for many different teams in your organization. For example, the table below provides a quick summary of the different teams in the organization, and the various user personas and the corresponding use-cases these users may be interested in.

Team	Persona	Use-cases
Sales	Sales Leadership (CRO & Directors)	Sales Pipeline & Performance Analysis Regional Performance Metrics Team Performance & Productivity Customer Segmentation & Portfolio Analysis Sales Process Efficiency
Customer Success	Customer Success managers	Customer Relationship Metrics Product adoption & Usage
Marketing	Marketing team	Marketing Attribution Marketing channel analysis Marketing Spend & ROI Analysis

Team

Persona

Use-cases

Sales

Sales Leadership (CRO & Directors)

Sales Pipeline & Performance Analysis Regional Performance Metrics Team Performance & Productivity Customer Segmentation & Portfolio Analysis Sales Process Efficiency

Customer Success

Customer Success managers

Customer Relationship Metrics Product adoption & Usage

Marketing

Marketing team

Marketing Attribution Marketing channel analysis Marketing Spend & ROI Analysis

Narrowing down to a specific persona and the use-cases you want to cater to for these personas is extremely important during the initial stages. You can prioritize the persona and use-cases based on following factors :

Demand for insights: The teams that are most underserved may also be a good candidate for Spotter to start with as these are the teams that are ready to take data-driven decisions and are mainly blocked by ability to get the insights in a timely manner. Once self-serve analytics are available, these teams will be inclined to use it.
Value delivered by enabling self-serve analytics: Another important factor to consider is the value generated by the organization once their analytics needs are served. For example, in the table above, if you think that the Sales team will be able to close more deals if they get the insights faster, then you should focus on the Sales team.
Data available for analysis: The availability of data for conducting the analysis is the last factor which plays a critical role in deciding the target users and the use-cases.

Sample data set

For the purpose of this example, a quick glance at the Sample Salesforce dataset reveals that the persona we should target is Sales Leadership, and the use-cases we can target are Sales Pipeline & Performance Analysis.

Step 2: Identify the user needs

Once you have identified the target user and the important use cases, the next step is to create a list of questions these users will be interested in. We recommend you to identify the various question groups and then list down a few questions users may have for each of these question groups. For example, the table below provides a quick starting point.

Use case table

In order to estimate the amount of training required for Spotter please make sure that you are doing a good job of ensuring the following things : Question Groups are exhaustive to cater all the different variations of the questions users may ask. Questions selected under each question group cover the various ways in which business users may end up asking the questions to Spotter. It is not necessary to create an exhaustive list of questions for each group, however, we recommend that you try to cover the different variations users may use while asking the questions.

Once you have identified the list of questions, you can use the guidelines listed above for reference questions and business terms to identify the key business terms and reference questions you should add. Conducting this analysis will help you answer questions like, what training should I provide to Spotter, and how much training is needed for my use case.

Step 3: Train Spotter for the identified users and needs

Let’s use a sample question to understand the training process for Spotter.

Question: What is the revenue generated this quarter?

You can start using the Add Coaching option on top of the Reference question page to start the training. Select the (Sample) Salesforce data source and add this question into the training experience.

Ask a question to coach Spotter

Once you have entered the question, press Next. Once you press Next, we generate the answer for this question. The answer generated by AI looks reasonable, but in order for an amount to qualify as revenue, we have to apply a filter on “Status = booked”.

Spotter question revenue

We can modify the query to add the required filter. Once it’s done, we can click on Done to save this question as a reference question.

spotter coach status

After the reference question and its answer are submitted, we can proceed to the next step for reviewing the business terms.

The system has suggested three business terms. The first and last business terms seem meaningful and will have the same meaning when used in different contexts, hence these are ideal candidates for business terms. However, we can see that the system has not picked up the filter for “Status = booked” in both these cases. Hence, we should use the X option to add these business terms. Selecting X ensures that the business terms are added in the business terms list but they are not available for use yet.

Review business terms

We can now navigate to the business term section and identify “the revenue” and click on it to open it for editing. In this experience, we can change the business term to “revenue” instead of “the revenue” and modify the mapped search tokens to include the filter on “Status=booked”. After making these changes, we can click on Submit to save the business term with the modified definition.

Fix business term

After submitting the correct definition, you may also need to change access to Global so that the training is applied for all users.
We generally recommend that you avoid training only for the date terms business terms because in a different context, “this quarter” may have to use a different date column. However, one way to overcome this limitation is to train date in association with a measure.
For example, the third fragment, “revenue this quarter”, uses a combination of a measure and date column in the business term. Such a combination ensures that business terms defined will continue to hold the same meaning in different contexts. This can be an effective way to coach Spotter on which date columns to use with specific measures if there are too many date columns.
Once you have trained for a business term, try asking a few other questions in your list to evaluate whether the system is able to apply the learnings.

Training Spotter for complex questions

Let’s see the approach to train Spotter for more complex business terms like “Win rate”. The “win rate” is defined as the ratio of the booked deals by the total number of deals. However, while calculating the total numbers of deals, we have to exclude the open deals.

Keep the following guidelines in mind while training Spotter in such cases :

Complex computations can be broken into a combination of simple computations: Create complex formulas using smaller, simpler ones with very clear names while training/fixing the AI-generated answer. This way, LLMs will generalize them better to similar queries.
Provide human-friendly names for the formula: Create formulas with clear names so that the LLM understands the computation process and it can generalize these computations in various scenarios.

For example, to compute win rate, you can create the following three formulas :

Total concluded deals: Count the total number of deals whose status is not Open.
Deals won: Count the number of deals won.
Win rate: Deals won divided by Total concluded deals.

Total concluded deals formula

Deals won formula

Win rate formula

Once these formula definitions are defined, you can correct the generated answer for the whole query and then submit.

Win rate question

Then on the Review business terms screen, accept the definition of win rate.

Business term win rate

Once you have completed this exercise, Spotter has learned how to compute the win rate. Try a few of the other sample questions to validate that the training provided to Spotter is working for the remaining set of queries.

What should I do if the training provided doesn’t work for similar queries?

In case you come across scenarios where you are trying to define a specific business term but Spotter is not applying the same to other business terms, don’t try to train Spotter for more queries. In general, Spotter is sensitive to the training provided and one to two examples should be sufficient to train Spotter on a specific concept. However, if after providing one to two examples, Spotter is still not applying the training to other queries, try checking the following things:

Reference questions are exactly applied to the same query only: If the question framing is changed slightly then the query feedback isn’t applied as is. However, it may still be used as a template for answering the question. In such cases, the visualization setting may not get applied and the answer generated by Spotter may be different as well if the interpretation of the new question can be different from the one used for training.
Business terms are only applied if the exact same term is used in the question: Business terms are only selected when the exact same spelling is used in the questions asked. This means that if you have trained Spotter for “win rate” it may not apply to “win frequency” ,”win ratio”, “winning-rate”, etc..

In cases where you are unable to train Spotter after providing one to two examples, please seek help from our experts instead of adding more training examples for the same thing.

Step 4: Validate the training

We recommend two steps for validating the training provided:

Validate with a set of test queries: Make sure at step 2, you are also identifying a set of questions from each question group which can be used to test how well Spotter is working for the use case. These questions can be used for evaluating the accuracy of Spotter after providing new training to Spotter.
Validate with early adopters: Identify a few early adopters of Spotter from the target user group and try to work with them to test the accuracy of Spotter. More often than not, you will uncover new questions from these users that can then be added to the questions identified in step 2. A couple of iterations with these users will help you make the training more robust before rolling out Spotter to all the users in the target audience.

Was this page helpful?Give us feedback!

ThoughtSpot is the Modern Analytics Cloud company. Our Live Analytics services deliver personalized, actionable insights at the point of impact for every user, at every level.

Product

By Role

By Department

Solutions

About Us

Connect

(800) 508-7008