Documentation Index

Fetch the complete documentation index at: https://help.dialpad.com/llms.txt

Use this file to discover all available pages before exploring further.

AI Scorecards

Prev Next

Let’s talk Quality Assurance (QA). Continuous coaching and performance monitoring drive better customer service, client retention, and employee motivation.

AI Scorecards automate call grading with objective, actionable insights—no manual effort required. With ready-to-use templates, you can get started instantly, all within the Dialpad app!

These include LLM-powered scorecard questions that use a more intelligent, contextual grading model.

Let's dive into the details.

Who can use this

AI Scorecards can be used in Contact Centers and Coaching Teams for users with Dialpad Support and Dialpad Sell accounts.

Contact us today to enable AI Scorecards for your team (an additional fee applies).

Create a scorecard

Create a traditional scorecard or add AI Questions to your scorecard. Using AI Questions automates call grading, reducing the time managers spend manually reviewing calls or shadowing agents.

To create an AI Scorecard, go to your Dialpad Admin Settings.

  1. Select AI Scorecards

  2. Select Create Scorecard

  3. Enter the name and description of your scorecard

    Note

    Every scorecard must have a unique name.

  4. Select which Contact Center(s) or Coaching Team(s) can use the scorecard

    Note

    Unless specified, a scorecard is assigned to all Contact Centers and Coaching Teams.

  5. Choose how you want the scorecard to be graded

    Grade by AI

    All calls are fully automated and graded by AI.

    Suggestions by AI

    AI will grade the call, but a supervisor will still need to review the grade and submit the final score.

    AI off (Manual grade)

    No AI assistance, the supervisor will need to manually review and grade each scorecard.

  6. Select the AI Questions you want to use

    Custom LLM questions

    When you create a new AI question for an AI Automated or AI Suggested scorecard, that question is created as a custom LLM question. These questions use a more advanced large language model and do not require trigger words or phrases in order to work.

    Note

    Trigger words and phrases have not been removed from question creation. They are now optional and can still be added to support more targeted detection.

    If you add trigger words to a custom LLM question, Dialpad uses both the LLM model and the trigger word model when evaluating the question. If you do not add trigger words, only the LLM model is used.

    Rewrite suggestions and clearer question setup

    The Rewrite option can suggest improved wording for a custom LLM question to help improve grading accuracy. Rewrite suggestions can also appear automatically if the system detects that the question may be unclear or could be improved.

    A support interface displaying questions and response options for customer identity confirmation.

    If a question is unintelligible or uses unclear internal jargon or acronyms, Dialpad warns the scorecard creator so they can revise the question before publishing. If the warning is ignored, the question can still be published, but grading quality may be reduced.

    Question about agent confirming customer's EAP information with response options and notes.

    Note

    If AI is off, you cannot add AI Questions.

  7. Select Create scorecard

  8. Review each question and edit as needed. You can:

    • Change the points allocated to each outcome.

    • Add a comment field and make it mandatory.

    • Allow a question to be skipped.

    • Automatically fail an entire scorecard for a certain response.

  9. Add additional questions as needed

    • To add more AI templated questions, select Add AI Questions

    • To create your own custom LLM question, select Create new question

      Mandatory question content

      Every question must have the following components:

      • Question title:  Question or statement to be considered when grading the call

      • Response type: Yes/no, agreement (Likert), frequency (Likert), importance (Likert), likelihood (Likert), or quality (Likert)

        • AI questions can only be yes/no. If you choose a different response type, the question will revert to a traditional (non-AI) question and will not be scored by AI.

        • AI Automated Scorecards provide a 'Yes' or 'No' answer, while AI Suggested Scorecards only give a 'Yes.' If no answer is detected, the grader scores it manually.

        • Responses are auto-filled to match the response type chosen but are customizable.

      • Points: Point value given for each response

      Optional question content

      Get granular by adding any of the following optional question features:

      • Add a comment: Creates an open text box allowing graders to add comments.

        • A comment field can be added for each question — all grading comments appear on the detailed scorecard exports.

        • The comment field can be mandatory or optional.

      • Allow question to be skipped: Marks the question as N/A without affecting the agent's overall score.

        • N/A responses now apply to all customer-created LLM questions, not just prebuilt or templated AI questions.

        • When you enable Allow question to be skipped, the When to skip option appears. Use this field to describe the specific situations where the question should be marked N/A so the model has enough context to make a better grading decision.

      • Automatically fail entire scorecard for certain responses: When selected, this feature automatically fails the entire scorecard, giving the agent a 0% grade.

        • A scorecard can have multiple questions marked to be automatically failed.

      • Save question as a template: Saves the AI question, allowing you to re-use it in different scorecards

  10. Once you've finished adding and editing questions, select Publish

  11. Select Yes, publish now to complete the scorecard creation

Graded scorecard answer with context

When Yes/No questions are graded by AI, you’ll see reasons provided for why these answers were graded as they were.

Note

Scorecard answers with context will only apply to new calls and will not be applied to calls already graded.

Customer service evaluation showing agent's performance and feedback on call handling.

To view a reason for a answers grade, hover over the answer and a context bubble displays with the reason.

Conversation summary with agent's responses and grading on customer service questions.

When the AI determines a question can’t be given a proper answer or didn’t detect an answer, it automatically answers No for that question.

A customer service interaction with questions about account access and agent responses.

When a question is marked as a question the AI can skip, a message is shown in that question explaining why it was skipped.

A customer service interaction with questions and grading feedback displayed on the screen.

Best practices for writing LLM questions

LLM questions are only as good as the context you provide. The clearer and more specific the evaluation criteria, the better the grading results tend to be.

To write stronger LLM questions:

  • Be explicit about what the question is checking and what should count as Yes, No, or N/A.

  • Use the When to skip field to explain when a question should be marked N/A.

  • Prefer observable behavior over vague or subjective wording.

  • Avoid internal shorthand and acronyms unless you also define them clearly.

Below are some examples of weak questions and strong questions:

Weak Questions

Strong Questions

Did the agent verify the appointment?

Did the agent confirm the appointment date and time with the customer?

Mark N/A if no appointment or meeting was discussed during the conversation.

Did the agent resolve the issue?

Did the agent provide a clear resolution or confirmed next action for the customer’s stated issue?

Mark N/A if the caller was a wrong number, the call ended before troubleshooting began, or no resolution was required for the interaction.

Multiple choice questions

Multiple choice questions must be graded manually and cannot be added to an AI Automated scorecard.

Within the response dropdown, scorecard creators can select multiple choice as a new response type.

How multiple choice questions work:

  • Each multiple choice question can include up to 10 total response options.

    • If a "Skip" option is included, it counts as one of the 10, allowing up to 9 additional custom response choices.

  • You can assign a maximum score to the question, but the individual response values do not need to add up to that score.

    • Multiple choice questions are designed to support more than one correct answer, which allows for different combinations of responses to reach the full score.

    • The system calculates a proportional score, meaning the agent’s score for that question will reflect the percentage of correct answers chosen—up to 100%.

Non-score impacting follow up questions

Use follow up questions to provide more clarity and context within scorecards. These non-score impacting questions enable graders to capture more detailed feedback when a primary question warrants additional explanation.

Each follow-up question can include:

  • A title

  • A response type:

    • Single Selection

    • Multiple Choice

    • Open Response

Change scorecard grading method

While the scorecard is in draft mode, you can change between the three different grading options.

To change a grading type:

  1. Select the scorecard (remember, it must be in draft mode)

  2. Select the current grading method

  3. Choose the new grading method from the dropdown

  4. Select Save

Notes

If you change the grade type from Grade by AI to Suggestions by AI, the scorecard will still use AI features, but it will only select a Yes response instead of Yes or No during the call. The graders will then need to submit the final grade manually.

If the scorecard grade type is changed from Grade by AI to AI off (manual grade), the scorecard will lose all AI functionality, and all questions must be answered manually by the supervisor or grader. If a scorecard with AI questions is switched to AI off (manual grade), you can change it back to Suggestions by AI again.

However, if you want to change the scorecard back to Grade by AI, you'll need to change all question response types to Yes or No and re-add any trigger words or phrases, as these are removed when you switch to AI off (manual grade).

LLM outage mitigation

If Dialpad detects an outage or missing grading events from the LLM provider, it avoids submitting an incorrect 0% grade. Instead, the call is left without a submitted grade so a supervisor or admin can review it manually.

Edit trigger words and phrases

The more a scorecard is used, you may realize you forgot a few trigger words and phrases. Don’t worry, these can be updated.

Note

Trigger words are still supported with the introduction of LLM questions and are not going away. Existing published scorecards can continue using them, and no action is required to keep them active.

To edit trigger words and phrases, go to your Dialpad Admin Settings.

  1. Select AI Scorecards

  2. Select AI Questions

  3. Beside the questions, select Options

  4. Select Edit
    View of the AI Scorecards page. The option to edit a scorecard is highlighted.

  5. Add new phrases or trigger words, or delete unwanted ones

  6. Select Save

Edit scorecard settings

You can edit scorecard settings for new or existing scorecards. Choose whether to grade calls based on call direction (inbound, outbound, or both) and call duration, providing you more control over which calls are graded.

Note

This applies only to AI Automated Scorecards, it does not apply to assisted or manual scorecards.

To edit the settings of a scorecard, go to your Dialpad Admin Settings.

  1. Select AI Scorecards

  2. Select an existing scorecard or create a scorecard

  3. Select the Settings icon

    View of a scorecard settings page. The Settings panel displays options to configure call length and direction for grading calls.

  4. In the Settings panel, configure these settings:

    Minimum length to grade

    The minimum length of a call that will be graded. The minimum length is 30 seconds.

    Note

    30 seconds is the default length. You can also add a maximum call length to be graded. The maximum call length is 99 minutes 59 seconds.

    Calls to grade

    The direction of the call that will be graded. You can grade calls for:

    • Inbound only

    • Outbound only

    • Inbound and outbound

  5. Select Save

  6. Select Publish

Archive a scorecard

Workflows adapt, and your scorecard needs will change as your company grows. AI Scorecards can be archived, which maintains the card's analytics data.

To archive a scorecard, go to your Dialpad Admin Settings.

  1. Navigate to AI Scorecards

  2. Beside the scorecard, select Options

  3. Select Archive
    View of the AI Scorecards page. The option to archive a scorecard is highlighted.

  4. When prompted, select Yes, Archive — remember this action is not reversible

Finding archived scorecards

Archived scorecards can be viewed at any time using the Archived filter.

View of the AI Scorecards page. The option to filter scorecards by archived status is highlighted.

Duplicate a scorecard

Need to create a copy of a scorecard? No problem.

To duplicate an AI Scorecard, go to your Dialpad Admin Settings.

  1. Select AI Scorecards

  2. Beside the scorecard, select Options

  3. Select Duplicate
    View of the AI Scorecards page. The option to duplicate an existing scorecard is highlighted.

  4. Review the copy of the scorecard, then select Publish

  5. When prompted, select Yes, publish now

Digital Scorecards

Digital Scorecards let supervisors and admins manually evaluate digital interactions across any supported channel in Dialpad. While AI grading isn’t available yet, this feature helps teams assess agent performance and maintain quality standards.

Learn more.

Frequently asked questions

Are LLM questions available to all customers?

AI Scorecards are available for Dialpad Support and Dialpad Sell customers, but custom LLM scorecard questions are currently supported in English only. Other languages are supported but are currently only available in an EAP. Please reach out to Dialpad Customer Care or your Account Manager for more information.

Do I still need trigger words and phrases?

No. Trigger words and phrases are now optional for custom LLM questions. You can still use them if you want to detect very specific wording or phrases in a conversation.

What languages are supported for custom LLM questions?

English is currently supported for custom LLM questions. Additional languages are expected later.

What happens to my scorecards that don’t use LLM questions?

Your existing scorecards continue to work, and trigger words remain active. This is so customers can benefit from the improved LLM model without needing to rebuild their scorecards from scratch.

What if the model can’t determine whether a question should be graded?

If the question is unclear, it will mark the question as No.

Use clear question wording and include When to skip guidance when appropriate. If the model can’t grade because of poor wording, vague criteria, or unclear jargon, you may see warnings during setup so you can improve the question before publishing.

More information

For more information on using AI Scorecards and the granular data they provide, be sure to read these Help Center articles.

Learn more

Check out Dialpad University's free courses covering our products and services, as well as insightful tips and best practices.