How to Build a Vendor Reliability Scorecard

A step-by-step guide to building a vendor reliability scorecard. Includes metrics, scoring methodology, a spreadsheet template, and a process for quarterly reviews.

You probably have opinions about which vendors are reliable and which are not. But opinions are not data. When it comes time to renew a contract, evaluate a new vendor, or explain to leadership why you need to migrate off a service, you need something more concrete than "it feels like they go down a lot."

A vendor reliability scorecard gives you that data. It is a structured, repeatable way to track how your SaaS vendors perform over time across the metrics that actually matter to your business. This guide walks you through why you need one, what to measure, how to score it, and how to use it for real decisions.

Why Track Vendor Reliability

Most teams do not track vendor reliability at all. They sign up for a service, use it until something breaks, and then react. The problems with this approach compound over time.

You lose negotiating leverage. When a vendor contract is up for renewal and you have six months of documented reliability data showing they missed their SLA three times, you have grounds to negotiate better terms or credits. Without data, you have nothing.

You make decisions based on recency. A vendor that had one bad week sticks in your memory more than a vendor that was quietly reliable for 11 months. A scorecard corrects for this bias by showing the full picture.

You cannot compare vendors objectively. If you are evaluating whether to stay with your current email provider or switch to a competitor, you need a consistent framework. Our guide on evaluating SaaS vendor reliability covers the initial evaluation. The scorecard extends that evaluation into an ongoing process.

You miss slow degradation. A vendor that goes from 99.95% uptime to 99.8% over the course of a year is getting worse, but the decline is gradual enough that you might not notice without tracking it.

What Metrics to Include

A useful scorecard covers five dimensions. Each one tells you something different about how well a vendor serves your business.

Uptime Percentage

This is the most straightforward metric. What percentage of the time was the vendor available during the measurement period?

Use data from your own monitoring tools, not just the vendor's self-reported status page. Third-party monitoring through tools like Is That Down gives you an independent view. For context on why self-reported and observed uptime often differ, see our guide on what vendor monitoring is.

Track uptime monthly and roll it up quarterly. A vendor might hit 99.9% in two out of three months but have a catastrophic week that drags their quarterly average down to 99.5%. Both the monthly and quarterly views are useful.

Incident Frequency

How many incidents did the vendor have during the measurement period? Count every event that appeared on their status page, whether it was a full outage, partial degradation, or elevated error rates.

Incident frequency matters because a vendor with 99.9% uptime and one long outage has a very different risk profile than a vendor with 99.9% uptime and twelve short outages. The first suggests a rare but serious failure. The second suggests systemic instability.

Mean Time to Resolve (MTTR)

When incidents happen, how long does it take the vendor to fix them? Calculate the average time between when an incident is first reported and when the vendor confirms resolution.

Track the average and the worst case. An average MTTR of 30 minutes is good, but if one incident lasted 8 hours, that worst-case number matters more for your contingency planning.

Communication Quality

This is the most subjective metric, but it is arguably the most important. When something goes wrong, does the vendor tell you what is happening?

Score communication quality on these factors:

  • Speed: How quickly did the vendor acknowledge the incident after it started?
  • Clarity: Were the status updates specific enough to understand what was affected and what was being done?
  • Frequency: Did the vendor provide regular updates during the incident, or did they go silent for long stretches?
  • Follow-up: Did the vendor publish a post-incident review with root cause and remediation steps?

A vendor with occasional outages and excellent communication is often a better partner than a vendor with fewer outages who goes dark when things break.

SLA Compliance

If the vendor has a published SLA, did they meet it? Track whether their actual uptime exceeded their committed uptime for each measurement period.

Also track whether SLA credits were claimed and received. Many teams are entitled to service credits after SLA breaches but never file the claim. According to Gartner's research on vendor management, a significant majority of eligible SLA credits go unclaimed simply because nobody tracks the breach.

Building the Scorecard

Here is a practical scorecard template you can adapt for your own use. Start with a spreadsheet and graduate to a more formal tool if your vendor list grows large enough to warrant it.

Vendor Reliability Scorecard Template

| Metric | Weight | Score (1-5) | Weighted Score | |--------|--------|-------------|---------------| | Uptime % | 30% | | | | Incident Frequency | 20% | | | | MTTR | 20% | | | | Communication Quality | 15% | | | | SLA Compliance | 15% | | | | Total | 100% | | [sum] |

Scoring Scale

Use a consistent 1-to-5 scale for each metric. Here is a reference for how to assign scores.

Uptime Percentage:

  • 5 = 99.99% or higher
  • 4 = 99.95% to 99.98%
  • 3 = 99.9% to 99.94%
  • 2 = 99.5% to 99.89%
  • 1 = Below 99.5%

Incident Frequency (per quarter):

  • 5 = 0 to 1 incidents
  • 4 = 2 to 3 incidents
  • 3 = 4 to 6 incidents
  • 2 = 7 to 10 incidents
  • 1 = More than 10 incidents

MTTR:

  • 5 = Average under 15 minutes
  • 4 = Average 15 to 30 minutes
  • 3 = Average 30 to 60 minutes
  • 2 = Average 1 to 4 hours
  • 1 = Average over 4 hours

Communication Quality:

  • 5 = Fast acknowledgment, clear updates, regular cadence, published post-mortems
  • 4 = Good communication with minor gaps
  • 3 = Adequate but slow or infrequent updates
  • 2 = Minimal communication, vague updates
  • 1 = No communication or actively misleading status

SLA Compliance:

  • 5 = SLA met every month, no breaches
  • 4 = One minor SLA breach, credits provided promptly
  • 3 = One significant breach or multiple minor breaches
  • 2 = Multiple significant breaches, credits required claiming
  • 1 = SLA consistently missed, no remediation offered

Weighting

The weights in the template above reflect a common prioritization, but you should adjust them based on what matters most to your business. If you are in e-commerce and payment processing uptime is everything, weight uptime higher. If you operate in a regulated industry where vendor communication is a compliance requirement, weight communication quality higher.

The key is to set the weights before you start scoring, not after. Adjusting weights after seeing the results defeats the purpose of having a structured framework.

Using the Scorecard for Vendor Decisions

A scorecard is only valuable if it drives action. Here are the ways it should inform your vendor strategy.

Renewal Decisions

Before renewing a vendor contract, pull their scorecard for the past year. A vendor consistently scoring 4 or above is performing well. A vendor scoring below 3 deserves a conversation about what needs to change, or an evaluation of alternatives.

Bring the scorecard data to your renewal negotiation. Documented reliability issues give you concrete grounds for requesting better pricing, improved SLA terms, or dedicated support.

New Vendor Evaluation

When evaluating a replacement vendor, use the same scorecard framework to assess their track record. Review their public status page history, incident reports, and SLA terms. Score them the same way you score existing vendors so the comparison is apples to apples.

Dependency Risk Assessment

Vendors with consistently low scores on your scorecard represent higher risk. Cross-reference scorecard data with your SaaS dependency map to identify where low-reliability vendors sit in your critical path. A vendor scoring 2 out of 5 that powers a non-essential internal tool is a different risk than a vendor scoring 2 out of 5 that handles your customer authentication.

Budget Justification

When you need to justify spending on monitoring tools, backup vendors, or migration projects, scorecard data makes the business case tangible. Instead of "we should probably have a backup payment processor," you can say "our primary payment processor has breached its SLA twice in the last six months and had an average MTTR of 2.5 hours, which costs us an estimated $X per incident." Our analysis of the real cost of vendor downtime can help you attach dollar figures to reliability scores.

Running Quarterly Reviews

The scorecard works best as part of a regular review cadence. Quarterly is the right frequency for most teams. Monthly is too often to show meaningful trends. Annually is too slow to catch problems before they compound.

What to Cover in a Quarterly Review

Score each vendor. Gather the data for the quarter, calculate scores, and update the scorecard.

Compare to previous quarters. Look for trends. Is a vendor improving or declining? A vendor that scored 4.2 last quarter and 3.1 this quarter needs attention even if 3.1 is technically "acceptable."

Review incidents. For any vendor that had significant incidents during the quarter, review what happened and how it affected your business. Reference your incident logs and any communications sent using your outage response playbook.

Identify action items. Decide what, if anything, needs to change. Do you need to have a conversation with a vendor? Start evaluating alternatives? Add a backup for a critical service? Invest in better monitoring?

Update your vendor list. Add any new vendors you started using during the quarter. Remove any you discontinued.

Quarterly Review Agenda

  1. Review and finalize scorecard data for each vendor (15 min)
  2. Identify vendors with declining scores or new SLA breaches (10 min)
  3. Discuss incidents that affected the business during the quarter (15 min)
  4. Decide on action items: vendor conversations, evaluations, or changes (10 min)
  5. Update vendor list and monitoring configuration (5 min)

Who Should Participate

The review should include whoever is responsible for vendor relationships and whoever is most affected by vendor outages. For most teams, this means someone from engineering or IT and someone from operations or product. Keep the group small. Three to five people is enough to cover the important perspectives without turning it into a committee meeting.

Getting Started

You do not need to score every vendor on day one. Start with your top five most critical dependencies. Set up a simple spreadsheet. Gather one quarter of data. Score it. See what the numbers tell you.

Over time, expand the scorecard to cover more vendors and refine your scoring criteria based on what you learn. The first version will not be perfect, and that is fine. A rough scorecard that exists is infinitely more useful than a perfect one that you never get around to building.

Pair the scorecard with automated vendor monitoring so the data collection happens continuously rather than requiring manual effort every quarter. The less work it takes to maintain, the more likely your team will keep doing it.

References

Beyond vendor monitoring, consider uptime monitoring for your own services and SSL monitoring to ensure your infrastructure health does not cloud vendor reliability assessments.

Get the reliability data your scorecard needs

Is That Down tracks vendor uptime and incident history over time, giving you the raw data to populate your reliability scorecard without manual monitoring.

Try Is That Down