Skip to main content

Blog  ·  Case Study  ·  Updated June 10, 2026

SEO Audit Case Studies: What We Find When We Look Under the Hood

Every business owner thinks their site is "basically fine." Then we run the crawl. Here is what a real site audit actually surfaces, the patterns we see in market after market, and how to read your own results.

We have run hundreds of SEO audits across plumbers, dentists, law firms, laundromats, process servers, flooring directories, and our own properties. The single most consistent finding is this: the problems holding a site back are almost never the ones the owner expected. People worry about the color of a button while Google quietly fails to index a third of their pages. This article is a tour of what a genuine site audit reveals once you stop guessing and start measuring.

To be clear about what this page is and is not: we are not going to invent a client, paste a fake "ranked #1 in 30 days" screenshot, or quote before-and-after numbers we made up. Instead we are going to describe honestly the categories of findings that show up over and over, drawn from real audits and a study of 228 SEO reports across multiple niches. If you want your own numbers, the place to get them is a free site audit on your actual domain. Patterns are useful; your data is the truth.

The Four Buckets Every Audit Falls Into

It helps to organize findings the way we organize the audit itself. Almost everything we surface lands in one of four buckets: crawl and indexing, keywords and content, on-page and technical, and the Google Business Profile. A finding in one bucket frequently explains a symptom in another. A page that will not rank for its target keyword is often not a content problem at all; it is an indexing problem two buckets over. Auditing in this structured order is how you find root causes instead of chasing symptoms.

Bucket One: Crawl and Indexing

This is where the biggest, most invisible wins hide. Before a page can rank it has to be crawled, and before it ranks well it has to be indexed cleanly under the URL you actually want. The most common failure we find through crawl analysis and Google Search Console is a quiet mismatch between the URL your sitemap advertises and the URL your server actually serves.

The trailing-slash and canonical traps

On one audit of a multi-page service site, dozens of URLs were listed in the sitemap without a trailing slash while the server forced a redirect to the slash version. Google sees the sitemap URL, requests it, gets a 301, and logs the original as "not indexed." Multiply that across a silo of pages and a meaningful slice of the site is effectively missing from the index. The same family of problems includes canonical tags pointing at the wrong version of a page, redirect chains that lose authority on every hop, and pages set to noindex long ago and never reverted. These are the kinds of issues indexing diagnostics in Search Console exist to catch.

What we actually pull from Search Console

When you have access to it, the GSC Coverage and Pages reports are the most honest mirror a site has. We read them for the count of valid versus excluded URLs, the specific exclusion reasons (crawled-not-indexed, discovered-not-indexed, redirect, duplicate-without-canonical), and the gap between what your sitemap submits and what Google actually keeps. On a recent property we diagnosed 88 affected URLs and traced them back to just six root causes. That ratio is typical: a handful of systemic mistakes generate a long, scary-looking list of symptoms.

The free reconnaissance trick

You do not always need GSC access to start. Google's site: operator returns the URLs Google has indexed for a domain plus a rough total. Comparing that count against your real page inventory is the fastest free way to spot a wholesale indexing gap. It is reconnaissance, not gospel, but it tells you in thirty seconds whether you have a crawl problem worth investigating.

Bucket Two: Keyword Gaps and Content

Once the pages are actually in the index, the question becomes whether they target the right things. The most common content finding is not thin writing; it is missing pages entirely. A business offers eight services and has three pages. The five unaddressed services represent pure keyword gaps — searches the business could rank for but has given Google nothing to rank.

How we find the gaps

We reverse-engineer what is already winning. Open the search results for a target keyword, look at the businesses ranking, and measure what they do that you do not. The pattern repeats across trades: the top-ranking profiles and pages address more of the long-tail, list individual services rather than lumping them under one generic label, and answer the questions searchers actually type. The gap between their coverage and yours is your roadmap.

Hidden content versus visible content

One counterintuitive finding from analyzing pages that rank: much of the SEO content on a high-ranking page is invisible to a casual reader. Structured data, descriptive alt text, semantically rich headings, and tightly scoped service sections do heavy lifting without producing a wall of text. A site can carry far more SEO signal than a competitor while looking cleaner, not busier. Audits that only eyeball the visible copy miss this entirely.

Bucket Three: On-Page and Technical SEO

This bucket is where the technical SEO findings cluster, and it is the one most owners assume is fine. It usually is not. Across our audits the recurring on-page problems are remarkably consistent:

Why orphan pages matter more than they look

An orphan page is a page that exists but is not linked from anywhere in your own navigation or body content. It is the technical-SEO equivalent of a store with no doors. Search engines lean heavily on internal links to discover and weight pages, so an orphaned page is both hard to find and starved of the internal authority your linked pages pass around. We find them constantly on sites that were built page-by-page over years without anyone maintaining the link graph. The fix is cheap; the finding is easy to miss without a full crawl.

The signal hiding in HTML tags

In our study of 228 SEO reports across multiple niches, the single strongest on-page signal was the variety of HTML tags used on a page (Spearman r = -0.26) — not a tactic any generic checklist emphasizes. Pages that use a fuller range of semantic markup tend to outperform pages built from a handful of divs. It is the kind of finding you only get from measuring across many sites, and it is why we benchmark rather than rely on memorized "best practices."

A note on the myths we audit against

Plenty of audit "findings" in this industry are wrong. Multiple H1s are fine. Meta descriptions are not capped at 160 characters. nofollow is a hint, not a death sentence, and has been since 2019. Unnecessary redirects hurt more than they help. We flag genuine problems and refuse to manufacture busywork out of folklore, because a checklist full of non-issues just buries the findings that actually move rankings.

The tools behind the crawl

The findings in this bucket come from a headless crawler that walks every internal link, records the status code and final URL for each, extracts the title, meta description, canonical, and heading structure, and validates every block of schema markup against the schema.org types it claims. Pairing that crawl with the live index from the site: operator and the exclusion reasons in Google Search Console is what lets us separate a page that is broken from a page that is simply not yet discovered. The crawler is also how we catch the unglamorous regressions that creep in over time — a robots.txt that started returning a 404, a sitemap that drifted out of sync with the live URL structure, favicons that quietly broke during a redesign. Individually these are small; together they tell Google a site is unmaintained, and unmaintained sites lose the benefit of the doubt on every other signal.

Page speed and core web vitals in context

Speed shows up in nearly every audit report, and it matters, but it is rarely the lever that unblocks rankings. We measure it, flag genuinely slow pages, and put it in its place on the priority list. A site that loads in a quarter of a second but has a third of its pages excluded from the index does not have a speed problem; it has an indexing problem wearing a speed problem's clothes. The discipline of an audit is refusing to let an easy-to-measure metric jump the queue ahead of a harder-to-measure one that actually controls visibility.

See your own technical findings

The fastest way to know which of these problems your site has is to run them against your real domain. Our free site audit checks crawlability, indexing signals, on-page tags, and schema in one pass.

Run a Free Site Audit See Our Audit Service

Bucket Four: The Google Business Profile

For local businesses the Google Business Profile is often the single biggest driver of visibility, and it is also the most consistently under-built asset we audit. The map pack sits above the regular results, so a weak profile caps your ceiling no matter how good the website is.

The GBP gaps we see in nearly every audit

When we scrape a profile and benchmark it against the local winners, the same shortfalls appear again and again:

None of these require new software or budget. They require someone to actually look, benchmark against the market, and work the list in priority order.

How We Run an Audit, Step by Step

The reason the findings above are reliable is that the process behind them is the same every time. Here is the pipeline we run, whether it is one location or a multi-location brand:

  1. Data collection. Pull a full Google Business Profile intelligence scrape on the top competitors and run a headless crawl of their sites for broken links, missing titles, schema markup, and page speed.
  2. Indexing review. Read Google Search Console coverage, compare the index against the sitemap, and run crawl analysis to find orphaned, redirected, and excluded URLs.
  3. Benchmarking. Measure your numbers against the niche — word counts, heading structure, review counts, photo counts, services listed, and structured-data completeness — to find the real bar in your market.
  4. Gap planning. Turn the difference between where you are and where the winners are into a prioritized action list, hardest-hitting items first.

That last step is the whole point. An audit that produces a 200-item undifferentiated list is not an audit; it is a way to look thorough while being useless. The value is in the ordering — knowing that for this business, fixing the indexing trap matters more this month than rewriting a homepage that already ranks.

How to Read Your Own Audit Without Panicking

If you run a tool against your own site, the report can look terrifying. A few things to keep in mind so you act on the right items:

The Bottom Line

An audit is not a verdict on your site; it is a map of where the leverage is. The recurring story across every market we have worked is the same: a small number of systemic indexing and technical SEO issues, a handful of obvious keyword gaps, a link graph with a few orphan pages, and a Google Business Profile built to maybe a third of its potential. None of it is exotic. All of it is findable. The businesses that pull ahead are simply the ones who looked, measured against their actual competitors, and worked the list in order.

If you want that map for your own site, start with a free site audit, or look at how the full SEO audit service turns these findings into a ranked, executed plan.

SEO Audit FAQ

What does an SEO audit actually check?

A real audit covers four areas: crawl and indexing health through Google Search Console and crawl analysis, keyword and content gaps, on-page and technical SEO issues like broken schema markup and orphan pages, and the completeness of your Google Business Profile. The goal is a prioritized list of root causes, not a 200-item dump of symptoms.

Why are some of my pages not indexed by Google?

The most common cause is a mismatch between the URL in your sitemap and the URL your server serves, such as a trailing-slash redirect that makes Google log the page as excluded. Other frequent causes are wrong canonical tags, leftover noindex directives, and redirect chains. Search Console's coverage report names the specific exclusion reason for each URL.

What is an orphan page and why does it hurt rankings?

An orphan page exists on your site but has no internal links pointing to it, so search engines struggle to discover it and it receives none of the internal authority your linked pages pass around. Orphan pages are common on sites built page-by-page over years, and they are easy to miss without a full crawl. The fix is usually just adding internal links.

Do I need a paid audit, or is a free one enough?

A free site audit is enough to surface the big systemic problems — indexing gaps, missing tags, broken schema markup, and obvious keyword gaps. A full paid audit adds competitor benchmarking, a Google Business Profile intelligence scrape, and a prioritized, executed action plan. Start free to see where you stand, then decide.

Want to Know What's Hiding Under Your Hood?

Run a free crawl of your own domain, or let us run the full audit and hand you a ranked plan.

Run a Free Site Audit See the Audit Service