Glossary · 2 min read

XML sitemap

A structured file listing your site's important URLs so crawlers can discover them, along with hints about when each page last changed.

You submitted a sitemap and felt done. Here is the catch: a sitemap tells a crawler where your pages are, not whether any of them can actually be read once it gets there.

An XML sitemap solves discovery. It does nothing about legibility, and the two get confused constantly.

What an XML sitemap is

An XML sitemap is a structured file, usually at yoursite.com/sitemap.xml, that lists the URLs you want crawlers to find. It often includes a last-modified date so a crawler knows which pages changed recently.

It is a directory, not a doorway. It helps a crawler discover pages instead of relying only on following links from one page to the next. That is genuinely useful on a large or loosely linked site.

Old way, new way

The old way: you maintained a sitemap so Google could find every page and index your links for human searchers.

The new way: the crawlers that read your sitemap also feed AI engines. Good discovery means your pages enter the candidate pool an engine reads from. But entering the pool is not being used. The sitemap gets you found, not chosen.

What a sitemap cannot do

A sitemap is easy to overrate. It does not:

  • Force a crawler to fetch a listed URL.
  • Make a listed page readable if the answer is buried or unlabeled.
  • Help if the URLs it lists are broken, redirected, or blocked by robots.txt.
  • Promise an engine will surface or cite anything it points to.

A tidy sitemap full of unreadable pages is a clean index of pages a machine still cannot use.

The damaging admission

Citedon does not treat a present sitemap as a passing grade. Discovery is the easy part. The hard part is whether the discovered pages read cleanly to a machine, and a sitemap says nothing about that.

We will tell you when your real problem is not discovery at all but the pages themselves, even though that is the harder thing to fix.

How to check yours

Take a page your sitemap lists and ignore the fact that it is listed. Ask the only question that matters next: if a machine fetched this exact URL, could it parse a clear, labeled answer. Discovery without readability is a map to a locked room.

Run a free scan on that URL to see how an engine reads it, or read the guide on how to fix crawlability issues.

See whether the pages in your sitemap actually read cleanly, free.
Run a free scan. No signup. You get a readiness score and the gaps to fix, in about a minute.