sitemap.xml Guide

A sitemap.xml file is a discovery file. It tells search engines which public URLs you want them to know about. It is not a ranking guarantee and it does not automatically make low-quality or blocked pages indexable.

Good sitemap habits

  • List canonical public URLs only.
  • Use one host consistently: either www or non-www.
  • Avoid visible index.html URLs when the canonical page is the folder URL.
  • Remove deleted, redirected, noindex, private, login-only, staging, or blocked URLs.
  • Use lastmod only when a page meaningfully changed.
  • Reference the sitemap from robots.txt and submit it in Google Search Console and Bing Webmaster Tools.

Common sitemap problems

Mixed hosts, duplicate trailing-slash versions, redirected URLs, old pages, noindex pages, and mechanically updated lastmod dates can make the sitemap less trustworthy as a discovery aid.

Official references

Google Search Central explains sitemap basics in its sitemap overview and build a sitemap guidance.

Important limitation

This guide is a practical publishing checklist. It does not guarantee indexing, ranking, AdSense approval, legal compliance, accessibility certification, or security. Always confirm current requirements in the relevant official tools and policy pages.