Digital Marketing

XML Sitemaps: How to Build and Submit for Maximum Crawl Efficiency

XML Sitemaps: How to Build and Submit for Maximum Crawl Efficiency

An XML sitemap is a file that lists all the pages on your website. It’s not for humans—it’s for Google and other search engines. Think of it as a roadmap to your content.

For small sites (50 pages), a sitemap is nice-to-have. For large sites (1,000+ pages), it’s essential. For e-commerce and news sites with constantly changing content, it’s mandatory.

This guide explains what sitemaps are, how to build them, and how to submit them for maximum SEO impact.


What Is an XML Sitemap?

An XML sitemap is a file that lists URLs on your website in a machine-readable format. It looks like this:

xml https://yoursite.com/ 2026-04-13 weekly 1.0 https://yoursite.com/blog/article-one/ 2026-04-10 monthly 0.8

Each entry contains:

  • loc: The page URL (required)
  • lastmod: When the page was last modified (optional)
  • changefreq: How often the page changes (optional; hint only, Google may ignore)
  • priority: Relative importance (optional; all pages default to 0.5)

Why Sitemaps Matter

For large sites: Google crawls a limited number of pages per day (crawl budget). A sitemap tells Google which pages are important so it prioritises them.

For new sites: A sitemap gets your pages indexed faster. Without a sitemap, a brand-new site might take weeks to index all pages. With a sitemap, days.

For sites with poor internal linking: If some pages don’t have internal links (orphaned pages), a sitemap is the only way to tell Google they exist.

For sites with frequently updated content: A sitemap with lastmod dates tells Google when to re-crawl pages.

What Sitemaps DON’T Do

  • Sitemaps don’t guarantee indexation. Even if a page is in a sitemap, Google might not index it if it’s low-quality or blocked by robots.txt.
  • Sitemaps don’t improve rankings. A sitemap doesn’t help your content rank better; it just makes crawling more efficient.
  • Sitemaps don’t pass link juice. Listings in a sitemap don’t contribute to page authority the way internal links do.

What Should Be in Your Sitemap

Include:

  • All public, indexable pages (blog posts, product pages, category pages, main pages)

Exclude:

  • Noindex pages (pages with )
  • Thin pages (duplicate content, very short pages with little value)
  • Login pages, account pages, checkout pages (not meant to be indexed)
  • Redirect pages (301/302 redirects should not appear in sitemaps)
  • Parameterised URLs (avoid URLs with ?id=123 style parameters; use clean URLs instead)
  • Duplicate URLs (if a page is accessible via two URLs, include only the canonical version)

A bloated sitemap confuses Google. If you have 10,000 pages but 8,000 are thin/noindex, include only the 2,000 quality pages.


How to Create an XML Sitemap

Option 1: WordPress (Recommended for Most Australian Businesses)

WordPress plugins handle sitemaps automatically. Choose one:

RankMath (free version)

  1. Install RankMath plugin (search “RankMath” in Plugins > Add New)
  2. Activate
  3. Go to RankMath > Sitemap Settings
  4. Enable “Sitemap”
  5. Choose which post types to include (Posts, Pages, Products if e-commerce)
  6. Save
  7. Your sitemap is at yourdomain.com/sitemap.xml

Yoast SEO (free version)

  1. Install Yoast (search “Yoast” in Plugins > Add New)
  2. Activate
  3. Go to SEO > Tools > Sitemap
  4. Click the sitemap link—it auto-generates
  5. Sitemap is at yourdomain.com/sitemap_index.xml (index file)

Tip: Both plugins auto-exclude noindex pages. If a page is set to noindex in the plugin, it won’t appear in the sitemap.

Option 2: Non-WordPress Sites (Custom Code)

If you’re not using WordPress, generate a sitemap using:

  • Screaming Frog (free; crawl your site, export as XML sitemap)
  • XML Sitemap Generator (free online tool; xmlsitemapgenerator.org)
  • PHP script (if you have development resources; create a dynamic sitemap that queries your database)

Using Screaming Frog:

  1. Download Screaming Frog (free version)
  2. Crawl your site (mode: Crawl)
  3. After crawl completes, go to Sitemaps > XML Sitemap
  4. Click “Save sitemaps”
  5. Choose location and save
  6. Upload the XML file to your site root

Option 3: Sitemap Index (For Very Large Sites)

If you have >50,000 URLs, create a sitemap index. This is a file that lists multiple sitemaps:

xml https://yoursite.com/sitemap-1.xml https://yoursite.com/sitemap-2.xml https://yoursite.com/sitemap-posts.xml

RankMath and Yoast auto-create sitemap indexes. You submit the index file (e.g., sitemap_index.xml) instead of individual sitemaps.


How to Submit a Sitemap to Google

Submitting a sitemap tells Google where to find your page list. Google will crawl the sitemap and, over time, crawl the pages listed in it.

Step 1: Verify Your Site in Google Search Console

  1. Go to Google Search Console
  2. Click “Start now” or add a property
  3. Choose “URL prefix” and enter your site URL (e.g., https://yoursite.com)
  4. Verify ownership (HTML file upload or DNS record, usually)

Step 2: Submit the Sitemap

  1. In GSC, click your property (your site)
  2. Go to Sitemaps (left sidebar)
  3. Click “New sitemap”
  4. Enter the sitemap URL (e.g., sitemap.xml or sitemap_index.xml)
  5. Click “Submit”

Step 3: Monitor Submission Status

  1. Return to the Sitemaps section
  2. Click your submitted sitemap
  3. You’ll see:
  • Submitted: Number of URLs in the sitemap
  • Indexed: Number of those URLs Google has indexed
  1. This updates weekly or monthly

Good: Indexed ≥ 80% of submitted Concerning: Indexed < 50% of submitted (some pages aren't indexable)


How to Submit a Sitemap to Bing

Bing also uses XML sitemaps. Bing uses “Bing Webmaster Tools” (now “Microsoft Clarity”).

  1. Go to Bing Webmaster Tools
  2. Add your site
  3. Go to Sitemaps (left sidebar)
  4. Enter your sitemap URL
  5. Submit

Bing will crawl your sitemap and index pages from it.


XML Sitemap Best Practices

1. Use Clean, Consistent URLs

Your sitemap should list only clean, canonical URLs:

“`xml https://yoursite.com/blog/article-title/

https://yoursite.com/index.php?post_id=123&cat=5

http://yoursite.com/blog/article/ “`

2. Include lastmod Only If Accurate

tells Google when to re-crawl. If you include it, keep it accurate:

“`xml 2026-04-13

2020-01-01 “`

If you can’t maintain accurate dates, omit entirely.

3. Avoid Including Noindex Pages

Don’t list pages you’ve marked as noindex. Example:

xml https://yoursite.com/private-page/

A good plugin (RankMath, Yoast) automatically excludes noindex pages. If you’re building a custom sitemap, ensure noindex pages are excluded.

4. Update Sitemap When Content Changes

For WordPress, your plugin handles this automatically. For custom sites:

  • If you add new pages, regenerate the sitemap
  • If pages are deleted or unpublished, regenerate
  • Regenerate weekly or monthly depending on update frequency

5. Validate Your Sitemap

Use Google’s Sitemap Validator or the built-in validator in GSC:

  1. In GSC, go to Sitemaps
  2. Click the sitemap you submitted
  3. If there are errors (invalid XML, unreachable URLs, etc.), they’ll show here
  4. Fix and resubmit

Common Sitemap Issues and Fixes

Issue 1: “Submitted but Not Indexed”

Many pages are in your sitemap but not indexed by Google.

Possible causes:

  • Pages are noindex (intentional or accidental)
  • Pages have canonicals pointing elsewhere
  • Pages are low quality
  • Pages are too new (index not yet complete)

Fix:

  1. In GSC, click the sitemap
  2. Go to Indexed and Not indexed tabs
  3. Check why pages aren’t indexed (GSC will show reasons)
  4. If a page should be indexed: remove noindex, fix canonicals, or improve content

Issue 2: Sitemap Has Duplicate URLs

Google’s error: “Multiple sitemaps contain the same URL.”

Cause: You’re submitting multiple sitemaps that list overlapping URLs.

Fix:

  • For WordPress: ensure you’re submitting only the main sitemap (or the sitemap index), not individual sitemaps
  • For custom sites: ensure each URL appears in only one sitemap
  • If you have multiple sitemaps, create a sitemap index that lists them all, and submit the index only

Issue 3: Sitemap Is Too Large

Google limits sitemaps to 50,000 URLs and 50MB file size.

Fix:

  • Create multiple sitemaps (e.g., sitemap-posts.xml, sitemap-pages.xml)
  • Create a sitemap index listing all sitemaps
  • Submit the index

Sitemap Types Beyond Standard XML

News Sitemap

If your site publishes news articles, a news sitemap tells Google about recent articles:

xml https://yoursite.com/article-title/ Your Site Name en 2026-04-13 Article Headline

Used for: news sites, blogs, any site publishing time-sensitive content.

Image Sitemap

Include images with your pages:

xml https://yoursite.com/page/ https://yoursite.com/image.jpg Image Title

Used for: image-heavy sites, e-commerce, galleries.

Video Sitemap

Include videos:

xml https://yoursite.com/video-page/ https://yoursite.com/video.mp4 https://yoursite.com/thumb.jpg Video Title

Used for: video-heavy sites.

Most WordPress plugins (RankMath, Yoast) auto-generate these sitemaps if you configure them.


Sitemaps and Site Speed

Do sitemaps slow down your site?

No. Sitemaps are static files that Google crawls. They don’t affect your site’s performance or user experience.

Do sitemaps count towards crawl budget?

No. Google crawls sitemaps separately from regular page crawling. A sitemap doesn’t consume crawl budget.


Sitemaps Aren’t Enough

A sitemap makes crawling more efficient, but it’s not a substitute for:

  • Internal linking (links between your pages help Google understand site structure)
  • Good content (pages listed in a sitemap must be indexable and valuable)
  • Authority (pages need backlinks or topical authority to rank)

A sitemap removes one barrier; it doesn’t build authority or improve content quality.


Checking Your Sitemap Status

  1. Visit your sitemap directly: https://yoursite.com/sitemap.xml
  • Does it load? Is it valid XML?
  1. In GSC, check submission status: How many indexed vs. submitted?
  2. In Bing Webmaster Tools: Is it submitted and crawled?
  3. Use an online validator: Test your XML for errors

What Anitech Does

Anitech audits and fixes sitemaps as part of technical SEO. We:

  1. Check if a sitemap exists and is valid
  2. Ensure it excludes noindex/thin pages
  3. Verify it’s submitted to Google and Bing
  4. Monitor indexed vs. submitted ratio
  5. For WordPress sites, configure RankMath or Yoast properly

If you’re unsure whether your sitemap is set up correctly, we can review it.

Request a technical SEO audit


Related Articles

  • May 25, 2026

Apollo.io Australia: B2B Prospecting & Outreach Guide

Apollo.io Australia: B2B Prospecting & Outreach Guide If you’re trying to build a B2B...

  • May 25, 2026

Landing Page CRO for Lead Generation: A Practical Guide

Landing Page CRO for Lead Generation: A Practical Guide A 2% conversion rate on...

  • May 24, 2026

How to Build a Lead Generation Strategy from Scratch

How to Build a Lead Generation Strategy from Scratch You can’t build a lead...

  • May 24, 2026

Best Lead Generation Software Australia 2026

Best Lead Generation Software Australia 2026 If you’re running a business in Queensland or...

  • May 24, 2026

Bing Ads Australia 2026 | Worth It? Strategy + Setup Guide

Bing Ads Australia: Is Microsoft Advertising Worth It in 2026? Here’s the question most...

Need SEO Help?

Get a free SEO audit and discover how we can help improve your rankings.