Shopify Auto-Generated Sitemap Limitations and Missing Products
Search intent: diagnose · Updated February 2026
Shopify automatically generates a sitemap.xml at yourdomain.com/sitemap.xml but it has four significant limitations that affect crawl efficiency: it excludes products that are only in hidden or password-protected collections, it outputs inaccurate lastmod timestamps (often the same date for all products regardless of when they were last edited), it caps product entries at 5,000 per sitemap file without warning, and it does not include priority or changefreq tags that help Google allocate crawl budget intelligently. Stores with large catalogues or complex collection structures will have products Google never discovers through the sitemap, relying entirely on internal linking for discoverability.
Quick Diagnostic Checklist
- Count products in sitemap_products_1.xml and compare against Shopify Admin product count
- Check Google Search Console → Sitemaps for Discovered URLs count vs. actual product count
- Verify 5-10 recently added products appear in the sitemap within 48 hours of publishing
- Check lastmod dates across sitemap - confirm they are not all identical
- Ensure all products are published to the Online Store sales channel in product availability settings
Not sure if your store has this issue?
Run a free scan to detect crawlability & indexing problems instantly.
What This Issue Means
Your sitemap is Google's roadmap to your store. If products are missing from it, Google can still discover them through internal links - but it may take significantly longer, particularly for new products in deep collection hierarchies. For stores with hundreds of products near the 5,000-entry cap, the newest products added are the ones most likely to be excluded, meaning your freshest inventory may be the slowest to appear in Google search.
What Causes It (Shopify-Specific)
Products in hidden collections are excluded from the sitemap
Shopify's sitemap only includes products that are published to the Online Store sales channel. Products exclusively in collections that are set to "hidden" from the Online Store, or products assigned to custom sales channels only, are excluded from the auto-generated sitemap regardless of whether their /products/[handle] URL is publicly accessible.
The 5,000 product entry cap per sitemap file
Each sitemap_products_1.xml file generated by Shopify lists up to 5,000 product URLs. Stores with more than 5,000 products get a sitemap_products_2.xml and so on - but the order in which products appear is not transparent, and there is no guarantee that all products are consistently included across regeneration cycles.
lastmod timestamps are not accurate indicators of content change
Shopify's sitemap outputs the product's updated_at timestamp as lastmod, but updated_at changes on any product modification including inventory adjustments, price changes, and tag edits - not just meaningful content changes. This trains Googlebot to distrust the lastmod signal, reducing its effectiveness for signalling important content updates.
Paginated collection URLs are absent from the sitemap
Shopify's sitemap includes individual product URLs and collection root URLs but does not include paginated collection pages (/collections/shirts?page=2). Products that only appear on deeper pagination may be less crawled as a result.
How to Detect It Manually
- 1Visit yourdomain.com/sitemap.xml and click through to sitemap_products_1.xml - count the entries and compare against your total published product count in Shopify Admin
- 2Take 5-10 products from deep in your catalogue or recently added and check whether their /products/[handle] URLs appear in your sitemap
- 3In Google Search Console → Sitemaps - check the "Discovered URLs" count against your total product count; a significant gap confirms missing products
- 4Compare the lastmod dates across your sitemap - if hundreds of products all show the same lastmod date, the timestamps are not reflecting genuine content changes
- 5Check for products you know you recently edited or added and verify they appear in the sitemap within 24-48 hours
How to Fix It (Step-by-Step)
Ensure all products are published to the Online Store sales channel
In Shopify Admin → Products, filter by sales channel availability. Any product not published to "Online Store" will be excluded from the sitemap. Update availability for products that should be indexable.
Submit your sitemap to Google Search Console and monitor coverage
In Google Search Console → Sitemaps → enter your sitemap URL. After submission, monitor the "Discovered URLs" count against your actual product count. A persistent gap after 2-3 weeks indicates missing products.
For large catalogues, supplement with a custom sitemap app
If your store is approaching or exceeding 5,000 products, install a sitemap management app (e.g., SEO Manager, Sitemap XML) that generates more accurate, complete sitemaps with proper lastmod handling and priority values. These apps can also include collection-product cross-listings and handle the 5,000-entry limitation.
Strengthen internal linking for products excluded from the sitemap
For products that cannot easily be added to the sitemap (e.g., exclusive sales channel products), ensure they are linked from at least one publicly accessible, crawlable collection page. Google can discover products through internal links even if they are not in the sitemap.
Create a custom robots.txt.liquid to add sitemap index directives
If using a custom sitemap solution, ensure the sitemap index URL is referenced in your robots.txt via the Sitemap: directive so Googlebot can discover it without being submitted manually.
{%- comment -%} In robots.txt.liquid, add your custom sitemap: {%- endcomment -%}
User-agent: *
Disallow: /admin
Disallow: /checkout
Disallow: /orders
Disallow: /account
Disallow: /carts
Sitemap: {{ 'sitemap.xml' | link_to }}How SEOScan Detects This Issue
SEOScan fetches and parses the store's sitemap index and all referenced sub-sitemaps. It counts the total product URLs found and compares this against the store's published product count (obtained from the Shopify API or by crawling collection pages). Products present in the store but absent from the sitemap are flagged individually. The tool also checks lastmod consistency - if more than 80% of products share the same lastmod timestamp, an inaccurate lastmod warning is raised.
Example Scan Result
Description
Store has 412 published products. Sitemap includes 269 product URLs (65% coverage). 143 products with publicly accessible /products/ URLs are absent from sitemap. 11 products added in the last 30 days are missing. lastmod dates for 89% of sitemap products are identical (2024-11-03), suggesting inaccurate timestamps.
Impact
Google's primary roadmap for your store is incomplete. New products may take weeks longer to appear in search results. Products added after a catalogue expansion may never be systematically surfaced if internal linking is shallow.
Recommended Fix
Audit product sales channel availability. For stores near 5,000 products, install a dedicated sitemap management app. Submit updated sitemap to Google Search Console and monitor coverage gap weekly.
Why It Matters for SEO
New Product Discovery Speed
For e-commerce stores, the speed at which new products appear in Google search directly affects revenue. A product not in the sitemap and not linked prominently from crawlable collection pages may take 3-8 weeks to be discovered and indexed, versus 2-5 days for a properly sitemapped product.
Crawl Budget Efficiency
An accurate, complete sitemap helps Googlebot prioritise crawling correctly. Missing products mean Googlebot must rediscover them through link following rather than the more efficient sitemap-driven discovery, wasting crawl budget on navigation and known pages.
Search Console Coverage Accuracy
Google Search Console's coverage reports are anchored to your submitted sitemap. Products missing from the sitemap appear in neither the "Valid" nor "Error" states - they are invisible to your monitoring, making it impossible to diagnose indexation issues for those products.
Seasonal and Sale Product Timing
Seasonal products (Christmas ranges, summer sale items) that are added to the store late and not properly sitemapped may not appear in Google until after the seasonal window has passed, representing a direct revenue loss.
Real-World Validation Signals
- Shopify's sitemap documentation confirms the 5,000-entry-per-file limit but provides no guidance on which products are included when the limit is exceeded.
- Google's documentation states that accurate lastmod timestamps help Googlebot prioritise recrawling changed pages - Shopify's inventory-triggered lastmod updates undermine this mechanism.
- Stores that supplement Shopify's auto-generated sitemap with a custom solution consistently see 15-30% improvements in new product indexation speed measured in Google Search Console.
- Large Shopify stores (5,000+ products) that rely solely on the default sitemap frequently show thousands of products in a "Discovered - currently not indexed" state in Search Console.
Frequently Asked Questions
Q: How do I check how many products are in my Shopify sitemap?
Visit yourdomain.com/sitemap.xml in a browser and click the sitemap_products_1.xml link. Use Ctrl+F to search for <url> and count the instances - or right-click → Save As and open the file in a text editor to count entries. Compare this against your published product count in Shopify Admin → Products.
Q: Will Google still index products missing from my sitemap?
Yes, Google can discover products through internal links on collection pages, navigation menus, and any external links. However, it will take longer and is less reliable. A product only reachable through deep pagination with no sitemap entry may take weeks to be discovered rather than days.
Q: Can I edit Shopify's auto-generated sitemap?
No. Shopify's sitemap.xml is auto-generated and cannot be modified directly. You can create a custom robots.txt.liquid to redirect Googlebot to a custom sitemap URL provided by a sitemap app, effectively replacing Shopify's default sitemap with one you control.
Q: Do draft products or hidden products affect the sitemap?
Draft products (not published) are excluded from the sitemap and are not publicly accessible, which is correct. Hidden products - those published to the Online Store but temporarily hidden via a collection setting - may still be accessible at their /products/[handle] URL but can be inconsistently included in the sitemap depending on how they are hidden.
Check Your Store for This Issue
SEOScan automatically detects shopify auto-generated sitemap limitations and missing products and 4 related issues - with specific fixes for your store.
Run Free ScanRelated Issues
Shopify /collections/all Duplicate Content Problem
Crawlability & Indexing · High
Thin Content on Shopify Product Pages: Diagnosis and Fix
Content Quality · Medium
The Shopify Internal Search Crawl Trap
Crawlability & Indexing · Medium
Shopify ?variant= URL Parameters Creating Near-Duplicate Product Pages
Crawlability & Indexing · High