
Seo Sitemap
Audit an existing XML sitemap or generate a standards-compliant one so crawlers discover every indexable URL.
Overview
SEO Sitemap is an agent skill for the Launch phase that analyzes or generates XML sitemaps and validates URLs, structure, and crawl alignment.
Install
npx skills add https://github.com/agricidaniel/claude-seo --skill seo-sitemapWhat is this skill?
- Two modes: analyze existing sitemaps or generate new ones with industry templates
- Validation checks include XML validity, URL count under 50,000 per file, and HTTP 200 for listed URLs
- Flags deprecated priority and changefreq tags ignored by Google
- Common-issues table with severity (critical/high/medium) and fixes
- Quality signals: sitemap index splits, HTTPS-only URLs, no noindex or redirect URLs in sitemap
- URL count limit under 50,000 per sitemap file
- Skill version 2.0.0
- Common-issues table with Critical/High/Medium severity tiers
Adoption & trust: 2.3k installs on skills.sh; 8.5k GitHub stars; 2/3 security scanners passed (skills.sh audits).
What problem does it solve?
Crawlers miss pages or waste budget because your XML sitemap is broken, bloated, or full of noindexed and redirect URLs.
Who is it for?
Indie SaaS, blogs, and ecommerce sites at launch or post-migration needing technical sitemap fixes.
Skip if: Pure content strategy without a live site, or app-store-only products with no web crawl surface.
When should I use this skill?
User says sitemap, generate sitemap, sitemap issues, or XML sitemap.
What do I get? / Deliverables
You get a validated or freshly generated sitemap that respects the 50k URL limit, lists only canonical 200 URLs, and is referenced correctly in robots.txt.
- Sitemap validation report with severity-tagged issues
- Generated or corrected XML sitemap (and index split guidance when needed)
Recommended Skills
Journey fit
How it compares
Technical SEO checker and generator—not a full site audit suite or rank-tracking tool.
Common Questions / FAQ
Who is seo-sitemap for?
Solo builders shipping web products who own technical SEO and need sitemap validation or generation inside their agent workflow.
When should I use seo-sitemap?
At launch when you say sitemap, generate sitemap, sitemap issues, or XML sitemap—or after deploy when coverage reports show sitemap errors.
Is seo-sitemap safe to install?
Analysis may fetch your live URLs; review Security Audits on this Prism page before granting network access to production domains.
SKILL.md
READMESKILL.md - Seo Sitemap
MIT License - see repository root LICENSE file for complete terms. Copyright (c) 2026 AgriciDaniel https://github.com/AgriciDaniel/claude-seo --- name: seo-sitemap description: > Analyze existing XML sitemaps or generate new ones with industry templates. Validates format, URLs, and structure. Use when user says "sitemap", "generate sitemap", "sitemap issues", or "XML sitemap". user-invokable: true argument-hint: "[url or generate]" license: MIT metadata: author: AgriciDaniel version: "2.0.0" category: seo --- # Sitemap Analysis & Generation ## Mode 1: Analyze Existing Sitemap ### Validation Checks - Valid XML format - URL count <50,000 per file (protocol limit) - All URLs return HTTP 200 - `<lastmod>` dates are accurate (not all identical) - No deprecated tags: `<priority>` and `<changefreq>` are ignored by Google - Sitemap referenced in robots.txt - Compare crawled pages vs sitemap; flag missing pages ### Quality Signals - Sitemap index file if >50k URLs - Split by content type (pages, posts, images, videos) - No non-canonical URLs in sitemap - No noindexed URLs in sitemap - No redirected URLs in sitemap - HTTPS URLs only (no HTTP) ### Common Issues | Issue | Severity | Fix | |-------|----------|-----| | >50k URLs in single file | Critical | Split with sitemap index | | Non-200 URLs | High | Remove or fix broken URLs | | Noindexed URLs included | High | Remove from sitemap | | Redirected URLs included | Medium | Update to final URLs | | All identical lastmod | Low | Use actual modification dates | | Priority/changefreq used | Info | Can remove (ignored by Google) | ## Mode 2: Generate New Sitemap ### Process 1. Ask for business type (or auto-detect from existing site) 2. Load industry template from `../seo-plan/assets/` directory 3. Interactive structure planning with user 4. Apply quality gates: - ⚠️ WARNING at 30+ location pages (require 60%+ unique content) - 🛑 HARD STOP at 50+ location pages (require justification) 5. Generate valid XML output 6. Split at 50k URLs with sitemap index 7. Generate STRUCTURE.md documentation ### Safe Programmatic Pages (OK at scale) ✅ Integration pages (with real setup docs) ✅ Template/tool pages (with downloadable content) ✅ Glossary pages (200+ word definitions) ✅ Product pages (unique specs, reviews) ✅ User profile pages (user-generated content) ### Penalty Risk (avoid at scale) ❌ Location pages with only city name swapped ❌ "Best [tool] for [industry]" without industry-specific value ❌ "[Competitor] alternative" without real comparison data ❌ AI-generated pages without human review and unique value ## Sitemap Format ### Standard Sitemap ```xml <?xml version="1.0" encoding="UTF-8"?> <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"> <url> <loc>https://example.com/page</loc> <lastmod>2026-02-07</lastmod> </url> </urlset> ``` ### Sitemap Index (for >50k URLs) ```xml <?xml version="1.0" encoding="UTF-8"?> <sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"> <sitemap> <loc>https://example.com/sitemap-pages.xml</loc> <lastmod>2026-02-07</lastmod> </sitemap> <sitemap> <loc>https://example.com/sitemap-posts.xml</loc> <lastmod>2026-02-07</lastmod> </sitemap> </sitemapindex> ``` ## Error Handling - **URL unreachable**: Report the HTTP status code and suggest checking if the site is live - **No sitemap found**: Check common locations (/sitemap.xml, /sitemap_index.xml, robots.txt reference) before reporting "not found" - **Invalid XML format**: Report specific parsing errors with line numbers - **Rate limiting detected**: Back off and report partial results with a note about retry timing ## Output ### For Analysis - `VALIDATION-REPORT.md`: analysis results - Issues list with severity - Recommendations ### For Generation - `sitemap.xml` (or split files with index) - `STRUCTURE.md`: site architecture documentation - URL count and organization summary