Skip to main content

Extract sitemap URLs to CSV using chat input, XML parsing and HTTP upload

Workflow preview

Workflow preview
100%
Extract sitemap URLs to CSV using chat input, XML parsing and HTTP upload preview
Open on n8n.io

1. Workflow Overview

SEO Sitemap Extractor: Convert XML to CSV via Chat Interface This workflow provides a streamlined, no code solution to extract all nested URLs from any standard XML sitemap and instantly convert th...

Best for

  • Market Research automation workflows
  • AI Chatbot automation workflows
  • advanced n8n builders looking for reusable templates

Tools used

n8n-nodes-base.stickynote, @n8n/n8n-nodes-langchain.chattrigger, n8n-nodes-base.httprequest, n8n-nodes-base.if, @n8n/n8n-nodes-langchain.chat, n8n-nodes-base.xml, n8n-nodes-base.splitout, n8n-nodes-base.set

Source and attribution

This workflow is cataloged by N8N Workflows and links back to its original n8n.io source page by Siddharth Gupta.

Original n8n.io source

1.1 Workflow description

Title
Extract sitemap URLs to CSV using chat input, XML parsing and HTTP upload
Workflow name
Extract sitemap URLs to CSV using chat input, XML parsing and HTTP upload

SEO Sitemap Extractor: Convert XML to CSV via Chat Interface

This workflow provides a streamlined, no-code solution to extract all nested URLs from any standard XML sitemap and instantly convert them into a structured CSV file. Built entirely within n8n's native chat interface, it eliminates the need for manual data formatting or custom Python scripts during SEO audits and site migrations.

Typical Use Cases

  • SEO Audits: Quickly compile a comprehensive list of a website's published pages for bulk analysis.
  • Site Migrations: Extract legacy URLs to prepare 301 redirect mapping spreadsheets.
  • Content Scraping Prep: Generate a clean list of target URLs to feed into downstream scraping or web automation workflows.

How It Works

  1. Interactive Trigger: The workflow begins in the n8n chat window. Simply paste a valid sitemap URL (e.g., https://example.com/sitemap.xml).
  2. Validation & Fetching: An HTTP Request node fetches the raw XML data while conditional logic verifies the link is accessible (HTTP 200).
  3. Data Parsing: The native XML node parses the raw text into a structured JSON object, isolating the loc (URL) and lastmod (Last Modified) tags.
  4. File Generation & Delivery: The extracted data is compiled into a sanitized CSV binary file and temporarily uploaded to a file host. The workflow concludes by returning a one-click, secure download link directly in the chat.

Key Features

  • Smart Error Handling: Includes built-in routing to detect and reject Sitemap Index files (which require recursive crawling) and inaccessible URLs, returning user-friendly error messages in the chat.
  • Automated Data Mapping: Automatically flattens complex XML arrays into a clean, two-column spreadsheet format.
  • Extraction Summary: Calculates and outputs the total number of successfully extracted URLs before delivering the file.

Prerequisites & Limitations

  • Direct Sitemaps Only: This workflow is optimized for standard page sitemaps. If you need to process a Sitemap Index, you will need to input the underlying child sitemap URLs individually.
  • Third-Party Hosting: By default, this workflow relies on a public API (Uguu) to host the final CSV file for download. You can easily swap the final HTTP Request node to your preferred cloud storage provider (e.g., AWS S3, Google Drive, Dropbox) if you require private file handling.
  • Memory Limits: Extremely large sitemaps (50,000+ URLs) may require increased memory allocation depending on your n8n hosting environment.

1.2 Logical Blocks

This catalog entry is organized from the workflow JSON. The node-level section below shows the executable blocks available for review before importing the template.

2. Block-by-Block Analysis

Block 1 - Sticky Note

Type / Role
n8n-nodes-base.stickyNote - stickyNote
Config choices
Version 1

Block 2 - Sticky Note 1

Type / Role
n8n-nodes-base.stickyNote - stickyNote
Config choices
Version 1

Block 3 - Sticky Note 2

Type / Role
n8n-nodes-base.stickyNote - stickyNote
Config choices
Version 1

Block 4 - Sticky Note README

Type / Role
n8n-nodes-base.stickyNote - stickyNote
Config choices
Version 1

Block 5 - Sticky Note Limitations

Type / Role
n8n-nodes-base.stickyNote - stickyNote
Config choices
Version 1

Block 6 - Listen for Sitemap URL

Type / Role
@n8n/n8n-nodes-langchain.chatTrigger - chatTrigger
Config choices
Version 1.4

Block 7 - Fetch Sitemap XML

Type / Role
n8n-nodes-base.httpRequest - httpRequest
Config choices
Version 4.3

Block 8 - Check if URL is Accessible

Type / Role
n8n-nodes-base.if - if
Config choices
Version 2.3

Block 9 - Alert User: Invalid URL

Type / Role
@n8n/n8n-nodes-langchain.chat - chat
Config choices
Version 1

Block 10 - Parse XML to JSON Object

Type / Role
n8n-nodes-base.xml - xml
Config choices
Version 1

Block 11 - Check for Sitemap Index

Type / Role
n8n-nodes-base.if - if
Config choices
Version 2.3

Block 12 - Alert User: Index Not Supported

Type / Role
@n8n/n8n-nodes-langchain.chat - chat
Config choices
Version 1

Block 13 - Extract URLs Array

Type / Role
n8n-nodes-base.splitOut - splitOut
Config choices
Version 1

Block 14 - Format URL Data

Type / Role
n8n-nodes-base.set - set
Config choices
Version 3.4

Block 15 - Convert Data to CSV

Type / Role
n8n-nodes-base.convertToFile - convertToFile
Config choices
Version 1.1

Block 16 - Upload File to Host

Type / Role
n8n-nodes-base.httpRequest - httpRequest
Config choices
Version 4.3

Block 17 - Format Final Output

Type / Role
n8n-nodes-base.code - code
Config choices
Version 2

Block 18 - Send Download Link

Type / Role
@n8n/n8n-nodes-langchain.chat - chat
Config choices
Version 1

Block 19 - Summarize Extraction Stats

Type / Role
n8n-nodes-base.summarize - summarize
Config choices
Version 1.1

Block 20 - Send Summary

Type / Role
@n8n/n8n-nodes-langchain.chat - chat
Config choices
Version 1

Block 21 - Alert User: Upload Failed

Type / Role
@n8n/n8n-nodes-langchain.chat - chat
Config choices
Version 1

Block 22 - Sticky Note - Node 1

Type / Role
n8n-nodes-base.stickyNote - stickyNote
Config choices
Version 1

Block 23 - Sticky Note - Node 2

Type / Role
n8n-nodes-base.stickyNote - stickyNote
Config choices
Version 1

Block 24 - Sticky Note - Node 3

Type / Role
n8n-nodes-base.stickyNote - stickyNote
Config choices
Version 1

Showing the first 24 of 37 workflow blocks. Download the JSON for the full node graph.

3. Summary Table

Workflow Extract sitemap URLs to CSV using chat input, XML parsing and HTTP upload
Complexity advanced
Nodes 37
Categories Market Research, AI Chatbot
Author Siddharth Gupta
Published 02 May 2026

4. Reproducing the Workflow from Scratch

  1. 1. Download the workflow JSON

    Use the JSON export at /data/workflows/15443/15443.json as the source template for this automation.

  2. 2. Import the template into n8n

    Open n8n, import the downloaded JSON, and review each node before activating the workflow.

  3. 3. Configure credentials and variables

    Replace placeholder credentials, API keys, webhook URLs, account IDs, and environment-specific values with your own settings.

  4. 4. Test with sample data

    Run the workflow manually or in a staging workspace, inspect node output, and confirm downstream systems receive the expected data.

  5. 5. Activate and monitor

    Enable the workflow only after testing, then monitor executions, errors, and rate limits during the first production runs.

5. General Notes & Resources

Review imported nodes carefully before activation. This catalog entry is intended to help you inspect the workflow structure, understand required services, and find related templates faster.

Node names, credentials, schedules, webhook paths, and external service limits may need adjustment for your workspace.

Frequently asked questions

What does Extract sitemap URLs to CSV using chat input, XML parsing and HTTP upload do?

SEO Sitemap Extractor: Convert XML to CSV via Chat Interface This workflow provides a streamlined, no code solution to extract all nested URLs from any standard XML sitemap and instantly convert th...

What do I need before importing this workflow?

Review the workflow JSON, configure any required credentials in n8n, and test the automation in a safe workspace before using it in production.

Can I customize this workflow?

Yes. Use the block-by-block analysis and the downloadable JSON to inspect each node, then adjust credentials, prompts, schedules, filters, or destinations for your Market Research, AI Chatbot use case.