Convert HTML to PDF & extract text from PDFs with CustomJS API
$20/month : Unlimited workflows
2500 executions/month
THE #1 IN WEB SCRAPING
Scrape any website without limits
HOSTINGER 🎉 Early Black Friday Deal
DISCOUNT 20% Try free
DISCOUNT 20%
Self-hosted n8n
Unlimited workflows - from $4.99/mo
#1 hub for scraping, AI & automation
6000+ actors - $5 credits/mo
This n8n workflow illustrates how to convert PDF files into text with the PDF Toolkit from www.customjs.space.
@custom-js/n8n-nodes-pdf-toolkit
Notice
Community nodes can only be installed on self-hosted instances of n8n.
What this workflow does
- Change the requested HTML to PDF..
- Extract text from the PDF.
- Use a Code node to handle URLs that point to PDF files.
- Convert the PDF to text.
Requirements
- Self-hosted n8n instance.
- CustomJS API key for converting PDF to text.
- HTML Data to convert PDF files.
- Code node for handling URL that indicates PDF file.
Workflow Steps:
Manual Trigger:
- Runs with user interaction.
HTML to PDF:
- Request HTML Data
- Convert HTML to PDF
Convert PDF to Text:
- Convert the generated Text from PDF
Usage
Get API key from customJS
- Sign up to customJS platform.
- Navigate to your profile page
- Press "Show" button to get API key
Set Credentials for CustomJS API on n8n
Copy and paste your API key generated from CustomJS here.
Design workflow
- A Manual Trigger for starting workflow.
- HTTP Request Nodes for downloading PDF files.
- Code node for handling URL that indicates PDF file.
- Convert PDF to Text.
You can replace logic for triggering and returning results. For example, you can trigger this workflow by calling a webhook and get a result as a response from webhook. Simply replace Manual Trigger and Write to Disk nodes.