Ebook to audiobook converter using MiniMax and FFmpeg
$20/month : Unlimited workflows
2500 executions/month
THE #1 IN WEB SCRAPING
Scrape any website without limits
HOSTINGER 🎉 Early Black Friday Deal
DISCOUNT 20% Try free
DISCOUNT 20%
Self-hosted n8n
Unlimited workflows - from $4.99/mo
#1 hub for scraping, AI & automation
6000+ actors - $5 credits/mo
Ebook to Audiobook Converter
What It Does
Turn any PDF ebook into a professional audiobook automatically. Upload a PDF, get an MP3 audiobook in your Google Drive. Perfect for listening to books, research papers, or documents on the go.
Example: Input PDF → Output Audiobook
Key Features
- Upload PDF via web form → Get MP3 audiobook in Google Drive
- Natural-sounding AI voices (MiniMax Speech-02-HD)
- Automatic text extraction, chunking, and audio merging
- Customizable voice, speed, and emotion settings
- Processes long books in batches with smart rate limiting
Perfect For
- Students: Turn textbooks into study audiobooks
- Professionals: Listen to reports and documents while commuting
- Content Creators: Repurpose written content as audio
- Accessibility: Make content accessible to visually impaired users
Requirements
| Component | Details |
|---|---|
| n8n | Self-hosted ONLY (cannot run on n8n Cloud) |
| FFmpeg | Must be installed in your n8n environment |
| Replicate API | For MiniMax TTS (Sign up here) |
| Google Drive | OAuth2 credentials + "Audiobook" folder |
⚠️ Important: This workflow does NOT work on n8n Cloud because FFmpeg installation is required.
Quick Setup
1. Install FFmpeg
Docker users:
docker exec -it <n8n-container-name> /bin/bash
apt-get update && apt-get install -y ffmpeg
Native installation:
sudo apt-get install ffmpeg # Linux
brew install ffmpeg # macOS
2. Get API Keys
- Replicate: Sign up at replicate.com and copy your API token
- Google Drive: Set up OAuth2 in n8n and create an "Audiobook" folder in Drive
3. Import & Configure
- Import
n8n.jsoninto your n8n instance - Replace the Replicate API token in the "MINIMAX TTS" node
- Configure Google Drive credentials and select your "Audiobook" folder
- Activate the workflow
Cost Estimate
| Component | Cost |
|---|---|
| MiniMax TTS API | |
| Google Drive Storage | Free (up to 15GB) |
| Processing Time | ~1-2 minutes per 10 pages |
How It Works

PDF Upload → Extract Text → Split into Chunks → Convert to Speech (batches of 5)
→ Merge Audio Files (FFmpeg) → Upload to Google Drive
The workflow uses four main modules:
- Extraction: PDF text extraction and intelligent chunking
- Conversion: MiniMax TTS processes text in batches
- Merging: FFmpeg combines all audio files seamlessly
- Upload: Final audiobook saved to Google Drive
Voice Settings (Customizable)
{
"voice_id": "Friendly_Person",
"emotion": "happy",
"speed": 1,
"pitch": 0
}
Available emotions: happy, neutral, sad, angry, excited
Limitations
- ⚠️ Self-hosted n8n ONLY (not compatible with n8n Cloud)
- PDF files only (not EPUB, MOBI, or scanned images)
- Large books (500+ pages) take longer to process
- Requires FFmpeg installation (see setup above)
Troubleshooting
FFmpeg not found?
- Docker: Run
docker exec -it <container> /bin/bashthenapt-get install ffmpeg - Native: Run
sudo apt-get install ffmpeg(Linux) orbrew install ffmpeg(macOS)
Rate limit errors?
- Increase wait time in the "WAITS FOR 5 SECONDS" node to 10-15 seconds
Google Drive upload fails?
- Make sure you created the "Audiobook" folder in your Google Drive
- Reconfigure OAuth2 credentials in n8n
Created by emp0 | More workflows: n8n Gallery
