Ebook to audiobook converter using MiniMax and FFmpeg

Name: Ebook to audiobook converter using MiniMax and FFmpeg
Availability: InStock
Rating: 4.5 (54 reviews)
Author: Jay Emp0

$20/month : Unlimited workflows

2500 executions/month

Try free

THE #1 IN WEB SCRAPING

Scrape any website without limits

Try free

HOSTINGER 🎉 Early Black Friday Deal
DISCOUNT 20%

Self-hosted n8n

Unlimited workflows - from $4.99/mo

Try free

#1 hub for scraping, AI & automation

6000+ actors - $5 credits/mo

Try free

Ebook to Audiobook Converter

▶️ Watch Full Demo Video

What It Does

Turn any PDF ebook into a professional audiobook automatically. Upload a PDF, get an MP3 audiobook in your Google Drive. Perfect for listening to books, research papers, or documents on the go.

Example: Input PDF → Output Audiobook

Key Features

Upload PDF via web form → Get MP3 audiobook in Google Drive
Natural-sounding AI voices (MiniMax Speech-02-HD)
Automatic text extraction, chunking, and audio merging
Customizable voice, speed, and emotion settings
Processes long books in batches with smart rate limiting

Perfect For

Students: Turn textbooks into study audiobooks
Professionals: Listen to reports and documents while commuting
Content Creators: Repurpose written content as audio
Accessibility: Make content accessible to visually impaired users

Requirements

Component	Details
n8n	Self-hosted ONLY (cannot run on n8n Cloud)
FFmpeg	Must be installed in your n8n environment
Replicate API	For MiniMax TTS (Sign up here)
Google Drive	OAuth2 credentials + "Audiobook" folder

⚠️ Important: This workflow does NOT work on n8n Cloud because FFmpeg installation is required.

Quick Setup

1. Install FFmpeg

Docker users:

docker exec -it &lt;n8n-container-name&gt; /bin/bash
apt-get update && apt-get install -y ffmpeg

Native installation:

sudo apt-get install ffmpeg  # Linux
brew install ffmpeg          # macOS

2. Get API Keys

Replicate: Sign up at replicate.com and copy your API token
Google Drive: Set up OAuth2 in n8n and create an "Audiobook" folder in Drive

3. Import & Configure

Import n8n.json into your n8n instance
Replace the Replicate API token in the "MINIMAX TTS" node
Configure Google Drive credentials and select your "Audiobook" folder
Activate the workflow

Cost Estimate

Component	Cost
MiniMax TTS API	~~$0.15 per 1000 characters (~~$3-5 for average book)
Google Drive Storage	Free (up to 15GB)
Processing Time	~1-2 minutes per 10 pages

How It Works

Workflow Diagram

PDF Upload → Extract Text → Split into Chunks → Convert to Speech (batches of 5)
→ Merge Audio Files (FFmpeg) → Upload to Google Drive

The workflow uses four main modules:

Extraction: PDF text extraction and intelligent chunking
Conversion: MiniMax TTS processes text in batches
Merging: FFmpeg combines all audio files seamlessly
Upload: Final audiobook saved to Google Drive

Voice Settings (Customizable)

{
  "voice_id": "Friendly_Person",
  "emotion": "happy",
  "speed": 1,
  "pitch": 0
}

Available emotions: happy, neutral, sad, angry, excited

Limitations

⚠️ Self-hosted n8n ONLY (not compatible with n8n Cloud)
PDF files only (not EPUB, MOBI, or scanned images)
Large books (500+ pages) take longer to process
Requires FFmpeg installation (see setup above)

Troubleshooting

FFmpeg not found?

Docker: Run docker exec -it <container> /bin/bash then apt-get install ffmpeg
Native: Run sudo apt-get install ffmpeg (Linux) or brew install ffmpeg (macOS)

Rate limit errors?

Increase wait time in the "WAITS FOR 5 SECONDS" node to 10-15 seconds

Google Drive upload fails?

Make sure you created the "Audiobook" folder in your Google Drive
Reconfigure OAuth2 credentials in n8n

Created by emp0 | More workflows: n8n Gallery

Jay Emp0

0 workflows

Nodes

set gmail telegram agent google-gemini

Complexity

advanced

Published 20 Oct 2025

Likes 0

View on n8n.io Download Workflow

✨

Share Your Workflow

Have a great workflow to share? Join the n8n Creator Hub and help the community!

Submit Your Template How to Submit

Related Workflows

Generate VEED AI talking head videos from sheet rows with OpenAI or ElevenLabs

A production-ready n8n workflow that generates AI avatar videos from images and text using **VEED Fabric 1.0**, with flexible multi-platform publishing capabilities. ## Key Capabilities ### Unlimited Scale - **Process any number of videos**: Sequential processing ensures each video is fully generated and published before moving to the next - **Batch processing**: Add multiple video requests to Google Sheet and let the workflow process them automatically - **No context mixing**: Each video maintains its own configuration throughout the entire pipeline ### Flexible Publishing - **Per-video platform selection**: Each video can target different platforms (e.g., Video 1 → Instagram+YouTube, Video 2 → Telegram only) - **Optional publishing**: Leave PLATFORMS column empty to generate videos without publishing (videos saved to Drive) - **Supported platforms**: Instagram Reels, YouTube/Shorts, Facebook, Telegram, Threads - **Platform-specific formatting**: Automatic optimization for each platform's requirements ### Smart Processing - **Two TTS providers**: Choose OpenAI or ElevenLabs per video - **Configurable quality**: Select resolution (480p/720p) and aspect ratio (9:16, 16:9, 1:1) per video - **Approval workflow**: Review videos before publishing with email approve/reject buttons - **Error handling**: Automatic error detection with detailed email notifications ### Status Tracking - **Real-time status updates**: Google Sheet updates as workflow progresses (new → processing → published) - **Detailed results**: Per-platform success/failure tracking with post URLs - **Email reports**: Comprehensive publishing reports with links to all posted content ## How It Works 1. **Input**: Add rows to Google Sheet with video details 2. **TTS**: Generate speech using OpenAI or ElevenLabs 3. **Video**: VEED Fabric 1.0 creates talking head video 4. **Approval**: Email with video preview and approve/reject buttons 5. **Publish**: Sequential publishing to selected platforms 6. **Report**: Status update in sheet + email with results ## Requirements - Fal.ai API Key (for VEED) - Google OAuth (Sheets, Drive, Gmail) - TTS: OpenAI or ElevenLabs API Key - Social Media credentials (optional, only for platforms you use) - Telegram Bot Token (optional, only for Telegram) **Node:** n8n-nodes-veed **Author:** VEED.io

View

Translate 🎙️and upload dubbed YouTube videos 📺 using ElevenLabs AI Dubbing

This workflow automates the end-to-end process of **video dubbing** using **ElevenLabs**, storage on Google Drive, and publishing on **Youtube**. This workflow is ideal for creators, agencies, and media teams that need to **TRANSLATE process** and publish large volumes of video content consistently. For this workflow, I started from my [Italian YouTube Short](https://iframe.mediadelivery.net/play/580928/c445daec-e3fe-4019-b035-58ac3bf386dd), and by applying the same workflow, the result was this [English version](https://iframe.mediadelivery.net/play/580928/2179db44-e7e2-43e6-82a1-13b12e18ba8b). --- ### Key Advantages #### 1. ✅ Full Automation of Video Localization The entire process—from video download to AI dubbing and publishing—is automated, eliminating manual steps and reducing human error. #### 2. ✅ Fast Multilingual Content Scaling With AI-powered dubbing, the same video can be quickly localized into different languages, enabling global audience expansion. #### 3. ✅ Efficient Time Management The workflow intelligently waits for the dubbing process to finish using dynamic timing, avoiding unnecessary retries or failures. #### 4. ✅ Centralized Content Distribution A single workflow handles storage, social posting, and YouTube uploads, simplifying content operations across platforms. #### 5. ✅ Reduced Operational Costs Automating dubbing and publishing significantly lowers costs compared to manual voiceovers, video editing, and uploads. #### 6. ✅ Easy Customization & Reusability Parameters like video URL, language, title, and platform can be easily changed, making the workflow reusable for different projects or clients. --- ### **How It Works** 1. The workflow begins with a manual trigger that sets input parameters: a video URL and the target language for dubbing (e.g., `en` for English). 2. The video is fetched from the provided URL via an HTTP request. 3. The video file is sent to the **ElevenLabs Dubbing API**, which initiates audio dubbing in the specified target language. 4. The workflow then waits for a calculated duration (video length + 120 seconds) to allow the dubbing process to complete. 5. After the wait, it checks the dubbing status using the `dubbing_id` and retrieves the final dubbed audio file. 6. The dubbed video is then processed in parallel: - Uploaded to **Google Drive** in a designated folder. - Uploaded to **Postiz** for social media management. - Uploaded via **Upload-Post.com API** for YouTube publishing. 7. Finally, the workflow triggers a **Postiz** node to schedule or publish the content to YouTube with the prepared metadata. --- ### **Set Up Steps** 1. **Configure Input Parameters** In the *Set params* node, define: - `video_url`: Direct URL to the source video. - `target_audio`: Language code (e.g., `en`, `es`, `fr`) for dubbing. 2. **Set Up Credentials** Ensure the following credentials are configured in n8n: - **[ElevenLabs API](https://try.elevenlabs.io/ahkbf00hocnu)** (for dubbing) - **Google Drive OAuth2** (for file upload) - **[Postiz API](https://affiliate.postiz.com/n3witalia)** (for social media scheduling) - **[Upload-Post.com API](https://www.upload-post.com/?linkId=lp_144414&sourceId=n3witalia&tenantId=upload-post-app)** (for YouTube upload) 3. **Adjust Wait Time** Modify the *Wait* node if needed: `expected_duration_sec + 120` ensures enough time for dubbing. Adjust based on video length. 4. **Customize Upload Destinations** Update folder IDs (Google Drive) and platform settings (Upload-Post.com) as needed. 5. **Set Post Content** In the *Youtube Postiz* and *Youtube Upload-Post* nodes, replace `YOUR_CONTENT` and `YOUR_USERNAME` with actual titles, descriptions, and channel details. 6. **Activate and Test** Activate the workflow in n8n, click *Execute workflow*, and monitor execution for errors. Ensure all API keys and permissions are valid. --- 👉 [Subscribe to my new **YouTube channel**](https://youtube.com/@n3witalia). Here I’ll share videos and Shorts with practical tutorials and **FREE templates for n8n**. [![image](https://n3wstorage.b-cdn.net/n3witalia/youtube-n8n-cover.jpg)](https://youtube.com/@n3witalia) --- ### **Need help customizing?** [Contact me](mailto:[email protected]) for consulting and support or add me on [Linkedin](https://www.linkedin.com/in/davideboizza/).

View

Create a daily AI & automation content digest from YouTube, Reddit, X and Perplexity with OpenAI and Airtable

What It Does This workflow automates the creation of a daily AI and automation content digest by aggregating trending content from four sources: YouTube (n8n-related videos with AI-generated transcript summaries), Reddit (rising posts from r/n8n), X/Twitter (tweets about n8n, AI automation, AI agents, and Claude via Apify scraping), and Perplexity AI (top 3 trending AI news stories). The collected data is analyzed using OpenAI models to extract key insights, stored in Airtable for archival, and then compiled into a beautifully formatted HTML email report that includes TL;DR highlights, content summaries, trending topics, and AI-generated content ideas—delivered straight to your inbox via Gmail. --- Setup Guide Prerequisites You will need accounts and API credentials for the following services: ┌──────────────────┬───────────────────────────────────────────────┐ │ Service │ Purpose │ ├──────────────────┼───────────────────────────────────────────────┤ │ YouTube Data API │ Fetch video metadata and search results │ ├──────────────────┼───────────────────────────────────────────────┤ │ Apify │ Scrape YouTube transcripts and X/Twitter data │ ├──────────────────┼───────────────────────────────────────────────┤ │ Reddit API │ Pull trending posts from subreddits │ ├──────────────────┼───────────────────────────────────────────────┤ │ Perplexity AI │ Get real-time AI news summaries │ ├──────────────────┼───────────────────────────────────────────────┤ │ OpenAI │ Content analysis and summarization │ ├──────────────────┼───────────────────────────────────────────────┤ │ OpenRouter │ Report generation (GPT-4.1) │ ├──────────────────┼───────────────────────────────────────────────┤ │ Airtable │ Store collected content │ ├──────────────────┼───────────────────────────────────────────────┤ │ Gmail │ Send the daily report │ └──────────────────┴───────────────────────────────────────────────┘ Step-by-Step Setup 1. Import the workflow into your n8n instance 2. Configure YouTube credentials: - Set up YouTube OAuth2 credentials - Replace YOURAPIKEY in the "Get Video Data" HTTP Request node with your YouTube Data API key 3. Configure Apify credentials: - In the "Get Transcripts" and "Scrape X" HTTP Request nodes, replace YOURAPIKEY in the Authorization header with your Apify API token 4. Configure Reddit credentials: - Set up Reddit OAuth2 credentials (see note below) 5. Configure AI service credentials: - Add your Perplexity API credentials - Add your OpenAI API credentials - Add your OpenRouter API credentials 6. Configure Airtable: - Create a base called "AI Content Hub" with three tables: YouTube Videos, Reddit Posts, and Tweets - Update the Airtable nodes with your base and table IDs 7. Configure Gmail: - Set up Gmail OAuth2 credentials - Replace YOUREMAIL in the Gmail node with your recipient email address 8. Customize search terms (optional): - Modify the YouTube search query in "Get Videos" node - Adjust the subreddit in "n8n Trending" node - Update Twitter search terms in "Scrape X" node Important Note: Reddit API Access The Reddit node requires OAuth2 authentication. If you do not already have a Reddit developer account, you will need to submit a request for API access: 1. Go to https://www.reddit.com/prefs/apps 2. Click "create another app..." at the bottom 3. Select "script" as the application type 4. Fill in the required fields (name, redirect URI as http://localhost) 5. Important: Reddit now requires additional approval for API access. Visit https://www.reddit.com/wiki/api to review their API terms and submit an access request if prompted 6. Once approved, use your client ID and client secret to configure the Reddit OAuth2 credentials in n8n API approval can take 1-3 business days depending on your use case. --- Recommended Schedule Set up a Schedule Trigger to run this workflow daily (e.g., 7:00 AM) for a fresh content digest each morning.

View

👨‍💻

Need Custom Automation?

N8N Automation Expert

Specialized in N8N automation, I design custom workflows that connect your tools and automate your processes.