Skip to main content
M

Mathis

1
Workflow

Workflows by Mathis

Workflow preview: Convert PDF documents to AI podcasts with Google Gemini and text-to-speech
Free intermediate

Convert PDF documents to AI podcasts with Google Gemini and text-to-speech

# Convert PDF documents to AI-generated podcasts with Google Gemini and Text-to-Speech Transform any PDF document into an engaging, natural-sounding podcast using Google's Gemini AI and advanced Text-to-Speech technology. This automated workflow extracts text content, generates conversational scripts, and produces high-quality audio files. ## Who is this for? This workflow template is perfect for **content creators**, **educators**, **researchers**, and **marketing professionals** who want to repurpose written content into audio format. Ideal for creating podcast episodes, educational content, or making documents more accessible. ## What problem does this solve? Converting written documents to engaging audio content manually is time-consuming and requires scriptwriting skills. This workflow automates the entire process, turning static PDFs into dynamic, conversational podcasts that sound natural and engaging. ## What this workflow does 1. **Extracts text** from uploaded PDF documents 2. **Generates podcast script** using Google Gemini AI with conversational tone 3. **Converts script to speech** using Google's advanced TTS with customizable voices 4. **Processes audio** into properly formatted WAV files 5. **Saves final podcast** ready for distribution ## Setup 1. **Obtain API credentials:** - Get Google Gemini API key from [AI Studio](https://aistudio.google.com/) - Configure credentials in n8n as "Google Gemini(PaLM) Api account" 2. **Configure voice settings:** - Choose from available voices: Kore (professional), Aoede (conversational), Laomedeia (energetic) - Customize script generation prompts if needed 3. **Test the workflow:** - Upload a sample PDF file - Verify audio output quality - Adjust voice settings as preferred ## How to customize this workflow - **Modify script style:** Edit the prompt in the "Generate Podcast Script" node to change tone, length, or format - **Change voice:** Update the voice name in "Prepare TTS Request" node - **Add preprocessing:** Insert text cleaning nodes before script generation - **Integrate with storage:** Connect to Google Drive, Dropbox, or other storage services - **Add notifications:** Include Slack or email notifications when podcasts are ready **Note:** This template requires Google Gemini API access and works best with text-based PDF files under 10MB.

M
Mathis
Content Creation
12 Jun 2025
3767
0