Analyze images with OpenAI Vision while preserving binary data for reuse

Name: Analyze images with OpenAI Vision while preserving binary data for reuse
Availability: InStock
Author: Robert Breen

Analyze images with OpenAI Vision while preserving binary data for reuse preview

Open on n8n.io

$20/month : Unlimited workflows

2500 executions/month

Try free

THE #1 IN WEB SCRAPING

Scrape any website without limits

Try free

HOSTINGER

Early Deal
DISCOUNT 20%

Self-hosted n8n

Unlimited workflows - from $4.99/mo

Try free

#1 hub for scraping, AI & automation

6000+ actors - $5 credits/mo

Try free

Important notice

This workflow is provided as-is. Please review and test before using in production.

Overview

Use this template to upload an image, run a first-pass OpenAI Vision analysis, then re-attach the original file (binary/base64) to the next step using a Merge node. The pattern ensures your downstream AI Agent (or any node) can access both the original file (data) and the first analysis result (content) at the same time.

✅ What this template does

Collects an image file via Form Trigger (binary field labeled data)
Analyzes the image with OpenAI Vision (GPT-4o) using base64 input
Merges the original upload and the analysis result (combine by position) so the next node has both
Re-analyzes/uses the image alongside the first analysis in an AI Agent step

🧩 How it works (Node-by-node)

Form Trigger
- Presents a simple upload form and emits a binary/base64 field named data.
Analyze image (OpenAI Vision)
- Reads the same data field as base64 and runs image analysis with GPT-4o.
- The node outputs a text content (first-pass analysis).
Merge (combine by position)
- Combines the two branches so the next node receives both the original upload (data) and the analysis (content) on the same item.
AI Agent
- Receives data + content together.
- Prompt includes the original image (=data) and the first analysis ({{$json.content}}) to compare or refine results.
OpenAI Chat Model
- Provides the language model for the Agent (wired as ai_languageModel).

🛠️ Setup Instructions (from the JSON)

> Keep it simple: mirror these settings and you’re good to go.

1) Form Trigger (n8n-nodes-base.formTrigger)

Path: d6f874ec-6cb3-46c7-8507-bd647c2484f0 (you can change this)
Form Title: Image Document Upload
Form Description: Upload a image document for AI analysis
Form Fields:
- Label: data
- Type: file
Output: emits a binary/base64 field named data.

2) Analyze image (@n8n/n8n-nodes-langchain.openAi)

Resource: image
Operation: analyze
Model: gpt-4o
Text: =data (use the uploaded file field)
Input Type: base64
Credentials: OpenAI (use your stored OpenAI API credential)

3) Merge (n8n-nodes-base.merge)

Mode: combine
Combine By: combineByPosition
- Connect Form Trigger → Merge (input 2)
- Connect Analyze image → Merge (input 1)
- This ensures the original file (data) and the analysis (content) line up on the same item.

4) AI Agent (@n8n/n8n-nodes-langchain.agent)

Prompt Type: define
Text:
System Message: analyze the image again and see if you get the same result.
Receives: merged item containing data + content.

5) OpenAI Chat Model (@n8n/n8n-nodes-langchain.lmChatOpenAi)

Model: gpt-4.1-mini
Wiring: connect as ai_languageModel to the AI Agent
Credentials: same OpenAI credential as above

> Security Note: Store API keys in Credentials (do not hardcode keys in nodes).

🧠 Why “Combine by Position” fixes the binary issue

Some downstream nodes lose access to the original binary once a branch processes it.
By merging the original branch (with data) and the analysis branch (with content) by position, you restore a single item with both fields—so the next step can use the image again while referencing earlier analysis.

🧪 Test Tips

Upload a JPG/PNG and execute the workflow from the Form Trigger preview.
Confirm Merge output contains both data (binary/base64) and content (text).
In the AI Agent, log or return both fields to verify availability.

🔧 Customize

Swap GPT-4o for another Vision-capable model if needed.
Extend the AI Agent to extract structured fields (e.g., objects detected, text, brand cues).
Add a Router after Merge to branch into storage (S3, GDrive) or notifications (Slack, Email).

📝 Requirements

n8n (cloud or self-hosted) with web UI access
OpenAI credential configured (Vision support)

🩹 Troubleshooting

Binary missing downstream? Ensure Merge receives both branches and is set to combineByPosition.
Wrong field name? The Form Trigger upload field must be labeled data to match node expressions.
Model errors? Verify your OpenAI credential and that the chosen model supports image analysis.

💬 Sticky Note (included in the workflow)

> “Use Binary Field after next step” — This workflow demonstrates how to preserve and reuse an uploaded file (binary/base64) after a downstream step by using a Merge node (combineByPosition). A user uploads an image via Form Trigger → the image is analyzed with OpenAI Vision → results are merged back with the original upload so the next AI Agent step can access both the original file (data) and the first analysis (content) at the same time.

📬 Contact

Need help customizing this (e.g., filtering by campaign, sending reports by email, or formatting your PDF)?

📧 [email protected]
🔗 https://www.linkedin.com/in/robert-breen-29429625/
🌐 https://ynteractive.com

Robert Breen

90 workflows

Nodes

n8n-nodes-base.formtrigger @n8n/n8n-nodes-langchain.openai n8n-nodes-base.merge n8n-nodes-base.stickynote @n8n/n8n-nodes-langchain.agent @n8n/n8n-nodes-langchain.lmchatopenai

Complexity

intermediate

Published 23 Sept 2025

Likes 0

View on n8n.io Download Workflow

Install path: /data/workflows/8867/8867.json

Share Your Workflow

Have a useful automation to share? Publish it and help the community.

Submit Your Template How to Submit

Related Workflows

Forecast property CAPEX and ROI weekly using Google Sheets and GPT-4o

## How It Works This workflow automates weekly capital expenditure (CAPEX) forecasting for property portfolios using a multi-agent AI architecture. It targets property managers, asset managers, and facilities finance teams who need data-driven maintenance budgeting without manual spreadsheet analysis. Three Google Sheets sources, namely: maintenance records, property data, and tenant feedback, are merged into a unified dataset. A Main Prediction Agent orchestrates three specialist sub-agents: a CAPEX Prioritizer that ranks spending needs, an ROI Simulator that models return scenarios, and a Quote Requester that fetches vendor estimates. Each agent is backed by dedicated AI models, memory, and tools including a Calculator and Financial Modeling Tool. Structured predictions are parsed, split by category, formatted, saved back to Google Sheets, and pushed to an external budgeting system via POST, delivering a fully automated, auditable CAPEX planning pipeline every week. ## Setup Steps 1. Connect Google Sheets credentials to all three read nodes and the Save Predictions node. 2. Set correct Sheet IDs for maintenance, property, and tenant feedback tabs. 3. Add Claude or OpenAI API credentials to all Chat Model nodes. 4. Configure the Financial Modeling Tool with your cost rate assumptions. 5. Replace the POST placeholder URL in Update Budgeting System with your actual endpoint. ## Prerequisites - Google Sheets account with populated maintenance, property, and tenant data - Claude or OpenAI API credentials - External budgeting system with a POST-compatible API endpoint ## Use Cases - Weekly CAPEX forecasting for multi-property real estate portfolios - Automated ROI modelling for planned renovations or equipment replacement ## Customization Add more data sources (e.g., IoT sensors, ERP exports). ## Benefits Eliminates manual CAPEX spreadsheet work with autonomous AI forecasting.

View

Turn support tickets into developer insights with OpenAI, Postgres, Slack and Jira

## Overview This workflow transforms raw support tickets into actionable developer insights using AI and data processing. It automatically detects recurring issues, identifies root causes, ranks severity, and generates a structured engineering report. By combining embeddings, clustering, and AI analysis, it helps teams prioritize bugs, understand user pain points, and take data-driven product decisions. --- ## How It Works 1. **Scheduled Trigger** - Runs automatically at a defined time (e.g., daily). 2. **Workflow Configuration** - Defines time window, similarity threshold, scoring weights, and delivery options. 3. **Fetch Feedback Data** - Retrieves recent support tickets (bugs and feature requests) from Postgres. 4. **Preprocessing** - Cleans, normalizes, and removes duplicate messages. 5. **Embedding & Clustering** - Generates embeddings using OpenAI. - Groups similar tickets using cosine similarity. 6. **Cluster Aggregation** - Combines related tickets into structured clusters. 7. **Root Cause Analysis** - AI agent analyzes clusters to identify: - Root cause - Impacted module - Severity - Debug steps - Fix direction 8. **Severity Scoring** - Calculates weighted score based on: - Frequency - Sentiment - Churn risk - Enterprise impact 9. **Report Generation** - Generates a developer-focused report including: - Executive summary - Ranked bugs - Feature requests - Risk analysis - Sprint priorities 10. **Delivery** - Sends report to Slack - Optionally creates Jira issues - Optional email delivery --- ## Setup Instructions 1. **Database Setup** - Configure Postgres credentials - Ensure `support_tickets` table exists with required fields 2. **OpenAI Configuration** - Add API key for: - Embeddings (text-embedding-3-small) - AI analysis agents 3. **Slack Integration** - Add Slack credentials - Set channel ID 4. **Email Setup (Optional)** - Configure SMTP or email service 5. **Jira Integration (Optional)** - Add Jira credentials - Set project key and issue type 6. **Customize Parameters** - Adjust: - Similarity threshold - Scoring weights - Time window 7. **Schedule Configuration** - Modify trigger timing as needed --- ## Use Cases - Product teams analyzing user feedback at scale - Engineering teams prioritizing bug fixes - SaaS companies tracking churn-related issues - Customer support insights automation - AI-driven product intelligence dashboards --- ## Requirements - OpenAI API key - Postgres database with support ticket data - Slack (optional) - Email service (optional) - Jira account (optional) - n8n instance --- ## Key Features - Automated feedback clustering using embeddings - AI-driven root cause analysis - Weighted severity scoring system - Developer-ready intelligence reports - Multi-channel delivery (Slack, Email, Jira) - Fully customizable scoring and thresholds --- ## Summary A powerful AI-driven workflow that converts raw support tickets into structured developer intelligence. It automates clustering, root cause detection, prioritization, and reporting helping teams fix the right problems faster and build better products.

View

Track LLM costs and usage across OpenAI, Anthropic, Google and more

## Installation Steps 1. Go to **Settings → n8n API** and create an API key 2. Add it as credential for the **Get Execution Data** node 3. Review model mappings in **Standardize Names** node 4. Review pricing in **Model Prices** node ## To Monitor a Workflow 1. Add **Execute Workflow** node at the end of your target workflow 2. Select this monitoring workflow 3. **Turn OFF** "Wait For Sub-Workflow Completion" 4. Pass `{ "executionId": "{{ $execution.id }}" }` as input ## Prerequisites Enable **"Return Intermediate Steps"** in your AI Agent settings for best results. ## Supported Providers **OpenAI** · **Anthropic** · **Google** · **DeepSeek** · **Meta** · **Mistral** · **xAI** · **Cohere** · **Alibaba Qwen** · **Moonshot Kimi** ### 120+ Model Variations Mapped Includes all versioned variants (e.g., gpt-4o-2024-08-06 → gpt-4o) Prices sourced from official provider pages (March 2026) ## Output Data ### Per LLM Call - Cost Breakdown (prompt, completion, total USD) - Token Metrics (prompt, completion, total) - Performance (execution time, finish reason) - Content Preview (first 100 chars I/O) - Model Parameters (temp, max tokens, timeout) - Execution Context (workflow, node, status) - Flow Tracking (previous nodes chain) ### Summary Statistics - Total executions and costs - Breakdown by model type - Breakdown by node - Average cost per call - Total execution time ## 💡 You can do anything with this data! - Store in a database for historical tracking - Send to Teams as a cost alert - Build dashboards with the summary data - Set budget thresholds and trigger warnings - Export to Google Sheets for reporting

View

Need Custom Automation?

Get help designing a custom n8n workflow that connects your stack and fits your process.

Analyze images with OpenAI Vision while preserving binary data for reuse

Workflow preview

Important notice

Overview

✅ What this template does

🧩 How it works (Node-by-node)

🛠️ Setup Instructions (from the JSON)

🧠 Why “Combine by Position” fixes the binary issue

🧪 Test Tips

🔧 Customize

📝 Requirements

🩹 Troubleshooting

💬 Sticky Note (included in the workflow)

📬 Contact