Maintain RAG embeddings with OpenAI, Postgres and auto drift rollback

Name: Maintain RAG embeddings with OpenAI, Postgres and auto drift rollback
Availability: InStock
Author: ResilNext

Maintain RAG embeddings with OpenAI, Postgres and auto drift rollback preview

Open on n8n.io

$20/month : Unlimited workflows

2500 executions/month

Try free

THE #1 IN WEB SCRAPING

Scrape any website without limits

Try free

HOSTINGER

Early Deal
DISCOUNT 20%

Self-hosted n8n

Unlimited workflows - from $4.99/mo

Try free

#1 hub for scraping, AI & automation

6000+ actors - $5 credits/mo

Try free

Overview

This workflow implements a self-healing Retrieval-Augmented Generation (RAG) maintenance system that automatically updates document embeddings, evaluates retrieval quality, detects embedding drift, and safely promotes or rolls back embedding updates.

Maintaining high-quality embeddings in production RAG systems is difficult. When source documents change or embedding models evolve, updates can accidentally degrade retrieval quality or introduce semantic drift.

This workflow solves that problem by introducing an automated evaluation and rollback pipeline for embeddings.

It periodically checks for document changes, regenerates embeddings for updated content, evaluates the new embeddings against a set of predefined golden test questions, and compares the results with the currently active embeddings.

Quality metrics such as Recall@K, keyword similarity, and answer variance are calculated, while embedding vectors are also analyzed for semantic drift using cosine distance.

If the new embeddings outperform the current ones and remain within acceptable drift limits, they are automatically promoted to production. Otherwise, the system safely rolls back or flags the update for manual review.

This creates a robust, production-safe RAG lifecycle automation system.

How It Works

1. Workflow Trigger

The workflow can start in two ways:

Scheduled trigger running daily
Webhook trigger when source documents change

Both paths lead to a centralized configuration node that defines parameters such as chunk size, thresholds, and notification settings.

2. Document Retrieval & Change Detection

Documents are fetched from the configured source (GitHub, Drive, Confluence, or other APIs).

The workflow then:

Splits documents into deterministic chunks
Computes SHA-256 hashes for each chunk
Compares them with previously stored hashes in Postgres

Only new or modified chunks proceed for embedding generation, which significantly reduces processing cost.

3. Embedding Generation

Changed chunks are processed through:

Recursive text splitting
Document loading
OpenAI embedding generation

These embeddings are stored as a candidate vector store rather than immediately replacing the production embeddings.

Metadata about the embedding version is stored in Postgres.

4. Golden Question Evaluation

A set of golden test questions stored in the database is used to evaluate retrieval quality.

Two AI agents are used:

One queries the candidate embeddings
One queries the current production embeddings

Both generate answers using retrieved context.

5. Quality Metrics Calculation

The workflow calculates several evaluation metrics:

Recall@K to measure retrieval effectiveness
Keyword similarity between generated answers and expected answers
Answer length variance to detect inconsistencies

These are combined into a weighted quality score.

6. Embedding Drift Detection

The workflow compares embedding vectors between versions using cosine distance.

This identifies semantic drift, which may occur due to:

embedding model updates
chunking changes
document structure changes

7. Promotion or Rollback

The workflow checks two conditions:

Quality score exceeds the configured threshold
Embedding drift remains below the drift threshold

If both conditions pass:

The candidate embeddings are promoted to active

If not:

The system rolls back to the previous embeddings
Or flags the update for human review

8. Notifications

A webhook notification is sent with:

update status
quality score
drift score
timestamp

This allows teams to monitor embedding health automatically.

Setup Instructions

Configure Document Source

Edit the Workflow Configuration node and set:

documentSourceUrl API endpoint or file source containing your documents.

Examples include:

GitHub repository API
Google Drive export API
Confluence REST API

Configure Postgres Database

Create the following tables in your Postgres database:

document_chunks
embeddings
embedding_versions
golden_questions

These tables store chunk hashes, embedding vectors, version metadata, and evaluation questions.

Connect the Postgres nodes using your database credentials.

Add OpenAI Credentials

Configure credentials for:

OpenAI Embeddings
OpenAI Chat Model

These are used for generating embeddings and answering evaluation questions.

Populate Golden Questions

Insert evaluation questions into the golden_questions table.

Each record should include:

question_text
expected passages
expected answer keywords

These questions represent critical queries your RAG system must answer correctly.

Configure Notification Webhook

Add a Slack or Teams webhook URL in the configuration node.

Notifications will be sent whenever:

embeddings are promoted
embeddings are rolled back
manual review is required

Adjust Quality Thresholds

In the configuration node you can modify:

qualityThreshold
driftThreshold
chunkSize
chunkOverlap

These parameters control the sensitivity of the evaluation system.

Use Cases

Production RAG Monitoring

Automatically evaluate and update embeddings in production knowledge systems without risking degraded results.

Continuous Knowledge Base Updates

Keep embeddings synchronized with frequently changing documentation, repositories, or internal knowledge bases.

Safe Embedding Model Upgrades

Test new embedding models against production data before promoting them.

AI System Reliability

Detect retrieval regressions before they affect end users.

Enterprise AI Governance

Provide automated evaluation and rollback capabilities for mission-critical RAG deployments.

Requirements

This workflow requires the following services:

n8n
Postgres Database
OpenAI API

Recommended integrations:

Slack or Microsoft Teams (for notifications)

Required nodes include:

Schedule Trigger
Webhook
HTTP Request
Postgres
Compare Datasets
Code nodes
OpenAI Embeddings
OpenAI Chat Model
Vector Store nodes
AI Agent nodes

Summary

This workflow provides a fully automated self-healing RAG infrastructure for maintaining embedding quality in production systems.

By combining change detection, golden-question evaluation, embedding drift analysis, and automatic rollback, it ensures that retrieval performance improves safely over time.

It is ideal for teams running production AI assistants, knowledge bases, or internal search systems that depend on high-quality vector embeddings.

ResilNext

33 workflows

Complexity

advanced

Published 14 Mar 2026

Likes 0

View on n8n.io Download Workflow

Install path: /data/workflows/14036/14036.json

Share Your Workflow

Have a useful automation to share? Publish it and help the community.

Submit Your Template How to Submit

Related Workflows

Forecast property CAPEX and ROI weekly using Google Sheets and GPT-4o

## How It Works This workflow automates weekly capital expenditure (CAPEX) forecasting for property portfolios using a multi-agent AI architecture. It targets property managers, asset managers, and facilities finance teams who need data-driven maintenance budgeting without manual spreadsheet analysis. Three Google Sheets sources, namely: maintenance records, property data, and tenant feedback, are merged into a unified dataset. A Main Prediction Agent orchestrates three specialist sub-agents: a CAPEX Prioritizer that ranks spending needs, an ROI Simulator that models return scenarios, and a Quote Requester that fetches vendor estimates. Each agent is backed by dedicated AI models, memory, and tools including a Calculator and Financial Modeling Tool. Structured predictions are parsed, split by category, formatted, saved back to Google Sheets, and pushed to an external budgeting system via POST, delivering a fully automated, auditable CAPEX planning pipeline every week. ## Setup Steps 1. Connect Google Sheets credentials to all three read nodes and the Save Predictions node. 2. Set correct Sheet IDs for maintenance, property, and tenant feedback tabs. 3. Add Claude or OpenAI API credentials to all Chat Model nodes. 4. Configure the Financial Modeling Tool with your cost rate assumptions. 5. Replace the POST placeholder URL in Update Budgeting System with your actual endpoint. ## Prerequisites - Google Sheets account with populated maintenance, property, and tenant data - Claude or OpenAI API credentials - External budgeting system with a POST-compatible API endpoint ## Use Cases - Weekly CAPEX forecasting for multi-property real estate portfolios - Automated ROI modelling for planned renovations or equipment replacement ## Customization Add more data sources (e.g., IoT sensors, ERP exports). ## Benefits Eliminates manual CAPEX spreadsheet work with autonomous AI forecasting.

View

Turn support tickets into developer insights with OpenAI, Postgres, Slack and Jira

## Overview This workflow transforms raw support tickets into actionable developer insights using AI and data processing. It automatically detects recurring issues, identifies root causes, ranks severity, and generates a structured engineering report. By combining embeddings, clustering, and AI analysis, it helps teams prioritize bugs, understand user pain points, and take data-driven product decisions. --- ## How It Works 1. **Scheduled Trigger** - Runs automatically at a defined time (e.g., daily). 2. **Workflow Configuration** - Defines time window, similarity threshold, scoring weights, and delivery options. 3. **Fetch Feedback Data** - Retrieves recent support tickets (bugs and feature requests) from Postgres. 4. **Preprocessing** - Cleans, normalizes, and removes duplicate messages. 5. **Embedding & Clustering** - Generates embeddings using OpenAI. - Groups similar tickets using cosine similarity. 6. **Cluster Aggregation** - Combines related tickets into structured clusters. 7. **Root Cause Analysis** - AI agent analyzes clusters to identify: - Root cause - Impacted module - Severity - Debug steps - Fix direction 8. **Severity Scoring** - Calculates weighted score based on: - Frequency - Sentiment - Churn risk - Enterprise impact 9. **Report Generation** - Generates a developer-focused report including: - Executive summary - Ranked bugs - Feature requests - Risk analysis - Sprint priorities 10. **Delivery** - Sends report to Slack - Optionally creates Jira issues - Optional email delivery --- ## Setup Instructions 1. **Database Setup** - Configure Postgres credentials - Ensure `support_tickets` table exists with required fields 2. **OpenAI Configuration** - Add API key for: - Embeddings (text-embedding-3-small) - AI analysis agents 3. **Slack Integration** - Add Slack credentials - Set channel ID 4. **Email Setup (Optional)** - Configure SMTP or email service 5. **Jira Integration (Optional)** - Add Jira credentials - Set project key and issue type 6. **Customize Parameters** - Adjust: - Similarity threshold - Scoring weights - Time window 7. **Schedule Configuration** - Modify trigger timing as needed --- ## Use Cases - Product teams analyzing user feedback at scale - Engineering teams prioritizing bug fixes - SaaS companies tracking churn-related issues - Customer support insights automation - AI-driven product intelligence dashboards --- ## Requirements - OpenAI API key - Postgres database with support ticket data - Slack (optional) - Email service (optional) - Jira account (optional) - n8n instance --- ## Key Features - Automated feedback clustering using embeddings - AI-driven root cause analysis - Weighted severity scoring system - Developer-ready intelligence reports - Multi-channel delivery (Slack, Email, Jira) - Fully customizable scoring and thresholds --- ## Summary A powerful AI-driven workflow that converts raw support tickets into structured developer intelligence. It automates clustering, root cause detection, prioritization, and reporting helping teams fix the right problems faster and build better products.

View

Track LLM costs and usage across OpenAI, Anthropic, Google and more

## Installation Steps 1. Go to **Settings → n8n API** and create an API key 2. Add it as credential for the **Get Execution Data** node 3. Review model mappings in **Standardize Names** node 4. Review pricing in **Model Prices** node ## To Monitor a Workflow 1. Add **Execute Workflow** node at the end of your target workflow 2. Select this monitoring workflow 3. **Turn OFF** "Wait For Sub-Workflow Completion" 4. Pass `{ "executionId": "{{ $execution.id }}" }` as input ## Prerequisites Enable **"Return Intermediate Steps"** in your AI Agent settings for best results. ## Supported Providers **OpenAI** · **Anthropic** · **Google** · **DeepSeek** · **Meta** · **Mistral** · **xAI** · **Cohere** · **Alibaba Qwen** · **Moonshot Kimi** ### 120+ Model Variations Mapped Includes all versioned variants (e.g., gpt-4o-2024-08-06 → gpt-4o) Prices sourced from official provider pages (March 2026) ## Output Data ### Per LLM Call - Cost Breakdown (prompt, completion, total USD) - Token Metrics (prompt, completion, total) - Performance (execution time, finish reason) - Content Preview (first 100 chars I/O) - Model Parameters (temp, max tokens, timeout) - Execution Context (workflow, node, status) - Flow Tracking (previous nodes chain) ### Summary Statistics - Total executions and costs - Breakdown by model type - Breakdown by node - Average cost per call - Total execution time ## 💡 You can do anything with this data! - Store in a database for historical tracking - Send to Teams as a cost alert - Build dashboards with the summary data - Set budget thresholds and trigger warnings - Export to Google Sheets for reporting

View

Need Custom Automation?

Get help designing a custom n8n workflow that connects your stack and fits your process.

Maintain RAG embeddings with OpenAI, Postgres and auto drift rollback

Workflow preview

Overview

Overview

How It Works

1. Workflow Trigger

2. Document Retrieval & Change Detection

3. Embedding Generation

4. Golden Question Evaluation

5. Quality Metrics Calculation

6. Embedding Drift Detection

7. Promotion or Rollback

8. Notifications

Setup Instructions

Use Cases

Production RAG Monitoring

Continuous Knowledge Base Updates

Safe Embedding Model Upgrades

AI System Reliability

Enterprise AI Governance

Requirements

Summary