Skip to main content

Evaluate AI workflows using Google Sheets, Gemini, Claude, GPT, and Perplexity

Workflow preview

Evaluate AI workflows using Google Sheets, Gemini, Claude, GPT, and Perplexity preview
Open on n8n.io

Important notice

This workflow is provided as-is. Please review and test before using in production.

Overview

This template and YouTube video goes over 5 different implementations of evaluations within n8n.

  • Categorization
  • Correctness
  • Tools used
  • String similarity
  • Helpfulness

You’ll learn when to use each type, how to set up test datasets in Google Sheets or data tables, and how to track your results over time. I also explain best practices like only changing one variable at a time, documenting your prompts and model settings, and building proper training datasets with enough examples to confidently validate your workflow.

YouTube Video: https://www.youtube.com/watch?v=-4LXYOhQ-Z0

Thank you for downloading our free n8n Evaluations template. If you enjoyed the template + tutorial please subscribe to the YouTube channel. We are uploading weekly content on AI/n8n

Connect With Us Check out the links down below. If you need help with this template, want 1:1 coaching, or have a n8n project you want to build, reach out at [email protected]

Free Skool AI/n8n Group: https://www.skool.com/data-and-ai LinkedIn: https://www.linkedin.com/in/ryan-p-nolan/ Twitter/X:https://x.com/RyanMattDS Website: https://ryanandmattdatascience.com/