Turn a GitHub knowledge base into a Telegram RAG bot with Qwen via OpenRouter

Workflow preview

100%

Open on n8n.io

$20/month : Unlimited workflows

2500 executions/month

Try free

THE #1 IN WEB SCRAPING

Scrape any website without limits

Try free

HOSTINGER

Early Deal
DISCOUNT 20%

Self-hosted n8n

Unlimited workflows - from $4.99/mo

Try free

#1 hub for scraping, AI & automation

6000+ actors - $5 credits/mo

Try free

1. Workflow Overview

WHAT IT DOES This workflow turns a plain JSON file sitting in a GitHub repository into a fully functional Telegram chatbot with retrieval augmented generation (RAG) — no Pinecone, no Qdrant, no vec...

Best for

Internal Wiki automation workflows
AI RAG automation workflows
advanced n8n builders looking for reusable templates

Tools used

n8n-nodes-base.stickynote, n8n-nodes-base.telegramtrigger, n8n-nodes-base.if, n8n-nodes-base.telegram, n8n-nodes-base.code, n8n-nodes-base.github, @n8n/n8n-nodes-langchain.lmchatopenai, @n8n/n8n-nodes-langchain.chainllm

Source and attribution

This workflow is cataloged by N8N Workflows and links back to its original n8n.io source page by Do Thanh Vinh.

Original n8n.io source

1.1 Workflow description

Title: Turn a GitHub knowledge base into a Telegram RAG bot with Qwen via OpenRouter
Workflow name: Turn a GitHub knowledge base into a Telegram RAG bot with Qwen via OpenRouter

WHAT IT DOES

This workflow turns a plain JSON file sitting in a GitHub repository into a fully functional Telegram chatbot with retrieval-augmented generation (RAG) — no Pinecone, no Qdrant, no vector database, no extra subscription.
A user sends /ask <question> to your Telegram bot. The workflow pulls the knowledge base from GitHub, runs a local keyword-matching engine to find the most relevant chunks, feeds them as context to a Qwen 3 model via OpenRouter, and sends the answer back as a reply to the original message.

HOW IT WORKS

1. Telegram Trigger Listens for messages starting with /ask. Anything shorter than 7 characters is rejected with an error message explaining the correct format.

2. Input Validation If a user sends just /ask without a question, or sends a message shorter than 7 characters, the workflow catches it immediately and replies with a clear instruction: "Please use: /ask <your question>". This prevents unnecessary API calls to GitHub and the LLM, and teaches the user the correct format on the first try.

3. GitHub File Fetch Pulls a JSON file from a GitHub repository using a Personal Access Token. If the file doesn't exist or the token is invalid, the user gets a specific error message instead of a silent failure or a generic n8n error. Same applies when the LLM returns an empty response — the user always gets a message, never silence.

4. Binary Decode & Parse Reads the raw binary output from the GitHub node, decodes it to UTF-8, and parses the JSON array. Each entry in the array is expected to have a "text" field (other field names like "content", "answer", "title" also work).

5. Rough Keyword Match Splits the user's question into individual words, scores every knowledge base entry by counting how many words appear in it, and picks the top 2 matches. This is intentionally simple — no embeddings, no vector math, no external API calls for retrieval. It works well for small-to-medium knowledge bases (up to a few hundred entries) where exact keyword overlap is a reliable signal.

6. LLM Call (Qwen 3 via OpenRouter) Sends the matched context and the original question to Qwen 3 235B through OpenRouter. The prompt instructs the model to answer strictly from the provided context. If the context doesn't contain relevant information, the model says so instead of hallucinating.

7. Output Formatting Strips any thinking tags from the model response, capitalizes the first letter, enforces Telegram's 4000-character limit, and appends a branded footer.

8. Telegram Reply Sends the formatted answer as a reply to the user's original message, so conversations stay threaded.

WHY THIS APPROACH

Most RAG setups require a vector database, an embedding model, and ongoing infrastructure costs. This workflow skips all of that. The trade-off is precision — keyword matching won't catch semantic synonyms the way vector search does. But for knowledge bases that are focused on a specific topic (FAQ, product info, internal docs, local business info), keyword overlap is surprisingly effective and the setup cost is zero.
The LLM handles the fuzzy part. Even if the keyword match pulls in slightly noisy context, the model can usually extract the right answer. This division of labor - simple retrieval plus smart generation - keeps the workflow fast, cheap, and easy to maintain.

SETUP (5-10 minutes)

Create a GitHub Fine-grained Personal Access Token with read access to the repository containing your knowledge base file.
Add the GitHub credential in n8n and configure the "get gh file" node with your repository owner, name, and file path.
Prepare your knowledge base as a JSON array: [ { "text": "your knowledge entry here" }, { "text": "another knowledge entry" } ]
Add your Telegram Bot credential to all four Telegram nodes.
Add your OpenRouter credential to the Qwen model node. You can swap Qwen for any model supported by OpenRouter by changing the model name field.
Activate the workflow and send /ask <your question> to your Telegram bot.

CUSTOMIZATION

Swap the LLM: Change the model field in the "qwen model" node to any OpenRouter-compatible model (GPT-4o, Claude, Gemini, etc.). You can also point to a self-hosted model by changing the base URL.
Change retrieval depth: Edit the "rough match" node to return top 3 or top 5 chunks instead of 2. More chunks = more context for the LLM but higher token cost per request.
Add more knowledge: Just edit the JSON file on GitHub. The workflow fetches fresh data on every request — no redeployment needed.
Multi-language: The workflow works with any language in the knowledge base. The LLM will respond in the same language as the question.

WHAT'S NEXT

This workflow is designed as a foundation. The keyword matching engine works well for small knowledge bases, but here's where it can grow:

Vector search: Replace the rough keyword match node with an embedding-based retrieval step (OpenAI embeddings + Qdrant or Pinecone) for larger knowledge bases where keyword overlap isn't enough.
Multi-source KB: Pull from multiple GitHub files, Notion databases, or Google Sheets instead of a single JSON file.
Conversation memory: Add a short-term memory buffer so the bot remembers the last 3-5 messages in a conversation.
Web interface: Add a Webhook trigger alongside the Telegram trigger to serve a chat widget on any website.
Auto-sync: Watch the GitHub file for changes and rebuild the search index automatically instead of fetching on every request.

The current version is intentionally simple. Simple means it runs cheap, breaks rarely, and is easy to debug. Complexity can be added when the use case demands it.

REQUIREMENTS

n8n instance (self-hosted or cloud)
Telegram Bot token
GitHub Personal Access Token (fine-grained, read-only, scoped to the KB repository)
OpenRouter API key (or OpenAI-compatible endpoint)

1.2 Logical Blocks

This catalog entry is organized from the workflow JSON. The node-level section below shows the executable blocks available for review before importing the template.

2. Block-by-Block Analysis

Block 1 - Sticky Note

Type / Role: n8n-nodes-base.stickyNote - stickyNote
Config choices: Version 1

Block 2 - Sticky Note1

Type / Role: n8n-nodes-base.stickyNote - stickyNote
Config choices: Version 1

Block 3 - Sticky Note2

Type / Role: n8n-nodes-base.stickyNote - stickyNote
Config choices: Version 1

Block 4 - Sticky Note3

Type / Role: n8n-nodes-base.stickyNote - stickyNote
Config choices: Version 1

Block 5 - Sticky Note4

Type / Role: n8n-nodes-base.stickyNote - stickyNote
Config choices: Version 1

Block 6 - Sticky Note5

Type / Role: n8n-nodes-base.stickyNote - stickyNote
Config choices: Version 1

Block 7 - Sticky Note6

Type / Role: n8n-nodes-base.stickyNote - stickyNote
Config choices: Version 1

Block 8 - When Message Received

Type / Role: n8n-nodes-base.telegramTrigger - telegramTrigger
Config choices: Version 1.2

Block 9 - Validate Input

Type / Role: n8n-nodes-base.if - if
Config choices: Version 2.3

Block 10 - Send Telegram Error

Type / Role: n8n-nodes-base.telegram - telegram
Config choices: Version 1.2

Block 11 - Handle GitHub Error

Type / Role: n8n-nodes-base.code - code
Config choices: Version 2

Block 12 - Send GitHub Error

Type / Role: n8n-nodes-base.telegram - telegram
Config choices: Version 1.2

Block 13 - Fetch GitHub File

Type / Role: n8n-nodes-base.github - github
Config choices: Version 1.1

Block 14 - Initialize Variables

Type / Role: n8n-nodes-base.code - code
Config choices: Version 2

Block 15 - Perform Rough Match

Type / Role: n8n-nodes-base.code - code
Config choices: Version 2

Block 16 - OpenAI Qwen Model

Type / Role: @n8n/n8n-nodes-langchain.lmChatOpenAi - lmChatOpenAi
Config choices: Version 1.3

Block 17 - Request AI Assistance

Type / Role: @n8n/n8n-nodes-langchain.chainLlm - chainLlm
Config choices: Version 1.9

Block 18 - Format AI Output

Type / Role: n8n-nodes-base.code - code
Config choices: Version 2

Block 19 - Check AI Output

Type / Role: n8n-nodes-base.if - if
Config choices: Version 2.3

Block 20 - Build Telegram Message

Type / Role: n8n-nodes-base.code - code
Config choices: Version 2

Block 21 - Send Telegram Message

Type / Role: n8n-nodes-base.telegram - telegram
Config choices: Version 1.2

3. Summary Table

Workflow	Turn a GitHub knowledge base into a Telegram RAG bot with Qwen via OpenRouter
Complexity	advanced
Nodes	21
Categories	Internal Wiki, AI RAG
Author	Do Thanh Vinh
Published	09 May 2026

4. Reproducing the Workflow from Scratch

1. Download the workflow JSON

Use the JSON export at /data/workflows/15570/15570.json as the source template for this automation.
2. Import the template into n8n

Open n8n, import the downloaded JSON, and review each node before activating the workflow.
3. Configure credentials and variables

Replace placeholder credentials, API keys, webhook URLs, account IDs, and environment-specific values with your own settings.
4. Test with sample data

Run the workflow manually or in a staging workspace, inspect node output, and confirm downstream systems receive the expected data.
5. Activate and monitor

Enable the workflow only after testing, then monitor executions, errors, and rate limits during the first production runs.

5. General Notes & Resources

Review imported nodes carefully before activation. This catalog entry is intended to help you inspect the workflow structure, understand required services, and find related templates faster.

Node names, credentials, schedules, webhook paths, and external service limits may need adjustment for your workspace.

Download workflow JSON Original n8n.io source Internal Wiki workflows AI RAG workflows

Frequently asked questions

What does Turn a GitHub knowledge base into a Telegram RAG bot with Qwen via OpenRouter do?

What do I need before importing this workflow?

Review the workflow JSON, configure any required credentials in n8n, and test the automation in a safe workspace before using it in production.

Can I customize this workflow?

Yes. Use the block-by-block analysis and the downloadable JSON to inspect each node, then adjust credentials, prompts, schedules, filters, or destinations for your Internal Wiki, AI RAG use case.

Do Thanh Vinh

3 workflows

Nodes

n8n-nodes-base.stickynote n8n-nodes-base.telegramtrigger n8n-nodes-base.if n8n-nodes-base.telegram n8n-nodes-base.code n8n-nodes-base.github @n8n/n8n-nodes-langchain.lmchatopenai @n8n/n8n-nodes-langchain.chainllm

Complexity

advanced

Published 09 May 2026

Likes 0

View on n8n.io Download Workflow

Install path: /data/workflows/15570/15570.json

Share Your Workflow

Have a useful automation to share? Publish it and help the community.

Submit Your Template How to Submit

Related Workflows

Build a RAG knowledge base from PDFs with Gemini, Supabase and Google Sheets

## Quick Overview This workflow ingests educational PDF URLs from Google Sheets, extracts and chunks their text, generates embeddings with Google Gemini, and stores them in a Supabase pgvector table for retrieval, while also exposing a public chat webhook that answers questions using Gemini and the same Supabase knowledge base. ## How it works 1. Runs every hour on a schedule and reads rows from a Google Sheets document, keeping only entries where the status is empty. 2. Processes each queued URL one at a time and routes it based on whether it is a seraj-uae.com page or a direct Google Drive file link. 3. For seraj-uae.com URLs, fetches the HTML page to extract the PDF download link (or embedded Google Drive file ID) and the document title. 4. Downloads the PDF from either the source website via HTTP or from Google Drive, waits briefly, and extracts text from the PDF binary. 5. Cleans and validates the extracted text, then splits it into overlapping chunks and generates embeddings using Google Gemini. 6. Inserts the embedded chunks into a Supabase pgvector table with file/title/source metadata and updates the Google Sheets row to “Embedded” or “Not Text Based PDF” if no text is found. 7. Separately, exposes a public n8n Chat webhook that receives student questions and uses Gemini Flash with Supabase vector retrieval to return answers. ## Setup 1. Create a Supabase project with pgvector enabled, create the seraj_documents table, and add the match_seraj_documents function used for vector search. 2. Add Supabase API credentials in n8n and ensure the table name (seraj_documents) and query function name (match_seraj_documents) match your Supabase setup. 3. Add Google Sheets OAuth2 credentials and update the Google Sheet document ID/sheet reference, ensuring columns include source_url, status, and row_number. 4. Add Google Drive OAuth2 credentials for downloading PDFs hosted in Drive. 5. Add a Google Gemini (PaLM) API credential for embeddings and Gemini Flash, then activate the workflow and copy the public chat webhook URL for your website chat widget.

View

Build a Slack knowledge graph with Claude, Neo4j and Google Sheets

## Quick Overview This workflow ingests chat messages via a webhook or a 15-minute schedule, filters out low-signal content, uses Anthropic Claude to extract entities and relationships, and stores the resulting knowledge graph in Neo4j while appending an audit log row to Google Sheets. ## How it works 1. Receives a chat message via an n8n webhook or runs every 15 minutes to process polled Slack data. 2. Normalizes the incoming payload into consistent fields like channel, sender, timestamp, and message text. 3. Uses a Python script to tag message signals (questions, decisions, action items) and filter out noise such as greetings, bots, and very short messages. 4. Sends each relevant message to Anthropic Claude to extract entities, relationships, a summary, message type, and importance. 5. Sends the extracted graph to Anthropic Claude again to add implicit relationships, weights, decision-chain metadata, and a thread category. 6. Merges both AI passes into a single nodes-and-edges knowledge graph, validates references and edge weights, and drops invalid graphs. 7. Upserts the graph into Neo4j via its transactional HTTP endpoint, appends an audit row to Google Sheets, and returns a JSON success response to the webhook caller. ## Setup 1. Add an Anthropic API credential and select the Claude model used by the two AI steps. 2. Configure Neo4j access by replacing the Neo4j HTTP endpoint URL and setting up HTTP Basic Auth credentials for your Neo4j instance. 3. Configure Google Sheets OAuth2 credentials and replace the placeholder Google Sheet ID, sheet tab name/range (KnowledgeGraph), and any required API permissions. 4. If using the webhook ingest path, copy the production webhook URL from n8n and configure your Slack/app integration to POST message payloads to `/chat-ingest` with the expected fields (text, channel, sender, timestamps).

View

Answer voice queries from a webhook over Google Drive docs using GPT-4o-mini and Supabase

## Quick overview Placetel AI – RAG Voice Assistant with Google Drive & Supabase ## How it works 1. Runs on a daily schedule at 02:00 or via manual start to reindex documents. 2. Lists files from a specified Google Drive folder and iterates through each file. 3. Downloads each Google Drive file, loads its text content, creates embeddings with OpenAI, and stores the resulting chunks in a Supabase vector table. 4. Receives a question via a POST webhook with a JSON body containing `chatInput`. 5. Generates an answer with GPT-4o-mini by semantically retrieving relevant passages from the Supabase vector store using the same OpenAI embeddings model. 6. Returns the generated, source-cited response to the webhook caller for voice output. ## Setup 1. Add Google Drive OAuth2 credentials and replace `DEINE_ORDNER_ID` in the Drive query with the folder you want to index. 2. Add an OpenAI API credential and ensure the same embeddings model/settings are used for both indexing and querying. 3. Create/configure a Supabase project with a `documents` table and the `match_documents` RPC/query used for vector search, then add your Supabase credentials. 4. Copy the webhook URL from the webhook trigger and configure your calling system to POST `{ "chatInput": "..." }` to it.

View

Need Custom Automation?

Get help designing a custom n8n workflow that connects your stack and fits your process.

Turn a GitHub knowledge base into a Telegram RAG bot with Qwen via OpenRouter

Workflow preview

1. Workflow Overview

Best for

Tools used

Source and attribution

1.1 Workflow description

WHAT IT DOES

HOW IT WORKS

WHY THIS APPROACH

SETUP (5-10 minutes)

CUSTOMIZATION

WHAT'S NEXT

REQUIREMENTS

1.2 Logical Blocks

2. Block-by-Block Analysis

Block 1 - Sticky Note

Block 2 - Sticky Note1

Block 3 - Sticky Note2

Block 4 - Sticky Note3

Block 5 - Sticky Note4

Block 6 - Sticky Note5

Block 7 - Sticky Note6

Block 8 - When Message Received

Block 9 - Validate Input

Block 10 - Send Telegram Error

Block 11 - Handle GitHub Error

Block 12 - Send GitHub Error

Block 13 - Fetch GitHub File

Block 14 - Initialize Variables

Block 15 - Perform Rough Match

Block 16 - OpenAI Qwen Model

Block 17 - Request AI Assistance

Block 18 - Format AI Output

Block 19 - Check AI Output

Block 20 - Build Telegram Message

Block 21 - Send Telegram Message

3. Summary Table

4. Reproducing the Workflow from Scratch

1. Download the workflow JSON

2. Import the template into n8n

3. Configure credentials and variables

4. Test with sample data

5. Activate and monitor

5. General Notes & Resources

Frequently asked questions