Automated academic paper metadata & variable extraction with Gemini to Google Sheets
Workflow preview
$20/month : Unlimited workflows
2500 executions/month
THE #1 IN WEB SCRAPING
Scrape any website without limits
HOSTINGER
Early Deal
DISCOUNT 20% Try free
DISCOUNT 20%
Self-hosted n8n
Unlimited workflows - from $4.99/mo
#1 hub for scraping, AI & automation
6000+ actors - $5 credits/mo
Important notice
This workflow is provided as-is. Please review and test before using in production.
Overview
πIn the social and behavioral sciences (e.g., psychology, sociology, economics, management), researchers and students often need to normalize academic paper metadata and extract variables before any literature review or meta-analysis.
π§©This workflow automates the busywork. Using an LLM, it processes CSV/XLSX/XLS files (exported from WoS, Scopus, EndNote, Zotero, or your own spreadsheets) into normalized metadata and extracted variables, and writes a neat table to Google Sheets.
π Example Google Sheet:Β click me
π₯ Who is this for?
- π Undergraduate and graduate students or researchers in soft-science fields (psychology, sociology, economics, business)
- β±οΈ People who donβt have time to read full papers and need quick overviews
- π Anyone who wants to automate academic paper metadata normalization and variable extraction to speed up a literature review
βοΈ How it works
- π€ Upload an academic paper file (CSV/XLSX/XLS) in chat.
- π The workflow creates a Google Sheets spreadsheet with two tabs:
CheckpointandFinalResult. - π A structured-output LLM normalizes core metadata (title, abstract, authors, publication date, source) from the uploaded file and writes it to
Checkpoint; π§ a Gmail notification is sent when finished. - π§ͺ A second structured-output LLM uses the metadata above to extract variables (Independent Variable, Dependent Variable) and writes them to
FinalResult; π§ youβll get a second Gmail notification when done.
π οΈ How to set up
π Credentials
- Google Sheets OAuth2 (read/write)
- Gmail OAuth2 (send notifications)
- Google Gemini (or any LLM you prefer)
π Quick start
- Connect Google Sheets, Gmail, and Gemini (or your LLM) credentials.
- Open
File Upload Triggerβ upload your CSV/XLSX/XLS file and type a name in chat (used as the Google Sheets spreadsheet title). - Watch your inbox for status emails and open the Google Sheets spreadsheet to review Checkpoint and FinalResult.
π Customization
- ποΈ Journal lists: Edit the Journal Rank Classifier code node to add/remove titles. The default list is for business/management journalsβswap it for a list from your own field.
- π Notifications: Replace Gmail with Slack, Teams, or any channel you prefer.
- π§ LLM outputs: Need different metadata or extracted data? Edit the LLMβs system prompt and Structured Output Parser.
π Note
- π Make sure your file includes abstracts. If the academic paper data you upload doesnβt contain an abstract, the extracted results will be far less useful.
- π§© CSV yields no items? Encoding mismatches can break the workflow. If this happens, convert the CSV to
.xlsor.xlsxand try again.
π© Help
Contact: [email protected]