Research Productivity

Ultimate Guide to tidedra/zotero-arxiv-daily in 2025

Stay ahead in your field. Our 2025 guide to zotero-arxiv-daily shows you how to automate your literature search and never miss a key paper again.

D

Dr. Alistair Finch

Computational biologist and productivity enthusiast focused on streamlining the research workflow.

7 min read3 views

The Ultimate Guide to tidedra/zotero-arxiv-daily in 2025

Feeling overwhelmed by the daily flood of new research on arXiv? You're not alone. In 2025, the pace of discovery is faster than ever, and staying on top of the literature can feel like a full-time job. What if you could automate the entire process, ensuring you never miss a critical paper in your field again?

Enter tidedra/zotero-arxiv-daily, a powerful and elegant solution that acts as your personal research assistant. This brilliant tool uses a simple GitHub Action to automatically scan arXiv for papers matching your interests and adds them directly to your Zotero library. It’s the set-it-and-forget-it system you've been dreaming of. This guide will walk you through everything you need to know to get it running and supercharge your research workflow for the year ahead.

What Exactly is zotero-arxiv-daily?

At its core, zotero-arxiv-daily is a script powered by GitHub Actions. Let's break that down:

  • Zotero: Your favorite open-source reference manager, where you organize papers, citations, and research notes.
  • arXiv: The essential preprint server for fields like physics, mathematics, computer science, and quantitative biology, where new research appears daily.
  • GitHub Actions: An automation platform built into GitHub. It lets you run code automatically in response to events, like on a daily schedule.

This tool, created by GitHub user tidedra, combines these three things. It's a pre-configured workflow that runs on a schedule (typically daily), searches arXiv for new preprints based on keywords and categories you define, and then automatically adds any new finds directly into a specific Zotero collection. It’s brilliant in its simplicity and powerful in its impact.

Why You Need This in Your 2025 Research Workflow

The manual process of checking for literature is tedious and prone to error. You might forget to check one day, use inconsistent search terms, or simply get distracted. Automating this chore isn't just a convenience; it's a strategic advantage. Let's compare the old way versus the new way.

TaskManual Method (The Old Way)Automated Method (The zotero-arxiv-daily Way)
Daily CheckRemember to visit arXiv.org every day.The script runs automatically on a schedule.
SearchingManually type in various keywords and filter by date.Uses a predefined, consistent set of keywords and categories.
ReviewingSkim through dozens of titles and abstracts on the website.Papers appear in a dedicated Zotero collection, ready for review.
SavingClick the Zotero browser connector for each interesting paper.All matching papers are saved automatically with metadata.
Time Spent15-30 minutes per day.~2 minutes to set up, 0 minutes per day.

By automating, you free up valuable time and mental energy that can be better spent on actual research—reading, thinking, and writing. You create a consistent, reliable pipeline of relevant literature, ensuring you're always aware of the latest advancements in your niche.

Getting Started: A Step-by-Step Installation Guide

Don't be intimidated by GitHub! This process is surprisingly straightforward, even if you're not a developer. Follow these steps carefully.

Step 1: Prerequisites

Before you start, make sure you have:

  1. A GitHub account. If you don't have one, sign up for free.
  2. A Zotero account and the Zotero desktop app installed.
  3. A dedicated Zotero collection where you want your new papers to go. Create a new one and call it something like "arXiv Daily Digest".

Step 2: Fork the Repository

Go to the tidedra/zotero-arxiv-daily repository on GitHub. In the top-right corner, click the Fork button. This creates a personal copy of the repository under your own GitHub account.

Step 3: Generate a Zotero API Key

You need to give the script permission to add items to your library.

  1. Go to your Zotero API settings page.
  2. Click Create new private key.
  3. Give the key a description (e.g., "arXiv Daily Script").
  4. Under Personal Library, check "Allow write access".
  5. Click Save Key.
  6. Important: Copy the generated key immediately. You will not be able to see it again.

Step 4: Find Your Zotero User ID and Collection ID

  • Your User ID is also on the API settings page, right above where you created the key.
  • To find the Collection ID, open the Zotero desktop app, right-click on the collection you created (e.g., "arXiv Daily Digest"), and select "Generate Report from Collection". In the report, look at the URL. The string of letters and numbers after /collections/ is your Collection ID.

Step 5: Add Your Credentials as GitHub Secrets

Secrets are the secure way to store sensitive information in GitHub.

  1. In your forked repository on GitHub, go to Settings > Secrets and variables > Actions.
  2. Click the New repository secret button for each of the following:
    • Name: ZOTERO_USER_ID, Secret: Your Zotero User ID.
    • Name: ZOTERO_COLLECTION_ID, Secret: Your Zotero Collection ID.
    • Name: ZOTERO_API_KEY, Secret: The API key you generated.

Step 6: Configure Keywords and Enable the Action

  1. Back in your forked repository, find the config.yaml file and click the pencil icon to edit it. This is where you tell the script what to search for. We'll cover this more in the next section. For now, add a test keyword.
  2. After editing, go to the Actions tab in your repository. You'll see a workflow called "arXiv daily".
  3. Click the Enable workflow button.

That's it! The script is now set to run on its daily schedule. You can also trigger a manual run from the Actions tab to test it immediately.

Configuration Deep Dive: Tailoring Your Daily Digest

The real power of this tool lies in the config.yaml file. This is where you fine-tune your search to get highly relevant papers and avoid noise. Let's look at a well-commented example:

# A list of arXiv categories you want to search in. # Find the full list on the arXiv website. e.g., 'cs.AI' for AI. categories:  - 'cs.LG'  # Machine Learning  - 'stat.ML' # Statistics Machine Learning# A list of keywords to search for in titles and abstracts. # Use quotes for multi-word phrases. keywords:  - 'reinforcement learning'  - 'causal inference'  - 'graph neural network'  - 'self-supervised'# Keywords to exclude. If a paper contains any of these, it will be skipped. # This is great for filtering out noise. exclude_keywords:  - 'review'  - 'survey'  - 'medical'# The maximum number of results to fetch from arXiv in one run. # Keep it reasonable to avoid API limits. max_results: 50# The field to sort results by. Options: 'relevance', 'lastUpdatedDate', 'submittedDate' sort_by: 'submittedDate'# The sort order. Options: 'ascending', 'descending' sort_order: 'descending'

Take time to craft your keywords. Think about the specific terms, techniques, and authors that define your research area. Using more specific, multi-word phrases in quotes will yield much better results than broad, single-word terms.

Best Practices & Pro Tips for 2025

You're set up, but let's take it to the next level.

  1. Use Boolean Operators: The underlying arXiv API supports boolean operators. You can structure your keywords like '("graph neural network" OR GNN) AND causal' to build highly specific queries. Test these directly on the arXiv advanced search page first.
  2. Create Multiple Workflows: Have two distinct research projects? Don't mix them! Fork the repository a second time (giving it a new name) and create a separate, dedicated workflow with its own Zotero collection and keywords. This keeps your projects organized.
  3. Integrate with Note-Taking Apps: This is the ultimate power move. Use a Zotero plugin like zotero-obsidian-citations or the built-in Markdown export to connect your new literature to your 'second brain' in Obsidian, Logseq, or Roam Research. Your daily digest papers can be seamlessly turned into literature notes.
  4. Prune Regularly: Your Zotero collection will grow quickly. Once a week, spend 15 minutes triaging the new additions. Read the abstracts, and move the truly interesting papers to a "To Read" collection, and delete the rest. This keeps your digest collection as a manageable inbox.

Troubleshooting Common Issues

  • My Action failed to run: Check your GitHub Secrets. A typo here is the most common culprit. Also, ensure your config.yaml file has valid syntax. Use a YAML linter online if you're unsure.
  • No papers are being added: Your keywords might be too narrow, or there were simply no new papers on a given day. Try broadening your keywords or running the action manually on a different day to test. You can also check the logs in the Actions tab for any error messages from the arXiv API.
  • Too many irrelevant papers are being added: Your keywords are too broad. Refine them by using more specific phrases and make liberal use of the exclude_keywords list to filter out common but irrelevant terms.

Conclusion: The Future of Your Research Workflow is Automated

In 2025, the most valuable resource for any researcher is focused time. By delegating the repetitive task of literature discovery to a tool like tidedra/zotero-arxiv-daily, you reclaim that time. You transform a daily chore into a seamless, automated background process that feeds you a curated list of the most relevant new science.

You are no longer just keeping up; you are staying ahead. So go ahead, fork the repository, spend a few minutes setting it up, and welcome your new automated research assistant. Your future self will thank you.