Slack AI Paper Bot

AI-powered tool that curates personalized ML paper digests (and future podcast-style content) for researchers.

Roadmap:

GitHub Actions

Requirements

A machine with at least 16GB RAM is enough. The machine should be up to be able to send out the messages to slack. With Apple Sillicon MPS or CUDA GPU is better to accelerate the embedding generation and vector similarity calculation.

Features

Paper fetching: Fetch recent papers from arXiv by category (with optional affiliations).
Semantic filtering: SPECTER2-based matching against a user profile (topics, keywords, past papers).
Slack digests: Daily top papers posted to a Slack channel, with reaction-based profile updates.
LLM summary: Optional GPT-4o-powered digest summary footer for Slack.

Install

uv venv
source .venv/bin/activate
uv pip install -e .

Configure Slack bot

Create Slack app
- Go to https://api.slack.com/apps
- Create a new app from scratch
- In OAuth & Permissions, add bot scopes: chat:write, chat:write.public, channels:history, groups:history, reactions:read
- Install the app and copy the Bot User OAuth Token (xoxb-...)
Find channel ID
- Open channel details in Slack and copy the Channel ID (C...) (At the bottom of the Slack channel detail page, after you click on the channel title)
Create config

cp config/config.example.json config/config.json

Edit config/config.json:

{
  "bot_token": "xoxb-your-token-here",
  "channel_id": "C0123456789",
  "profile_path": "profiles/efficient_ml.json",
  "categories": ["cs.LG", "cs.AI", "cs.CL"],
  "days": 1,
  "top_k": 5,
  "openai_api_key": "sk-... (optional for summaries)"
}

If you don't want to use LLM to summarize the papers, just remove openai_api_key.

Paper Interest Profiles

Paper Interest profiles live in profiles/*.json. See profiles/example_profile.json or profiles/efficient_ml.json:

name: Profile name.
topics: Free-text research interests.
keywords: Short key phrases to emphasize.
past_papers: Optional list of {title, abstract, arxiv_id}.
preferred_authors: Optional list of author names.

Reaction Learning: Papers you react to with :fire: (🔥) in Slack are automatically added to your profile's past_papers. The bot checks for reactions on papers posted within the last 3 days before each run and updates your profile to improve future recommendations.

Core commands

# **MAIN** Run Slack bot
python -m ai_pod.slack_bot --dry-run  # test without Slack configuration
python -m ai_pod.slack_bot  # full run

# Import historic arXiv links from a channel into paper history to avoid duplications
python -m ai_pod.slack_bot --import-from-channel --import-channel C01234ABCDE --import-days 30

Scheduling (Daily Paper Digest)

crontab -e

# Daily 8am digest
0 8 * * * cd /path/to/repo && .venv/bin/python -m ai_pod.slack_bot >> logs/slack_bot.log 2>&1

Other Debug Commands:

# Fetch papers from arXiv (shows affiliations when present)
python -m ai_pod.get_papers -c cs.LG cs.AI -d 7 -n 20 --show-affiliations

# Filter papers by profile (fetching from arXiv)
python -m ai_pod.filter_papers -p profiles/example_profile.json --fetch -c cs.LG -d 3 -t 0.3

# Filter using an existing cached papers file
python -m ai_pod.filter_papers -p profiles/example_profile.json --papers-cache data/papers_*.json

Data & caching

data/posted_papers.json: Papers already posted to Slack (with timestamps, titles, abstracts).
data/paper_embeddings.json: Cached SPECTER2 embeddings for papers.
data/profile_embeddings_*.json: Cached profile embeddings by profile name.

Caches are used automatically and refreshed as needed.

Troubleshooting

Rate Limiting (HTTP 429 errors) The bot uses multiple fallback methods for fetching paper metadata:

Official arXiv API (primary, with retry logic)
arxiv-txt.org (fallback #1)
arXiv HTML scraping (fallback #2)

If you hit rate limits during channel import, the bot will:

Wait and retry with exponential backoff (5s, 10s, 15s)
Automatically try fallback sources
Continue processing remaining papers

For large imports, consider using --no-fetch-metadata for a faster initial import.

Missing Papers in Digest

Check data/posted_papers.json - papers already posted won't appear again
Use --allow-duplication flag for testing
Adjust top_k in config to see more papers

Profile Not Improving

Make sure reactions:read scope is enabled in your Slack app
React with 🔥 to papers you find interesting
Check logs to see if reactions are being detected

Architecture (high level)

ai_pod.get_papers: arXiv fetching + XML parsing + caching.
ai_pod.filter_papers: SPECTER2 model loading, profile & paper embeddings, similarity scoring.
ai_pod.slack_bot: Orchestrates fetch → filter → dedupe → post, plus optional summary.
ai_pod.slack_utils: Slack formatting, posting, reaction-based profile updates, channel import.
ai_pod.posted_papers: Tracking of posted papers with metadata for deduplication.
ai_pod.summary: GPT-4o-based digest summarization using OpenAI Python SDK with contrastive analysis.
ai_pod.summary: GPT-4o-based digest summarization using OpenAI Python SDK.

Acknoledgements

Some insights came from arXiv_recbot and ArxivDigest.

License

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
ai_pod		ai_pod
config		config
profiles		profiles
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Slack AI Paper Bot

Roadmap:

Requirements

Features

Install

Configure Slack bot

Paper Interest Profiles

Core commands

Scheduling (Daily Paper Digest)

Other Debug Commands:

Data & caching

Troubleshooting

Architecture (high level)

Acknoledgements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Slack AI Paper Bot

Roadmap:

Requirements

Features

Install

Configure Slack bot

Paper Interest Profiles

Core commands

Scheduling (Daily Paper Digest)

Other Debug Commands:

Data & caching

Troubleshooting

Architecture (high level)

Acknoledgements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages