TinyScientist: A Lightweight Framework for Building Research Agents

News

Oct 11, 2025 — Our paper is out on arXiv (2510.06579) and accepted to EMNLP 2025 (Demo Track) 🎉

Introduction

Tiny-Scientist is a lightweight, user-friendly framework for automating the entire lifecycle of scientific research—from ideation to implementation, writing, and review. Designed for flexibility, it integrates smoothly with your favorite LLMs and search tools.

Core Features

🧠 Think: Generate structured research ideas from an intent string.
💻 Code: Automatically generate and run experiments based on the idea.
✍️ Write: Convert your results and ideas into a conference-style paper.
📝 Review: Review any form of paper and output structured feedback in JSON.
🔧 MCP: The extensible tool use protocol by Anthropic

Software Architecture

Our codebase is structured around three core components to support an extensible framework: core, tools, and formatters. The core module provides essential functionalities, tools enhance and extend these core capabilities, and formatters handle input/output tasks such as LaTeX template rendering.

Installation

Option 1: Install via pip (recommended)

pip install tiny-scientist

Option 2: Install from source

# create conda environment
conda create -n tiny-scientist python=3.10
conda activate tiny-scientist

# Install Poetry
curl -sSL https://install.python-poetry.org | python3
export PATH="$HOME/.local/bin:$PATH"

# Install dependencies
poetry install

Get started

TinyScientist now uses Claude Agent SDK-backed stages by default (think, code, write, review). OpenAI Agents SDK remains available as an explicit runtime choice.

1) Required runtime setup

Set your model API key:

export OPENAI_API_KEY=your-key-here
# or DEEPSEEK_API_KEY / ANTHROPIC_API_KEY depending on your model

If you installed from source and see ModuleNotFoundError: No module named 'claude_agent_sdk', install:

pip install claude-agent-sdk

If you want the OpenAI runtime as well, install:

pip install openai-agents

smolagents is no longer required for the main TinyScientist runtime. It is only needed for legacy compatibility helpers.

2) Tool providers (strict mode)

Tooling runs in strict provider mode (no automatic fallback). If a selected provider is unavailable, the tool call fails.

Core switches:

# Web search provider: duckduckgo | tavily | serpapi | brave
export WEB_SEARCH_PROVIDER=duckduckgo

# Diagram backend: llm_svg | nano-banana
export DRAWER_BACKEND=llm_svg

Optional provider keys:

export S2_API_KEY=...             # semantic scholar tools
export NEWSAPI_KEY=...            # news_search
export TAVILY_API_KEY=...         # if WEB_SEARCH_PROVIDER=tavily
export SERPAPI_API_KEY=...        # if WEB_SEARCH_PROVIDER=serpapi
export BRAVE_SEARCH_API_KEY=...   # if WEB_SEARCH_PROVIDER=brave

# nano-banana image generation (optional)
export NANO_BANANA_MODEL=gpt-image-1

3) Minimal Python usage (unchanged API)

The minimal Python API is still the same:

from tiny_scientist import TinyScientist

scientist = TinyScientist(model="claude-3-5-sonnet-20241022", budget=1.0)

idea = scientist.think(
    intent="Benchmarking adaptive step size strategies using a convex quadratic optimization function"
)
status, experiment_dir = scientist.code(idea=idea)

if status:
    pdf_path = scientist.write(idea=idea, experiment_dir=experiment_dir)
    review = scientist.review(pdf_path=pdf_path)

Use agent_sdk to select the runtime explicitly when you want the OpenAI backend:

scientist = TinyScientist(model="gpt-4o", agent_sdk="openai")

Supported values today are claude and openai. The default is claude.

3.1) Skills and MCP

TinyScientist now supports project skills from both:

.claude/skills
.agents/skills

Claude backend:

Uses Claude's native filesystem skill loading via .claude/skills
Loads Claude skills from both user and project settings sources
Does not inject SKILL.md contents into prompts; skills are consumed only through Claude's native Skill tool flow
Uses generated .tiny_scientist.generated.mcp.json to mount TinyScientist MCP research tools

OpenAI backend:

Continues to inject local SKILL.md content into agent instructions
Also supports official OpenAI shell-mounted skills when you provide skill specs as JSON:

export OPENAI_AGENT_SKILLS_JSON='[{"type":"skill_reference","skill_id":"skill_xxx","version":"1"}]'

You can also scope mounted OpenAI skills per stage:

export OPENAI_AGENT_SKILLS_THINKER_JSON='[...]'
export OPENAI_AGENT_SKILLS_CODER_JSON='[...]'
export OPENAI_AGENT_SKILLS_WRITER_JSON='[...]'
export OPENAI_AGENT_SKILLS_REVIEWER_JSON='[...]'

4) Non-OpenAI-compatible endpoints (advanced)

For OpenAI-compatible gateways:

export OPENAI_API_BASE=http://your-endpoint/v1
export OPENAI_API_KEY=your-key-here

Then use models like openai/qwen3-30b-a3b.

5) Built-in research tools

Agents can use these built-in tools during thinking/coding/writing/review:

web_search
paper_search
scholar_graph_search
benchmark_search
dataset_search
code_search
repo_runtime_probe
arxiv_daily_watch
news_search
patent_search
table_extractor
claim_verifier
generate_diagram (writer path)

Managing API Keys (Optional)

You can configure keys using a .toml file for convenience beyond exporting.

Step 1: Copy the template

cp config.template.toml config.toml

Step 2: Fill in your API credentials

Edit config.toml to include your keys, such as:

[core]
llm_api_key = "xxxx"

No need to export environment variables manually—just set this once.

Developing

Develop Demo

To develop a demo (Both frontend and backend):

python backend/app.py

cd frontend
npm install
npm start

Q&A

If you face "cairo"-related errors, cario is a system-level dependency, please run conda install -c conda-forge cairo or brew install cairo.

If you face errors related to pdflatex, this is also a system-level dependency for latex rendering, please run brew install --cask mactex.

Contribution

We’re working on extending support for more tools, models, and paper formats. Contributions welcome!

Citation

@misc{tinyscientist,
author       = {Haofei Yu and Keyang Xuan and Fenghai Li and Kunlun Zhu and Zijie Lei and Jiaxun Zhang and Ziheng Qi and Jiaxuan You},
title        = {TinyScientist: A Lightweight Framework for Building Research Agents},
howpublished = {https://github.com/ulab-uiuc/tiny-scientist},
note         = {Accessed: 2025-04-14},
year         = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 199 Commits
.github		.github
assets		assets
backend		backend
data		data
docs		docs
examples		examples
frontend		frontend
scripts		scripts
tests		tests
tiny_scientist		tiny_scientist
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
config.template.toml		config.template.toml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TinyScientist: A Lightweight Framework for Building Research Agents

News

Introduction

Core Features

Software Architecture

Installation

Option 1: Install via pip (recommended)

Option 2: Install from source

Get started

1) Required runtime setup

2) Tool providers (strict mode)

3) Minimal Python usage (unchanged API)

3.1) Skills and MCP

4) Non-OpenAI-compatible endpoints (advanced)

5) Built-in research tools

Managing API Keys (Optional)

Step 1: Copy the template

Step 2: Fill in your API credentials

Developing

Develop Demo

Q&A

Contribution

Citation

About

Uh oh!

Releases 9

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TinyScientist: A Lightweight Framework for Building Research Agents

News

Introduction

Core Features

Software Architecture

Installation

Option 1: Install via pip (recommended)

Option 2: Install from source

Get started

1) Required runtime setup

2) Tool providers (strict mode)

3) Minimal Python usage (unchanged API)

3.1) Skills and MCP

4) Non-OpenAI-compatible endpoints (advanced)

5) Built-in research tools

Managing API Keys (Optional)

Step 1: Copy the template

Step 2: Fill in your API credentials

Developing

Develop Demo

Q&A

Contribution

Citation

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages