Skip to content

EvoScientist/EvoSkills

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

53 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🧬 EvoSkills

The official skill repository for EvoScientist. Each skill is an installable knowledge pack that extends EvoScientist with domain-specific expertise.

πŸ“¦ Installation

Important

These skills are purpose-built for EvoScientist β€” together they amplify each other, unlocking the full potential of both the agent and the skills. Under EvoScientist, skills evolve across research cycles through persistent memory (evo-memory).

In-session commands

Install all skills at once:

/install-skill EvoScientist/EvoSkills@skills

Or install a single skill:

/install-skill EvoScientist/EvoSkills@skills/paper-planning

Ask EvoScientist directly

Simply ask the agent in conversation:

"Install all skills from EvoScientist/EvoSkills@skills."

Tip

Not using EvoScientist? These skills are compatible with any coding agent. One command via skills.sh to install on Claude Code, OpenCode, Cursor, Codex, Gemini CLI, DeepAgents, and more:

npx skills add EvoScientist/EvoSkills

✨ Available Skills

Skill Description
research-ideation πŸ’‘ Literature grounding, tournament ranking & proposal generation
paper-planning πŸ“ Research paper planning & outline generation
experiment-pipeline πŸ§ͺ Structured 4-stage experiment execution
experiment-craft πŸ”§ Experiment debugging, logging & iteration
experiment-iterative-coder πŸ”„ Iterative code refinement (plan β†’ code β†’ evaluate β†’ refine)
paper-writing ✍️ End-to-end paper writing assistance
paper-review πŸ” Automated paper review & feedback
paper-rebuttal πŸ’¬ Rebuttal writing after peer review
academic-slides 🎀 Academic presentation & research talk creation
evo-memory 🧠 Persistent research memory & self-evolution
paper-navigator πŸ“š Academic paper discovery, evaluation & reading
research-survey πŸ“ Structured literature survey synthesis
nano-banana 🍌 AI-generated presentation slides & illustrations via Gemini

Paper Suite + Self-Evolution Suite: Each skill is self-contained β€” use them individually or combine freely. The self-evolution loop now runs through research-ideation, experiment-pipeline, and evo-memory.

πŸ”Œ MCP Server Marketplace

The mcp/ directory contains a curated collection of MCP servers that extend agents with external tools β€” web search, academic paper retrieval, documentation lookup, and more. Browse the full list or install directly:

/install-mcp              # interactive browser
EvoSci mcp install arxiv  # install by name

⛳️ Framework Overview

EvoScientist Framework

The diagram above shows the full EvoScientist pipeline. The Researcher Agent (top, blue) runs idea tree search and Elo tournament ranking to produce a research proposal. The Engineer Agent (bottom, green) executes the 4-stage experiment pipeline. The Evolution Manager Agent (right) manages three memory evolution mechanisms β€” IDE, IVE, and ESE β€” that feed learned knowledge back into Ideation Memory (M_I) and Experimentation Memory (M_E) for future cycles.

🎒 Skill Pipeline

flowchart LR
    A["<b>πŸ”¬ Research Phase</b><br/>πŸ’‘ research-ideation"]
    --> B["<b>βš™οΈ Experiment Phase</b><br/>πŸ“ paper-planning<br/>πŸ§ͺ experiment-pipeline<br/>πŸ”§ experiment-craft<br/>πŸ”„ experiment-iterative-coder"]
    --> C["<b>πŸ“ Writing Phase</b><br/>✍️ paper-writing<br/>πŸ” paper-review<br/>πŸ’¬ paper-rebuttal<br/>🎀 academic-slides"]

    D[("🧠 evo-memory<br/>(IDE · IVE · ESE)")] <--> A
    D <--> B
    E["πŸ“š paper-navigator<br/>(standalone)"] -.-> A
    E -.-> B
    F["🍌 nano-banana<br/>(standalone)"] -.-> C

    style A fill:#7C3AED,stroke:#5B21B6,stroke-width:2px,color:#fff
    style B fill:#D97706,stroke:#B45309,stroke-width:2px,color:#fff
    style C fill:#16A34A,stroke:#15803D,stroke-width:2px,color:#fff
    style D fill:#475569,stroke:#334155,stroke-width:2px,color:#fff
    style E fill:#0369A1,stroke:#075985,stroke-width:2px,color:#fff
    style F fill:#D97706,stroke:#B45309,stroke-width:2px,color:#fff
Loading

πŸ’‘ research-ideation β€” Literature Grounding, Tournament & Proposal

The starting point of the research pipeline. It now covers the full path from literature grounding to ranked ideas to a concrete proposal:

  • Load Prior Knowledge β€” Read evo-memory first to reuse feasible directions and avoid known dead ends
  • Literature Grounding β€” Use paper-navigator to collect and analyze papers before generating ideas
  • Multi-Track Ideation + Refinement β€” Generate candidates across multiple personas, then iteratively strengthen them
  • Elo Tournament β€” Rank refined ideas on novelty, feasibility, relevance, and clarity; present the top-3
  • Proposal Extension β€” Expand the selected winner into a manuscript-quality research proposal

πŸ“ research-survey β€” Literature Survey & Synthesis

Dedicated skill for turning a large paper collection into a structured survey report:

  • Adaptive Outline β€” Generate a field-specific outline based on the query type and literature set
  • Draft + Expansion Pipeline β€” Draft from top papers, then deepen each section with the full collection
  • Summary Refinement β€” Build section summaries before rewriting the abstract, introduction, and conclusion
  • Survey-Grade Output β€” Comparative tables, taxonomy-based method organization, dense citations, and references

πŸ“ paper-planning β€” Research Paper Planning & Outline Generation

Guides pre-writing planning before a single word is drafted. Covers four key activities:

  • Story Design β€” Reverse-engineer the narrative: task β†’ challenge β†’ insight β†’ contribution β†’ advantage
  • Experiment Planning β€” Plan comparisons, ablations, and demo scenarios with structured checklists
  • Figure Design β€” Pipeline figures that highlight novelty; teaser figures that hook reviewers
  • Timeline Management β€” 4-week countdown schedule from outline to submission

Includes counterintuitive tactics: write your rejection letter first, narrow claims before broadening, and plan fallback narratives.

πŸ§ͺ experiment-pipeline β€” 4-Stage Experiment Execution

A structured framework for executing research experiments with attempt budgets and gate conditions:

  • Stage 1: Initial Implementation β€” Get baseline code running and reproduce known results (≀20 attempts)
  • Stage 2: Hyperparameter Tuning β€” Optimize configuration for your setup (≀12 attempts)
  • Stage 3: Proposed Method β€” Implement and validate the novel method (≀12 attempts)
  • Stage 4: Ablation Study β€” Prove each component's contribution (≀18 attempts)
  • Code Trajectory Logging β€” Structured attempt logging that feeds into evo-memory
  • Counterintuitive Rules β€” Initial implementation is not wasted time; budget limits prevent rabbit holes; failed attempts are data

Integrates with experiment-craft for failure diagnosis within stages and evo-memory for cross-cycle learning.

πŸ”§ experiment-craft β€” Experiment Debugging & Iteration

A systematic approach to experiment debugging, logging, and iterative improvement:

  • 5-Step Diagnostic Flow β€” Collect failures β†’ find a working version β†’ bridge the gap β†’ hypothesize β†’ fix
  • Counterintuitive Rules β€” Change one variable at a time; effective experiments beat more experiments
  • Experiment Logging β€” 5-section structured log template for reproducible records
  • Handoff to Paper-Writing β€” Feed validated results and logs into paper-writing for drafting

πŸ”„ experiment-iterative-coder β€” Iterative Code Refinement

Structured plan β†’ code β†’ evaluate β†’ refine cycles for higher code quality:

  • Phase Decomposition β€” Break complex tasks into 1-5 sequential phases
  • Iteration Loop β€” Up to 3 iterations per phase (10 total): plan, code, run lint/tests, score, decide
  • Objective Evaluation β€” ruff lint + pytest with dynamic score weighting and hard caps
  • Failure Mode Guidance β€” Targeted responses for timeout, syntax, import, test, and lint failures

Integrates with experiment-craft for stuck diagnoses and evo-memory for loading prior strategies.

✍️ paper-writing β€” Section-by-Section Paper Drafting

A proven 11-step workflow for writing academic papers with LaTeX templates:

  • Structured Process β€” From pipeline sketch β†’ story design β†’ Method β†’ Experiments β†’ Related Work β†’ Abstract β†’ Title
  • Section Templates β€” Three Abstract templates, four Introduction openers, Method module structure, Experiments organization
  • LaTeX Assets β€” Annotated paper skeleton (paper-skeleton.tex) and booktabs table macros (table-style.tex)
  • Writing Principles β€” One message per paragraph, topic sentence first, terminology consistency, reverse-outlining
  • Counterintuitive Tactics β€” Underclaim in prose / overdeliver in evidence; lead with mechanism, not just metrics

πŸ” paper-review β€” Self-Review & Quality Assurance

Systematic self-review before submission using adversarial and counterintuitive review strategies:

  • 5-Aspect Checklist β€” Contribution sufficiency, writing clarity, results quality, testing completeness, method design
  • Reverse-Outlining β€” Extract the outline from finished paragraphs to verify logical flow
  • Figure & Table Quality Checks β€” Captions, resolution, booktabs, color-blind friendliness
  • Rejection Simulation β€” Force a reject summary first; attack your own novelty claim
  • Handoff to Rebuttal β€” After review, feed identified weaknesses into paper-rebuttal for response preparation

πŸ’¬ paper-rebuttal β€” Rebuttal Writing After Peer Review

Dedicated rebuttal skill for responding to reviewer feedback after peer review:

  • Score Diagnosis β€” Color-code every reviewer comment: red (critical), orange (important), gray (minor), green (positive)
  • Champion Strategy β€” Arm your most positive reviewer with evidence for the Area Chair discussion
  • Tactical Writing β€” 18 rules for structure, content, and tone in rebuttal responses
  • Counterintuitive Principles β€” Submit even with extreme scores; concede small points to win the big argument
  • Common Concerns β€” Response strategies for 12 frequently raised reviewer complaints

🎀 academic-slides β€” Presentation & Research Talk Creation

A structured approach to creating academic presentations and preparing research talks:

  • Narrative Arc β€” Define scope, audience, and key takeaway before touching slides
  • Slide Design β€” 10 design rules, visual hierarchy, one idea per slide, claim-style titles
  • Practical Creation β€” .pptx file generation with color palettes, layout code, charts, and figures
  • Delivery & Q&A β€” Rehearsal protocol, timing, and backup slide preparation
  • Counterintuitive Rules β€” Slides are not your paper; enthusiasm beats polish; related work builds motivation, not citation counts

🧠 evo-memory β€” Persistent Research Memory & Self-Evolution

The learning layer that accumulates knowledge across research cycles. Maintains two memory stores and implements three evolution mechanisms:

  • Ideation Memory (M_I) β€” Tracks feasible and unsuccessful research directions across ideation cycles
  • Experimentation Memory (M_E) β€” Stores reusable data processing and model training strategies (paper core), plus architecture and debugging (extensions)
  • IDE (Idea Direction Evolution) β€” Extracts promising directions after research-ideation
  • IVE (Idea Validation Evolution) β€” Classifies experiment failures as implementation vs fundamental direction failures
  • ESE (Experiment Strategy Evolution) β€” Distills reusable patterns from successful experiment pipelines

Read by research-ideation and experiment-pipeline at cycle start; updated after each cycle completes.

πŸ“š paper-navigator β€” Academic Paper Discovery & Reading

Focused paper workflow in four stages β€” from query to evaluated reading list:

  • Disambiguate β€” Analyze user intent, resolve ambiguous terms (project names, module names) to actual paper titles
  • Discover β€” 7 discovery paths: keyword search, citation traversal, recommendations, author tracking, arXiv monitoring, trending detection, GitHub search
  • Evaluate β€” Quick assessment via TLDR, citations, code availability (HuggingFace + GitHub), and top models by task
  • Read β€” Full-text retrieval via Jina Reader with 3-level reading strategy (Technical, Analytical, Contextual) Includes Python scripts powered by Semantic Scholar, HuggingFace, GitHub, arXiv, and Jina Reader APIs.

🍌 nano-banana β€” AI-Generated Slides & Illustrations

Generate professional presentation slides and high-quality illustrations using Gemini's image generation API, with an interactive browser-based review loop:

  • 7-Phase Workflow β€” Content planning conversation β†’ slides_plan.json β†’ style selection & batch generation β†’ browser review β†’ feedback editing β†’ PPTX packaging β†’ cleanup
  • 3 Visual Styles β€” Lineal Color (flat icons, educational), Gradient Glass (glassmorphism, premium), Vector Illustration (retro, approachable)
  • Interactive Review β€” Local HTTP server with per-slide feedback; edits are applied without regenerating the entire deck
  • Multi-Model Support β€” gemini-3-pro-image-preview (best quality), gemini-3.1-flash-image-preview (fast iteration), gemini-2.5-flash-image (rapid prototyping)
  • Counterintuitive Rules β€” More planning = better slides; edit don't regenerate; never read generated images yourself (use the review server)

πŸ”Back to top

🎯 α―“βž€ Roadmap

Completed:

  • 🧠 Self-Evolution Suite β€” research-ideation, experiment-pipeline, evo-memory
  • πŸ“š Literature Survey β€” Systematic literature search, filtering, and survey generation
  • πŸ”„ Iterative Coder β€” Iterative code refinement with plan β†’ code β†’ evaluate β†’ refine cycles
  • 🎨 Visual Generation β€” AI-generated slides & illustrations (nano-banana)

Coming soon:

  • πŸ… Math Olympiad β€” Advanced mathematical reasoning & problem solving
  • πŸ”¬ Paper Reproduction β€” Read a paper, reproduce its core results, and verify claims
  • πŸ’‘ Grant & Proposal Writing β€” Research proposal drafting with funding agency conventions
  • πŸ€– Peer Debate β€” Multi-agent adversarial discussion to stress-test research ideas
  • πŸ“ˆ Trend Radar β€” Analyze publication trends, identify emerging topics & research gaps
  • πŸ—£οΈ Paper QA β€” Interactive question-answering over paper collections, extracting key findings & cross-referencing claims

Stay tuned β€” more skills are on the way!

πŸ”Back to top

🌍 Project Roles

Xi Zhang
EvoScientist
Xi Zhang
Xi Zhang
Yougang Lyu
Yougang Lyu
Dinos Papakostas
Dinos Papakostas
Yuyue Zhao
Yuyue Zhao

Xiaoyi DeepResearch Xiaoyi DeepResearch Team and the wider open-source community contribute to this project.

For any enquiries or collaboration opportunities, please contact: EvoScientist.ai@gmail.com

πŸ”Back to top

🀝 Contributing

We welcome contributions! See the guides for skills and MCP servers, or start with the Contributing Guidelines.

Every contribution brings us one step closer to a future where AI accelerates scientific breakthroughs for all of humanity.

πŸ“ˆ Star History

Star History Chart

πŸ”Back to top

πŸ“ Citation

If you find our paper and code useful in your research and applications, please cite using this BibTeX:

@article{evoscientist2026, 
  title={EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery}, 
  author={Yougang Lyu and Xi Zhang and Xinhao Yi and Yuyue Zhao and Shuyu Guo and Wenxiang Hu and Jan Piotrowski and Jakub Kaliski and Jacopo Urbani and Zaiqiao Meng and Lun Zhou and Xiaohui Yan}, 
  journal={arXiv preprint arXiv:2603.08127}, 
  year={2026} 
}

πŸ”Back to top

πŸ“œ License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

πŸ”Back to top

About

🧬 Extend EvoScientist with Installable Skill & Knowledge Packs

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors