The official skill repository for EvoScientist. Each skill is an installable knowledge pack that extends EvoScientist with domain-specific expertise.
Important
These skills are purpose-built for EvoScientist β together they amplify each other, unlocking the full potential of both the agent and the skills. Under EvoScientist, skills evolve across research cycles through persistent memory (evo-memory).
Install all skills at once:
/install-skill EvoScientist/EvoSkills@skillsOr install a single skill:
/install-skill EvoScientist/EvoSkills@skills/paper-planningSimply ask the agent in conversation:
"Install all skills from EvoScientist/EvoSkills@skills."
Tip
Not using EvoScientist? These skills are compatible with any coding agent. One command via skills.sh to install on Claude Code, OpenCode, Cursor, Codex, Gemini CLI, DeepAgents, and more:
npx skills add EvoScientist/EvoSkills| Skill | Description |
|---|---|
research-ideation |
π‘ Literature grounding, tournament ranking & proposal generation |
paper-planning |
π Research paper planning & outline generation |
experiment-pipeline |
π§ͺ Structured 4-stage experiment execution |
experiment-craft |
π§ Experiment debugging, logging & iteration |
experiment-iterative-coder |
π Iterative code refinement (plan β code β evaluate β refine) |
paper-writing |
βοΈ End-to-end paper writing assistance |
paper-review |
π Automated paper review & feedback |
paper-rebuttal |
π¬ Rebuttal writing after peer review |
academic-slides |
π€ Academic presentation & research talk creation |
evo-memory |
π§ Persistent research memory & self-evolution |
paper-navigator |
π Academic paper discovery, evaluation & reading |
research-survey |
π Structured literature survey synthesis |
nano-banana |
π AI-generated presentation slides & illustrations via Gemini |
Paper Suite + Self-Evolution Suite: Each skill is self-contained β use them individually or combine freely. The self-evolution loop now runs through
research-ideation,experiment-pipeline, andevo-memory.
The mcp/ directory contains a curated collection of MCP servers that extend agents with external tools β web search, academic paper retrieval, documentation lookup, and more. Browse the full list or install directly:
/install-mcp # interactive browser
EvoSci mcp install arxiv # install by nameThe diagram above shows the full EvoScientist pipeline. The Researcher Agent (top, blue) runs idea tree search and Elo tournament ranking to produce a research proposal. The Engineer Agent (bottom, green) executes the 4-stage experiment pipeline. The Evolution Manager Agent (right) manages three memory evolution mechanisms β IDE, IVE, and ESE β that feed learned knowledge back into Ideation Memory (M_I) and Experimentation Memory (M_E) for future cycles.
flowchart LR
A["<b>π¬ Research Phase</b><br/>π‘ research-ideation"]
--> B["<b>βοΈ Experiment Phase</b><br/>π paper-planning<br/>π§ͺ experiment-pipeline<br/>π§ experiment-craft<br/>π experiment-iterative-coder"]
--> C["<b>π Writing Phase</b><br/>βοΈ paper-writing<br/>π paper-review<br/>π¬ paper-rebuttal<br/>π€ academic-slides"]
D[("π§ evo-memory<br/>(IDE Β· IVE Β· ESE)")] <--> A
D <--> B
E["π paper-navigator<br/>(standalone)"] -.-> A
E -.-> B
F["π nano-banana<br/>(standalone)"] -.-> C
style A fill:#7C3AED,stroke:#5B21B6,stroke-width:2px,color:#fff
style B fill:#D97706,stroke:#B45309,stroke-width:2px,color:#fff
style C fill:#16A34A,stroke:#15803D,stroke-width:2px,color:#fff
style D fill:#475569,stroke:#334155,stroke-width:2px,color:#fff
style E fill:#0369A1,stroke:#075985,stroke-width:2px,color:#fff
style F fill:#D97706,stroke:#B45309,stroke-width:2px,color:#fff
The starting point of the research pipeline. It now covers the full path from literature grounding to ranked ideas to a concrete proposal:
- Load Prior Knowledge β Read
evo-memoryfirst to reuse feasible directions and avoid known dead ends - Literature Grounding β Use
paper-navigatorto collect and analyze papers before generating ideas - Multi-Track Ideation + Refinement β Generate candidates across multiple personas, then iteratively strengthen them
- Elo Tournament β Rank refined ideas on novelty, feasibility, relevance, and clarity; present the top-3
- Proposal Extension β Expand the selected winner into a manuscript-quality research proposal
Dedicated skill for turning a large paper collection into a structured survey report:
- Adaptive Outline β Generate a field-specific outline based on the query type and literature set
- Draft + Expansion Pipeline β Draft from top papers, then deepen each section with the full collection
- Summary Refinement β Build section summaries before rewriting the abstract, introduction, and conclusion
- Survey-Grade Output β Comparative tables, taxonomy-based method organization, dense citations, and references
Guides pre-writing planning before a single word is drafted. Covers four key activities:
- Story Design β Reverse-engineer the narrative: task β challenge β insight β contribution β advantage
- Experiment Planning β Plan comparisons, ablations, and demo scenarios with structured checklists
- Figure Design β Pipeline figures that highlight novelty; teaser figures that hook reviewers
- Timeline Management β 4-week countdown schedule from outline to submission
Includes counterintuitive tactics: write your rejection letter first, narrow claims before broadening, and plan fallback narratives.
A structured framework for executing research experiments with attempt budgets and gate conditions:
- Stage 1: Initial Implementation β Get baseline code running and reproduce known results (β€20 attempts)
- Stage 2: Hyperparameter Tuning β Optimize configuration for your setup (β€12 attempts)
- Stage 3: Proposed Method β Implement and validate the novel method (β€12 attempts)
- Stage 4: Ablation Study β Prove each component's contribution (β€18 attempts)
- Code Trajectory Logging β Structured attempt logging that feeds into
evo-memory - Counterintuitive Rules β Initial implementation is not wasted time; budget limits prevent rabbit holes; failed attempts are data
Integrates with experiment-craft for failure diagnosis within stages and evo-memory for cross-cycle learning.
A systematic approach to experiment debugging, logging, and iterative improvement:
- 5-Step Diagnostic Flow β Collect failures β find a working version β bridge the gap β hypothesize β fix
- Counterintuitive Rules β Change one variable at a time; effective experiments beat more experiments
- Experiment Logging β 5-section structured log template for reproducible records
- Handoff to Paper-Writing β Feed validated results and logs into
paper-writingfor drafting
Structured plan β code β evaluate β refine cycles for higher code quality:
- Phase Decomposition β Break complex tasks into 1-5 sequential phases
- Iteration Loop β Up to 3 iterations per phase (10 total): plan, code, run lint/tests, score, decide
- Objective Evaluation β ruff lint + pytest with dynamic score weighting and hard caps
- Failure Mode Guidance β Targeted responses for timeout, syntax, import, test, and lint failures
Integrates with experiment-craft for stuck diagnoses and evo-memory for loading prior strategies.
A proven 11-step workflow for writing academic papers with LaTeX templates:
- Structured Process β From pipeline sketch β story design β Method β Experiments β Related Work β Abstract β Title
- Section Templates β Three Abstract templates, four Introduction openers, Method module structure, Experiments organization
- LaTeX Assets β Annotated paper skeleton (
paper-skeleton.tex) and booktabs table macros (table-style.tex) - Writing Principles β One message per paragraph, topic sentence first, terminology consistency, reverse-outlining
- Counterintuitive Tactics β Underclaim in prose / overdeliver in evidence; lead with mechanism, not just metrics
Systematic self-review before submission using adversarial and counterintuitive review strategies:
- 5-Aspect Checklist β Contribution sufficiency, writing clarity, results quality, testing completeness, method design
- Reverse-Outlining β Extract the outline from finished paragraphs to verify logical flow
- Figure & Table Quality Checks β Captions, resolution, booktabs, color-blind friendliness
- Rejection Simulation β Force a reject summary first; attack your own novelty claim
- Handoff to Rebuttal β After review, feed identified weaknesses into
paper-rebuttalfor response preparation
Dedicated rebuttal skill for responding to reviewer feedback after peer review:
- Score Diagnosis β Color-code every reviewer comment: red (critical), orange (important), gray (minor), green (positive)
- Champion Strategy β Arm your most positive reviewer with evidence for the Area Chair discussion
- Tactical Writing β 18 rules for structure, content, and tone in rebuttal responses
- Counterintuitive Principles β Submit even with extreme scores; concede small points to win the big argument
- Common Concerns β Response strategies for 12 frequently raised reviewer complaints
A structured approach to creating academic presentations and preparing research talks:
- Narrative Arc β Define scope, audience, and key takeaway before touching slides
- Slide Design β 10 design rules, visual hierarchy, one idea per slide, claim-style titles
- Practical Creation β
.pptxfile generation with color palettes, layout code, charts, and figures - Delivery & Q&A β Rehearsal protocol, timing, and backup slide preparation
- Counterintuitive Rules β Slides are not your paper; enthusiasm beats polish; related work builds motivation, not citation counts
The learning layer that accumulates knowledge across research cycles. Maintains two memory stores and implements three evolution mechanisms:
- Ideation Memory (M_I) β Tracks feasible and unsuccessful research directions across ideation cycles
- Experimentation Memory (M_E) β Stores reusable data processing and model training strategies (paper core), plus architecture and debugging (extensions)
- IDE (Idea Direction Evolution) β Extracts promising directions after
research-ideation - IVE (Idea Validation Evolution) β Classifies experiment failures as implementation vs fundamental direction failures
- ESE (Experiment Strategy Evolution) β Distills reusable patterns from successful experiment pipelines
Read by research-ideation and experiment-pipeline at cycle start; updated after each cycle completes.
Focused paper workflow in four stages β from query to evaluated reading list:
- Disambiguate β Analyze user intent, resolve ambiguous terms (project names, module names) to actual paper titles
- Discover β 7 discovery paths: keyword search, citation traversal, recommendations, author tracking, arXiv monitoring, trending detection, GitHub search
- Evaluate β Quick assessment via TLDR, citations, code availability (HuggingFace + GitHub), and top models by task
- Read β Full-text retrieval via Jina Reader with 3-level reading strategy (Technical, Analytical, Contextual) Includes Python scripts powered by Semantic Scholar, HuggingFace, GitHub, arXiv, and Jina Reader APIs.
Generate professional presentation slides and high-quality illustrations using Gemini's image generation API, with an interactive browser-based review loop:
- 7-Phase Workflow β Content planning conversation β slides_plan.json β style selection & batch generation β browser review β feedback editing β PPTX packaging β cleanup
- 3 Visual Styles β Lineal Color (flat icons, educational), Gradient Glass (glassmorphism, premium), Vector Illustration (retro, approachable)
- Interactive Review β Local HTTP server with per-slide feedback; edits are applied without regenerating the entire deck
- Multi-Model Support β
gemini-3-pro-image-preview(best quality),gemini-3.1-flash-image-preview(fast iteration),gemini-2.5-flash-image(rapid prototyping) - Counterintuitive Rules β More planning = better slides; edit don't regenerate; never read generated images yourself (use the review server)
Completed:
- π§ Self-Evolution Suite β
research-ideation,experiment-pipeline,evo-memory - π Literature Survey β Systematic literature search, filtering, and survey generation
- π Iterative Coder β Iterative code refinement with plan β code β evaluate β refine cycles
- π¨ Visual Generation β AI-generated slides & illustrations (
nano-banana)
Coming soon:
- π Math Olympiad β Advanced mathematical reasoning & problem solving
- π¬ Paper Reproduction β Read a paper, reproduce its core results, and verify claims
- π‘ Grant & Proposal Writing β Research proposal drafting with funding agency conventions
- π€ Peer Debate β Multi-agent adversarial discussion to stress-test research ideas
- π Trend Radar β Analyze publication trends, identify emerging topics & research gaps
- π£οΈ Paper QA β Interactive question-answering over paper collections, extracting key findings & cross-referencing claims
Stay tuned β more skills are on the way!
|
EvoScientist |
Xi Zhang |
Yougang Lyu |
Dinos Papakostas |
Yuyue Zhao |
Xiaoyi DeepResearch Team and the wider open-source community contribute to this project.
For any enquiries or collaboration opportunities, please contact: EvoScientist.ai@gmail.com
We welcome contributions! See the guides for skills and MCP servers, or start with the Contributing Guidelines.
Every contribution brings us one step closer to a future where AI accelerates scientific breakthroughs for all of humanity.
If you find our paper and code useful in your research and applications, please cite using this BibTeX:
@article{evoscientist2026,
title={EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery},
author={Yougang Lyu and Xi Zhang and Xinhao Yi and Yuyue Zhao and Shuyu Guo and Wenxiang Hu and Jan Piotrowski and Jakub Kaliski and Jacopo Urbani and Zaiqiao Meng and Lun Zhou and Xiaohui Yan},
journal={arXiv preprint arXiv:2603.08127},
year={2026}
}This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
