Skip to content

Commit 7ac5aa8

Browse files
realmarcinclaude
andcommitted
Add 7 new CultureBotAI repositories to resources page
Update resources page with detailed descriptions of: - MicroGrowLink: Knowledge graph-based media prediction framework - MicroGrowAgents: Multi-agent system for AI-driven cultivation - MicroMediaParam: Chemical compound mapping pipeline - CMM-AI: Lanthanide bioprocessing data pipeline - PFAS-AI: ML-enabled PFAS biodegradation pipeline - assay-metadata: BacDive API metadata extractor - microbe-rules: ML models for microbial data Repositories organized into three categories: Growth Media Prediction & Design, Specialized Research Pipelines, and Data Processing & Analysis. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
1 parent 39ef6d9 commit 7ac5aa8

1 file changed

Lines changed: 106 additions & 7 deletions

File tree

resources.md

Lines changed: 106 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -65,16 +65,115 @@ kg transform
6565
kg merge
6666
```
6767

68-
## 🔧 Computational Tools
68+
## 🔧 CultureBotAI Software & Tools
6969

70-
### CultureBot Predictor
71-
AI-powered tool for predicting optimal growth conditions for target microorganisms.
70+
### Growth Media Prediction & Design
71+
72+
#### MicroGrowLink
73+
**[GitHub Repository](https://github.com/CultureBotAI/MicroGrowLink)** | Python
74+
75+
Knowledge graph-based framework for predicting microbial growth media using advanced graph and transformer models. Integrates microbial, chemical, and environmental data into a heterogeneous knowledge graph and applies link prediction to forecast which media enable growth of given taxa.
76+
77+
**Supported Models:**
78+
- RGT (Relational Graph Transformer)
79+
- HGT (Heterogeneous Graph Transformer)
80+
- NBFNet (Neural Bellman-Ford Network)
81+
82+
**Key Features:**
83+
- Heterogeneous knowledge graph integration
84+
- Advanced transformer-based link prediction
85+
- Multi-modal data integration (microbial, chemical, environmental)
86+
87+
---
88+
89+
#### MicroGrowAgents
90+
**[GitHub Repository](https://github.com/CultureBotAI/MicroGrowAgents)** | Python | [Documentation](https://CultureBotAI.github.io/MicroGrowAgents)
91+
92+
Agent-based system for AI-driven microbial cultivation and growth media design. Bridges the microbial cultivation gap through AI-powered multi-agent systems that integrate knowledge graphs, machine learning, and experimental automation.
93+
94+
**Specialized Agents:**
95+
- **LiteratureAgent** - Mining 245+ papers for cultivation protocols
96+
- **AnalogyReasoningAgent** - Cross-organism comparison and reasoning
97+
- **GenomeFunctionAgent** - Auxotrophy detection from 57 Bakta-annotated genomes (667K features)
98+
- **MediaFormulationAgent** - Schema-driven media recommendation with evidence-based ingredient suggestions
99+
100+
**Key Achievements:**
101+
- 864,363 validated species across bacteria, archaea, fungi, and protozoa (GTDB + LPSN + NCBI)
102+
- Multi-modal reasoning combining literature mining, metabolic modeling (FBA/gap-filling), chemical similarity (208K+ embeddings)
103+
- Genome-guided design for organism-specific media formulation
104+
105+
---
106+
107+
#### MicroMediaParam
108+
**[GitHub Repository](https://github.com/CultureBotAI/MicroMediaParam)** | Python
109+
110+
Comprehensive chemical compound knowledge graph mapping pipeline for microbial growth media analysis. Extracts chemical compounds from media compositions and maps them to knowledge graph entities with standardized chemical properties.
111+
112+
**Features:**
113+
- Processes 23,181 chemical entries from 1,807 microbial growth media
114+
- 78% ChEBI coverage (18,088 compounds mapped)
115+
- Multi-database mapping to ChEBI, PubChem, and CAS-RN identifiers
116+
- Intelligent hydrate parsing and molecular weight calculation
117+
- Solution expansion for DSMZ solution references
118+
- 99.99% chemical mapping accuracy
119+
120+
### Specialized Research Pipelines
121+
122+
#### CMM-AI: Lanthanide Bioprocessing Data Pipeline
123+
**[GitHub Repository](https://github.com/CultureBotAI/CMM-AI)** | Python
124+
125+
Automated data pipeline for lanthanide bioprocessing research, focusing on rare earth element-dependent biological processes in microorganisms. Integrates multiple biological databases to create comprehensive research datasets.
126+
127+
**Scientific Focus:**
128+
- XoxF methanol dehydrogenase systems (lanthanide-dependent enzymes)
129+
- Methylotrophic bacteria (Methylobacterium, Methylorubrum, Paracoccus)
130+
- Environmental metal cycling and biogeochemistry
131+
- Siderophore/lanthanophore transport mechanisms
132+
- PQQ-dependent enzyme complexes
133+
134+
---
135+
136+
#### PFAS-AI: Machine Learning-Enabled PFAS Biodegradation Pipeline
137+
**[GitHub Repository](https://github.com/CultureBotAI/PFAS-AI)** | Python
138+
139+
ML-enabled data pipeline for PFAS biodegradation research, focusing on identification and characterization of microorganisms capable of degrading per- and polyfluoroalkyl substances (PFAS).
140+
141+
**Research Objectives:**
142+
- **ML-Powered Database** - Semantically-aware database using KG-Microbe platform to identify putative PFAS biodegradation genes, pathways, taxa, and environments
143+
- **Intelligent Consortia Design** - Graph learning and LLMs to design optimized microbial consortia for PFAS remediation
144+
145+
**Scientific Focus:**
146+
- C-F bond cleavage mechanisms (dehalogenases and defluorinases)
147+
- Fluoride resistance systems
148+
- Hydrocarbon degradation pathways
149+
- Environmental context (AFFF-contaminated sites, groundwater, wastewater)
150+
151+
### Data Processing & Analysis
152+
153+
#### assay-metadata: BacDive API Assay Metadata Extractor
154+
**[GitHub Repository](https://github.com/CultureBotAI/assay-metadata)** | Python
155+
156+
Extracts API assay metadata from BacDive JSON data with comprehensive identifier mappings to CHEBI, EC, RHEA, and PubChem databases.
157+
158+
**Capabilities:**
159+
- Parses 99,392 bacterial strain records from BacDive
160+
- Extracts 17 unique API kit types (API zym, API 50CHac, etc.)
161+
- Maps substrate codes to CHEBI and PubChem identifiers
162+
- Maps enzyme EC numbers to RHEA reaction databases
163+
- Generates consolidated JSON metadata files
164+
165+
---
166+
167+
#### microbe-rules: Machine Learning Models for Microbial Data
168+
**[GitHub Repository](https://github.com/CultureBotAI/microbe-rules)** | Python
169+
170+
Research code repository containing machine learning models and analysis pipelines for binary classification and comparative modeling of microbial datasets.
72171

73172
**Features:**
74-
- Machine learning models trained on kg-microbe data
75-
- Growth condition recommendations
76-
- Media composition suggestions
77-
- Confidence scoring for predictions
173+
- Binary classification models for microbial data
174+
- Model comparison and evaluation frameworks
175+
- Automated data preparation pipelines
176+
- Reproducible research workflows
78177

79178

80179
## 📊 Datasets

0 commit comments

Comments
 (0)