You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add 7 new CultureBotAI repositories to resources page
Update resources page with detailed descriptions of:
- MicroGrowLink: Knowledge graph-based media prediction framework
- MicroGrowAgents: Multi-agent system for AI-driven cultivation
- MicroMediaParam: Chemical compound mapping pipeline
- CMM-AI: Lanthanide bioprocessing data pipeline
- PFAS-AI: ML-enabled PFAS biodegradation pipeline
- assay-metadata: BacDive API metadata extractor
- microbe-rules: ML models for microbial data
Repositories organized into three categories: Growth Media Prediction & Design, Specialized Research Pipelines, and Data Processing & Analysis.
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Knowledge graph-based framework for predicting microbial growth media using advanced graph and transformer models. Integrates microbial, chemical, and environmental data into a heterogeneous knowledge graph and applies link prediction to forecast which media enable growth of given taxa.
76
+
77
+
**Supported Models:**
78
+
- RGT (Relational Graph Transformer)
79
+
- HGT (Heterogeneous Graph Transformer)
80
+
- NBFNet (Neural Bellman-Ford Network)
81
+
82
+
**Key Features:**
83
+
- Heterogeneous knowledge graph integration
84
+
- Advanced transformer-based link prediction
85
+
- Multi-modal data integration (microbial, chemical, environmental)
Agent-based system for AI-driven microbial cultivation and growth media design. Bridges the microbial cultivation gap through AI-powered multi-agent systems that integrate knowledge graphs, machine learning, and experimental automation.
93
+
94
+
**Specialized Agents:**
95
+
-**LiteratureAgent** - Mining 245+ papers for cultivation protocols
96
+
-**AnalogyReasoningAgent** - Cross-organism comparison and reasoning
97
+
-**GenomeFunctionAgent** - Auxotrophy detection from 57 Bakta-annotated genomes (667K features)
98
+
-**MediaFormulationAgent** - Schema-driven media recommendation with evidence-based ingredient suggestions
99
+
100
+
**Key Achievements:**
101
+
- 864,363 validated species across bacteria, archaea, fungi, and protozoa (GTDB + LPSN + NCBI)
102
+
- Multi-modal reasoning combining literature mining, metabolic modeling (FBA/gap-filling), chemical similarity (208K+ embeddings)
103
+
- Genome-guided design for organism-specific media formulation
Comprehensive chemical compound knowledge graph mapping pipeline for microbial growth media analysis. Extracts chemical compounds from media compositions and maps them to knowledge graph entities with standardized chemical properties.
111
+
112
+
**Features:**
113
+
- Processes 23,181 chemical entries from 1,807 microbial growth media
114
+
- 78% ChEBI coverage (18,088 compounds mapped)
115
+
- Multi-database mapping to ChEBI, PubChem, and CAS-RN identifiers
116
+
- Intelligent hydrate parsing and molecular weight calculation
117
+
- Solution expansion for DSMZ solution references
118
+
- 99.99% chemical mapping accuracy
119
+
120
+
### Specialized Research Pipelines
121
+
122
+
#### CMM-AI: Lanthanide Bioprocessing Data Pipeline
Automated data pipeline for lanthanide bioprocessing research, focusing on rare earth element-dependent biological processes in microorganisms. Integrates multiple biological databases to create comprehensive research datasets.
126
+
127
+
**Scientific Focus:**
128
+
- XoxF methanol dehydrogenase systems (lanthanide-dependent enzymes)
ML-enabled data pipeline for PFAS biodegradation research, focusing on identification and characterization of microorganisms capable of degrading per- and polyfluoroalkyl substances (PFAS).
140
+
141
+
**Research Objectives:**
142
+
-**ML-Powered Database** - Semantically-aware database using KG-Microbe platform to identify putative PFAS biodegradation genes, pathways, taxa, and environments
143
+
-**Intelligent Consortia Design** - Graph learning and LLMs to design optimized microbial consortia for PFAS remediation
144
+
145
+
**Scientific Focus:**
146
+
- C-F bond cleavage mechanisms (dehalogenases and defluorinases)
Research code repository containing machine learning models and analysis pipelines for binary classification and comparative modeling of microbial datasets.
72
171
73
172
**Features:**
74
-
-Machine learning models trained on kg-microbe data
0 commit comments