You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Add min_occurrences config parameter (default 5) to filter peptides by PSM count
- Add min_matched_peaks to test config (4 fragments minimum per PSM)
- Fix DIA-NN feature generator memory leak: caches grew unbounded across peptidoforms
because entries were never reused (unique fragment hashes per peptidoform)
- Add enable_ms1_features toggle (default False) to skip slow MS1-based DIA-NN features
(feature_ms1_accuracy_correlations took ~20ms/item due to iterating all MS1 scans)
- Add prepare_ms1_dict() for 3.4x speedup when MS1 features are enabled
(pre-converts mz arrays to sorted numpy, avoids np.asarray + sort check per scan)
- Switch from ThreadPoolExecutor to sequential processing (GIL made threading 3-6x
slower than single-threaded for CPU-bound numpy/pandas work: 2 it/s vs 13 it/s)
- Use singleton DIA-NN generator to avoid 11k constructor/import calls
- Fix quantification: derive stripped_peptide and proteins columns when missing
- Fix create_model() to accept meta parameter from scikit-keras for dynamic input_dim
- Remove unused CorrelationResults fields (8 commented-out matrix variants)
- Add adaptive RT margin support in fragment intensity filtering
- Reduce per-peptidoform logging noise (info -> debug)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
0 commit comments