Skip to content

v0.10.3

Latest

Choose a tag to compare

@abhinadduri abhinadduri released this 21 Feb 16:13

Adds a consecutive data loading option for training on huge datasets. This packs cell sets so that within a condition, they are consecutive on disk, leading to around a 3x improvement for HVG training (e.g. output space = gene), and closer to 12-15x improvement for full transcriptome training (output space = all). Code will error if underlying data is not sorted by condition