scale for sparse matrixes and mask #2941

Intron7 · 2024-03-22T09:22:20Z

What kind of feature would you like to request?

Additional function parameters / changed functionality / changed defaults?

Please describe your wishes

currently pp.scale with a mask_obs with a sparse matrix and with zero_center== False takes a really long time to update the sparse matrix. This also takes up a lot of memory because of the parity calculations. I would suggest a numba kernel that just swaps out the data. This works really well for rapids-singlecell and greatly improves performance and reduces the memory overhead.
I would open a PR with this kernel.

Performance for 90k cells and 25k genes:
without mask:
CPU 645 ms | GPU 37 ms | 20x
with mask:
CPU 22 s | GPU 50 ms | 460x

The text was updated successfully, but these errors were encountered:

Intron7 added the Enhancement ✨ label Mar 22, 2024

Intron7 self-assigned this Mar 22, 2024

Intron7 mentioned this issue Mar 22, 2024

updates sparse scale #2942

Merged

ivirshup added the Area – Performance 🐌 label Mar 22, 2024

ivirshup modified the milestones: 1.11.0, 1.10.1 Mar 22, 2024

ivirshup closed this as completed in #2942 Apr 8, 2024

flying-sheep removed the Enhancement ✨ label Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scale for sparse matrixes and mask #2941

scale for sparse matrixes and mask #2941

Intron7 commented Mar 22, 2024 •

edited

Loading

scale for sparse matrixes and mask #2941

scale for sparse matrixes and mask #2941

Comments

Intron7 commented Mar 22, 2024 • edited Loading

What kind of feature would you like to request?

Please describe your wishes

Intron7 commented Mar 22, 2024 •

edited

Loading