Skip to content

Pull requests: NVIDIA/Fuser

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

DID loop split for matmul + allreduce
#3742 opened Jan 21, 2025 by Priya2698 Loading…
Transformer benchmark
#3741 opened Jan 21, 2025 by nsarka Loading…
add 1D TMA UBLKCP
#3739 opened Jan 21, 2025 by liqiangxl Loading…
Prefer Array class over register arrays.
#3737 opened Jan 20, 2025 by csarofeen Loading…
Int based RNG
#3733 opened Jan 19, 2025 by csarofeen Draft
Reapply #3621
#3714 opened Jan 16, 2025 by wujingyue Draft
[DO NOT REVIEW] Testing main
#3698 opened Jan 12, 2025 by csarofeen Loading…
[WIP] Resize scheduler update
#3657 opened Dec 31, 2024 by naoyam Draft
Split Hopper MMA by warp-tile before instruction tile on hold This issue should be revisited in the future
#3642 opened Dec 24, 2024 by jacobhinkle Loading…
Support outer reduction scheduler with SOL autotuning Autotune Generate heuristics through machine learning models.
#3618 opened Dec 19, 2024 by rdspring1 Loading…
Support TMA with 64-bit indexing
#3599 opened Dec 16, 2024 by jacobhinkle Loading…
ProTip! What’s not been updated in a month: updated:<2024-12-22.