-
Notifications
You must be signed in to change notification settings - Fork 54
Pull requests: NVIDIA/Fuser
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Move RNG in runtime/random_numbers.cu to use Array instead of uint4/uint2.
#3738
opened Jan 20, 2025 by
csarofeen
Loading…
Lower stream-parallelized
LinearOp
into Host IR AG+GEMM overlap algo
#3736
opened Jan 20, 2025 by
samnordmann
Loading…
[Do not merge] Overlap benchmark: AG+GEMM distributed matmul with
HostIr
and ParallelType::Stream
#3719
opened Jan 16, 2025 by
samnordmann
Loading…
In the permissive bfs traversal, don't allow reverse traversal
#3717
opened Jan 16, 2025 by
naoyam
Loading…
A FusionDefinition wrapper that takes/produces DTensors.
#3703
opened Jan 14, 2025 by
wujingyue
Loading…
Split Hopper MMA by warp-tile before instruction tile
on hold
This issue should be revisited in the future
#3642
opened Dec 24, 2024 by
jacobhinkle
Loading…
Support outer reduction scheduler with SOL autotuning
Autotune
Generate heuristics through machine learning models.
#3618
opened Dec 19, 2024 by
rdspring1
Loading…
[wgmma] Insert commit_group and wait_group after mma_async
Matmuls
#3573
opened Dec 11, 2024 by
jacobhinkle
•
Draft
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-12-22.