High Performance Architecture Lab at GT
We focus on research which enables high-performance and energy-efficient computing from microarchitectures to compilers.
Popular repositories Loading
-
CuPBoP-AMD
CuPBoP-AMD PublicCuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.
-
-
Repositories
Showing 10 of 24 repositories
- MacSim-User-Guide Public
- CuPBoP-AMD Public
CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.
- llm-on-cpu Public
- onnxruntime Public Forked from AlexanderPuckhaber/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
- NVPTX-SPIRV-Translator Public
- etri-quant Public
- SPIRV-LLVM-Translator Public Forked from KhronosGroup/SPIRV-LLVM-Translator
A tool and a library for bi-directional translation between SPIR-V and LLVM IR