This repository has been archived by the owner on Sep 12, 2024. It is now read-only.
feature: implemented parallel inference for llama-rs, implemented naive sequential async inference for llama-cpp and rwkv-cpp#52
Merged
hlhr202 merged 5 commits intomainfrom feature/asyncMay 9, 2023
+558-1,415