Skip to content

SuDIS-ZJU/llm-inference-all-in-one

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
Feb 18, 2025

Repository files navigation

๐Ÿš€ LLM Inference All-in-One ๐ŸŒŸ

Your ultimate guide to resources, papers, and blogs on Large Language Model (LLM) inference techniques! ๐Ÿ“šโœจ


๐Ÿ† Awesome Lists

Overview

  • ๐Ÿ”— Awesome-LLM-Inference
    A curated collection of papers and codes on LLM inference, including topics like FlashAttention, PagedAttention, and Parallelism.

  • ๐Ÿ”— Awesome LLM Systems Papers
    A curated list of Large Language Model systems related academic papers, articles, tutorials, slides and projects.


๐ŸŒ€ Speculative Decoding


๐Ÿ“ Long-Context Modeling

๐Ÿ”— Large Language Model Based Long Context Modeling Papers and Blogs
Dive deep into papers and blogs on extending LLM context length, efficient transformers, and retrieval-augmented generation (RAG). ๐Ÿง โœจ


๐Ÿ’ญ Reasoning


๐Ÿงฉ Mixture of Experts (MoE)

๐Ÿ”— Awesome MoE LLM Inference System and Algorithm
A comprehensive list of resources for optimizing MoE-based LLM inference. Perfect for tackling sparse expert models! ๐ŸŒŸ


๐Ÿ—‚๏ธ KV Cache Management

Efficient management of KV Caches for LLM acceleration! โšก


๐Ÿ“ Resources

Explore insightful blogs and courses on cutting-edge LLM inference techniques! ๐ŸŒ

Courses

๐Ÿ”— ๅ…ฅ้—จๅฟ…ๅค‡ - Andrej Karpathy๏ผšไปŽ้›ถๅผ€ๅง‹ๆž„ๅปบ GPT ็ณปๅˆ—

๐Ÿ”— MIT 6.5940 TinyML ๅ’Œ้ซ˜ๆ•ˆ็š„ๆทฑๅบฆๅญฆไน ่ฎก็ฎ—

๐Ÿ”— UCSD CSE 234: Data Systems for Machine Learning

๐Ÿ”— CMU Large Language Model System Course

Blogs

๐Ÿ”— Learning notes for ML System

๐Ÿ”— A batch of noteworthy MLSys bloggers

Stay tuned for more updates! ๐ŸŽ‰

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published