-
Zhejiang University
- China Mainland
Highlights
- Pro
-
zotero-arxiv-daily Public
Forked from TideDra/zotero-arxiv-dailyRecommend new arxiv papers of your interest daily according to your Zotero libarary.
Python GNU Affero General Public License v3.0 UpdatedMar 3, 2025 -
sglang Public
Forked from Zhuohao-Li/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedMar 2, 2025 -
cuda-mode-lectures Public
Forked from gpu-mode/lecturesMaterial for cuda-mode lectures
Jupyter Notebook Apache License 2.0 UpdatedMar 1, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 1, 2025 -
AI_analysis Public
Forked from ifromeast/AI_analysisanalyse problems of AI with Math and Code
Jupyter Notebook UpdatedDec 19, 2024 -
Awesome-LLM-Inference Public
Forked from DefTruth/Awesome-LLM-Inference📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
GNU General Public License v3.0 UpdatedDec 1, 2024 -
-
CUDA-by-example Public
Forked from CodedK/CUDA-by-Example-source-code-for-the-book-s-examples-CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through w…
C MIT License UpdatedOct 30, 2024 -
-
efficient-large-model-papers Public
A Curated Paper List for Efficient Large Models
1 UpdatedSep 18, 2024 -
cuda-samples Public
Forked from NVIDIA/cuda-samplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
C Other UpdatedSep 12, 2024 -
Awesome-LLM-Long-Context-Modeling Public
Forked from Xnhyacinth/Awesome-LLM-Long-Context-Modeling📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
MIT License UpdatedAug 5, 2024 -
FlashRAG Public
Forked from RUC-NLPIR/FlashRAG⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Python MIT License UpdatedAug 2, 2024 -
Awesome-Efficient-LLM Public
Forked from horseee/Awesome-Efficient-LLMA curated list for Efficient Large Language Models
Python UpdatedJul 12, 2024 -
knowhere Public
Forked from zilliztech/knowhereKnowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.
C++ Apache License 2.0 UpdatedJun 4, 2024 -
milvus Public
Forked from milvus-io/milvusA cloud-native vector database, storage for next generation AI applications
Go Apache License 2.0 UpdatedMay 21, 2024 -
cmu-llm-class Public
Forked from cmu-llms-class/cmu-llm-class-website-2023The course website for Large Language Models Methods and Applications
Jupyter Notebook MIT License UpdatedMay 21, 2024 -
sigmod-2024-contest Public
🏆 The winner code for ACM SIGMOD 2024 Programming Contest. Efficient and Accurate Hybrid Vector Search
-
llm.c-cuda-cpp Public
Forked from gevtushenko/llm.cLLM training in simple, raw C/CUDA
Cuda MIT License UpdatedMay 1, 2024 -
RangeFilteredANN Public
Forked from JoshEngels/RangeFilteredANNAlgorithms for approximate nearest neighbor search with window filters
C++ UpdatedMar 11, 2024 -
pybind-debug Public
Debug pybind11-mixed Python and C++ program in VSCode
Python UpdatedFeb 21, 2024 -
-
DiskANN Public
Forked from microsoft/DiskANNGraph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
C++ Other UpdatedDec 5, 2023 -
cpp32 Public
Forked from adah1972/geek_time_cppC++ code examples for Modern C++ (32 chapters) in Geek Time
C++ The Unlicense UpdatedNov 30, 2023 -
-
miniob Public
Forked from oceanbase/miniobMiniOB is a compact database that assists developers in understanding the fundamental workings of a database.
C++ Mulan Permissive Software License, Version 2 UpdatedOct 10, 2023 -
15445-bootcamp Public
Forked from cmu-db/15445-bootcampA basic introduction to coding in modern C++.
C++ Apache License 2.0 UpdatedSep 26, 2023 -
MLC_notebooks Public
Forked from mlc-ai/notebooksMachine Learning Compilation Notebooks
Jupyter Notebook Apache License 2.0 UpdatedJul 31, 2023 -
AI-System Public
Forked from microsoft/AI-SystemSystem for AI Education Resource.
Python Creative Commons Attribution 4.0 International UpdatedJul 23, 2023 -
VBASE-artifacts Public
Forked from Catoverflow/VBASE-artifactsArtifacts for VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity
C UpdatedJul 10, 2023