Skip to content
View zmwang03's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report zmwang03

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Python 1,705 216 Updated Mar 9, 2025

Making large AI models cheaper, faster and more accessible

Python 40,576 4,478 Updated Mar 11, 2025

An elegant PyTorch deep reinforcement learning library.

Python 8,272 1,136 Updated Mar 11, 2025

Tile primitives for speedy kernels

Cuda 2,135 123 Updated Mar 11, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,909 6,090 Updated Mar 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,603 435 Updated Mar 11, 2025

NanoGPT (124M) in 3 minutes

Python 2,368 261 Updated Mar 11, 2025

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 37,952 3,913 Updated Mar 11, 2025
Python 24 38 Updated Mar 10, 2025

Community fork of PlayCover

Swift 9,172 787 Updated Feb 3, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,723 198 Updated Mar 4, 2025

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,788 289 Updated Mar 4, 2025

A library for advanced large language model reasoning

Python 2,034 180 Updated Feb 21, 2025
Python 34 1 Updated Feb 26, 2025

解决Cursor在免费订阅期间出现以下提示的问题: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please l…

Shell 12,981 1,722 Updated Mar 10, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,936 350 Updated Jul 21, 2024

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 431 26 Updated Feb 10, 2025

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Python 161 13 Updated Feb 6, 2024

Textbook on reinforcement learning from human feedback

TeX 474 34 Updated Mar 6, 2025

Modeling, training, eval, and inference code for OLMo

Python 5,323 570 Updated Mar 11, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,381 83 Updated Feb 19, 2025

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 542 40 Updated Feb 14, 2025

用于解决白嫖Cursor被封设备,适用windows、macos

Python 133 25 Updated Dec 10, 2024

A unified evaluation framework for large language models

Python 2,558 189 Updated Feb 11, 2025
Python 2,427 217 Updated Feb 28, 2025

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.

Python 160 10 Updated Mar 11, 2025

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym

Jupyter Notebook 390 24 Updated Mar 10, 2025

DeepSeek LLM: Let there be answers

Makefile 6,162 952 Updated Feb 4, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,147 524 Updated Mar 11, 2025
Next
Showing results