HandH1998

Follow

HandH1998 HandH1998

Follow

34 followers · 33 following

Beijing
11:08 (UTC +08:00)
https://scholar.google.com/citations?hl=zh-CN&user=MBR97ZIAAAAJ

Achievements

Achievements

HandH1998/README.md

Hi there 👋

About Me

🔭 I’m currently working on Meituan, focusing on algorithms and engineering related to LLM inference acceleration, familar with model quantization.
💻 I'm lucky to contribute to some open source projects: SGLang, vLLM, TorchAO, Megatron-DeepSpeed and LightSeq.
🚀 I'm proud to build some projects from scratch:
- AutoSmoothQuant: An easy-to-use package for implementing SmoothQuant for LLMs.
- QQQ: QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.
📫 Contact: [email protected]
📚 Google Scholar: https://scholar.google.com/citations?hl=zh-CN&user=MBR97ZIAAAAJ

📊 My GitHub Stats

Pinned Loading

vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 38.9k 5.8k
bytedance/lightseq bytedance/lightseq Public

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3.3k 332
deepspeedai/Megatron-DeepSpeed deepspeedai/Megatron-DeepSpeed Public

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2k 349
AniZpZ/AutoSmoothQuant AniZpZ/AutoSmoothQuant Public

An easy-to-use package for implementing SmoothQuant for LLMs

Python 92 7
QQQ QQQ Public

QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.

Python 102 9
sglang sglang Public

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python