GitHub - ModelTC/lightllm: LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance. LightLLM harnesses the strengths of numerous well-regarded open-source implementations, including but not limited to FasterTransformer, TGI, vLLM, and FlashAttention.

English Docs | 中文文档 | Blogs

News

[2025/02] 🔥 LightLLM v1.0.0 release, achieving the fastest DeepSeek-R1 serving performance on single H200 machine.

Get started

Performance

Learn more in the release blogs: v1.0.0 blog.

FAQ

Please refer to the FAQ for more information.

Projects using lightllm

We welcome any coopoeration and contribution. If there is a project requires lightllm's support, please contact us via email or create a pull request.

LazyLLM: Easyest and lazyest way for building multi-agent LLMs applications.

Once you have installed lightllm and lazyllm, and then you can use the following code to build your own chatbot:

from lazyllm import TrainableModule, deploy, WebModule
# Model will be download automatically if you have an internet connection
m = TrainableModule('internlm2-chat-7b').deploy_method(deploy.lightllm)
WebModule(m).start().wait()

Documents: https://lazyllm.readthedocs.io/

Community

For further information and discussion, join our discord server. Welcome to be a member and look forward to your contribution!

License

This repository is released under the Apache-2.0 license.

Acknowledgement

We learned a lot from the following projects when developing LightLLM.

Name	Name	Last commit message	Last commit date
Latest commit SangChengC and sangchengmeng [fix]fix image api (#806 ) Apr 7, 2025 11ec65a · Apr 7, 2025 History 504 Commits
.github	.github	add docker clean (#555 )	Oct 9, 2024
assets	assets	init lightllm repo	Jul 22, 2023
demos	demos	fix (#526 )	Sep 4, 2024
docs	docs	DeepseekV3 support deepep, deepgemm, PD, DP TP SP Mix mode. (#783 )	Mar 28, 2025
format_out	format_out	DeepseekV3 support deepep, deepgemm, PD, DP TP SP Mix mode. (#783 )	Mar 28, 2025
lightllm	lightllm	[fix]fix image api (#806 )	Apr 7, 2025
test	test	add deepseek demo start. (#795 )	Apr 3, 2025
tools	tools	Modify readme about container multiple GPUs usage (#63 )	Aug 12, 2023
unit_tests	unit_tests	Prefill overlap (#788 )	Apr 2, 2025
.gitignore	.gitignore	DeepseekV3 support deepep, deepgemm, PD, DP TP SP Mix mode. (#783 )	Mar 28, 2025
.pre-commit-config.yaml	.pre-commit-config.yaml	[Feature] Dynamic prompt cache (#356 )	Mar 18, 2024
CONTRIBUTING.md	CONTRIBUTING.md	Code Style Improvement (#282 )	Jan 12, 2024
Dockerfile	Dockerfile	fix Dockerfile (#804 )	Apr 7, 2025
LICENSE	LICENSE	Initial commit	Jul 22, 2023
README.md	README.md	DeepseekV3 support deepep, deepgemm, PD, DP TP SP Mix mode. (#783 )	Mar 28, 2025
benchmark.md	benchmark.md	init lightllm repo	Jul 22, 2023
build_and_upload_docker.sh	build_and_upload_docker.sh	Some API changes (#125 )	Sep 18, 2023
format.py	format.py	Revert "Merge internal (#164 )" (#165 )	Oct 16, 2023
requirements.txt	requirements.txt	fix image url read. (#803 )	Apr 7, 2025
setup.py	setup.py	Prefill overlap (#788 )	Apr 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News

Get started

Performance

FAQ

Projects using lightllm

Community

License

Acknowledgement

About

Releases 2

Packages 1

Contributors 38

Languages

License

ModelTC/lightllm

Folders and files

Latest commit

History

Repository files navigation

News

Get started

Performance

FAQ

Projects using lightllm

Community

License

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 1

Contributors 38

Languages