InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 698
Star 7.9k

Code
Issues 526
Pull requests 73
Discussions
Actions
Projects
Security and quality 4
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: InternLM/lmdeploy

Labels 34 Milestones 0

New pull request New

73 Open 2,200 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix model loading on windows

#4626 opened May 27, 2026 by irexyc Collaborator

Loading…

Include spec stats in metrics

#4625 opened May 27, 2026 by RunningLeon Collaborator

Loading…

modify save model in lite module improvement

#4624 opened May 26, 2026 by 43758726 Collaborator

Loading…

Add MixtralForCausalLM in Turbomind Bug:P1

#4623 opened May 26, 2026 by 43758726 Collaborator

Loading…

Recheck response

#4621 opened May 26, 2026 by lvhan028 Collaborator

Loading…

fix cp inference Bug:P0

#4619 opened May 25, 2026 by irexyc Collaborator

Loading…

Refactor prefix caching

#4618 opened May 24, 2026 by grimoire Collaborator

Loading…

Improve health endpoint improvement

#4615 opened May 23, 2026 by lvhan028 Collaborator

Loading…

feat(turbomind): support priority schedule policy

#4614 opened May 22, 2026 by 4mengy

Loading…

3 of 4 tasks

[WIP]: Support mtp + dp

#4611 opened May 21, 2026 by RunningLeon Collaborator

Loading…

perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap

#4605 opened May 21, 2026 by windreamer Collaborator

Loading…

1 of 4 tasks

Remove state init improvement

#4604 opened May 20, 2026 by grimoire Collaborator

Loading…

support qwen3.5(vit) inference in turbomind backend enhancement

New feature or request

#4602 opened May 20, 2026 by irexyc Collaborator

Loading…

Intern s2 preview lite awq fix bug

#4600 opened May 19, 2026 by 43758726 Collaborator

Loading…

[WIP]: Support reuse routed experts on eviction

#4599 opened May 19, 2026 by RunningLeon Collaborator

Loading…

Extend v1/messages by introducing token-in/out and returning routed experts improvement

#4597 opened May 19, 2026 by lvhan028 Collaborator

Loading…

Refactor proxy server improvement

#4596 opened May 18, 2026 by lvhan028 Collaborator • Draft

update anthropic endpoint test

#4594 opened May 18, 2026 by littlegy Contributor

Loading…

fix(pytorch): offload guided decoding CPU ops to thread pool to prevent event loop blocking improvement

#4590 opened May 18, 2026 by windreamer Collaborator

Loading…

3 of 4 tasks

docs(advance): add Add a New Speculative Decoding Method guide documentation

Improvements or additions to documentation

#4589 opened May 17, 2026 by SuperMarioYL

Loading…

4 tasks done

refactor ascend multinode

#4588 opened May 15, 2026 by yao-fengchen Collaborator • Draft

Add OpenAI Responses-compatible endpoint enhancement

New feature or request

#4582 opened May 13, 2026 by CUHKSZzxy Collaborator

Loading…

[security] fix(proxy): require auth for node management

#4579 opened May 11, 2026 by Hinotoi-agent

Loading…

5 of 9 tasks

feat: configure cudagraph capture batch sizes

#4573 opened May 8, 2026 by CUHKSZzxy Collaborator

Loading…

Fix health latency under concurrent VL request preparation Bug:P0

#4570 opened May 7, 2026 by CUHKSZzxy Collaborator

Loading…

Previous 1 2 3 Next

Previous Next

ProTip! Filter pull requests by the default branch with base:main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!