Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[serve] [llm] Known issues in Ray Serve LLM #50931

Closed
2 of 3 tasks
akshay-anyscale opened this issue Feb 27, 2025 · 4 comments
Closed
2 of 3 tasks

[serve] [llm] Known issues in Ray Serve LLM #50931

akshay-anyscale opened this issue Feb 27, 2025 · 4 comments

Comments

@akshay-anyscale
Copy link
Contributor

akshay-anyscale commented Feb 27, 2025

@GeneDer
Copy link
Contributor

GeneDer commented Feb 27, 2025

Common pitfalls:

  • /home/ray/.cache/torch_extensions/py311_cu124/xgrammar/xgrammar.so: cannot open shared object file: No such file or directory' when request json responses
    • Follow recommended requirements to pip install xgrammar==0.1.11 in your environment/ image
  • RuntimeError: Failed to infer device type on vllm 0.7.2
    • Follow recommended requirements to pip install pynvml==12.0.0 in your environment/ image
  • ModuleNotFoundError: No module named 'ray.serve.llm...
    • vllm 0.7.3 pins Ray to 2.40.0 and is incompatible with the latest Ray version which containing llm serving code. Please use vllm 0.7.2 for now until the incompatibility is resolve in a later vllm release

@SongGuyang
Copy link
Contributor

Will we support kv-cache aware load balance in LLMRouter?

@akshay-anyscale
Copy link
Contributor Author

Will we support kv-cache aware load balance in LLMRouter?

yes, we will publish a github issue with roadmap soon. KV cache aware LB is in design

@kouroshHakha
Copy link
Contributor

all the issues here are closed and the roadmap is published at #51313

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants