Pinned Loading
-
xorbitsai/inference
xorbitsai/inference PublicReplace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
-
Vahe1994/AQLM
Vahe1994/AQLM PublicOfficial Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…
-
QwenLM/Qwen2.5
QwenLM/Qwen2.5 PublicQwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
-
QwenLM/Qwen2.5-VL
QwenLM/Qwen2.5-VL PublicQwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
-
ictnlp/LLaMA-Omni
ictnlp/LLaMA-Omni PublicLLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
-
erniebot-openai-api
erniebot-openai-api Publicerniebot兼容openai的API调用方式,支持流式,非流式调用 ,支持system提示词
If the problem persists, check the GitHub status page or contact support.