Popular repositories Loading
-
vllm
vllm PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
-
kernels
kernels PublicForked from huggingface/kernels
Build compute kernels and load them from the Hub.
Python 1
-
neural-compressor
neural-compressor PublicForked from intel/neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, spar…
Python
-
accelerate
accelerate PublicForked from huggingface/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python
-
-
optimum-intel
optimum-intel PublicForked from huggingface/optimum-intel
Accelerate inference of 🤗 Transformers with Intel optimization tools
Jupyter Notebook 1
If the problem persists, check the GitHub status page or contact support.



