feat: GGUF transformer embedder shows “use in llama.cpp” button for `create_chat_completion`, not `create_embeddings` #892

davidberenstein1957 · 2024-09-04T06:21:36Z

Additionally, it might be interesting to consider adding GGUFed version of more models?

davidberenstein1957 · 2024-09-04T06:21:59Z

@Vaibhavs10

julien-c · 2024-09-04T08:54:57Z

Additionally, it might be interesting to consider adding GGUFed version of more models?

wdym?

davidberenstein1957 · 2024-09-04T09:41:27Z

Additionally, it might be interesting to consider adding GGUFed version of more models?

wdym?

Most sentence-transformer models don’t have a GGUFed version but we could create a bot that loops over them and creates a PR to add the GGUFed version.

The code is already there so we could potentially schedule through a CRON or do this once to set a standard for the current sentence-transformers on the Hub.

julien-c · 2024-09-04T10:03:36Z

hmm, automated PRs tend to not work as well as user-initiated conversion, so far at least

Maybe start by promoting to users that they can very easily convert those models to GGUF into a new repo?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: GGUF transformer embedder shows “use in llama.cpp” button for `create_chat_completion`, not `create_embeddings` #892

feat: GGUF transformer embedder shows “use in llama.cpp” button for `create_chat_completion`, not `create_embeddings` #892

davidberenstein1957 commented Sep 4, 2024

davidberenstein1957 commented Sep 4, 2024

julien-c commented Sep 4, 2024 •

edited

Loading

davidberenstein1957 commented Sep 4, 2024

julien-c commented Sep 4, 2024

feat: GGUF transformer embedder shows “use in llama.cpp” button for create_chat_completion, not create_embeddings #892

feat: GGUF transformer embedder shows “use in llama.cpp” button for create_chat_completion, not create_embeddings #892

Comments

davidberenstein1957 commented Sep 4, 2024

davidberenstein1957 commented Sep 4, 2024

julien-c commented Sep 4, 2024 • edited Loading

davidberenstein1957 commented Sep 4, 2024

julien-c commented Sep 4, 2024

feat: GGUF transformer embedder shows “use in llama.cpp” button for `create_chat_completion`, not `create_embeddings` #892

feat: GGUF transformer embedder shows “use in llama.cpp” button for `create_chat_completion`, not `create_embeddings` #892

julien-c commented Sep 4, 2024 •

edited

Loading