Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: GGUF transformer embedder shows “use in llama.cpp” button for create_chat_completion, not create_embeddings #892

Open
davidberenstein1957 opened this issue Sep 4, 2024 · 4 comments

Comments

@davidberenstein1957
Copy link
Member

Additionally, it might be interesting to consider adding GGUFed version of more models?

image

@davidberenstein1957
Copy link
Member Author

@Vaibhavs10

@julien-c
Copy link
Member

julien-c commented Sep 4, 2024

Additionally, it might be interesting to consider adding GGUFed version of more models?

wdym?

@davidberenstein1957
Copy link
Member Author

Additionally, it might be interesting to consider adding GGUFed version of more models?

wdym?

Most sentence-transformer models don’t have a GGUFed version but we could create a bot that loops over them and creates a PR to add the GGUFed version.

The code is already there so we could potentially schedule through a CRON or do this once to set a standard for the current sentence-transformers on the Hub.

@julien-c
Copy link
Member

julien-c commented Sep 4, 2024

hmm, automated PRs tend to not work as well as user-initiated conversion, so far at least

Maybe start by promoting to users that they can very easily convert those models to GGUF into a new repo?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants