Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KEP-2170: Create LLM training runtime for Llama 3.1 8B #2212

Open
andreyvelich opened this issue Aug 14, 2024 · 9 comments
Open

KEP-2170: Create LLM training runtime for Llama 3.1 8B #2212

andreyvelich opened this issue Aug 14, 2024 · 9 comments
Assignees

Comments

@andreyvelich
Copy link
Member

andreyvelich commented Aug 14, 2024

Related: #2170

Once we implement storage initializers, trainers, and controllers, we should add the LLM training runtimes.
We can start with runtime for Llama 3.1 8B.

https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct

/area runtime

@andreyvelich andreyvelich changed the title KEP-2170: Create LLM training runtime for Llama 2 7b KEP-2170: Create LLM training runtime for Llama 3.1 8B Oct 26, 2024
@Electronic-Waste
Copy link
Member

/assign

I can help with this. Please let me know if you have different plans @kubeflow/wg-training-leads .

@andreyvelich
Copy link
Member Author

Thank you, Shao!
However, we need to work on the LLM Trainer before we add the post-training runtimes: #2321

@Electronic-Waste
Copy link
Member

Thanks for pointing this out, Andrey!

Shall I unassign myself since this issue is related to #2321 ?

@andreyvelich
Copy link
Member Author

If you could also help us with #2321 that would be great!
We have a few ideas with @saileshd1402, but we still investigate on how we can build that Trainer to support different LLMs and datasets.

@Electronic-Waste
Copy link
Member

Sure, I'm glad to hear that I can help with #2321 !

Copy link

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@Electronic-Waste
Copy link
Member

/remove-label lifecycle/stale

Copy link

@Electronic-Waste: The label(s) /remove-label lifecycle/stale cannot be applied. These labels are supported: tide/merge-method-merge, tide/merge-method-rebase, tide/merge-method-squash, lifecycle/needs-triage. Is this label configured under labels -> additional_labels or labels -> restricted_labels in plugin.yaml?

In response to this:

/remove-label lifecycle/stale

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@Electronic-Waste
Copy link
Member

/remove-lifecycle stale

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

No branches or pull requests

2 participants