-
Notifications
You must be signed in to change notification settings - Fork 61
Issues: huggingface/optimum-neuron
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
AttributeError: can't set attribute 'deepspeed_plugin'
bug
Something isn't working
#735
opened Nov 14, 2024 by
anushka0415
2 of 4 tasks
Size mismatch while loading consolidated checkpoints trained with Tensor parallelism for custom LLama Model
bug
Something isn't working
#734
opened Nov 9, 2024 by
unography
3 of 4 tasks
Size Mismatch Error During LoRA Adapter Merge in Supervised Fine-Tuning of Llama 3.2 1B on AWS Trainium Instance
bug
Something isn't working
#733
opened Nov 4, 2024 by
Kelv1nYu
3 of 4 tasks
Could not find a matching NEFF for your HLO in this directory
bug
Something isn't working
#730
opened Oct 30, 2024 by
SteliosGian
4 tasks
Unable to compile and export Stable Diffusion 2.1
bug
Something isn't working
#723
opened Oct 24, 2024 by
pinak-p
1 of 4 tasks
SPECULATE option error
enhancement
New feature or request
#722
opened Oct 23, 2024 by
SteliosGian
1 of 4 tasks
training loss while fine-tuning llama 3.1 with lora is very high compared to rtx 3090
bug
Something isn't working
#721
opened Oct 18, 2024 by
anilozlu
4 tasks done
running packages when running the "Supervised Fine-Tuning of Llama 3 8B on one AWS Trainium instance" sample
bug
Something isn't working
#720
opened Oct 17, 2024 by
yahavb
2 of 4 tasks
can't compile llama-3-8B or llama-3.1-8B with lora if batch size is more than 1
bug
Something isn't working
#709
opened Oct 5, 2024 by
anilozlu
3 of 4 tasks
Codellama generates wierd tokens with TGI 0.0.24
bug
Something isn't working
#704
opened Sep 25, 2024 by
pinak-p
1 of 4 tasks
Unable to resume training after saving checkpoint, while using Zero-1 Optimization
bug
Something isn't working
#694
opened Sep 9, 2024 by
unography
2 of 4 tasks
ValueError: The NeuronTrainer only accept NeuronTrainingArguments, but <class 'optimum.neuron.training_args.Seq2SeqNeuronTrainingArguments'> was provided.
bug
Something isn't working
#693
opened Sep 6, 2024 by
industrialeaf
2 of 4 tasks
Cannot host Llama-3-8B exported by optimum-neuron with TGI contianer using optimum-neuron(0.0.24) and neuron-sdk(2.19.1)
bug
Something isn't working
#684
opened Aug 25, 2024 by
cszhz
2 of 4 tasks
Training output reports incorrect num examples when using DDP
bug
Something isn't working
Stale
#683
opened Aug 24, 2024 by
syl-taylor-aws
2 of 4 tasks
text-generation-inference docker builds are not reproducible due to missing Cargo.lock causing builds to fail on previous versions
bug
Something isn't working
#677
opened Aug 8, 2024 by
charlesmelby
1 of 4 tasks
MPMD errors when enabling pipeline parallel for fine-tuning llama 3 8B model
bug
Something isn't working
Stale
#674
opened Jul 31, 2024 by
bingchen-liu
2 of 4 tasks
Underloaded Neuron Cores with Llama3
bug
Something isn't working
Stale
#672
opened Jul 30, 2024 by
dlptv
2 of 4 tasks
Llama 3 8B fine tuning shows nan value as loss
bug
Something isn't working
Stale
#660
opened Jul 20, 2024 by
BaiqingL
2 of 4 tasks
Llama3-8B finetuning shows runtime error of TDRV:v2_cc_execute
bug
Something isn't working
Stale
#658
opened Jul 17, 2024 by
jianyinglangaws
Rope_scaling not implemented. Issue using
deepseek-ai/deepseek-coder-6.7b-instruct
Stale
#439
opened Jan 24, 2024 by
michaelfeil
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.