huggingface / optimum-neuron Public

Notifications You must be signed in to change notification settings
Fork 61
Star 207

Code
Issues 54
Pull requests 8
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/optimum-neuron

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

54 Open 230 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

AttributeError: can't set attribute 'deepspeed_plugin' bug

Something isn't working

#735 opened Nov 14, 2024 by anushka0415

2 of 4 tasks

Size mismatch while loading consolidated checkpoints trained with Tensor parallelism for custom LLama Model bug

Something isn't working

#734 opened Nov 9, 2024 by unography

3 of 4 tasks

Size Mismatch Error During LoRA Adapter Merge in Supervised Fine-Tuning of Llama 3.2 1B on AWS Trainium Instance bug

Something isn't working

#733 opened Nov 4, 2024 by Kelv1nYu

3 of 4 tasks

Could not find a matching NEFF for your HLO in this directory bug

Something isn't working

#730 opened Oct 30, 2024 by SteliosGian

4 tasks

Unable to compile and export Stable Diffusion 2.1 bug

Something isn't working

#723 opened Oct 24, 2024 by pinak-p

1 of 4 tasks

SPECULATE option error enhancement

New feature or request

#722 opened Oct 23, 2024 by SteliosGian

1 of 4 tasks

training loss while fine-tuning llama 3.1 with lora is very high compared to rtx 3090 bug

Something isn't working

#721 opened Oct 18, 2024 by anilozlu

4 tasks done

running packages when running the "Supervised Fine-Tuning of Llama 3 8B on one AWS Trainium instance" sample bug

Something isn't working

#720 opened Oct 17, 2024 by yahavb

2 of 4 tasks

stablediffusion (sdxl) ip-adapter support

#718 opened Oct 16, 2024 by Suprhimp

can't compile llama-3-8B or llama-3.1-8B with lora if batch size is more than 1 bug

Something isn't working

#709 opened Oct 5, 2024 by anilozlu

3 of 4 tasks

Move neuron_parallel_compile outside of bash script

#706 opened Sep 26, 2024 by jgray-aws

Codellama generates wierd tokens with TGI 0.0.24 bug

Something isn't working

#704 opened Sep 25, 2024 by pinak-p

1 of 4 tasks

Unable to resume training after saving checkpoint, while using Zero-1 Optimization bug

Something isn't working

#694 opened Sep 9, 2024 by unography

2 of 4 tasks

ValueError: The NeuronTrainer only accept NeuronTrainingArguments, but <class 'optimum.neuron.training_args.Seq2SeqNeuronTrainingArguments'> was provided. bug

Something isn't working

#693 opened Sep 6, 2024 by industrialeaf

2 of 4 tasks

Cannot host Llama-3-8B exported by optimum-neuron with TGI contianer using optimum-neuron(0.0.24) and neuron-sdk(2.19.1) bug

Something isn't working

#684 opened Aug 25, 2024 by cszhz

2 of 4 tasks

Training output reports incorrect num examples when using DDP bug

Something isn't working

Stale

#683 opened Aug 24, 2024 by syl-taylor-aws

2 of 4 tasks

Enable use of IterableDataset when training with DDP

#681 opened Aug 23, 2024 by syl-taylor-aws

text-generation-inference docker builds are not reproducible due to missing Cargo.lock causing builds to fail on previous versions bug

Something isn't working

#677 opened Aug 8, 2024 by charlesmelby

1 of 4 tasks

Add support for new Black Forest's model (Flux)

#676 opened Aug 6, 2024 by mrrfr

MPMD errors when enabling pipeline parallel for fine-tuning llama 3 8B model bug

Something isn't working

Stale

#674 opened Jul 31, 2024 by bingchen-liu

2 of 4 tasks

Underloaded Neuron Cores with Llama3 bug

Something isn't working

Stale

#672 opened Jul 30, 2024 by dlptv

2 of 4 tasks

Add support for Llama3.1

#664 opened Jul 24, 2024 by dacorvo

Llama 3 8B fine tuning shows nan value as loss bug

Something isn't working

Stale

#660 opened Jul 20, 2024 by BaiqingL

2 of 4 tasks

Llama3-8B finetuning shows runtime error of TDRV:v2_cc_execute bug

Something isn't working

Stale

#658 opened Jul 17, 2024 by jianyinglangaws

Rope_scaling not implemented. Issue using deepseek-ai/deepseek-coder-6.7b-instruct Stale

#439 opened Jan 24, 2024 by michaelfeil

Previous 1 2 3 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly