fix(transformers) Fix training issue in TrainOneStepWrapper #1408

chenyingshu · 2025-10-31T08:29:41Z

What does this PR do?

Fixes issue #1402 derived from PR #1337
Tested with

Llama model training with either graph mode or pynative mode, with "static"/"dynamic"/"none" scaler type, in ms2.7.0 + transformers 4.54
OmniGen2 training

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you make sure to update the documentation with your changes? E.g. record bug fixes or new features in What's New. Here are the
documentation guidelines
Did you build and run the code without any errors?
Did you report the running environment (NPU type/MS version) and performance in the doc? (better record it for data loading, model inference, or training tasks)
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@xxx

…aining in llama

fix mindspore-lab#1402 issue. support both pynative and graph mode tr…

8d6b84d

…aining in llama

chenyingshu added the bug Something isn't working label Oct 31, 2025

chenyingshu self-assigned this Oct 31, 2025

linting

82b91cd

zhtmike approved these changes Nov 3, 2025

View reviewed changes

chenyingshu marked this pull request as ready for review November 7, 2025 02:47

chenyingshu requested a review from vigo999 as a code owner November 7, 2025 02:47

chenyingshu requested a review from hadipash November 10, 2025 09:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(transformers) Fix training issue in TrainOneStepWrapper #1408

fix(transformers) Fix training issue in TrainOneStepWrapper #1408

Uh oh!

chenyingshu commented Oct 31, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(transformers) Fix training issue in TrainOneStepWrapper #1408

Are you sure you want to change the base?

fix(transformers) Fix training issue in TrainOneStepWrapper #1408

Uh oh!

Conversation

chenyingshu commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chenyingshu commented Oct 31, 2025 •

edited

Loading