Skip to content

Conversation

@chenyingshu
Copy link
Collaborator

@chenyingshu chenyingshu commented Oct 31, 2025

What does this PR do?

Fixes issue #1402 derived from PR #1337
Tested with

  • Llama model training with either graph mode or pynative mode, with "static"/"dynamic"/"none" scaler type, in ms2.7.0 + transformers 4.54
  • OmniGen2 training

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline?
  • Did you make sure to update the documentation with your changes? E.g. record bug fixes or new features in What's New. Here are the
    documentation guidelines
  • Did you build and run the code without any errors?
  • Did you report the running environment (NPU type/MS version) and performance in the doc? (better record it for data loading, model inference, or training tasks)
  • Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@xxx

@chenyingshu chenyingshu added the bug Something isn't working label Oct 31, 2025
@chenyingshu chenyingshu self-assigned this Oct 31, 2025
@chenyingshu chenyingshu marked this pull request as ready for review November 7, 2025 02:47
@chenyingshu chenyingshu requested a review from vigo999 as a code owner November 7, 2025 02:47
@chenyingshu chenyingshu requested a review from hadipash November 10, 2025 09:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants