Refactor: Separate GPU AOT tests from the main AOT DAG #968
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR refactors the
maxtext_configs_aot
DAG by splitting it into two more focused DAGs:maxtext_configs_aot
: Now exclusively handles all TPU configuration tests.maxtext_configs_aot_gpu
: A new DAG dedicated to running GPU AOT tests.Why is this change being made?
Validation Summary
This refactor was tested in the
ml-automation-solutions-dev
environment. The TPU tests have been validated, while the GPU tests will be addressed later according to team priorities.Test Results
TPU DAG (maxtext_configs_aot): Success ✅
GPU DAG ( maxtext_configs_aot_gpu): Failed (Deprioritized)⚠️
Checklist
Before submitting this PR, please make sure (put X in square brackets):