Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Use threadpool when mixing multiple datasets in Grain
#2703 opened Nov 17, 2025 by aireenmei Loading…
4 tasks done
Introduce Multislice RL
#2702 opened Nov 17, 2025 by xuefgu Loading…
4 tasks done
adding perf members to the owners
#2700 opened Nov 16, 2025 by notabee Loading…
4 tasks done
Add Qwen3 Omni Vision Encoder
#2698 opened Nov 16, 2025 by hengtaoguo Loading…
4 tasks done
Support updating grain data mixture during training
#2697 opened Nov 15, 2025 by aireenmei Draft
4 tasks done
fix tp+tokamax_gmm draft Draft PR
#2696 opened Nov 14, 2025 by NuojCheng Draft
4 tasks
[tools/setup/setup.sh] Nightly cleanup
#2694 opened Nov 14, 2025 by SamuelMarks Loading…
4 tasks done
Add GSPO and small fixes draft Draft PR
#2693 opened Nov 14, 2025 by mydatascience Loading…
4 tasks done
quantized ragged dot maxtext integration
#2691 opened Nov 14, 2025 by copybara-service bot Loading…
[WIP] Bump dependency version
#2686 opened Nov 14, 2025 by RissyRan Loading…
4 tasks done
add scheduler config
#2685 opened Nov 14, 2025 by suexu1025 Loading…
4 tasks done
Add audio encoder support
#2683 opened Nov 13, 2025 by eitanporat Loading…
4 tasks done
Add checkpoint conversion parameter mapping for Qwen3 Omni
#2678 opened Nov 13, 2025 by eitanporat Loading…
4 tasks done
[DECOUPLED-MODE] Adding necessary files gemini-review
#2673 opened Nov 12, 2025 by gulsumgudukbay Loading…
4 tasks done
Qwen3 deepstack [WIP]
#2660 opened Nov 11, 2025 by eitanporat Loading…
4 tasks
Add MRoPE support for Qwen3-Omni [WIP]
#2659 opened Nov 11, 2025 by eitanporat Loading…
4 tasks
feat: migrate deepseek models to nnx
#2658 opened Nov 11, 2025 by mesakhcienet Loading…
4 tasks done
Fix deepseek tp sharding error draft Draft PR
#2657 opened Nov 11, 2025 by NuojCheng Draft
4 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.