Refactor/cleanup model loaders #58

mikepapadim · 2025-10-28T10:04:33Z

No description provided.

Copilot

Pull Request Overview

This PR refactors model loaders and state field allocators to eliminate code duplication and improve maintainability using Template Method and Abstract Factory design patterns.

Key Changes:

Introduced AbstractModelLoader base class implementing Template Method pattern, reducing model loader code by ~60%
Created StateFieldAllocator abstract factory hierarchy to centralize state field allocation logic, reducing state class code by ~60-66%
Removed AOT (Ahead-of-Time) compilation support and related code
Added comprehensive architecture documentation (1,229 lines) covering project structure, patterns, and design principles

Reviewed Changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
AbstractModelLoader.java	New abstract base class implementing Template Method pattern for model loading workflow
StateFieldAllocator.java	New abstract factory for state field allocation with model-specific dimension calculations
Qwen3ModelLoader.java	Refactored to extend AbstractModelLoader, implementing model-specific methods
Qwen2ModelLoader.java	Refactored to extend AbstractModelLoader, implementing model-specific methods
Phi3ModelLoader.java	Refactored to extend AbstractModelLoader, implementing model-specific methods
LlamaModelLoader.java	Refactored to extend AbstractModelLoader, implementing model-specific methods
MistralModelLoader.java	Refactored to extend AbstractModelLoader, implementing model-specific methods
ModelLoader.java	Removed AOT-related code
ModelLoadException.java	New exception class for model loading failures
Qwen3State.java	Simplified to delegate allocation to Qwen3StateFieldAllocator
Qwen2State.java	Simplified to delegate allocation to Qwen2StateFieldAllocator
Phi3State.java	Simplified to delegate allocation to Phi3StateFieldAllocator
LlamaState.java	Simplified to delegate allocation to LlamaStateFieldAllocator
Qwen3StateFieldAllocator.java	New allocator implementing Qwen3-specific dimension calculations
Qwen2StateFieldAllocator.java	New allocator implementing Qwen2-specific dimension calculations
Phi3StateFieldAllocator.java	New allocator implementing Phi3-specific dimension calculations
LlamaStateFieldAllocator.java	New allocator implementing Llama/Mistral dimension calculations
AOT.java	Removed AOT compilation support
LlamaApp.java	Removed AOT import
ARCHITECTURE.md	New comprehensive architecture documentation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-10-28T10:05:15Z

src/main/java/org/beehive/gpullama3/inference/state/Qwen3State.java

+    protected StateFields createStateFields(Configuration config) {
+        StateFieldAllocator allocator = new Qwen3StateFieldAllocator(config, localSize);
+        return allocator.allocateFields();


The StateFieldAllocator is created on every call to createStateFields(). Since State objects are reused during inference and createStateFields() is called during initialization, consider caching the allocator instance if createStateFields() could be called multiple times, or document that this is a one-time initialization method.

src/main/java/org/beehive/gpullama3/model/loader/AbstractModelLoader.java

docs/ARCHITECTURE.md

src/main/java/org/beehive/gpullama3/inference/state/StateFieldAllocator.java

src/main/java/org/beehive/gpullama3/model/loader/LlamaModelLoader.java

src/main/java/org/beehive/gpullama3/model/loader/Qwen2ModelLoader.java

src/main/java/org/beehive/gpullama3/model/loader/Qwen3ModelLoader.java

…ity and configuration handling. # Conflicts: # src/main/java/org/beehive/gpullama3/model/loader/Phi3ModelLoader.java # src/main/java/org/beehive/gpullama3/model/loader/Qwen2ModelLoader.java # src/main/java/org/beehive/gpullama3/model/loader/Qwen3ModelLoader.java

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

# Conflicts: # src/main/java/org/beehive/gpullama3/model/loader/Qwen3ModelLoader.java

…ity and configuration handling.

…to corresponding subpackages (fp16 & q8)

Copilot AI review requested due to automatic review settings October 28, 2025 10:04

Copilot AI reviewed Oct 28, 2025

View reviewed changes

mikepapadim requested review from mairooni and orionpapadakis October 29, 2025 07:38

orionpapadakis requested changes Oct 30, 2025

View reviewed changes

Copilot AI review requested due to automatic review settings November 5, 2025 13:28

orionpapadakis force-pushed the refactor/cleanup_model_loaders branch from 27a764b to 8a08dd3 Compare November 5, 2025 13:28

Copilot AI reviewed Nov 5, 2025

View reviewed changes

mairooni and others added 6 commits November 6, 2025 16:17

Support Q8_0 models for Qwen2 and Deepseek

8b06ffb

Support Q8_0 for Qwen3

fd0b6c6

# Conflicts: # src/main/java/org/beehive/gpullama3/model/loader/Qwen3ModelLoader.java

[WIP] Support Q8_0 for Phi3 - testing pending

ea06ee5

Refactor: remove AOT.java and update model loaders to enhance modular…

320850c

…ity and configuration handling.

Refine model loader refactoring and converge with Q8 support

fb9939e

Move Qwen2Q8_0TornadoVMLayerPlanner to tornadovm package and weights …

eb13ea7

…to corresponding subpackages (fp16 & q8)

orionpapadakis force-pushed the refactor/cleanup_model_loaders branch from 8a08dd3 to eb13ea7 Compare November 6, 2025 14:27

mikepapadim merged commit 6367a00 into beehive-lab:main Nov 6, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor/cleanup model loaders #58

Refactor/cleanup model loaders #58

Uh oh!

mikepapadim commented Oct 28, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Refactor/cleanup model loaders #58

Refactor/cleanup model loaders #58

Uh oh!

Conversation

mikepapadim commented Oct 28, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants