Refactor: Extract duplicated code from test and infer verbs into shared utilities by Copilot · Pull Request #646 · lincc-frameworks/hyrax

Copilot · 2026-01-28T19:52:06Z

Change Description

Two verb modules (test.py and infer.py) contained significant code duplication for model weight loading and batch result saving.

Solution Description

Extracted shared logic into reusable utilities:

load_model_weights() → models/model_utils.py
Consolidates ~96 lines of duplicated code from test.py and infer.py. Handles explicit path or auto-detection from recent training run. Parameterized by verb name.
create_save_batch_callback() → pytorch_ignite.py
Consolidates ~84 lines of duplicated code from test.py and infer.py. Factory function returns closure that saves model outputs batch-by-batch during inference/testing. Includes validation and error handling to ensure batches contain required object_id field, with clear error messages to guide dataset implementation.

Scope: Only test.py and infer.py were refactored. The train.py verb remains unchanged and retains its own _log_params method. Similarly, test.py has its own _log_params method (not shared with train).

Impact: Reduced code duplication in test and infer verbs while maintaining clear separation of concerns between verbs. Removed unused variables (batch_len, write_index) from the callback implementation for cleaner code.

Breaking changes: None. All verb interfaces unchanged. The callback now requires datasets to provide object_id in batch dictionaries, which was already the expected behavior in the original implementation.

Code Quality

I have read the Contribution Guide and agree to the Code of Conduct
My code follows the code style of this project
My code builds (or compiles) cleanly without any errors or warnings
My code contains relevant comments and necessary documentation

Original prompt

@copilot, will you take a moment to address this refactor?

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

codecov · 2026-01-28T21:41:42Z

Codecov Report

❌ Patch coverage is 82.45614% with 10 lines in your changes missing coverage. Please review.
✅ Project coverage is 62.96%. Comparing base (808dbfa) to head (d94afb1).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/hyrax/models/model_utils.py	76.00%	6 Missing ⚠️
src/hyrax/pytorch_ignite.py	73.33%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #646      +/-   ##
==========================================
+ Coverage   62.95%   62.96%   +0.01%     
==========================================
  Files          58       59       +1     
  Lines        5782     5730      -52     
==========================================
- Hits         3640     3608      -32     
+ Misses       2142     2122      -20

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

src/hyrax/pytorch_ignite.py

github-actions · 2026-02-04T18:35:42Z

Before [`808dbfa`]	After [`3b5ebea`]	Ratio	Benchmark (Parameter)
3.74G	3.88G	1.04	vector_db_benchmarks.VectorDBInsertBenchmarks.peakmem_load_vector_db(16384, 'qdrant')
1.91±0.01s	1.96±0.03s	1.03	benchmarks.time_rebuild_manifest_help
16.8962717254681	17.46608516124805	1.03	data_cache_benchmarks.DataCacheBenchmarks.track_cache_hsc1k_hyrax_size_undercount
113±1μs	116±2μs	1.03	data_request_benchmarks.DatasetRequestBenchmarks.time_request_all_data
9.17±0.1ms	9.34±0.1ms	1.02	vector_db_benchmarks.VectorDBSearchBenchmarks.time_search_by_vector_many_shards(128, 'chromadb')
1.92±0.02s	1.94±0.02s	1.01	benchmarks.time_lookup_help
36.7±0.4ms	37.0±0.1ms	1.01	benchmarks.time_nb_obj_construct
1.91±0.01s	1.94±0.03s	1.01	benchmarks.time_prepare_help
1.91±0.01s	1.93±0.01s	1.01	benchmarks.time_train_help
1.27G	1.29G	1.01	vector_db_benchmarks.VectorDBInsertBenchmarks.peakmem_load_vector_db(2048, 'qdrant')

Click here to view all benchmarks.

…that `batch["object_id"]` will be used for the ids.

drewoldag

@copilot, please address the two comments where you are explicitly called out.

src/hyrax/config_utils.py

src/hyrax/verbs/train.py

src/hyrax/pytorch_ignite.py

- Move load_model_weights from config_utils.py to models/model_utils.py - Remove log_mlflow_params from config_utils.py - Restore _log_params method in test.py (no longer using shared function) - Revert all changes to train.py (back to original state) - Update imports in test.py and infer.py to use models.model_utils - Update tests to import from new location Co-authored-by: drewoldag <47493171+drewoldag@users.noreply.github.com>

Copilot

Pull request overview

This PR refactors duplicated code from the test.py and infer.py verbs by extracting shared functionality into reusable utilities. The refactoring consolidates model weight loading and batch result saving logic, reducing code duplication and improving maintainability.

Changes:

Extracted load_model_weights() function to models/model_utils.py for shared model weight loading logic
Extracted create_save_batch_callback() factory function to pytorch_ignite.py for creating batch result saving callbacks
Updated test.py and infer.py to use the new shared utilities, removing ~180 lines of duplicated code
Updated test imports and function calls to reference the new utility functions

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
src/hyrax/models/model_utils.py	New file containing `load_model_weights()` utility function that consolidates model weight loading logic from test and infer verbs
src/hyrax/pytorch_ignite.py	Added `create_save_batch_callback()` factory function that creates closures for saving batch results during inference/testing
src/hyrax/verbs/test.py	Removed `load_model_weights()` static method and `_save_batch` nested function, replaced with calls to new shared utilities
src/hyrax/verbs/infer.py	Removed `load_model_weights()` static method and `_save_batch` nested function, replaced with calls to new shared utilities
tests/hyrax/test_infer.py	Updated test imports to reference new `load_model_weights()` location in `models/model_utils.py`

src/hyrax/pytorch_ignite.py

src/hyrax/models/model_utils.py

Copilot · 2026-02-05T00:03:50Z

src/hyrax/pytorch_ignite.py

+    data_loader_indexes : array-like
+        Indices mapping data loader order to dataset order


The data_loader_indexes parameter is documented and accepted but never used in the function body. The old implementations in test.py and infer.py used this parameter as a fallback when batch didn't contain object_id. Consider either removing this parameter if it's truly unnecessary, or documenting why it's passed but unused.

src/hyrax/pytorch_ignite.py

mtauraso

This looks good to me

drewoldag

@copilot please address PR comments.

src/hyrax/pytorch_ignite.py

…allback - Remove unused batch_len variable - Remove unused write_index variable and nonlocal reference - Add error handling to log and raise RuntimeError if batch doesn't contain object_id - Update docstring to remove mention of "indexes" parameter Co-authored-by: drewoldag <47493171+drewoldag@users.noreply.github.com>

src/hyrax/pytorch_ignite.py

drewoldag

Looking good now

Initial plan

3d054b2

Copilot AI assigned Copilot and drewoldag Jan 28, 2026

Copilot started work on behalf of drewoldag January 28, 2026 19:52 View session

Refactor: Extract shared code from verb modules into utilities

2976aca

Copilot AI changed the title ~~[WIP] Refactor code structure for better maintainability~~ Refactor: Extract duplicated code from verb modules into shared utilities Jan 28, 2026

Copilot AI requested a review from drewoldag January 28, 2026 20:15

Copilot finished work on behalf of drewoldag January 28, 2026 20:15

mtauraso reviewed Jan 28, 2026

View reviewed changes

src/hyrax/pytorch_ignite.py Show resolved Hide resolved

Merge branch 'main' into copilot/refactor-code-structure

a6dd9a4

Simplifying the logic in _save_batch since we know with confidence …

1190e65

…that `batch["object_id"]` will be used for the ids.

drewoldag requested changes Feb 4, 2026

View reviewed changes

src/hyrax/config_utils.py Show resolved Hide resolved

src/hyrax/verbs/train.py Show resolved Hide resolved

src/hyrax/pytorch_ignite.py Show resolved Hide resolved

Copilot started work on behalf of drewoldag February 4, 2026 22:52 View session

Copilot AI changed the title ~~Refactor: Extract duplicated code from verb modules into shared utilities~~ Refactor: Extract duplicated code from test and infer verbs into shared utilities Feb 4, 2026

Copilot AI requested a review from drewoldag February 4, 2026 23:00

Copilot finished work on behalf of drewoldag February 4, 2026 23:00

drewoldag marked this pull request as ready for review February 4, 2026 23:58

Copilot AI review requested due to automatic review settings February 4, 2026 23:58

Copilot started reviewing on behalf of drewoldag February 4, 2026 23:58 View session

Merge branch 'main' into copilot/refactor-code-structure

43af581

drewoldag requested a review from mtauraso February 5, 2026 00:02

Copilot AI reviewed Feb 5, 2026

View reviewed changes

mtauraso approved these changes Feb 5, 2026

View reviewed changes

Removing some unusued parameters from engine creation functions.

a2d44cc

drewoldag requested changes Feb 5, 2026

View reviewed changes

src/hyrax/pytorch_ignite.py Outdated Show resolved Hide resolved

src/hyrax/pytorch_ignite.py Outdated Show resolved Hide resolved

src/hyrax/pytorch_ignite.py Show resolved Hide resolved

Copilot started work on behalf of drewoldag February 5, 2026 00:14 View session

Copilot AI requested a review from drewoldag February 5, 2026 00:20

Copilot finished work on behalf of drewoldag February 5, 2026 00:20

drewoldag reviewed Feb 5, 2026

View reviewed changes

src/hyrax/pytorch_ignite.py Outdated Show resolved Hide resolved

Apply suggestion from @drewoldag

d94afb1

drewoldag approved these changes Feb 5, 2026

View reviewed changes

drewoldag enabled auto-merge (squash) February 5, 2026 00:31

drewoldag merged commit a800e9c into main Feb 5, 2026
8 of 9 checks passed

drewoldag deleted the copilot/refactor-code-structure branch February 5, 2026 00:32

drewoldag linked an issue Feb 5, 2026 that may be closed by this pull request

Pull _save_batch out of infer and test into single location #645

Closed

		data_loader_indexes : array-like
		Indices mapping data loader order to dataset order

Conversation

Copilot AI commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Change Description

Solution Description

Code Quality

Uh oh!

codecov bot commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

github-actions bot commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drewoldag left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mtauraso left a comment

Choose a reason for hiding this comment

Uh oh!

drewoldag left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

drewoldag left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Jan 28, 2026 •

edited

Loading

codecov bot commented Jan 28, 2026 •

edited

Loading

github-actions bot commented Feb 4, 2026 •

edited

Loading