Ad-hoc testing of Kerchunk engine compatibility with xCDAT by tomvothecoder · Pull Request #61 · xCDAT/xcdat-validation

tomvothecoder · 2026-01-12T20:56:08Z

Perform ad-hoc manual testing to confirm that xCDAT works seamlessly with Kerchunk without modifying any code or adding a formal test suite. The goal is to confirm functional parity between Kerchunk-backed datasets and traditional NetCDF I/O through exploratory testing and to document any issues found.

Key areas to check manually:

Basic Open Behavior
- Can open single-file and multi-file Kerchunk JSONs.
Performance and Stability
- Compare performance (random sample with n=40)
  - I/O aggregate metrics (median and mean)
  - I/O individual dataset metrics -- in progress
  - Investigate specific 3hr datasets where Kerchunk is slower
  - Investigate single-file only datasets -- done
  - Add test for .load() behavior (subset) in a new notebook -- IN PROGRESS
- Lazy loading works as expected (no data read on open).
- No Dask graph errors or performance regressions.
Metadata and CF Handling
- Dataset contents match the same data opened via NetCDF.
- CF axes (time, lat, lon, lev) are detected correctly.
- Time decoding, bounds variables, and attributes are preserved.
xCDAT Functionalities -- identical results?
- Temporal
- Spatial
- Horizontal
- Vertical regridding
xCDAT Functionalities -- performance differences + .load?
- Temporal
- Spatial
- Horizontal
- Vertical regridding

- Add __init__.py to make modules importable

tomvothecoder · 2026-01-12T21:46:18Z

Notes from 01/12/26 meeting

Debug

Loop over CFsubhr and pin down individual timing, capture outliers or issues (extends first task to script extensions)

Script Extensions

Use pandas to store timings
Capture timing for JSON/NetCDF pairing
Add number of timesteps for each pairing
Add number of files netCDF files for each pairing

Notes

Steve suggests that frequency might not be the biggest factor in the speed difference, but other factors or issues. Capturing more granular information will allow us to extrapolate more information as needed.

tomvothecoder added 6 commits January 8, 2026 14:05

Add environment file

25da07d

Add modules for experimentation

cb399d9

Extract benchmarking utilities to benchmark.py

f7c01ab

- Add __init__.py to make modules importable

Add notes and findings

531913b

Add plotting code

09e0c8c

Add experiment results

955d836

tomvothecoder self-assigned this Jan 12, 2026

tomvothecoder added 14 commits January 16, 2026 12:24

Fix json netcdf map

24af4af

Refactor code to use pandas and parallelism

ea3996d

Use python logger module

0044481

Update FREQ_PATTERN

d179490

Update main.py and benchmark.py for production runs

3a5bfc5

Add latest metrics

83d4c43

Update module name

8424ec2

Add metric notebooks

2fb2618

Update title of cell

2bed6ea

Update takeaway

a6a545f

Fix display of dataframe

693b21e

Add single file metrics notebook

f69eea6

Add 3hr notebook

9a4cb2b

Add overall takeaway to single file notebook

75d61d4

tomvothecoder mentioned this pull request Feb 5, 2026

[Exploration]: Ad-hoc testing of Kerchunk engine compatibility with xCDAT I/O and APIs xCDAT/xcdat#812

Open

tomvothecoder added 5 commits February 9, 2026 14:24

Update notebook with key findings on reference counts

08f7b7b

Update notebook

55cb09c

Fix comment

47b8072

Update 3hr_outlier_metrics.ipynb

8920f1d

Add initial end-to-end .load() notebook

77654a4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ad-hoc testing of Kerchunk engine compatibility with xCDAT#61

Ad-hoc testing of Kerchunk engine compatibility with xCDAT#61
tomvothecoder wants to merge 25 commits intomainfrom
kerchunk

tomvothecoder commented Jan 12, 2026 •

edited

Loading

Uh oh!

tomvothecoder commented Jan 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

tomvothecoder commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tomvothecoder commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Debug

Script Extensions

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

tomvothecoder commented Jan 12, 2026 •

edited

Loading

tomvothecoder commented Jan 12, 2026 •

edited

Loading