Added PatchTSMixer model by Jasmine-Yuting-Zhang · Pull Request #404 · TL-System/plato

Jasmine-Yuting-Zhang · 2025-12-01T04:53:15Z

This PR introduces support for the PatchTSMixer model in the Plato federated learning framework for time series forecasting tasks.

Description

Specifically, this PR:

Added ETT.py to support the Electricity Transformer dataset, including data loading, preprocessing, and federated partitioning logic.
Integrated the PatchTSMixer model architecture in HuggingFace for time series forecasting within Plato.
Added TOML configuration files for PatchTSMixer experiments under configs/TimeSeries/.
Added mean squared error (MSE)–based evaluation for PatchTSMixer experiments.

How has this been tested?

Quick check evaluation:

uv run python plato.py --config configs/TimeSeries/patchtsmixer_custom.toml

This configuration runs only 3 rounds, which is useful for quick functional tests and CORE-style checks. The run completed successfully without runtime errors.

Longer training run:

uv run python plato.py --config configs/TimeSeries/patchtsmixer_large.toml

This configuration uses more rounds. After 400 rounds, the MSE dropped from 7.14 to around 1.30, indicating that the model and data pipeline are working as expected.

Types of changes

Bug fix (non-breaking change which fixes an issue) Fixes #
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)

Checklist:

My code has been formatted using the Ruff formatter (ruff format) and checked using the Ruff linter (ruff check --fix).
My change requires a change to the documentation.
I have updated the documentation accordingly.

…core_eval.py).

…dule under 'external/nanochat'.

- Resolved a RuntimeError caused by non-contiguous tensors during view operations (in nanochat - gpt.py): "view size is not compatible with input tensor's size and stride...". Replaced .view() with .reshape()

- Resolved an issue where the configuration requested 'train_loss' in the results, but the server's get_logged_items() did not include it.

- To avoid vocabulary size mismatch between model and tokenizer during CORE evaluation.

- Updated log message from "global accuracy" to "Average Centered CORE benchmark metric" - Used ruff to format code

…ORE metadata so ty check is clean again.

- Added instructions for initializing submodules and resolving maturin build failure.

- Included configurations for both pre-trained and custom modes.

…tasources.

netlify · 2025-12-01T04:53:25Z

✅ Deploy Preview for platodocs canceled.

Name	Link
🔨 Latest commit	`20ab574`
🔍 Latest deploy log	https://app.netlify.com/projects/platodocs/deploys/692f507818245f0008203aed

…HF examples.

- Used Open-Meteo Archive API for hourly inputs. - Interpolated to 5-min resolution with a linear method. - Added TOML config files (tunable for better results). - Formatted code with ruff.

netlify · 2026-02-03T04:24:12Z

✅ Deploy Preview for platodocs ready!

Name	Link
🔨 Latest commit	`e76f09a`
🔍 Latest deploy log	https://app.netlify.com/projects/platodocs/deploys/6981786b87c6af00084cf859
😎 Deploy Preview	https://deploy-preview-404--platodocs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

plato/datasources/openmeteo.py

+        )
+        logging.info(
+            "Location: lat=%.2f, lon=%.2f, historical_days=%d",
+            latitude,


In general, the fix is to avoid logging sensitive data such as raw geographic coordinates. Instead, log a non‑sensitive label or a redacted/generalized form that still provides observability without exposing private information.

Concretely for plato/datasources/openmeteo.py, we should change the logging.info call that currently logs lat=%.2f, lon=%.2f, historical_days=%d with latitude, longitude, and historical_days. The simplest safe approach that preserves intent is to stop logging the numeric coordinates and keep only non‑sensitive context such as location_name (already logged in the previous logging.info) and historical_days. For example, we can log "Location configuration: historical_days=%d" or "Location configuration: name=%s, historical_days=%d" using location_name instead of coordinates. This keeps functionality identical; only the log message changes.

No new imports or helper methods are required; we just modify the existing log statement in that file/region.

plato/datasources/openmeteo.py

+        logging.info(
+            "Location: lat=%.2f, lon=%.2f, historical_days=%d",
+            latitude,
+            longitude,


In general, to fix clear-text logging of sensitive information, either stop logging the sensitive fields entirely, or sanitize them so that only non-sensitive/less sensitive derivatives (e.g., coarse-grained, masked, or redacted values) are logged. The rest of the functionality (in this case, fetching weather data based on actual coordinates) should continue to use the full-precision values; only the log output should change.

Here, the best minimal fix is to avoid logging the raw latitude and longitude in clear text while preserving useful diagnostic context. We can do this by:

Removing latitude and longitude from the formatted log line, and instead

Logging only non-sensitive, high-level information, such as location_name, historical_days, and the selected task_type/description; or

If coordinates are still desired for debugging, logging a coarse/rounded or redacted version (e.g., to the nearest whole degree or replacing them with [REDACTED]).

To keep changes minimal and avoid assumptions about what is sensitive, I will treat the numeric coordinates as sensitive and remove them from the log message, while still logging historical_days. Concretely, in plato/datasources/openmeteo.py:

Locate the logging.info call around lines 144–149 that logs "Location: lat=%.2f, lon=%.2f, historical_days=%d" with latitude, longitude, historical_days.

Replace it with a log line that does not include latitude or longitude in clear text, for example: "Location configured: historical_days=%d" or "Location configured for %s: historical_days=%d" using location_name and historical_days.

No new imports or helper functions are needed; we only change the string and arguments of the existing log call.

This change ensures that the tainted longitude (and latitude) no longer flow into the logging sink, addressing all alert variants referencing that call, while leaving how the coordinates are used elsewhere untouched.

plato/utils/openmeteo_api.py

+) -> str:
+    """Generate a unique cache key based on request parameters."""
+    key_string = f"{latitude}_{longitude}_{start_date}_{end_date}_{'_'.join(sorted(variables))}_{target_freq}"
+    return hashlib.md5(key_string.encode()).hexdigest()


In general, to fix this kind of issue you should avoid MD5 (and other broken hashes like SHA‑1) when hashing potentially sensitive data, even if only for identifiers. Instead, use a modern, collision-resistant hash function such as SHA‑256 (for general hashing) or a dedicated password hashing scheme for credentials. For non-security uses like cache keys, SHA‑256 is a drop‑in replacement for MD5.

The single best fix here is to change _generate_cache_key in plato/utils/openmeteo_api.py to use hashlib.sha256 instead of hashlib.md5. This preserves the behavior (a deterministic hex string derived from the same input) but uses a strong hash. No other logic needs to change, and all callers will continue to work since the function still returns a hex string. We should also keep the hashlib import, since we are still using it.

Concretely:

In plato/utils/openmeteo_api.py, update line 29:

From: return hashlib.md5(key_string.encode()).hexdigest()

To: return hashlib.sha256(key_string.encode()).hexdigest()

No changes are required in plato/datasources/openmeteo.py or elsewhere.

No new imports or helper methods are needed; hashlib.sha256 is part of the standard library and already available via the existing import hashlib.

baochunli and others added 30 commits October 28, 2025 08:51

Initial support for the Nanochat model and its evaluation benchmark (…

3355698

…core_eval.py).

Added support for vendoring the external Nanochat repo as a git submo…

822a1cb

…dule under 'external/nanochat'.

ruff check --fix & ruff format.

e039b2c

Added benchmark configuration ([evaluation]) support in config.py.

ac5beba

Added test to verify that [evaluation] configuration is properly loaded.

4501781

Fixed tensor contiguity issue in datasource.

a8efbea

- Resolved a RuntimeError caused by non-contiguous tensors during view operations (in nanochat - gpt.py): "view size is not compatible with input tensor's size and stride...". Replaced .view() with .reshape()

Fixed KeyError: 'train_loss'.

f0bc22d

- Resolved an issue where the configuration requested 'train_loss' in the results, but the server's get_logged_items() did not include it.

Fixed train_loss aggregation in FedAvg server to handle None values.

4810080

Added evaluation configs for nanochat CORE metric.

eb736eb

Added automatic download of nanochat CORE evaluation bundle.

d9fe94a

Using tokenizer's vocab_size to match between model and tokenizer.

6f34950

- To avoid vocabulary size mismatch between model and tokenizer during CORE evaluation.

Added outputs for Nanochat CORE evaluation in FedAvg server.

35f25eb

Added specific logging output for CORE benchmark metrics.

e4ae761

- Updated log message from "global accuracy" to "Average Centered CORE benchmark metric" - Used ruff to format code

Typed the Nanochat datasource/optimizer plumbing and enforced valid C…

432fe50

…ORE metadata so ty check is clean again.

All nanochat tests now pass.

da04815

Updated nanochat README with setup and troubleshooting notes.

279d05e

- Added instructions for initializing submodules and resolving maturin build failure.

Added configuration file for NanoChat Parquet mode.

af0bafa

Formatted code with Ruff and applied autofixes.

2b7cf3d

Added two configuration files for PatchTSMixer model.

736b29e

- Included configurations for both pre-trained and custom modes.

Added MSE metric output for time series models.

1fe0e22

Added GitHub dataset handling (ETT datasets) for PatchTSMixer model.

205043d

Added ETT datasource to the registry.

12721f1

Added TimeSeriesDatasetWrapper support for time-series datasets in da…

73d19de

…tasources.

Added PatchTSMixer model support to HuggingFace model factory.

3bb745c

Added timeseries_utils module with is_timeseries_model function.

03abe2f

Added time-series support to the HuggingFace trainer.

c30dafc

Added documentation for time series model PatchTSMixer.

dc90468

Added links to time series model in docs.

ed13def

Revised dataset split to improve training performance.

7bc7f43

Added a larger PatchTSMixer config file with extended hyperparameters.

173095a

Jasmine-Yuting-Zhang added 2 commits December 1, 2025 04:46

Revised MSE evaluation logs for time series models.

5562465

Used uv ruff format .

b9778ec

Jasmine-Yuting-Zhang requested a review from baochunli December 1, 2025 04:53

Jasmine-Yuting-Zhang added 2 commits December 2, 2025 20:43

Refactored ETT data splitting and normalization for consistency with …

20ab574

…HF examples.

Added PatchTSMixer solar radiation and temperature forecasts.

e76f09a

- Used Open-Meteo Archive API for hourly inputs. - Interpolated to 5-min resolution with a linear method. - Added TOML config files (tunable for better results). - Formatted code with ruff.

github-advanced-security bot found potential problems Feb 3, 2026

View reviewed changes

@@ -142,9 +142,8 @@
                         task_config["description"],
                     )
                     logging.info(
-                        "Location: lat=%.2f, lon=%.2f, historical_days=%d",
-                        latitude,
-                        longitude,
+                        "Location configuration: name=%s, historical_days=%d",
+                        location_name,
                         historical_days,
                     )
                     logging.info("Variables: %s", ", ".join(variables))

@@ -142,9 +142,8 @@
                         task_config["description"],
                     )
                     logging.info(
-                        "Location: lat=%.2f, lon=%.2f, historical_days=%d",
-                        latitude,
-                        longitude,
+                        "Location configured for %s: historical_days=%d",
+                        location_name,
                         historical_days,
                     )
                     logging.info("Variables: %s", ", ".join(variables))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added PatchTSMixer model#404

Added PatchTSMixer model#404
Jasmine-Yuting-Zhang wants to merge 34 commits intomainfrom
PatchTSMixer

Jasmine-Yuting-Zhang commented Dec 1, 2025

Uh oh!

netlify bot commented Dec 1, 2025 •

edited

Loading

Uh oh!

netlify bot commented Feb 3, 2026 •

edited

Loading

Uh oh!

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Jasmine-Yuting-Zhang commented Dec 1, 2025

Description

How has this been tested?

Types of changes

Checklist:

Uh oh!

netlify bot commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for platodocs canceled.

Uh oh!

netlify bot commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for platodocs ready!

Uh oh!

Check failure

Uh oh!

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot Autofix

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

netlify bot commented Dec 1, 2025 •

edited

Loading

netlify bot commented Feb 3, 2026 •

edited

Loading