How to use the pre-trained or fine-tuned model for high-frequency and long-term data? #110

XiaoqZhang · 2024-06-12T07:33:42Z

XiaoqZhang
Jun 12, 2024

Hello, I am interested in using this model for predicting high-frequency (1s) and long-term (1e6 to 5e7 s) data. I fine-tuned the chronos-t5-mini model with the configuration below:

training_data_paths:
- "<path_to_my_arrow_file>"
probability:
- 1.0
context_length: 512
prediction_length: 64
min_past: 60
max_steps: 200_000
save_steps: 100_000
log_steps: 500
per_device_train_batch_size: 32
learning_rate: 0.001
optim: adamw_torch_fused
num_samples: 20
shuffle_buffer_length: 100_000
gradient_accumulation_steps: 1
model_id: google/t5-efficient-mini
model_type: seq2seq
random_init: true
tie_embeddings: true
output_dir: ./output/
tf32: true
torch_compile: true
tokenizer_class: "MeanScaleUniformBins"
tokenizer_kwargs:
  low_limit: -15.0
  high_limit: 15.0
n_tokens: 4096
lr_scheduler_type: linear
warmup_ratio: 0.0
dataloader_num_workers: 1
max_missing_prop: 0.9
use_eos_token: true

As the model is suggested to predict 64 timesteps at most every time, I made the model predict 64 steps and then used the predictions as context and asked for the next 64 predictions. I found that the predictions performed quite well in the first 6 rounds. Since the 7th round, the amplitude of the predictions dropped largely and the predictions converged to 0 as the plot shown below. Even though the model can perform quite well until around 1000 steps, which is far from the length that I need, I would like to ask if you have tested any case like this or do you have any suggestions? I have thought about fine-tuning the model with large context and prediction length. But it cannot solve the fundamental problem due to the limitation of the GPU memory.

CoCoNuTeK · 2024-06-12T09:52:11Z

CoCoNuTeK
Jun 12, 2024

Well you have context len of 512 and pred_len of 64 and as you shift the byt 64 so the original data taht got you the first 64 prediction gets shited away cant it be that you are inferencing on data the model has been fine tuned on? and as you shift this data away the model starts performing worse

0 replies

XiaoqZhang · 2024-06-12T10:05:07Z

XiaoqZhang
Jun 12, 2024
Author

I was fine-tuning using this data but up to 1e6 steps. I am new to this model. May I ask if the fine-tuning will loop over all the context windows or just the first window?

0 replies

CoCoNuTeK · 2024-06-12T10:14:31Z

CoCoNuTeK
Jun 12, 2024

It depends on what you put into your .arrow file that you referenced here training_data_paths: in the yaml config file

0 replies

XiaoqZhang · 2024-06-12T11:56:41Z

XiaoqZhang
Jun 12, 2024
Author

My input .arrow file contains several time serial data with length of 1e6s on different objects.

0 replies

abdulfatir · 2024-06-12T15:46:13Z

abdulfatir
Jun 12, 2024
Maintainer

@XiaoqZhang The training script will sample random cuts from your time series during training, so you don't really need to worry about the lengths of the time series. They can be of any length.

Regarding the plot above, I am not exactly sure what is going on. I have a couple of questions:

How are you shifting the time series after 64 steps? Note that you don't need to do this manually. Just set limit_prediction_length=False in the predict function.
Did you try a larger value of top_k? Does that help?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to use the pre-trained or fine-tuned model for high-frequency and long-term data? #110

Uh oh!

{{title}}

Uh oh!

Replies: 5 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to use the pre-trained or fine-tuned model for high-frequency and long-term data? #110

Uh oh!

XiaoqZhang Jun 12, 2024

Replies: 5 comments

Uh oh!

CoCoNuTeK Jun 12, 2024

Uh oh!

XiaoqZhang Jun 12, 2024 Author

Uh oh!

CoCoNuTeK Jun 12, 2024

Uh oh!

XiaoqZhang Jun 12, 2024 Author

Uh oh!

abdulfatir Jun 12, 2024 Maintainer

XiaoqZhang
Jun 12, 2024

CoCoNuTeK
Jun 12, 2024

XiaoqZhang
Jun 12, 2024
Author

CoCoNuTeK
Jun 12, 2024

XiaoqZhang
Jun 12, 2024
Author

abdulfatir
Jun 12, 2024
Maintainer