-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
forecast about autoregression #9
Comments
Hello, in the inference phase of Timer's autoregression, only the lookback window part is needed. During inference, Timer will provide the scrolling result of each segment in the lookback window in parallel, and we only need the result of the last segment as the final result. This process may differ from the autoregressive rolling in language models, but it is consistent in overall logic. |
Sorry, I'm a bit confused. Why do autoregressive predictions need to use data from other batches in the mentioned scenario? |
I contend that the source of confusion lies in the fact that pred_len and patch_len essentially serve as redundant parameters. In the model's implementation, each forward pass produces predictions for the subsequent patch_len timestamps. Conversely, within the dataloader, batch_x and batch_y are separated by a span of pred_len timestamps. A discrepancy in the sizes of pred_len and patch_len introduces logical inconsistencies. It should be noted that in scripts related to forecasting, both parameters are set to the same value. |
We released a new codebase OpenLTM, which contains a more detailed pipeline to pre-train and inference with large time-series models. A refined autoregression pipeline is also provided here. |
What I do: I have trained a data file and get a timer model for MS task. I wanted to use another file to test the result by this model so I change the dataset to read two file and change the index for test file,then set the is_finetuning to zeros.
What I think: However, I set all target to zeros.(I think autoregression by mask will be used in the test prediction). I found that the prediction still depending on labels. Is there a mistake in my understanding of the decoding of transformer? How can I make auto regression in Timer without using label?
The text was updated successfully, but these errors were encountered: