Description
What's needed?
For forecasting we have been using average load and production profiles.
Given a time series, we want a module that is able to calculate the item wise average of repeating time windows of the same size and the same sample-rate, possibly all having the same distance.
Example:
Consider the time series
Hence we get the sub-series
Note: A position could likewise be a timestamp in a usual time-series representation.
Proposed solution
We want to design such a module in a way such that it
- uses batches of data (most likely cached in memory)
- is designed to work together with the data structure defined here
- it should be able to run calculations on demand and memoize the result
Use cases
It's been used for calculating average load and production profiles certain days.
For example it could just calculate the average load of some consume of the last seven days as well as the average PV production of the last four Tuesdays.
Alternatives and workarounds
It's possible to run the calculation and update the mean on a data stream whenever a new sample arrives. However we plan on storing batches of data in any case, as this can be used as a feature for ml models, and thus calculating the averages from the batches seems obvious.
Additionally this option hasn't been further explored so far.
Additional context
We have had a use-case where we where looking at a full days averaged data plus a 15 minute margin. Thus we will need to add a margin for a window size in order to re-engineer this feature.
Metadata
Metadata
Assignees
Labels
Type
Projects
Status