Cross-year Aggregations #198

rmartz · 2016-12-27T15:22:06Z

For #169, we would like to be able to aggregate across years. This is not a simple task, but should be workable with two modifications performed in conjunction

Single-stage aggregation keys

Within our indicator system, we perform two steps for filtering and aggregating by time. In the first step we filter by year, and then within each year we annotate a sub-key used to annotate the results within a common time aggregation.

We should streamline this and only aggregated by the annotated sharding key, which would be defined to be the key used in the final result. That way, database results could look like this:

{'key': '2056-Q4', 'model': 17, 'value': 297}

Time aggregation would only have to be handled while determining what process to construct the key, and once database results are returned we can treat the key as a black box.

Filter by year using aggregation range

This one is going to be more tricky. If we want to have yearly aggregations that don't track with the calendar year, we will need to have a way for data points within a common range to be filtered by whether the range fits within the criteria, not whether the data point necessarily does.

For instance, assuming we want to accumulate ranges by the year they start (Rather than the year they conclude), if a user requests data for 2045-2055 using aggregation spanning from 6/1 to 5/31, we would want to include 4/17/2056 because it fits in the range starting in 2055, but not 1/1/2045 because it fits in the range starting in 2044, outside the year filter.

To do this, we would need to:

Detect if a range spans the new year
If it does, split it into two ranges, one stopping at 12/31, the other starting on 1/1
Apply an offset to the year for points in the orphaned-year range (For instance, the latter, post-1/1 range if we want to key aggregations by the year they begin)
Filter data points by the calculated year

The text was updated successfully, but these errors were encountered:

CloudNiner added the carded label Jan 12, 2017

CloudNiner modified the milestone: Sprint Ending: 1/26/2017 Jan 13, 2017

CloudNiner added the ready label Jan 18, 2017

CloudNiner added the to-do label Feb 1, 2017

CloudNiner removed this from the Sprint Ending: 1/26/2017 milestone Feb 1, 2017

rmartz added in progress and removed to-do labels Feb 6, 2017

fungjj92 added in progress and removed to-do labels Feb 8, 2017

CloudNiner assigned rmartz Feb 10, 2017

rmartz mentioned this issue Feb 10, 2017

Cross Year Aggregations #274

Merged

2 tasks

rmartz added in review and removed in progress labels Feb 15, 2017

rmartz added done and removed in review labels Feb 23, 2017

sharph closed this as completed Feb 23, 2017

sharph removed the done label Feb 23, 2017

hectcastro unassigned rmartz Jul 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cross-year Aggregations #198

Cross-year Aggregations #198

rmartz commented Dec 27, 2016 •

edited by fungjj92

Loading

Cross-year Aggregations #198

Cross-year Aggregations #198

Comments

rmartz commented Dec 27, 2016 • edited by fungjj92 Loading

Single-stage aggregation keys

Filter by year using aggregation range

rmartz commented Dec 27, 2016 •

edited by fungjj92

Loading