`CStringStore` Unit tests write way too many logs #302

benwtrent · 2018-11-02T19:49:56Z

CStringStore unit tests write thousands of logs like the following:

2018-11-02 15:49:24,988 UTC [3922] ERROR CBucketGatherer.cc@451 Sample start time 166605000 is not bucket aligned
2018-11-02 15:49:24,988 UTC [3922] ERROR CBucketGatherer.cc@451 Sample start time 166605000 is not bucket aligned
2018-11-02 15:49:24,988 UTC [3922] ERROR CBucketGatherer.cc@451 Sample start time 166605000 is not bucket aligned
2018-11-02 15:49:24,988 UTC [3922] ERROR CBucketGatherer.cc@451 Sample start time 166615000 is not bucket aligned
2018-11-02 15:49:24,988 UTC [3922] ERROR CBucketGatherer.cc@451 Sample start time 166615000 is not bucket aligned

We need to either fix the test or mute these particular logs when being ran from a unittest.

The text was updated successfully, but these errors were encountered:

benwtrent · 2018-11-09T18:01:52Z

Found the path that is causing this:

In all of the tests a non-zero bucket delay is added:

ml-cpp/lib/api/unittest/CStringStoreTest.cc

Line 131 in 0328bdb

modelConfig.bucketResultsDelay(2);

This trips this conditional:

ml-cpp/lib/model/CAnomalyDetector.cc

Lines 370 to 372 in 0328bdb

    
           if (m_ModelConfig.bucketResultsDelay()) { 
        
               bucketLength /= 2; 
        
           }

Which cuts the bucketLength in half. Then the following start/ends are off by half a bucket length and when:

ml-cpp/lib/model/CAnomalyDetector.cc

Lines 388 to 390 in 0328bdb

    
           void CAnomalyDetector::sample(core_t::TTime startTime, 
        
                                         core_t::TTime endTime, 
        
                                         CResourceMonitor& resourceMonitor) {

ml-cpp/lib/model/CAnomalyDetector.cc

Lines 398 to 400 in 0328bdb

    
           for (core_t::TTime time = startTime; time < endTime; time += bucketLength) { 
        
               m_Model->sample(time, time + bucketLength, resourceMonitor); 
        
           }

is called, certain models validate the sample times

e.g CCountingModel::sample,

ml-cpp/lib/model/CCountingModel.cc

Lines 210 to 219 in 4dd90fa

    
           void CCountingModel::sample(core_t::TTime startTime, 
        
                                       core_t::TTime endTime, 
        
                                       CResourceMonitor& resourceMonitor) { 
        
               CDataGatherer& gatherer = this->dataGatherer(); 
        
               m_ScheduledEventDescriptions.clear(); 
        
               if (!gatherer.validateSampleTimes(startTime, endTime)) { 
        
                   return; 
        
               }

Which calls:

ml-cpp/lib/model/CBucketGatherer.cc

Lines 450 to 453 in 0328bdb

    
           if (!maths::CIntegerTools::aligned(startTime - m_BucketStart, this->bucketLength())) { 
        
               LOG_ERROR(<< "Sample start time " << startTime << " is not bucket aligned"); 
        
               return false; 
        
           }

I can make this stop by having CStringStoreTest not set the bucketDelay and let it use the default of 0.

What say y'all @tveasey @droberts195 ?

droberts195 · 2018-11-09T18:16:54Z

bucketDelay corresponds to the config option result_finalization_window, and we still have that in our Java side AnalysisConfig objects. What happens in the test makes me think that if anyone ever set that to a non-zero value then we could get this problem in production.

We need to check the code in more detail, so don't make any changes today, but since we do little/no testing with non-zero result_finalization_windows I'm wondering if we should remove all trace of it from the product.

droberts195 · 2018-11-12T17:42:33Z

result_finalization_window is undocumented in our public analysis config documentation. Now we have our multi-bucket feature it's unlikely we'd ever want to resurrect the old Prelert attempt at multiple bucket lengths, and, if I remember correctly, result_finalization_window was designed to work with that.

Based on this issue there's clearly a bug that's triggered if this option is used. Since it's undocumented my preference would be to change the code to completely remove it in 6.6.

But before this is done, can you think of any reason to keep result_finalization_window aka bucketResultsDelay @tveasey and @dimitris-athanasiou?

tveasey · 2018-11-15T14:17:07Z

This was also for implementing out-of-phase analysis to mitigate bucketing partially sampling important features in the time series.

However, I don't think that this is something we are going to prioritise reviving. I'm not sure it is worth the complexity and there is overlap the new multi-bucket functionality.

Removing this also means we don't need result serialisation and various other code. I'd be +1 for cleaning this up now, especially since it appears to not be working correctly.

droberts195 · 2018-11-15T14:27:28Z

Good point. If we get rid of result_finalization_window then we should get rid of overlapping_buckets at the same time.

I agree that they're less likely to be needed (and probably too complex to understand) now we have multi-bucket functionality.

benwtrent added >non-issue >test labels Nov 2, 2018

benwtrent self-assigned this Nov 2, 2018

benwtrent mentioned this issue Nov 16, 2018

Removes two unused AnalysisConfig options elastic/elasticsearch#35645

Merged

dimitris-athanasiou mentioned this issue Nov 20, 2018

[ML] Remove out-of-phase buckets feature #318

Merged

dimitris-athanasiou closed this as completed in #318 Nov 22, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`CStringStore` Unit tests write way too many logs #302

`CStringStore` Unit tests write way too many logs #302

benwtrent commented Nov 2, 2018

benwtrent commented Nov 9, 2018

droberts195 commented Nov 9, 2018

droberts195 commented Nov 12, 2018

tveasey commented Nov 15, 2018 •

edited

Loading

droberts195 commented Nov 15, 2018

CStringStore Unit tests write way too many logs #302

CStringStore Unit tests write way too many logs #302

Comments

benwtrent commented Nov 2, 2018

benwtrent commented Nov 9, 2018

droberts195 commented Nov 9, 2018

droberts195 commented Nov 12, 2018

tveasey commented Nov 15, 2018 • edited Loading

droberts195 commented Nov 15, 2018

`CStringStore` Unit tests write way too many logs #302

`CStringStore` Unit tests write way too many logs #302

tveasey commented Nov 15, 2018 •

edited

Loading