-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
geoip provider of maxmind flaky #35829
Comments
If fixing this require some time, could we disable this test first? @nezdolik |
There is another type of flake in asan that needs fixing:
|
@nezdolik not sure if its the same issue but im seeing this fail asan quite a bit (in other testing repo) - and its failed postsubmit here possibly unrelated but ive also seen this fail tsan a few times |
@phlax yup asan still needs addressing. Did not notice that issue got closed. Did you see tsan failures after 27th August? |
yeah saw tsan issues yesterday, altho seemed to go away after rerunning - probably relevant is that the workers it was running on are pretty resource-constrained |
so, im doing a load of testing that are non-cached, and im seeing this quite a bit in compile-time-options also envoy_reloadable_features_mmdb_files_reload_enabled to: true
[ OK ] TestName/MmdbReloadImplTest.MmdbNotReloadedRuntimeFeatureDisabled/2 (398 ms)
[----------] 9 tests from TestName/MmdbReloadImplTest (3787 ms total)
[----------] Global test environment tear-down
[==========] 23 tests from 4 test suites ran. (14742 ms total)
[ PASSED ] 23 tests.
Have memory regions w/o callers: might report false leaks
Leak check _main_ detected leaks of 144 bytes in 3 objects
The 2 largest leaks:
*** WARNING: Cannot convert addresses to symbols in output below.
*** Reason: Cannot find 'pprof' (is PPROF_PATH set correctly?)
*** If you cannot fix this, try running pprof directly.
Leak of 96 bytes in 2 objects allocated from:
@ 1ecfe15
@ 1ecfd7d |
which i think doesnt fail anything - so may be expected/related to test - but also wondering if its related to issues seen elsewhere |
contrary to what i said above it does fail the CI also on uncached runs it seems ~50/50 whether this passes @nezdolik i think this issue is pretty high priority given the frequency this is failing stuff, and the nature of what is failing |
this is a tail of the tsan error Thread T10 'mmdb_reload_rou' (tid=14608, running) created by main thread at:
#0 pthread_create ??:? (geoip_provider_test+0x246892d)
#1 Envoy::Thread::PosixThreadFactory::createPthread(Envoy::Thread::ThreadHandle*) ??:? (geoip_provider_test+0x6cb4752)
#2 Envoy::Thread::PosixThreadFactory::createThread(std::__1::function<void ()>, std::__1::optional<Envoy::Thread::Options> const&, bool) ??:? (geoip_provider_test+0x6cb48c7)
#3 Envoy::Thread::PosixThreadFactory::createThread(std::__1::function<void ()>, std::__1::optional<Envoy::Thread::Options> const&) ??:? (geoip_provider_test+0x6cb4631)
#4 Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider::GeoipProvider(Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Singleton::Instance>, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig>) ??:? (geoip_provider_test+0x268781d)
#5 Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider* std::__1::construct_at<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider, Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&, Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider*>(Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider*, Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&) config.cc:? (geoip_provider_test+0x26580c4)
#6 void std::__1::allocator_traits<std::__1::allocator<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider> >::construct<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider, Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&, void, void>(std::__1::allocator<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider>&, Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider*, Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&) ??:? (geoip_provider_test+0x2657d5b)
#7 std::__1::__shared_ptr_emplace<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider, std::__1::allocator<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider> >::__shared_ptr_emplace<Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&>(std::__1::allocator<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider>, Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&) ??:? (geoip_provider_test+0x265781f)
#8 std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider> std::__1::allocate_shared<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider, std::__1::allocator<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider>, Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&, void>(std::__1::allocator<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider> const&, Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&) config.cc:? (geoip_provider_test+0x26574a2)
#9 std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider> std::__1::make_shared<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProvider, Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&, void>(Envoy::Event::Dispatcher&, Envoy::Api::Api&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>&, std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderConfig> const&) config.cc:? (geoip_provider_test+0x26532fc)
#10 Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton::get(std::__1::shared_ptr<Envoy::Extensions::GeoipProviders::Maxmind::DriverSingleton>, envoy::extensions::geoip_providers::maxmind::v3::MaxMindConfig const&, std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&, Envoy::Server::Configuration::FactoryContext&) ??:? (geoip_provider_test+0x2651a72)
#11 Envoy::Extensions::GeoipProviders::Maxmind::MaxmindProviderFactory::createGeoipProviderDriverTyped(envoy::extensions::geoip_providers::maxmind::v3::MaxMindConfig const&, std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&, Envoy::Server::Configuration::FactoryContext&) ??:? (geoip_provider_test+0x264eb02)
#12 Envoy::Extensions::GeoipProviders::Common::FactoryBase<envoy::extensions::geoip_providers::maxmind::v3::MaxMindConfig>::createGeoipProviderDriver(google::protobuf::Message const&, std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&, Envoy::Server::Configuration::FactoryContext&) ??:? (geoip_provider_test+0x2652c79)
#13 Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderTestBase::initializeProvider(std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&, std::__1::optional<Envoy::ConditionalInitializer>&) ??:? (geoip_provider_test+0x2503f2f)
#14 Envoy::Extensions::GeoipProviders::Maxmind::GeoipProviderTest_ValidConfigEmptyLookupResult_Test::TestBody() ??:? (geoip_provider_test+0x24eb627)
#15 void testing::internal::HandleSehExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*) ??:? (geoip_provider_test+0x7aea187)
#16 void testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*) ??:? (geoip_provider_test+0x7aceb3a)
#17 testing::Test::Run() ??:? (geoip_provider_test+0x7ab1d18)
#18 testing::TestInfo::Run() ??:? (geoip_provider_test+0x7ab2b51)
#19 testing::TestSuite::Run() ??:? (geoip_provider_test+0x7ab36f0)
#20 testing::internal::UnitTestImpl::RunAllTests() ??:? (geoip_provider_test+0x7ac372a)
#21 bool testing::internal::HandleSehExceptionsInMethodIfSupported<testing::internal::UnitTestImpl, bool>(testing::internal::UnitTestImpl*, bool (testing::internal::UnitTestImpl::*)(), char const*) ??:? (geoip_provider_test+0x7af0127)
#22 bool testing::internal::HandleExceptionsInMethodIfSupported<testing::internal::UnitTestImpl, bool>(testing::internal::UnitTestImpl*, bool (testing::internal::UnitTestImpl::*)(), char const*) ??:? (geoip_provider_test+0x7ad252a)
#23 testing::UnitTest::Run() ??:? (geoip_provider_test+0x7ac2fed)
#24 RUN_ALL_TESTS() ??:? (geoip_provider_test+0x5186af7)
#25 Envoy::TestRunner::runTests(int, char**) ??:? (geoip_provider_test+0x5185890)
#26 main ??:? (geoip_provider_test+0x5182ac6)
SUMMARY: ThreadSanitizer: data race geoip_provider_test.cc:? in _ZNSt3__14swapIPNS_8functionIFN4absl12lts_202308026StatusEjEEEEENS_9enable_ifIXaasr21is_move_constructibleIT_EE5valuesr18is_move_assignableIS9_EE5valueEvE4typeERS9_SC_ |
@phlax looking currently into asan failure |
I'm seeing this failure today in #35698: https://dev.azure.com/cncf/envoy/_build/results?buildId=179725&view=logs&j=76424b50-4542-51d2-ddfa-1b282263c744&t=ee414bb6-1c71-528f-760c-893708054276 -- are we sure this is fixed? |
should have probably created 3 separate issues per each ci failure. So far tsan and asan checks have been fixed. Now there is one more on release tests (not related to tsan or asan):
|
i believe the flakiness has been addressed. closing this one |
https://dev.azure.com/cncf/envoy/_build/results?buildId=178428&view=logs&jobId=76424b50-4542-51d2-ddfa-1b282263c744&j=76424b50-4542-51d2-ddfa-1b282263c744&t=ee414bb6-1c71-528f-760c-893708054276
The text was updated successfully, but these errors were encountered: