Fix hang for --discover-log-location flag #592

msakrejda · 2024-08-29T22:44:39Z

This was missed in #384.

msakrejda · 2024-08-29T22:46:07Z

main.go

@@ -225,6 +225,8 @@ func run(ctx context.Context, wg *sync.WaitGroup, globalCollectionOpts state.Col

 	if globalCollectionOpts.DiscoverLogLocation {
 		selfhosted.DiscoverLogLocation(ctx, servers, globalCollectionOpts, logger)
+		testRunSuccess = make(chan bool, 1)
+		testRunSuccess <- true


I don't know if this is a good fix (this isn't really a "testRun"), but it works, and I don't have any better ideas. Thoughts?

It looks like this is happening because the main function selects from this channel, and selecting from a nil channel will block forever. It seems Go's recommended solution is to add a timeout to that select, but we wouldn't want to confuse users with a timeout error when the function is returning normally.

So since select statements apparently can't check if the channel is nil first, this is the only way for the main function to handle it:

DoneOrSignal: for { if testRunSuccess != nil { select { case success := <-testRunSuccess: if reloadRun { if success { Reload(logger) } else { logger.PrintError("Error: Reload requested, but ignoring since configuration errors are present") exitCode = 1 } } else if !success { exitCode = 1 } break DoneOrSignal case s := <-sigs: if s == syscall.SIGINT || s == syscall.SIGTERM { logger.PrintError("Interrupt") break DoneOrSignal } } } else { select { case s := <-sigs: if s == syscall.SIGINT || s == syscall.SIGTERM { logger.PrintError("Interrupt") break DoneOrSignal } } } }

I don't think either option is easy to understand, so your approach is probably better since it results in fewer lines of code.

Right, I think the underlying bug here is that we expect anything that exits quickly, instead of continuing to run, to be a test run and depend on reading something from the testRun channel (or a signal). I think a proper fix involves rethinking that, but the run method is kind of daunting, and I'd rather fix this first and refactor that some other time.

seanlinsley · 2024-08-29T23:21:22Z

main.go

@@ -225,6 +225,8 @@ func run(ctx context.Context, wg *sync.WaitGroup, globalCollectionOpts state.Col

 	if globalCollectionOpts.DiscoverLogLocation {
 		selfhosted.DiscoverLogLocation(ctx, servers, globalCollectionOpts, logger)
+		testRunSuccess = make(chan bool, 1)
+		testRunSuccess <- true


It looks like this is happening because the main function selects from this channel, and selecting from a nil channel will block forever. It seems Go's recommended solution is to add a timeout to that select, but we wouldn't want to confuse users with a timeout error when the function is returning normally.

So since select statements apparently can't check if the channel is nil first, this is the only way for the main function to handle it:

DoneOrSignal: for { if testRunSuccess != nil { select { case success := <-testRunSuccess: if reloadRun { if success { Reload(logger) } else { logger.PrintError("Error: Reload requested, but ignoring since configuration errors are present") exitCode = 1 } } else if !success { exitCode = 1 } break DoneOrSignal case s := <-sigs: if s == syscall.SIGINT || s == syscall.SIGTERM { logger.PrintError("Interrupt") break DoneOrSignal } } } else { select { case s := <-sigs: if s == syscall.SIGINT || s == syscall.SIGTERM { logger.PrintError("Interrupt") break DoneOrSignal } } } }

I don't think either option is easy to understand, so your approach is probably better since it results in fewer lines of code.

Fix hang for --discover-log-location flag

85d5702

This was missed in #384.

msakrejda requested a review from a team August 29, 2024 22:44

msakrejda mentioned this pull request Aug 29, 2024

Release 0.58.0 #590

Merged

msakrejda commented Aug 29, 2024

View reviewed changes

seanlinsley approved these changes Aug 29, 2024

View reviewed changes

msakrejda merged commit 677eac2 into main Aug 30, 2024
9 checks passed

msakrejda deleted the fix-discover-log-location-hang branch August 30, 2024 15:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix hang for --discover-log-location flag #592

Fix hang for --discover-log-location flag #592

msakrejda commented Aug 29, 2024

msakrejda Aug 29, 2024

seanlinsley Aug 29, 2024

msakrejda Aug 30, 2024

seanlinsley Aug 29, 2024

Fix hang for --discover-log-location flag #592

Fix hang for --discover-log-location flag #592

Conversation

msakrejda commented Aug 29, 2024

msakrejda Aug 29, 2024

Choose a reason for hiding this comment

seanlinsley Aug 29, 2024

Choose a reason for hiding this comment

msakrejda Aug 30, 2024

Choose a reason for hiding this comment

seanlinsley Aug 29, 2024

Choose a reason for hiding this comment