FSharp.Data.Npgsql retry functionality and related enhancements. #102

symboliq · 2021-07-28T18:28:11Z

This PR contains the following enhancements courtesy of FutureLab Software Company -

Add optional type parameters to the NgpsqlConnection<> type provider to configure automatic retrying of queries in the event of NpgsqlException errors, up to N retries, with X milliseconds' delay between retries
Add the same type parameters to the NpgsqlConnection.CreateCommand<> type provider to allow users to enable or disable automatic retrying for individual queries
Retries should not abort open ambient transactions, if any. If possible: retries should check if there are open ambient transactions, and if they have been aborted, they should skip any further retries and fail immediately.
If all retries fail, an AggregateException should be thrown, wrapping all the individual errors encountered during the retries.
When a query is retried, the NpgsqlConnection<> type provider should emit an event that clients can subscribe to in order to log the retry attempt
Add an optional boolean type parameter to the NgpsqlConnection<> type provider. If enabled, any exceptions will be caught and the returned type 'T will be wrapped as a Result<'T, exn>

Moved setup connection to Utils.

…Retries and RetryWaitTime.

Applied config to retry functions.

… going so great.

This reverts commit 1547dd6.

Variable name clean-up. Working on unit tests.

Fixed ChoiceAsync name.

kerams · 2021-07-28T19:00:12Z

Why would you want to have this baked in the library rather than using something like Polly on your end? True, you have to wrap every command in a policy, but the configuration options are so much more flexible.

symboliq · 2021-07-28T19:21:07Z

Here's the explanation I received upon receiving this task -

Connection resiliency.

We use managed Azure PostgreSQL and sometimes, due to either network connection or performance / misconfiguration problems (connection pool exhausted, etc.), our SQL queries run into various kind of transient errors, often SocketExceptions.

All but a few of our queries are idempotent (we use UUID primary keys and ON CONFLICT clauses), so we would like to be able to set a global option to retry those queries in the event of transient errors, instead of having to wait for the entire job to be retried.

Fortunately, Npgsql already distinguishes between errors in the query execution (PostgresException), errors in the type mapping (usually InvalidOperationException) and transient connection errors, which fall under NpgsqlException:

Wrap connection exceptions with NpgsqlException · Issue #2495 · npgsql/npgsql (github.com)

EFCore, for example, uses an optional EnableRetryOnFailure flag that automatically retries queries in the event of transient exceptions. Our old product used raw ADO.NET and was also able to implement automatic retries at the lower level

Getting sporadic SocketExceptions/ExtendedSocketExceptions / Connection Refused using Docker · Issue #619 · npgsql/efcore.pg (github.com)
Connection Resiliency - EF Core | Microsoft Docs

Unfortunately, the library FSharp.Data.Npgsql, being a type provider, generates different CreateCommand types for each query, with different AsyncExecute() methods, so as far as I can tell it's not possible to create e.g. an extension member AsyncExecuteWithRetry() that applies to all queries.

Right now, we have a generic Async.retry wrapper that we manually sprinkle on critical queries, but that's not optimal since connections errors can happen anywhere.

If further explanation would be helpful, I'll ping the gentlemen who gave me the requirements (he has a link to this PR, tho it's late where he is so I don't think he's seen it yet).

piaste · 2021-07-29T14:16:49Z

Why would you want to have this baked in the library rather than using something like Polly on your end? True, you have to wrap every command in a policy, but the configuration options are so much more flexible.

This was indeed our initial approach to the problem. We found it to be suboptimal and frustrating pretty quickly because it was not possible to automatically ensure usage of the wrapper, especially for non-query commands where the result is thrown away.

While looking for alternative solutions, I noticed that having a built-in retry policy for transient errors was a pretty standard feature among .NET database access libraries, e.g.:

We decided that contributing the feature to the library would be less work, more reliable, and less/better code than wrapping several hundred database connections + forever keeping an eye out for unsafe queries in code reviews. It would also be a non-breaking change that could be useful to other users of the library, should you choose to accept the PR.

Granted, you make a strong point that manual wrapping with a dedicated library offers more flexibility.

However, at the low level of individual database queries, I think it's OK to stick to only handling simple transient errors (as the above libraries do). Strategies such as delayed retrying, backpressure, caching etc. IMO benefit from being aware of the higher level context (e.g. sending user notifications, coordinating between multiple clients, etc.). In our case we use Hangfire to define such strategies and monitor them via dashboard, because, for example, some ETL jobs are more stressful than others and should not be queued carelessly, but it's not individual database queries that risk exhausting the DB's resources.

piaste · 2022-01-19T09:32:55Z

@kerams, since your fork is now effectively the master branch, what do you think of this PR? If I updated it to merge non-breakingly with the current master, would you be willing to merge it?

We've been using it in prod for several months now without issue, but I'd like to keep up with future library updates without maintaining a fork if possible.

kerams · 2022-01-20T12:10:09Z

your fork is now effectively the master branch

Not quite. It's true that my fork (or, to be precise, the cloned repository) was merged here in its entirety a couple of months ago, but we have sinced diverged a bit; there's even a separate Nuget package. I think I would not mind having this feature over there (if you were to switch), but after a cursory look at the PR, I do have a couple of issues with the implementation.

piaste · 2022-01-20T14:59:08Z

I'd be OK with switching. There's a feature I'd been working on (NodaTime integration) that might be easier as a breaking change in a private fork, but it's not critical by any means and I'd rather get rid of the fork instead.

Let me know which issues you would like to see addressed, thanks!

daz10000 · 2022-01-20T16:27:38Z

Hey - sorry we haven't been very chatty for a while here. Philosophically, I'd like to take in more contributions. The world has too many slightly undermaintained repos and I believe in 'less, better, software' - a few well-maintained things. If you want to address any thoughts @kerams has on improvements, I think we'd be happy to take the change (though we don't use that functionality). @kerams - I'd also like to say thanks for all the improvements and we'd be open to any suggested improvements you have (we obviously liked your fork enough to take the improvements back). We tend to have time in waves to improve this - we did a bunch of things for the net6 push recently. My only ask for contributions is to try to have decent test examples and where possible extend the doc file so there's at least one example of the feature for future generations.

That all said, I'm happy for the code to be useful in any way, including forks. If we have to put in a bit more work to help concentrate the work in fewer places though, that's a good investment for us (IMO)

symboliq added 30 commits July 15, 2021 17:19

Fixed various build issies for local development / debugging.

2df0e6b

Speculating on async-ifying and adding retries to setupConnection.

6809dba

More retry functions.

96640e7

Moved setup connection to Utils.

More retry functions.

addfea6

Using retry functions in Utils.fs.

7e3d31f

More application of retry functions.

f470e73

Applied final retry function.

90c2983

Forgot one!

0cffde9

Checking for open connection before retrying where possible.

c35ddb9

Implemented ShouldRetry function to follow specs in e-mail.

936260f

Bit of expression clean-up.

ce829c2

Implemented Async.CatchDb.

d9b899b

Got rid of Read retry function.

cd1c770

Parameterized NpgsqlConnection and NpgsqlCommand type providers with …

4899cf3

…Retries and RetryWaitTime.

Renamed Retries to Tries.

1a6b54f

Applied config to retry functions.

Placing retries around data table Load.

2ed06e2

Fixed match failure exception.

8026719

Minor code clean-up.

bc2a770

Created Retry module to hold retry algos.

3bca52c

Removed likely useless Thread.Sleep.

03998e3

Got new test 'working'...

da9f664

Trying to figure out how to implement a type provider property... not…

93e253c

… going so great.

Attempted to improve property provider implementation...

90e5368

Nothing works at all.

4b37d83

Reverted retry callback feature.

2abadbb

First pass of implementing retry callback.

812d995

Got initial design of retry working.

240e7f6

Removed unecessary conditionals.

59c9a40

Got actual retry event working.

f5fe356

Removed extraneous type provisions.

fce1a14

symboliq added 12 commits July 21, 2021 12:36

Changed to version numbers to custom.

1547dd6

Revert "Changed to version numbers to custom."

139892c

This reverts commit 1547dd6.

Implemented AsyncChoice capability.

1c6338a

Fixed method cache issue.

8f44909

Variable name clean-up. Working on unit tests.

Implemented finer-grained unit tests.

fa9be8a

Reverted unecessary changes for PR.

f6117f7

Fixed ChoiceAsync name.

More work on reverting unecessary changes.

e9ccc85

More work on reverting unecessary changes.

a445763

More work on reverting unecessary changes.

fa51b12

More work on reverting unecessary changes.

56673f8

More work on reverting unecessary changes.

a7a8cca

More work on reverting unecessary changes.

2ea114b

symboliq changed the title ~~Working async choice~~ FSharp.Data.Npgsql retry functionality and related enhancements. Jul 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FSharp.Data.Npgsql retry functionality and related enhancements. #102

FSharp.Data.Npgsql retry functionality and related enhancements. #102

symboliq commented Jul 28, 2021

kerams commented Jul 28, 2021 •

edited

Loading

symboliq commented Jul 28, 2021

piaste commented Jul 29, 2021

piaste commented Jan 19, 2022

kerams commented Jan 20, 2022

piaste commented Jan 20, 2022

daz10000 commented Jan 20, 2022

FSharp.Data.Npgsql retry functionality and related enhancements. #102

Are you sure you want to change the base?

FSharp.Data.Npgsql retry functionality and related enhancements. #102

Conversation

symboliq commented Jul 28, 2021

kerams commented Jul 28, 2021 • edited Loading

symboliq commented Jul 28, 2021

piaste commented Jul 29, 2021

piaste commented Jan 19, 2022

kerams commented Jan 20, 2022

piaste commented Jan 20, 2022

daz10000 commented Jan 20, 2022

kerams commented Jul 28, 2021 •

edited

Loading