API Proposal: SocketsHttpConnectionContext.ConnectAsynchronously #44876

stephentoub · 2020-11-18T15:31:13Z

Background and Motivation

In .NET 5, we added SocketsHttpHandler.ConnectCallback, which enables a developer to supply a delegate that'll be used to establish connections. In .NET 5, the callback is only usable for asynchronous operations, but we also added sync support to SocketsHttpHandler (and HttpClient), and if you try to both use the ConnectCallback and sync APIs, you get an exception. We can trivially remove this restriction simply by telling the ConnectCallback in what mode it should operate.

Proposed API

namespace System.Net.Http
{
    public sealed class SocketsHttpConnectionContext
    {
+        public bool ConnectAsynchronously { get; } // true for SendAsync, false for Send
         ...
     }
}

Usage Examples

socketsHttpHandler.ConnectCallback = async (ctx, cancellationToken) =>
{
    var socket = new Socket(SocketType.Stream, ProtocolType.Tcp) { NoDelay = true };
    try
    {
        if (context.ConnectAsynchronously)
        {
            await socket.ConnectAsync(context.DnsEndPoint, cancellationToken).ConfigureAwait(false);
        }
        else
        {
            using (cancellationToken.UnsafeRegister(static s => ((Socket)s!).Dispose(), socket))
                socket.Connect(context.DnsEndPoint);
        }
    }
    catch
    {
        socket.Dispose();
        throw;
    }

    return new NetworkStream(socket, ownsSocket: true);
};

ConnectAsynchronously will be true for all existing uses that work today, because today the synchronous Send will throw an exception if a callback is provided. If once this API exists the ConnectCallback hasn't been updated to respect a false ConnectAsynchronously and a synchronous Send is issued, the caller will simply block waiting for the operation to complete (alternatives are also possible, like throwing an exception).

This also helps to consolidate/simplify logic in SocketsHttpHandler, which no longer needs to maintain two separate connection code paths, to special-case ConnectCallback, etc.

Example implementation:
stephentoub@3674c8d

The text was updated successfully, but these errors were encountered:

ghost · 2020-11-18T15:31:16Z

Tagging subscribers to this area: @dotnet/ncl
See info in area-owners.md if you want to be subscribed.

Issue Details

Description:	Background and Motivation In .NET 5, we added SocketsHttpHandler.ConnectCallback, which enables a developer to supply a delegate that'll be used to establish connections. In .NET 5, the callback is only usable for asynchronous operations, but we also added sync support to SocketsHttpHandler (and HttpClient), and if you try to both use the ConnectCallback and sync APIs, you get an exception. We can trivially remove this restriction simply by telling the ConnectCallback in what mode it should operate. Proposed API namespace System.Net.Http { public sealed class SocketsHttpConnectionContext { + public bool ConnectAsynchronously { get; } // true for SendAsync, false for Send ... } } Usage Examples socketsHttpHandler.ConnectCallback = async (ctx, cancellationToken) => { var socket = new Socket(SocketType.Stream, ProtocolType.Tcp) { NoDelay = true }; try { if (context.ConnectAsynchronously) { await socket.ConnectAsync(context.DnsEndPoint, cancellationToken).ConfigureAwait(false); } else { using (cancellationToken.UnsafeRegister(static s => ((Socket)s!).Dispose(), socket)) socket.Connect(context.DnsEndPoint); } } catch { socket.Dispose(); throw; } return new NetworkStream(socket, ownsSocket: true); }; ConnectAsynchronously will be true for all existing uses that work today, because today the synchronous Send will throw an exception if a callback is provided. If once this API exists the ConnectCallback hasn't been updated to respect a false ConnectAsynchronously and a synchronous Send is issued, the caller will simply block waiting for the operation to complete (alternatives are also possible, like throwing an exception). This also helps to consolidate/simplify logic in SocketsHttpHandler, which no longer needs to maintain two separate connection code paths, to special-case ConnectCallback, etc. Example implementation: stephentoub@`3674c8d`
Author:	stephentoub
Assignees:	-
Labels:	`api-suggestion`, `area-System.Net.Http`
Milestone:	[object Object]

geoffkizer · 2020-11-18T19:11:25Z

Another option would be to simply always do the connect async and do sync-over-async for the sync path.

We are presumably already blocking the calling thread in other cases here, e.g. when the connection limit is hit.

stephentoub · 2020-11-18T20:01:40Z

We could. But there is no connection limit by default. And on Linux, that would put the sockets into a state that makes all sync operations more expensive for the lifetime of the connection.

geoffkizer · 2020-11-18T20:06:31Z

And on Linux, that would put the sockets into a state that makes all sync operations more expensive for the lifetime of the connection.

Have we actually measured this?

Operations that can complete immediately will still complete immediately. The handling is basically the same here regardless of path -- there may be a little bit more overhead in going through the SocketAsyncContext path, but it's probably dwarfed by kernel call overhead.

Operations that can't complete immediately will still block. The blocking happens differently, and is certainly more involved in the SocketAsyncContext path -- but once you block, it's not clear to me that any of that matters all that much.

geoffkizer · 2020-11-18T20:07:14Z

I also wonder what our plan here is for HTTP2, HTTP3, etc where there is no good way to avoid sync-over-async.

stephentoub · 2020-11-18T21:44:58Z

Have we actually measured this?

Yes.

using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Diagnosers;
using BenchmarkDotNet.Running;
using System;
using System.Net;
using System.Net.Sockets;
using System.Threading.Tasks;

[MemoryDiagnoser]
public class Program
{
    static void Main(string[] args) => BenchmarkSwitcher.FromAssemblies(new[] { typeof(Program).Assembly }).Run(args);

    private NetworkStream _client, _server;

    [Params(false, true)]
    public bool EmulatedSync { get; set; }

    [GlobalSetup]
    public void Setup()
    {
        var listener = new Socket(AddressFamily.InterNetwork, SocketType.Stream, ProtocolType.Tcp);
        var client = new Socket(AddressFamily.InterNetwork, SocketType.Stream, ProtocolType.Tcp);

        listener.Bind(new IPEndPoint(IPAddress.Loopback, 0));
        listener.Listen();

        client.Connect(listener.LocalEndPoint);
        Socket server = listener.Accept();

        _client = new NetworkStream(client, ownsSocket: true);
        _server = new NetworkStream(server, ownsSocket: true);

        if (EmulatedSync)
        {
            var t1 = _client.ReadAsync(new byte[1]).AsTask();
            var t2 = _server.ReadAsync(new byte[1]).AsTask();
            _client.WriteAsync(new byte[1]).AsTask().Wait();
            _server.WriteAsync(new byte[1]).AsTask().Wait();
            t1.Wait();
            t2.Wait();
        }

        Task.Run(() =>
        {
            while (true)
            {
                _client.ReadByte();
                _client.WriteByte(42);
            }
        });
    }

    [Benchmark]
    public void ReadWrite()
    {
        for (int i = 0; i < 1000; i++)
        {
            _server.WriteByte(42);
            _server.ReadByte();
        }
    }
}

Method	EmulatedSync	Mean	Error	StdDev	Gen 0	Gen 1	Gen 2	Allocated
ReadWrite	False	57.17 ms	1.029 ms	0.963 ms	-	-	-	-
ReadWrite	True	105.53 ms	2.063 ms	1.930 ms	-	-	-	368792 B

geoffkizer · 2020-11-25T07:46:16Z

Nevermind, I see now.

geoffkizer · 2020-11-25T07:51:19Z

That said, I'm still skeptical.

Your data shows that the existing sync-over-async logic in SocketAsyncContext etc is more expensive than a direct sync call. But I don't think we have focused on the perf of this path in any meaningful way. If we cared about it, I expect we could improve this in non-trivial ways.

stephentoub · 2020-11-25T19:28:03Z

But I don't think we have focused on the perf of this path in any meaningful way. If we cared about it, I expect we could improve this in non-trivial ways.

If we can without negatively impacting true sync or true async, we should.

This is still a red herring, though. They main issue is sync-over-async on a path we otherwise wouldn't have it. If you're doing HTTP/1.1, and you're not explicitly setting MaxConnectionsPerServer, this should be able to be sync all the way.

geoffkizer · 2020-11-26T21:48:21Z

If you're doing HTTP/1.1, and you're not explicitly setting MaxConnectionsPerServer, this should be able to be sync all the way.

Unless/until we implement #44818.

stephentoub · 2020-11-26T23:31:38Z

Unless/until we implement #44818.

Even if we implement that, it's very dialable: there's no reason sync requests would need to participate, and could continue waiting for any connection they initiate, which is also the easiest thing to do since they'd be synchronously opening.

stephentoub · 2020-11-26T23:34:58Z

For now, I suggest we simply change the code to allow making sync requests even if a connect callback is provided. Someone who knows all requests on their HttpClient will be sync can create a sync callback, and if the callback otherwise didn't complete synchronously, well, we block. But at least we've made it possible. The next step is then exposing this one additional Boolean property on an advanced type, should we desire.

geoffkizer · 2020-11-26T23:36:41Z

For now, I suggest we simply change the code to allow making sync requests even if a connect callback is provided. Someone who knows all requests on their HttpClient will be sync can create a sync callback, and if the callback otherwise didn't complete synchronously, well, we block.

Seems reasonable to me.

scalablecory · 2020-12-02T17:58:01Z

@stephentoub do we have a customer who is doing both sync and async with one handler?

I would generally expect the user to just do the right thing in the callback based on their usage patterns here, and never need to check this API.

(Unless they are doing both sync and async)

stephentoub · 2020-12-02T18:17:31Z

do we have a customer who is doing both sync and async with one handler?

Ourselves, ideally: we can use this to address HttpWebRequest.ReadWriteTimeout, with multiple HWRs sharing a single handier instance. The workaround now is hacky and more expensive (and requires all consumers to have direct knowledge of the callback), putting a known property into all requests' options bags for the callback to inspect.

In all other cases where custom logic can be plugged in (e.g. handler.Send/SendAsync), the provider of the custom implementation is told whether it's sync or async. We dropped the ball on this one (the two callbacks), and we can fix it with a single bool property on each context type.

karelz · 2020-12-17T16:56:52Z

Triage: We should discuss it with @stephentoub in the room.

karelz · 2021-01-28T17:26:11Z

Triage: The only/biggest value is to make HttpWebRequest.ReadWriteTimeout work properly - that is tracked in #43520.

stephentoub added api-suggestion Early API idea and discussion, it is NOT ready for implementation area-System.Net.Http labels Nov 18, 2020

stephentoub added this to the 6.0.0 milestone Nov 18, 2020

Dotnet-GitSync-Bot added the untriaged New issue has not been triaged by the area owner label Nov 18, 2020

stephentoub mentioned this issue Nov 28, 2020

Allow SocketsHttpHandler.ConnectCallback for sync requests #45300

Merged

karelz mentioned this issue Jan 28, 2021

HttpWebRequest timeout broken with proxy/ssl, repro attached #43520

Closed

karelz closed this as completed Jan 28, 2021

stephentoub mentioned this issue Jan 29, 2021

Add support for HttpWebRequest.ReadWriteTimeout #47648

Merged

ghost locked as resolved and limited conversation to collaborators Feb 27, 2021

karelz removed the untriaged New issue has not been triaged by the area owner label Oct 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API Proposal: SocketsHttpConnectionContext.ConnectAsynchronously #44876

API Proposal: SocketsHttpConnectionContext.ConnectAsynchronously #44876

stephentoub commented Nov 18, 2020

ghost commented Nov 18, 2020

Background and Motivation

Proposed API

Usage Examples

geoffkizer commented Nov 18, 2020

stephentoub commented Nov 18, 2020

geoffkizer commented Nov 18, 2020

geoffkizer commented Nov 18, 2020

stephentoub commented Nov 18, 2020

geoffkizer commented Nov 25, 2020 •

edited

Loading

geoffkizer commented Nov 25, 2020

stephentoub commented Nov 25, 2020

geoffkizer commented Nov 26, 2020

stephentoub commented Nov 26, 2020

stephentoub commented Nov 26, 2020

geoffkizer commented Nov 26, 2020

scalablecory commented Dec 2, 2020

stephentoub commented Dec 2, 2020

karelz commented Dec 17, 2020

karelz commented Jan 28, 2021

API Proposal: SocketsHttpConnectionContext.ConnectAsynchronously #44876

API Proposal: SocketsHttpConnectionContext.ConnectAsynchronously #44876

Comments

stephentoub commented Nov 18, 2020

Background and Motivation

Proposed API

Usage Examples

ghost commented Nov 18, 2020

Background and Motivation

Proposed API

Usage Examples

geoffkizer commented Nov 18, 2020

stephentoub commented Nov 18, 2020

geoffkizer commented Nov 18, 2020

geoffkizer commented Nov 18, 2020

stephentoub commented Nov 18, 2020

geoffkizer commented Nov 25, 2020 • edited Loading

geoffkizer commented Nov 25, 2020

stephentoub commented Nov 25, 2020

geoffkizer commented Nov 26, 2020

stephentoub commented Nov 26, 2020

stephentoub commented Nov 26, 2020

geoffkizer commented Nov 26, 2020

scalablecory commented Dec 2, 2020

stephentoub commented Dec 2, 2020

karelz commented Dec 17, 2020

karelz commented Jan 28, 2021

geoffkizer commented Nov 25, 2020 •

edited

Loading