Server-side caching of viewmodels #704

tomasherceg · 2019-06-16T14:48:43Z

Disclaimer: This is an experimental feature - it should not be merged in master any time soon without extensive testing.

I have implemented a relatively simple mechanism that can dramatically decrease the amount of data transferred between the client and the server on postbacks. Basically, the viewmodel is cached on the server and its hash is used as the cache key (that should keep the cache small for static pages or for pages with identical initial state).

On HTTP GET, the viewmodel JSON (without CSRF token and encrypted values) is cached on the server.
When the HTML is generated, the viewModelCacheId field is included to the serialized viewmodel.
DotVVM stores a copy of the viewmodel in dotvvm.viewModels['root'].viewModelCache when the page is loaded.
When a postback is made, the client sends viewModelDiff and viewModelCacheId instead of the full viewmodel.
If the viewmodel is found in the server cache, the diff is applied to it and the request processing works normally. The response viewmodel is cached using the same way as in the first step.
If the viewmodel is not in the cache, the server returns a special response with viewModelNotCached notice and the client repeats the postback with the full viewmodel.

Notes

The solution is backwards-compatible - the client may decide to always send the full viewmodel and server must support it.

We should make this feature optional and allow to turn it only for individual pages.

Also, a security review of this feature should be made.The CSRF token and encrypted values are not part of the cache mechanism and are sent (and verified) on all requests as it was before.

The viewmodel serialization and deserialization are using synchronous API right now. The cache may require using an async API (in case the user would like to use some distributed cache etc.) and maybe the serialization itself could benefit from async as well.

And finally, I haven't done any performance comparisons yet - I assume that hashing and caching the viewmodels should be way faster than transferring unnecessary data, however we should measure the impacts on a real-world application.

exyi

Few comments about the server side, I'll go though the JS part later.

In general, I like the idea of this mechanism, it may improve the bandwidth requirements quite significantly. However, especially on s***ty connection the chaining of requests in case the VM is not on the server may cause issues (when combined with #705 there may be 3 requests). When there is a fixed timeout of few minutes, I think we could also explain that to the client, so it does not even try to send the diff.

src/DotVVM.Framework/ViewModel/Serialization/DefaultViewModelServerCache.cs

src/DotVVM.Framework/ViewModel/Serialization/IViewModelServerStore.cs

exyi · 2019-07-18T10:46:27Z

src/DotVVM.Framework/ViewModel/Serialization/InMemoryViewModelServerStore.cs

+    {
+        private readonly IDotvvmCacheAdapter cacheAdapter;
+
+        public TimeSpan CacheLifetime { get; set; } = TimeSpan.FromMinutes(5);


This is quite short IMHO and it's not trivial to reconfigure.

It would however make sense to somehow remove entries that were used by client that got a new model from the server. For that we'd have to pass some client identifier to the cache.

The lifetime management will change definitely, I need to do some measurements. There should also be some limits per route. The problem with removing cache entries is that you don't know how many people are using them. We'd need to add some reference counting - maybe the short lifetime will work well enough.

exyi · 2019-07-19T15:32:02Z

The lifetime management will change definitely, I need to do some measurements. There should also be some limits per route. The problem with removing cache entries is that you don't know how many people are using them. We'd need to add some reference counting - maybe the short lifetime will work well enough.

It's true I also have no idea how will people use, but I also don't know how to measure that. I'd also try to store the viewmodels for infinite time (or maybe a day or so) and low priority and let the ASP.NET cache handle the pressure. Also, if we'd a decent global compression mechanism, the cost of one view model could get very low, most of the large things are going to be very repetitive anyway. It would be very intereting to have a look at a site with significant traffic and try to come up with something, if you'd know of something I'd be interested to have a look.

exyi · 2019-07-19T15:35:15Z

It's not thread-safe for sure, see <https://source.dot.net/#System.Security.Cryptography.Primitives/System/Security/Cryptography/HashAlgorithm.cs,106>. The `ComputeHash` method accumulates some hash on itself and then returns it. If multiple threads do that, it will certainly break.

exyi · 2019-07-19T15:54:35Z

Newtonsoft.Json has support for BSON, but in 12.0 and older versions it's included in the Newtonsoft.Json package itself. From 12.0 it is in the separate package. Since DotVVM tries to request the lowest Newtonsoft.Json as possible, it will be difficult to make this binary compatible. Maybe we can use Reflection to find the BsonWriter and BsonReader class.

Ok, thats a bit too complicated, let's not care about for now. BSON does not bring that big improvement anyway.

exyi · 2019-07-19T15:56:55Z

And btw ReadOnlyMemory is only in .NET Standard 2.1 which means we cannot use this for .NET Framework. I will use byte array and BSON - it should be more efficient than parsing and writing the string.

This package https://www.nuget.org/packages/System.Memory should work with almost anything. We don't need the native support from the runtime.

tomasherceg · 2019-07-19T16:19:12Z

What will be the benefit of ReadOnlyMemory over byte[]? I found that the lowest version of Newtonsoft.Json.Bson supports 10.0.3 which is minimum version of Newtonsoft.Json required by DotVVM, so it should not be an issue.
I have added an extensibility point in DefaultViewModelSerializer so someone might add compression or use any other method of getting bytes from the viewmodel JToken and vice-versa.

I'll continue working on this later.
I'd like to add code that emits some metrics, and enable this feature on a few websites to gather some data and usage patterns. I am thinking of collecting:

the number of cache entries created by a particular route
how many times these entries were used for a particular route
If the first number is big and the second is low, caching is not useful on the particular route.
If the first number is small and the second is big, the cache can help a lot.

I don't know yet if we want to have some auto-tuning in the framework that would decide whether the cache is good or not, or if we just give the user the tools to decide on their own. I would definitely start with the second way, but I am not sure if anyone will use it if it requires some work to set it up.

exyi · 2019-08-11T19:50:20Z

src/DotVVM.Framework/ViewModel/Serialization/DefaultViewModelSerializer.cs

+            string viewModelCacheId = null;
+            if (context.Configuration.ExperimentalFeatures.ServerSideViewModelCache.IsEnabledForRoute(context.Route.RouteName))
+            {
+                viewModelCacheId = viewModelServerCache.StoreViewModel(context, (JObject)viewModelToken);


One more thing, we should only store properties that sent to client and also back to server. This way we are wasting quite a bit of space. Fortunately, the serializer should silently ignore properties that should not be sent to server, so there is no change in behavior.

Unless you use the Direction.ClientToServerNotInPostbackPath, then the serializer will take it into account even though it should not be sent at all (assuming it was not in the path). Unfortunately, these properties can't be just dropped as they might also be needed when they are in the path. And, on the server side, we have basically no way of knowing which object are in the path during the serialization phase, so I don't see a simple fix to that :/ Maybe we could figure out the JSON path on client and send it to the server.

tomasherceg · 2019-08-27T07:16:30Z

TBD:

make the lifetime of cache entries configurable
describe the behavior of Bind.ClientToServerIfInPostbackPath - everything not in the postback path is undefined and can be present in the viewmodel even if it shall not

Then we can merge.

Tiny updates in DotVVM.ts script

Js and C# tests for JSON diff & patch implemented

exyi · 2019-11-10T21:36:01Z

src/DotVVM.Framework/ViewModel/Serialization/ViewModelSerializationMap.cs

@@ -399,6 +399,10 @@ public WriterDelegate CreateWriterFactory()
                    {
                        options["pathOnly"] = true;
                    }
+                    if (!property.TransferAfterPostback)
+                    {
+                        options["firstRequest"] = true;


I'm not fan of transmitting all those options to the client. It's already annoyingly large.

I know and we should definitely de-duplicate them and put them in the $type definitions together with the validation rules. But I'd rather solve this in a separate PR.

…side viewmodels are enabled

Server-side caching of viewmodels

468845e

tomasherceg added enhancement Framework labels Jun 16, 2019

Service registration issue in tests fixed

440b512

quigamdev requested a review from exyi June 28, 2019 08:17

tomasherceg added 3 commits July 13, 2019 15:35

Merge branch 'master' into feature/viewmodel-server-cache

fdff9ff

Feature flag integrated

957c4ef

Unit tests fixed

89835f5

exyi reviewed Jul 18, 2019

View reviewed changes

Various code review fixes

1f3a8ed

exyi reviewed Aug 11, 2019

View reviewed changes

exyi mentioned this pull request Aug 14, 2019

Add a DependsOn attribute #736

Closed

tomasherceg and others added 4 commits August 27, 2019 21:19

Server-side viewmodel caching options extracted to a separate class

e8c5b3f

Merged changes from master

e8ed7b8

Merged changes from master

fc2745a

UI tests for viewmodel cache miss added

4b62348

Tiny updates in DotVVM.ts script

tomasherceg added this to the Version 2.4 milestone Nov 10, 2019

tomasherceg added 3 commits November 10, 2019 18:18

ViewModel diffing bug fixed

0c5c91a

Js and C# tests for JSON diff & patch implemented

ClientToServer diff fixed

b0a4213

IfInPostbackPath and ServerToClient serialization fixed

cbf4807

exyi reviewed Nov 10, 2019

View reviewed changes

tomasherceg added 3 commits November 11, 2019 12:39

ViewModel cache miss - test timing increased

979784f

Merge branch 'master' into feature/viewmodel-server-cache

b01172e

Added relaiable non-forgetting viewmodel store for testing purposes

e21388e

tomasherceg added 4 commits November 11, 2019 16:23

GitHub API test timing fixes

75b3643

ASP.NET Core 3.0 - server-side viewmodel fixes

63ff121

Bug in RouteLink query param tests fixed

6477241

Serializer - $options.firstRequest annotation added only when server-…

7d7289a

…side viewmodels are enabled

tomasherceg merged commit 756d2e0 into master Nov 27, 2019

tomasherceg deleted the feature/viewmodel-server-cache branch November 27, 2019 19:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Server-side caching of viewmodels #704

Server-side caching of viewmodels #704

tomasherceg commented Jun 16, 2019

exyi left a comment

exyi Jul 18, 2019

exyi Jul 18, 2019

tomasherceg Jul 19, 2019

exyi commented Jul 19, 2019 via email

exyi commented Jul 19, 2019 via email

exyi commented Jul 19, 2019 via email

exyi commented Jul 19, 2019 via email

tomasherceg commented Jul 19, 2019

exyi Aug 11, 2019

tomasherceg commented Aug 27, 2019

exyi Nov 10, 2019

tomasherceg Nov 11, 2019

Server-side caching of viewmodels #704

Server-side caching of viewmodels #704

Conversation

tomasherceg commented Jun 16, 2019

Notes

exyi left a comment

Choose a reason for hiding this comment

exyi Jul 18, 2019

Choose a reason for hiding this comment

exyi Jul 18, 2019

Choose a reason for hiding this comment

tomasherceg Jul 19, 2019

Choose a reason for hiding this comment

exyi commented Jul 19, 2019 via email

exyi commented Jul 19, 2019 via email

exyi commented Jul 19, 2019 via email

exyi commented Jul 19, 2019 via email

tomasherceg commented Jul 19, 2019

exyi Aug 11, 2019

Choose a reason for hiding this comment

tomasherceg commented Aug 27, 2019

exyi Nov 10, 2019

Choose a reason for hiding this comment

tomasherceg Nov 11, 2019

Choose a reason for hiding this comment