Spec should discourage abuse of initializationOptions and didChangeConfiguration #567

mickaelistria · 2018-09-10T10:10:03Z

I'm working on having Eclipse IDE adopting some language servers.
I see a possible bad trend for Language Servers to heavily rely on initializationOptions and didChangeConfigurations to enable/disable features. The issue with those is that they are unspecified placeholders and that whatever usage is made of it requires all clients to write code specific to this language server to support those options.
The main example I have in mind right now is RLS that, by abusing those settings is progressively, and most likely without really willing it, breaking rich compatibility with other IDEs: rust-lang/rls#1047
Abuse of those properties should be deprecated in the spec, with some disclaimer explaining how relying on those make the LS integration less likely to be trivially portable from an editor to another.

rcjsuen · 2018-09-10T10:56:08Z

Interesting. I use workspace/didChangeConfiguration as that's what was there but did not consider this case.

At what point would you consider the use of this "abuse" though?

mickaelistria · 2018-09-10T10:58:07Z

As an adopter of LS I don't develop, I consider any usage is abuse, as long as it requires to create a specific UI or workflow to interact with it in the client ;) But I hope this discussion can lead to a more flexible definition.

LaurentTreguier · 2018-09-10T11:05:28Z

As I see it, a server initialized without initializationOptions should just work in any editor. If the editor allows servers to be started by custom extensions with possible deeper integration (like VSCode or Atom for example), then initializationOptions can be used to enable specific features that can't be used in other editors.

mickaelistria · 2018-09-10T11:44:36Z

@LaurentTreguier: so your proposal is that the settings should be limited to only client-specific configuration? I think it'd be fair.

LaurentTreguier · 2018-09-10T11:56:39Z

This is how I understood it. The spec describes it as User provided initialization options, which can be misleading; it makes it sound like the user should change it themselves when it should be up to a specific client to do this.

dbaeumer · 2018-09-12T10:04:14Z

IMO the spec should encourage all server providers to spec the following to things:

the initialization options it supports. I agree with @LaurentTreguier that a server should work with an empty literal as well and simply assume a set of defaults.
which workspace/configuration request it sends.

Having a push model for configurations was a mistake and got basically replaced by the pull model where the server sends workspace/configuration requests.

The client should still send workspace/didChangeConfiguration notification so that the server can clear caches if it caches configurations. But it should not send any values since values can differ based on the scope used in the workspace/configuration request.

mickaelistria · 2018-09-12T16:10:29Z

- the initialization options it supports. I agree with @LaurentTreguier <https://github.com/LaurentTreguier> that a server should work with an empty literal as well and simply assume a set of defaults. Just to give examples and food for thought, VSCode CSS language server

requires explicit enablement for sass, scss and so on; and VSCode JSon language server doesn't pre-load a typical list of JSon schema. Both assume client discover this settings by reverse engineering VSCode and repeat the same settings. The question is what drove the developers of those LS to rely on those options instead of making them default? It'd be interesting to get their POV on this question.

Having a push model for configurations was a mistake and got basically replaced by the pull model where the server sends workspace/configuration requests.

Still, the expected type is `any[]` which means that it's some LS specific settings that require specific integration. I don't think the flow of the operation was the issue here (while it's still good to know it was improved), it's really than any `any` leads to the unspecified world and specific effort of integration between client and LS, which is the opposite of LSP goal. I believe instead of specifying some operations with `any`, it's better to leave these as extensions. For clients, it's a similar effort to support one or the other, and it's not reusable between LS, so the protocol should remain strictly made of specified, portable, reusable operations, and whenever there is `any`, consider deprecating the operation basically because it's not specified enough to be useful by most tools.

dbaeumer · 2018-09-13T08:22:34Z

I disagree here. The reason is that the fact that initializationOptions is in the spec say that this property should be used and not any other random property.

I still think that it is a fair thing to require a server to depend on some initializationOptions. But I do fully agree that these need to be speced by the server (and not be reverse engineered from code) and that server should work with a resonable default set if they are not provided. Same is true for settings.

I do fully agree that the spec need to spec this assumptions.

@aeschli any comments on why the CSS language server can't work with a reasonable default set ?

felixfbecker · 2018-11-16T09:39:50Z

One pattern I saw emerge is that many language servers interpret initializationOptions as containing user configuration, i.e. the same that is sent in workspace/didChangeConfiguration or in response to workspace/configuration. But LSP doesn't actually really say this, it is very vague on what initializationOptions actually means:

User provided initialization options.

It doesn't use the term "configuration", but it does say "user provided" (not client provided).

The reason why language servers use it for configuration is because there is otherwise no way to read configuration in initialize. workspace/didChangeConfiguration is only sent after initialize returned (if at all), and workspace/configuration is not among the whitelisted requests allowed during initialize.
The problem is that since initializationOptions is not clearly defined, most clients do not send configuration in it.
Could LSP just be clearer about what initializationOptions is intended to be used for (maybe with an example in the spec)? And could workspace/configuration be whitelisted to be used during initialize?

dbaeumer · 2018-12-18T10:51:13Z

I will clarify the spec in a way that the initializationOptions is typically something that could be passed on the command line when starting the server. It shouldn't be user configurations.

I am actually against whitelisting workspace/configuration. If a server needs the configuration to register providers it should use dynamic registration instead of static registration which allows to mix workspace/configuration with registration calls.

dbaeumer · 2018-12-18T10:57:14Z

Please ping if you think dynamic registration is not the right path to go.

mickaelistria · 2018-12-18T10:59:16Z

I will clarify the spec in a way that the initializationOptions is typically something that could be passed on the command line when starting the server. It shouldn't be user configurations.

I disagree with that. The initializationOptions can be a good way to guarantee that user settings are passed immediately to the LS before to starts up. I think that basically, the initializationOptions have to be a super-set of the didChangeConfiguration as there are case where we want the configuration immediately. From a client perspective, the didChangeConfiguration is over-used, hard to maintain and is semantically often used with wrong semantic since several LS use it even to retrieve a default configuration. Eclipse Corrosion had an important discussion with RLS on that matter, and the resolution that initializationOptions can contain a mirror of didChangeConfiguration improved things a lot: rust-lang/rls#1026

felixfbecker · 2018-12-18T11:03:12Z

I am actually against whitelisting workspace/configuration. If a server needs the configuration to register providers it should use dynamic registration instead of static registration which allows to mix workspace/configuration with registration calls.

@dbaeumer I agree for determining whether a provider should be registered or not, but a server might need configuration during initialisation for a lot of reasons. For example, a loglevel or logfile, whether file watchers should be set up with polling or OS events, the HTTP endpoint of a service that needs to be contacted, whether dependencies should be installed, if yes an access token for that, the path of an external tool that needs to be shelled into, something like JAVA_HOME or GOPATH, ...
Requiring to delay all of these after initialize complicates a lot of things for no apparent reason. Also not every client supports dynamic registration, and these clients should gracefully degrade in functionality, i.e. they should work with a static set of capabilities from server initialize and work with restarting the server instead of not working at all.

dbaeumer · 2018-12-18T11:08:06Z

@felixfbecker the list you provided don't seem to be user configuration which is something that should come through workspace/configuration anyways. For example a logLevel or an access token is something that you usually would pass as a command line option and therefore I have no problem if something like this comes through the initialize options. I want to makes sure that the initialize options are static.

felixfbecker · 2018-12-18T11:11:54Z

Could you explain why a user-defined access token should come through a command line option and not user settings? Lots of VS Code extensions configure access tokens through settings, or JAVA_HOME/GOPATH, ... It seems like an arbitrary distinction if the extension had to filter out specific settings to pass them through initializationOptions (and prevent the server from reacting to their changes) while other settings go through didChangeConfiguration?

mickaelistria · 2018-12-18T11:15:56Z

that should come through workspace/configuration anyways.

Multiplying the entry-points for configuration/options is making harder and harder to adopt the protocol.
For example, see how current VSCode CSS language server requires enablement for SCSS, SASS and so on in initializationOptions. Are those user config or not? (a user could decide to disable SASS)
The line between user config and initializationOptions is IMO way to blurry to be specified and pragmatically, initializationOptions are also very good to pass some user settings and should be spec'd as being deprecated for this case.

LaurentTreguier · 2018-12-18T11:18:43Z

@dbaeumer dynamic registration is not always available, as clients aren't forced to implement it. Atom, with its official atom-languageclient for example, doesn't support any dynamic registration; so right now initializationOptions is actually the only reliable way to get any kind of configuration at server startup.

dbaeumer · 2018-12-18T11:20:29Z

This is exactly what I want to avoid by making it more clear. The initializationOptions should be static in the sense of a server run (e.g. like command line parameters). User settings should come through workspace/configuration.

@felixfbecker if the user token is a user setting then it should come through workspace/configuration. If we have more such configuration I am open to rethink to white list it in the initialize call. But that would be something clients need to opt in.

dbaeumer · 2018-12-18T11:23:49Z

@LaurentTreguier I am not against having initializationOptions. But as others pointed out this should not be used to pass user configuration since this is something a server needs to handle dynamically anyways.

felixfbecker · 2018-12-18T11:34:48Z

@felixfbecker if the user token is a user setting then it should come through workspace/configuration. If we have more such configuration I am open to rethink to white list it in the initialize call.

I think me and others gave a lot of examples of this kind of config in this thread - it really is a very common use case.

But that would be something clients need to opt in.

The server could just make the request and if it returns an error fall back to a different mechanism. But the whole whitelist could also be announced in ClientCapabilities.

mickaelistria · 2018-12-18T11:52:06Z

But as others pointed out this should not be used to pass user configuration since this is something a server needs to handle dynamically anyways.

static vs dynamic is not absolute and depends a lot on the client and the server. Some clients tend to allow/encourage a lot of user configuration, some others using similar LS would prefer providing a pre-defined config and not let user tweak it. As such, I don't think using the static vs dynamic distinction is reliable here.

dbaeumer · 2018-12-18T14:20:36Z

@felixfbecker and @mickaelistria can you both please write what you would like to see written in the spec. I have to say I get puzzled here since it is not clear to me what kind of settings you want to allow and what kind of settings you want to forbid. The original idea of the initializationOptions option was to pass parameters that are usually parameters pass on the command line to ease that.

I will revert the changes and wait for you to make concrete wording proposals.

mickaelistria · 2018-12-18T14:31:13Z

I would like to see specified something like

It's recommended that initializationOptions can read a superset of didChangeConfiguration parameters, and interpret them as default/initial values.

and/or the other way round

parameters of didChangeConfiguration should also be valid as input to initialization, where they would be interpreted as default/initial values.

mickaelistria · 2018-12-18T14:38:40Z

and also something like

initializationOptions and didChangeConfiguration can receive any values. This flexibility increase the difficulty for clients to adopt the Language Servers as they need to learn about the Language Server specific value that can be set there and implement support for them. It's recommended to rely on those as less as possible to ease integration.

dbaeumer · 2018-12-19T10:01:48Z

It's recommended that initializationOptions can read a superset of didChangeConfiguration parameters, and interpret them as default/initial values.

I thought this is exactly what we don't want. We shouldn't push server developers in updating a configuration using didChangeConfiguration. Configuration should always be pull from the client and didChangeConfiguration should only be used to signal a change. So may be in a first step I will clarify this.

In my opinion to make this easier for clients to handle we should say:

no configuration / setting in initializationOptions
settings should be fetch using workspace/configuration and a server needs to spec which configuiration settings it expects and can handle.
didChangeConfiguration should only be used to signal a change in configuration not to push data.

felixfbecker · 2018-12-19T11:10:11Z

I agree that they shouldn’t be in initialisationOptions, and the docs should clarify that. The term used for user-defined settings is “configuration”, not “options”.
I am proposing to whitelist workspace/configuration during initialize.

This seems like the simplest solution to me to solve the use case and without any drawbacks.

See microsoft/language-server-protocol#567 for motivations to not require `InitializationOptions` TODO: Check if there is any other custom clientside code we use which should be disabled if not implemented

dbaeumer · 2021-10-28T11:53:26Z

It is possible to send getConfiguration requests in the initialized notification.

I will close the issue since I am really not a fan of having another property during initialization. Please ping if you think otherwise.

michaelpj · 2022-03-07T10:21:10Z

It is possible to send getConfiguration requests in the initialized notification.

The current spec says

In addition the server is not allowed to send any requests or notifications to the client until it has responded with an InitializeResult, with the exception that during the initialize request the server is allowed to send the notifications window/showMessage, window/logMessage and telemetry/event as well as the window/showMessageRequest request to the client.

Which seems to contradict what you said, @dbaeumer ?

rwols · 2022-03-07T11:51:12Z

No this contradicts nothing.

initialize is a request sent from client to server.
initialized is a notification from the client. Clients are expected to send this notification after receiving a response to the initialize request.

A server can therefore do a workspace/configuration request in its handler for initialized notifications.

michaelpj · 2022-03-07T12:21:32Z

Gotcha, thanks!

nayeemrmn · 2024-04-30T11:58:15Z

I will look into white listing workspace/configuration.

It is possible to send getConfiguration requests in the initialized notification.

@dbaeumer The latest solution doesn't seem sufficient and I think workspace/configuration should be allow-listed during initialize.

Good servers should pull relevant workspace settings by namespace, for each workspace folder if interested, and incorporate that before they start processing normal document notifications/requests. Currently you can only do that in the initialized handler. This means every server must have a boilerplate initializer lock which blocks out other message handling until that part of the initialized handler is completed.

Is that the basic recommended approach? Or are servers supposed to initialize once with default settings, initialize again with pulled config slightly later while possibly receiving doc info in between? I don't see any other interpretation.

Well, extension authors have avoided this and are still heavily relying on initializationOptions for startup workspace settings despite using pull-based config when it comes to receiving workspace/didChangeConfiguration. IMO that should only be a fallback for clients that don't support pull-based config, if they're worth the effort. My problem is that initializationOptions has no convention for per-workspace-folder settings.

Is there a good reason to not allow-list workspace/configuration?

michaelpj · 2024-04-30T13:46:18Z

Is that the basic recommended approach? Or are servers supposed to initialize once with default settings, initialize again with pulled config slightly later while possibly receiving doc info in between?

This is what we're doing in HLS. In fact we do all of what you said:

Start with default config
Take the config from intializationOptions if it's there
Fire off workspace/configuration in the initialized handler

I agree that this is not very good, and it certainly seems that if you do the recommended thing you can start getting sent requests before you have your config set up.

aon012345 · 2024-05-01T04:45:55Z

good

dbaeumer · 2024-05-06T13:26:32Z

Is there a good reason to not allow-list workspace/configuration?

In general I try to keep the message that can be sent to the client in initialized as small as possible to make the initialization phase on the client simple.

Currently you can only do that in the initialized handler

I usually do that when actually processing a request and then cache the result of the workspace/configuration. When I receive a workspace/didChangeConfiguration I simply clear the cache.

nayeemrmn · 2024-05-06T14:20:05Z

I usually do that when actually processing a request and then cache the result of the workspace/configuration.

This is not a good candidate for lazy init because:

workspace/configuration generally is behind an async binding, and accessing config shouldn't need to be.
It spreads to other structures that depend on config. Now everything must be computed lazily and asynchronously.
Lazy computation is used to make things more responsive. In this case it's less responsive compared to acting on workspace/didChangeConfiguration.

In general I try to keep the message that can be sent to the client in initialized as small as possible to make the initialization phase on the client simple.

I think that would mean just storing the initialize params and returning server info and capabilities, since technically everything else can be done lazily. But server authors won't use it that way.

A more meaningful differentiation is that initialize is the request that all other requests are blocked on, hinting that it should get up-to-date everything required for handling textDocument/* notifications. That makes more sense so server authors are using it that way (but relying on initializationOptions).

Please reconsider allow-listing workspace/configuration during initialize.

dbaeumer added the discussion label Sep 12, 2018

dbaeumer closed this as completed in 545cd69 Dec 18, 2018

dbaeumer reopened this Dec 18, 2018

nevill mentioned this issue Jun 27, 2020

add parameterizable duration to retrieve data from prometheus prometheus-community/promql-langserver#177

Merged

radeksimko mentioned this issue Jul 1, 2020

Allow user to override discovery of root modules hashicorp/terraform-ls#189

Closed

rcjsuen mentioned this issue Jul 28, 2020

Add formatting option for ignoring multiline statements rcjsuen/dockerfile-utils#62

Closed

lukel97 mentioned this issue Aug 28, 2020

Remove configuration aspect from onInitialConfiguration? haskell/lsp#253

Closed

dbaeumer added the initialization label Nov 12, 2020

mfussenegger mentioned this issue Dec 31, 2020

LSP: Move workspace/configuration from nvim-lspconfig to core neovim/neovim#13649

Merged

dbaeumer closed this as completed Oct 28, 2021

dbaeumer removed this from the Backlog milestone Nov 2, 2021

michaelpj mentioned this issue Mar 7, 2022

HLS does not send workspace/configuration on startup haskell/haskell-language-server#2762

Closed

fwcd mentioned this issue Sep 13, 2022

Investigate improving configuration flow by sending workspace/configuration at startup fwcd/kotlin-language-server#392

Open

kLabz mentioned this issue Dec 23, 2022

haxe-language-server doesn't work until the first onDidChangeConfiguration event vshaxe/vshaxe#359

Open

asok mentioned this issue May 20, 2023

Auto detect formatter when client did not specify it Shopify/ruby-lsp#723

Merged

rgrinberg mentioned this issue Jul 3, 2023

Support lsp server settings ocamllabs/vscode-ocaml-platform#1157

Merged

Khady mentioned this issue Jul 4, 2023

ocamllsp shouldn't rely on workspace/didChangeConfiguration to get its settings ocaml/ocaml-lsp#1160

Open

michaelpj mentioned this issue Aug 6, 2023

Support workspace/configuration request for every server initilize haskell/lsp#510

Closed

lukaszsamson mentioned this issue Aug 13, 2023

The server should not depend on client sending workspace/didChangeConfiguration elixir-lsp/elixir-ls#961

Closed

krassowski mentioned this issue Oct 12, 2023

Support initializationOptions to configure the server python-lsp/python-lsp-server#459

Merged

tkrabel-db mentioned this issue Oct 13, 2023

Maybe use initializationOptions as additional source of settings python-lsp/python-lsp-server#195

Closed

JohnnyMorganz mentioned this issue Jan 20, 2024

Split Roblox-specific functionality into dedicated interface JohnnyMorganz/luau-lsp#505

Merged

1 task

nayeemrmn mentioned this issue May 27, 2024

perf(lsp): lock out requests until init is complete denoland/deno#23998

Merged

nayeemrmn mentioned this issue Sep 14, 2024

LSP: textDocument/completion hangs starting in version 1.43 denoland/deno#25610

Closed

Spec should discourage abuse of initializationOptions and didChangeConfiguration #567

Spec should discourage abuse of initializationOptions and didChangeConfiguration #567

Comments

mickaelistria commented Sep 10, 2018

rcjsuen commented Sep 10, 2018

mickaelistria commented Sep 10, 2018

LaurentTreguier commented Sep 10, 2018

mickaelistria commented Sep 10, 2018

LaurentTreguier commented Sep 10, 2018

dbaeumer commented Sep 12, 2018

mickaelistria commented Sep 12, 2018 via email

dbaeumer commented Sep 13, 2018

felixfbecker commented Nov 16, 2018 • edited Loading

dbaeumer commented Dec 18, 2018

dbaeumer commented Dec 18, 2018

mickaelistria commented Dec 18, 2018 via email

felixfbecker commented Dec 18, 2018 • edited Loading

dbaeumer commented Dec 18, 2018

felixfbecker commented Dec 18, 2018

mickaelistria commented Dec 18, 2018

LaurentTreguier commented Dec 18, 2018

dbaeumer commented Dec 18, 2018

dbaeumer commented Dec 18, 2018

felixfbecker commented Dec 18, 2018

mickaelistria commented Dec 18, 2018

dbaeumer commented Dec 18, 2018

mickaelistria commented Dec 18, 2018 • edited Loading

mickaelistria commented Dec 18, 2018

dbaeumer commented Dec 19, 2018

felixfbecker commented Dec 19, 2018

dbaeumer commented Oct 28, 2021

michaelpj commented Mar 7, 2022

rwols commented Mar 7, 2022

michaelpj commented Mar 7, 2022

nayeemrmn commented Apr 30, 2024

michaelpj commented Apr 30, 2024

aon012345 commented May 1, 2024

dbaeumer commented May 6, 2024

nayeemrmn commented May 6, 2024

felixfbecker commented Nov 16, 2018 •

edited

Loading

felixfbecker commented Dec 18, 2018 •

edited

Loading

mickaelistria commented Dec 18, 2018 •

edited

Loading