[WIP] Collaborative editing using Yjs #1

dmonad · 2021-01-20T13:55:40Z

This PR implements collaborative editing in JupyterLab using the Yjs shared editing framework. Yjs is an open-source framework to build collaborative applications using data structures (CRDTs) that sync automatically.

Try it out now:

Scope

The aim of this PR is to switch to Yjs as a data model for notebooks and text files. The ability to share content between users will be provided by separate plugins that connect the Yjs data model with other peers. Currently, this PR currently also implements a basic shared-editing server that synchronizes clients that open the same file. We will outsource this shared-editing server to a separate plugin.

Status

There are a couple of known UI regressions and we probably have to do some refactoring. We are working on it. But this PR is already usable.

Known bugs:

It is currently not possible to rename files
Since outputs are shared, it is possible to display HTML script tags in the browsers of other users.

Quick start

# optional: use conda
conda create -n yjupyter
conda activate yjupyter
conda install jupyterlab

# clone our branch
git clone [email protected]:QuantStack/jupyterlab.git --branch yjupyter
cd jupyterlab

# installation steps as usual for Jupyter development
pip install -e .
jlpm
jlpm build
jupyter lab --dev-mode

Technical details

Existing work

Ticket to track Real Time Collaboration: Real Time Collaboration jupyterlab/jupyterlab#5382
Using Lumino datastore by [at] saulshanabrook: Add datastore support jupyterlab/jupyterlab#6871
Jupyter Real Time Collaboration monorepo: https://github.com/jupyterlab/rtc

Currently, Jupyter Notebooks and several other components use ModelDB to model the notebooks' internal representation. It provides observable data structures that fire events when data is added or removed.

This PR replaces the IModel data model with Yjs' shared types that provide the same functionality. Yjs is meant for building collaborative applications and provides many helpful, well-tested abstractions that reduce the complexity of this codebase significantly. For example, we removed several hundred lines of code that keep ModelDB in-sync with the CodeMirror editor. Instead, we use the y-codemirror editor binding that keeps the editor in-sync with Yjs' data structures.

We are aware that this will break existing plugins that rely on the IModelDB interface. We want to make this upgrade as easy as possible and keep existing APIs (e.g. the event emitters) whenever possible.

We also restructured how the internal data is represented using the observable data structures. Before, we had a complex mixture of key-value stores and observable arrays based on ModelDB. With Yjs, we produced a nearly one-on-one mapping from a Yjs document to the .ipynb JSON format. ydoc.toJSON() is an existing method that converts a Yjs document to a JSON representation that is very similar to the .ipynb JSON format (some keys are missing). Developers that are familiar with the JSON format will easily know how to work with the Yjs data model.

We also want to make it easier for plugins to provide additional features based on Yjs as a data model. A separate plugin could provide commenting features based on annotations on the Yjs document. "Relative Positions" is another Yjs concept that makes it possible to assign information to a range of text while automatically adjusting for position changes.

Another complex problem that Yjs solves is selective undo/redo. We replaced the existing undo manager with a powerful Yjs-based alternative that allows you to selectively decide which changes you want to be able to undo. Text-modifications to the editor models and cell-insertions are tracked as "undoable", while other changes to the Yjs data model are not tracked (e.g. modifications on metadata and the computed output). The use-case for the selective undo manager is that you want to prevent users from undoing remote changes created by other users.

Currently, this PR also implements a websocket server (in Python) to sync connected clients, and a hook to connect the Yjs data model to the server. We still use http-requests to save the notebook-content to the server. Concurrent access is prevented using a locking implementation that is similar to "redlock".

We will outsource the server-implementation and the hook to a separate plugin to allow third-parties to implement their custom server. Applications that use Jupyter notebooks, like JupyterHub, will be able to add custom authorization and access control to the server.

Yjs is network agnostic and doesn't need a server to perform conflict resolution. The implemented websocket server (79 lines of code) only forwards messages to other clients and implements a little custom logic (e.g. room-management and locking). This implementation is fully functional and yields little overhead. However, we want the Yjs data model to be accessible in Python as well. Next, we will be working on a Rust port of Yjs, including Python bindings, that will allow the server to parse the shared document and perform modifications.

Next steps

Fix the remaining bugs
Outsource the server to a separate plugin and provide extension-points to provide custom shared-editing servers.
Discuss problems that arise when implementing shared-editing in Jupyter (e.g. do we want to share output between users? should there be a shared kernel for a shared notebook?)

…ck-plugin

jtpio · 2021-01-20T14:20:39Z

Thanks for starting this!

Posting the link to the Binder dev mode here so it's easily accessible:

jtpio · 2021-01-22T13:16:51Z

FYI, I added the jupyterlab-link-share extension in f916cad, so it's easier to share the link to a running Binder instance.

… of a collaborative session)

bollwyvl added 9 commits December 28, 2020 10:13

add license-webpack-plugin

49dfb0c

also add license plugin to staging template

9935639

simplify license file name

db48f37

also add licenses to dev mode

3ecfd1d

merge upstream

970f3d0

only generate license file for production builds

53c65fa

Merge remote-tracking branch 'upstream/master' into add-license-webpa…

14e5dcb

…ck-plugin

linting

52802fe

more moving licenses to prod

952d91c

dmonad force-pushed the yjupyter branch from 5c6680d to 3f7e574 Compare January 20, 2021 15:59

tonyfast added 4 commits January 20, 2021 14:56

change extension manager header to div.header for a11y

7cad6dd

change header to div.header in running panel for a11y

be94d36

change header to div.header in toc for a11y

3830c4f

change header to div.header in debugger for a11y

8a7d5c2

jtpio and others added 14 commits January 23, 2021 09:58

Make file browser plugins more flexible

d01a194

Lint

531da1e

Rename browser widget plugin

55712b6

Tweak treePathUpdater logic

8374c20

Move the launcher button to its own plugin

23b8039

Rename plugin to launcherToolbarButton

497eb50

Move settings handling to the browser plugin

24b23ba

Group activate functions with plugin definitions

1c21b1f

change header class to jp-stack-panel-header for debugger

d89123d

change header class to jp-stack-panel-header for the extension manager

3ddeffa

change header to jp-stack-panel-header in running

b6d836d

update header to jp-stack-panel-header in toc

b7a97f8

Update notebook toolbar example docs

ec1eae4

merge master

583ee3f

dmonad and others added 27 commits February 12, 2021 18:38

fix a few issues

3fe6265

fix a few bugs and make it work (with some issues)

1d76e62

add websocket provider

25de677

Add jupyterlab-link-share to the dev Binder

e3f0373

rework some changes

a9ac658

synchronize outputs

bfbd15d

add yawareness

5ddd39c

make sure awareness is defined

d30b1be

type error

1dd65b2

fix saving of content

7d9e2fc

fix loading issue of files with output

b0c8ab2

fix cell-move issue

c9cef68

integrate Y.UndoManager

2a4982b

selective undo tracking

8acc057

YJS echo WebSocket

60d0ff8

remove prints

0aef2ac

Remove unused y-webrtc package

90be06f

fix unhandled message issue

33603b8

fix initial duplication issue and implement lock for saving to disk

5a295dd

prevent mutex lock of y-websocket by sending messages async

61a6c1f

upgrade yjs

b325081

fix sync issue in binder

a50b12b

fix cell moving issue

477dd91

fix initialization flow (no automatic cell insertion at the beginning…

f6047d7

… of a collaborative session)

fix empty notebook init

722af57

Update to jupyterlab-link-share=0.2

b3f11c6

rebase and update deps

3b3cc12

dmonad force-pushed the yjupyter branch from 7caabb5 to 3b3cc12 Compare February 12, 2021 17:51

fix loading issue that occured since last rebase

71e192a

dmonad closed this Feb 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Collaborative editing using Yjs #1

[WIP] Collaborative editing using Yjs #1

dmonad commented Jan 20, 2021 •

edited

Loading

jtpio commented Jan 20, 2021

jtpio commented Jan 22, 2021

[WIP] Collaborative editing using Yjs #1

[WIP] Collaborative editing using Yjs #1

Conversation

dmonad commented Jan 20, 2021 • edited Loading

Scope

Status

Quick start

Technical details

Existing work

Next steps

jtpio commented Jan 20, 2021

jtpio commented Jan 22, 2021

dmonad commented Jan 20, 2021 •

edited

Loading