Install JupyterHub in the cern-vre cluster via tf helm provider #21

goseind · 2022-12-01T16:50:17Z

Information

Versions of the helm chart: https://jupyterhub.github.io/helm-chart/
Chart: https://github.com/jupyterhub/zero-to-jupyterhub-k8s/blob/main/jupyterhub/Chart.yaml and configuration ref: https://z2jh.jupyter.org/en/stable/resources/reference.html#helm-chart-configuration-reference
Docs: https://github.com/jupyterhub/helm-chart
https://github.com/jupyterhub/repo2docker
OAuth configuration and examples: https://z2jh.jupyter.org/en/stable/administrator/authentication.html#oauth2-based-authentication

Questions

(how) can sessions be stored for longer periods of time? e. g. https://pypi.org/project/dill/
would a similar solution to Codespaces/Gitpod self-hosted (https://www.gitpod.io/docs/configure/self-hosted/latest) be a solution?
compare to SWAN and Binderhub: https://jupyter.org/binder solutions

The text was updated successfully, but these errors were encountered:

goseind · 2023-01-30T16:23:13Z

Why did the ESCAPE team decide to go with singleuser.storage.static instead of dynamically provisioned PVCs through a storage class and how does Jhub provide individual storage with a static PVC?

ESCAPE JHub values: https://gitlab.cern.ch/escape-wp2/flux-rucio/-/blob/master/escape/jupyterhub/releases/jupyterhub.yaml
JHub Storage config options: https://z2jh.jupyter.org/en/stable/jupyterhub/customizing/user-storage.html
File Share CERN docs: https://clouddocs.web.cern.ch/file_shares/quickstart.html

tag @garciagenrique

garciagenrique · 2023-01-31T17:34:31Z

JupyterHub was installed following the zero2jupyterhub documentation and using helm charts.
- The config.yaml file is empty so that z2jh starts with the default values.
- However, the deployment won't succeed because there is no storage assigned to the cluster. See below:
Currently the HUB pod is starting (Pending mode) with the following error

|   Type     Reason            Age                  From               Message                                    │
│   ----     ------            ----                 ----               -------                                    │
│   Warning  FailedScheduling  19m (x309 over 26h)  default-scheduler  0/3 nodes are available: 3 pod has unbound │
│  immediate PersistentVolumeClaims. preemption: 0/3 nodes are available: 3 Preemption is not helpful for schedul │
│ ing.

This error is connected with the problem @goseind is stressing above: why static vs dynamic storage for the cluster.

garciagenrique · 2023-01-31T17:43:59Z

The following commands where used for the deployment (helm needs to be installed, of course)

$ helm repo add jupyterhub https://jupyterhub.github.io/helm-chart/
$ helm repo update

To install/updated, just modify the config.yaml file, save it and run

$ helm upgrade --cleanup-on-fail \
  --install z2jh jupyterhub/jupyterhub \
  --namespace jupyterhub \
  --create-namespace \
  --version=2.0.0 \
  --values config.yaml

The des installation of the helm chart does not work really well. The way it's currently done is by logging into k9s, selecting the jupyterhub (or whoever it was called), and erasing it completely.

goseind · 2023-02-06T11:15:22Z

I have requested a quota change for the number of shares, so we can use dynamically provisioned storage.

goseind · 2023-02-06T11:18:58Z

Done, quota updated to 200 shares.

goseind · 2023-02-06T19:51:47Z

JupyterHub is running under http://137.138.226.35 (IP subject to change). Current status:

Update on Quota Question here
LBaaS automatic LB provisioning was having issues again (see SNOW issue), thus creating the LB manually like so openstack loadbalancer create --name kube_service_8d85602e-7816-4153-b85b-ea86586e47c1_jhub_proxy-public --vip-network-id public and passing the IP into the helm values loadBalancerIP.
sqlite-memory db type is just for testing and needs to be replaced by DBOD postgres db.
Due to the usage of podSecurityContext also here, we need to set fsGroupChangePolicy: "OnRootMismatch" as per default the shares are owned by root:root, and writable only by root (ref. https://kubernetes.docs.cern.ch/docs/storage/fileshares/#running-pods-with-fsgroup-security-context). Setting the value through extraPodConfig as described here, does not work, the value does not get passed onto the users container. Topic opened in Forum: https://discourse.jupyter.org/t/extrapodconfig-values-do-not-get-set/17868 (see also Mattermost)

tf commands:

terraform apply -target=kubernetes_namespace_v1.ns_jupyterhub
terraform apply -target=kubernetes_storage_class_v1.sc_manila-meyrin-cephfs # sc with Delete policy did not exist yet
terraform destroy -target=helm_release.jupyterhub-chart

@garciagenrique see if you can follow my changes, the extra values I set are described in the customization docs. As I have no idea why the value does not get set, I'll have to wait for an answer in the Forum..

goseind · 2023-02-07T16:23:16Z

My workaround solution by setting the fsGroup to 0 in order to use root access didn't work as then the following error occurs: Running as root is not recommended. Use --allow-root to bypass. This has been a problem for other users too, see: jupyterhub/zero-to-jupyterhub-k8s#562 or jupyterhub/zero-to-jupyterhub-k8s#2177

Also asking the SWAN team how they worked around this ref.: https://github.com/swan-cern/swan-charts/blob/master/swan/values.yaml#L22

Another idea would be to use extraConfig to directly modify KubeSpawner.

Delete PVCs used fo testing

goseind · 2023-02-10T12:06:08Z

To solve the issue with the extraPodConfig not being set, I created a bug report in the JupyterHub repo: jupyterhub/zero-to-jupyterhub-k8s#3021

garciagenrique · 2023-02-10T14:05:39Z

@goseind could we just not set up a podSecurityContext for the moment, finish configuring JHub, and later implement this ?

goseind · 2023-02-13T15:26:39Z

Set a meeting with Diogo from SWAN for next week Monday and check their storage configuration with eosxd: https://gitlab.cern.ch/kubernetes/automation/charts/cern/-/tree/master/eosxd

JupyterHub with dynamic PVCs fix #21

goseind · 2023-02-13T15:55:09Z

For an update see #34, the service is now reachable from within CERN but needs further configuration, as listed in the issue description.
@garciagenrique can you add a PR with the cvmfs configuration?

garciagenrique · 2023-02-14T11:09:43Z

The base image of the ESCAPE-VRE is this one: https://gitlab.cern.ch/escape-wp2/docker-images/-/tree/master/datalake-singleuser
There are some features that could be improved (maybe python version and latests rucio client versions ?), but all the configuration of how to add the rucio-jupyterlab plugin is linked here.

I suggest either start with this image, either based on it.

wiegerthefarmer · 2023-03-30T12:27:50Z

@goseind could we just not set up a podSecurityContext for the moment, finish configuring JHub, and later implement this ?

is there a solution to this? I've tried everything to get fsGroupChangePolicy: "OnRootMismatch" set. Nothing gets passed to the pod. Only setting the values in a yaml that is used to start the pod works. But nothing in the spawner works.

goseind · 2023-04-17T20:34:07Z

@goseind could we just not set up a podSecurityContext for the moment, finish configuring JHub, and later implement this ?

is there a solution to this? I've tried everything to get fsGroupChangePolicy: "OnRootMismatch" set. Nothing gets passed to the pod. Only setting the values in a yaml that is used to start the pod works. But nothing in the spawner works.

So far we haven't found a solution either, some values get set through the YAML values, but for the ones we need not, setting them with the extra config script doesn't work either. We need to further debug this.

goseind · 2023-06-05T10:17:02Z

Adjust the K8s network policy of JHub for URLs to work (optional?)
use Lets Encrypt cert with cert-manager
Change single-user storage to share once the fsGroup issue is solved
rebuild main single-user images on GH (see my example repo)
Move deployment to ArgoCD once set up
Redo EOS FUSE mount with CERN-provided image or rebuild the old one here (use private image as the keytab needs to stay secret), ref. to: Enable EOS mount OpenStack #130

goseind · 2023-06-07T08:18:44Z

@garciagenrique I now merged the config, from my side this looks fine. Do you want to take charge of redoing the images, also the EOS image? References can be found in the config of the images I was using/creating.

goseind · 2023-08-21T07:45:51Z

I think this issue could be closed and the remaining tasks split up into separate tasks as they are not directly linked to the initial goal of this issue. What do you think @garciagenrique ?

garciagenrique · 2023-08-21T10:05:10Z

I agree @goseind, although this issue become too big and touched plenty of subjects...

Could we start in this thread an interaction to summary the remaining tasks ?

goseind · 2023-09-22T07:56:08Z

In my opinion, this topic can now be closed as all the tasks have been completed.

goseind self-assigned this Dec 1, 2022

goseind added the terraform label Dec 1, 2022

goseind added a commit that referenced this issue Dec 1, 2022

prepare tf helm files for reana and jhub #19 #21

c1b7d2c

goseind added a commit that referenced this issue Dec 1, 2022

update charts and add to main config #19 #21

9fbf288

goseind added this to the Get notebook service running on the cluster milestone Dec 5, 2022

goseind assigned garciagenrique Jan 18, 2023

goseind removed the terraform label Jan 20, 2023

goseind removed their assignment Jan 20, 2023

goseind added enhancement New feature or request component/jhub priority/critical Needs to be done very soon labels Jan 20, 2023

goseind self-assigned this Feb 6, 2023

goseind added a commit that referenced this issue Feb 6, 2023

delete jhub tf module #21

46a347c

goseind added a commit that referenced this issue Feb 6, 2023

add jhub tf helm #21

d4bdfda

goseind added a commit that referenced this issue Feb 6, 2023

add storageclass and run tf fmt #21

648afca

goseind added a commit that referenced this issue Feb 6, 2023

add loadbalancer annotations to config #21

3ad069e

goseind added a commit that referenced this issue Feb 6, 2023

add loadbalancer annotations to config #21

f89df4d

goseind added a commit that referenced this issue Feb 6, 2023

change db type for testing #21

7affdb7

goseind added a commit that referenced this issue Feb 6, 2023

set securityContext for containers #21

87153ad

goseind mentioned this issue Feb 6, 2023

JupyterHub with dynamic PVCs fix #21 #34

Merged

goseind pinned this issue Feb 6, 2023

goseind added a commit that referenced this issue Feb 13, 2023

create jhub and reana db secrets #21 #19

0b126a1

goseind added a commit that referenced this issue Feb 13, 2023

modify secret names and keys #21 #19

95f1605

goseind added a commit that referenced this issue Feb 13, 2023

set dbconnectstring value for jhub #21

ac6c0c4

goseind added a commit that referenced this issue Feb 13, 2023

adjust jhub config #21

8cbf9fe

goseind added a commit that referenced this issue Feb 13, 2023

update secrets #21 #19

6268ecd

goseind closed this as completed in #34 Feb 13, 2023

goseind added a commit that referenced this issue Feb 13, 2023

Merge pull request #34 from cern-vre/jupyterhub_dynamic

a976579

JupyterHub with dynamic PVCs fix #21

goseind reopened this Feb 13, 2023

goseind added a commit that referenced this issue Feb 14, 2023

https and auth jhub #21

fa8aa18

goseind added a commit that referenced this issue Feb 14, 2023

https to manual jhub #21

9ea7744

goseind added a commit that referenced this issue Feb 14, 2023

add sealedsecrets and update jhub config #21

c1c25ec

goseind mentioned this issue Feb 14, 2023

JHub SSL and OAuth configuration #44

Merged

goseind added cern-vre-infra Things only related and depandant on our team component/images Container images for jhub profiles labels Feb 14, 2023

goseind mentioned this issue May 31, 2023

Jupyterhub modifications #137

Merged

This was referenced Jul 24, 2023

Migrate jhub from tf ro flux #179

Merged

Keep tf for the K8s openstack configuration #159

Closed

Token refresh jhub #180

Closed

goseind removed their assignment Sep 22, 2023

garciagenrique closed this as completed Sep 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Install JupyterHub in the cern-vre cluster via tf helm provider #21

Install JupyterHub in the cern-vre cluster via tf helm provider #21

goseind commented Dec 1, 2022 •

edited

Loading

Tasks

goseind commented Jan 30, 2023 •

edited

Loading

garciagenrique commented Jan 31, 2023

garciagenrique commented Jan 31, 2023

goseind commented Feb 6, 2023

goseind commented Feb 6, 2023

goseind commented Feb 6, 2023 •

edited

Loading

goseind commented Feb 7, 2023 •

edited

Loading

goseind commented Feb 10, 2023 •

edited

Loading

garciagenrique commented Feb 10, 2023

goseind commented Feb 13, 2023

goseind commented Feb 13, 2023

garciagenrique commented Feb 14, 2023

wiegerthefarmer commented Mar 30, 2023

goseind commented Apr 17, 2023

goseind commented Jun 5, 2023 •

edited

Loading

goseind commented Jun 7, 2023

goseind commented Aug 21, 2023

garciagenrique commented Aug 21, 2023

goseind commented Sep 22, 2023

Install JupyterHub in the cern-vre cluster via tf helm provider #21

Install JupyterHub in the cern-vre cluster via tf helm provider #21

Comments

goseind commented Dec 1, 2022 • edited Loading

Information

Questions

Tasks

goseind commented Jan 30, 2023 • edited Loading

garciagenrique commented Jan 31, 2023

garciagenrique commented Jan 31, 2023

goseind commented Feb 6, 2023

goseind commented Feb 6, 2023

goseind commented Feb 6, 2023 • edited Loading

goseind commented Feb 7, 2023 • edited Loading

goseind commented Feb 10, 2023 • edited Loading

garciagenrique commented Feb 10, 2023

goseind commented Feb 13, 2023

goseind commented Feb 13, 2023

garciagenrique commented Feb 14, 2023

wiegerthefarmer commented Mar 30, 2023

goseind commented Apr 17, 2023

goseind commented Jun 5, 2023 • edited Loading

goseind commented Jun 7, 2023

goseind commented Aug 21, 2023

garciagenrique commented Aug 21, 2023

goseind commented Sep 22, 2023

goseind commented Dec 1, 2022 •

edited

Loading

goseind commented Jan 30, 2023 •

edited

Loading

goseind commented Feb 6, 2023 •

edited

Loading

goseind commented Feb 7, 2023 •

edited

Loading

goseind commented Feb 10, 2023 •

edited

Loading

goseind commented Jun 5, 2023 •

edited

Loading