Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Linstor-Controller crashing #512

Open
dimm0 opened this issue Aug 1, 2023 · 7 comments
Open

Linstor-Controller crashing #512

dimm0 opened this issue Aug 1, 2023 · 7 comments
Labels
bug Something isn't working v2 This affects only Operator v2

Comments

@dimm0
Copy link

dimm0 commented Aug 1, 2023

Error:

root@linstor-controller-5557d9ccb4-dffn8:/# linstor error-reports show 64C941EE-00000-000004
ERROR REPORT 64C941EE-00000-000004

============================================================

Application:                        LINBIT�� LINSTOR
Module:                             Controller
Version:                            1.23.0
Build ID:                           28dbd33ced60d75a2a0562bf5e9bc6b800ae8361
Build time:                         2023-05-23T06:27:14+00:00
Error time:                         2023-08-01 17:35:50
Node:                               linstor-controller-5557d9ccb4-dffn8

============================================================

Reported error:
===============

Category:                           Error
Class name:                         ImplementationError
Class canonical name:               com.linbit.ImplementationError
Generated at:                       Method 'run', Source file 'SpaceTrackingTask.java', Line #300

Error message:                      Uncaught exception in k

Call backtrace:

    Method                                   Native Class:Line number
    run                                      N      com.linbit.linstor.spacetracking.k:300
    run                                      N      java.lang.Thread:829

Caused by:
==========

Category:                           RuntimeException
Class name:                         NullPointerException
Class canonical name:               java.lang.NullPointerException
Generated at:                       Method 'a', Source file 'SpaceTrackingApiCallHandler.java', Line #108


Call backtrace:

    Method                                   Native Class:Line number
    a                                        N      com.linbit.linstor.core.apicallhandler.controller.internal.a:108
    a                                        N      com.linbit.linstor.core.apicallhandler.controller.internal.a:80
    a                                        N      com.linbit.linstor.spacetracking.k:884
    c                                        N      com.linbit.linstor.spacetracking.k:548
    run                                      N      com.linbit.linstor.spacetracking.k:269
    run                                      N      java.lang.Thread:829


END OF ERROR REPORT.

Operator version 2.1.1

@WanzenBug
Copy link
Member

Please open an issue over at https://github.com/linbit/linstor-server

Does the issue happen right at start up? If not, have you tried restarting the Pod?

@WanzenBug WanzenBug added bug Something isn't working v2 This affects only Operator v2 labels Aug 2, 2023
@dimm0
Copy link
Author

dimm0 commented Aug 2, 2023

Ok will do
It happens in a couple minutes after the start up. Then pod crashes, and tries to start again.

@dimm0
Copy link
Author

dimm0 commented Aug 9, 2023

There's a fix in 1.24
How can I update the controller version in the operator? Add a patch for the controller deployment?

@WanzenBug
Copy link
Member

You can edit the piraeus-operator-image-config ConfigMap which holds the image information. You need to change the linstor-satellite and linstor-controller tag.

@WanzenBug
Copy link
Member

By the way, the original issue was only with the livenessprobe for the SpaceTracking service, you could go back to 1.23.0 and patch the deployment to remove the livenessProbe.

Something like this should work:

apiVersion: piraeus.io/v1
kind: LinstorCluster
metadata:
  name: linstorcluster
spec:
    - target:
        kind: Deployment
        name: linstor-controller
      patch: |
        apiVersion: apps/v1
        kind: Deployment
        metadata:
          name: linstor-controller
        spec:
          template:
            spec:
              containers:
              - name: linstor-controller
                startupProbe:
                  $patch: delete
                livenessProbe:
                  $patch: delete

@dimm0
Copy link
Author

dimm0 commented Aug 9, 2023

I did that, there's a 2nd one...

LINBIT/linstor-server#364 (comment)

@dimm0
Copy link
Author

dimm0 commented Aug 11, 2023

Can't patch too:

manager 2023-08-11T00:13:58Z    ERROR    Reconciler error    {"controller": "linstorcluster", "controllerGroup": "piraeus.io", "controllerKind": "LinstorCluster", "LinstorCluster": {"name":"linstorcluster"}, "namespace": "", "name": "linstorcluster", "reconcileID": "7e38a128-3355-4a8c-b13d-e00b7d8e7e1c", "error": "Deployment.apps \"linstor-controller\" is invalid: spec.template.spec.containers[0].livenessProbe: Required value: must specify a handler type"}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working v2 This affects only Operator v2
Projects
None yet
Development

No branches or pull requests

2 participants