Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update telemetry adoption guide #290

Merged
merged 1 commit into from
Jul 29, 2024

Conversation

yadneshk
Copy link
Contributor

No description provided.

@yadneshk yadneshk force-pushed the telemetry_adoption branch 2 times, most recently from c6b5dae to e07187f Compare February 26, 2024 09:55
Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/39d465e1310d41a29d769feecc3376c9

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 1h 17m 27s
✔️ adoption-docs-preview SUCCESS in 2m 24s

Copy link
Contributor

@jistr jistr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, i noticed three spots which i think need a tweak.

docs_user/modules/openstack-stop_openstack_services.adoc Outdated Show resolved Hide resolved
tests/roles/backend_services/tasks/main.yaml Outdated Show resolved Hide resolved
tests/roles/stop_openstack_services/tasks/main.yaml Outdated Show resolved Hide resolved
@yadneshk yadneshk changed the title [WIP] Update telemetry adoption guide Update telemetry adoption guide Feb 26, 2024
Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/8f7b45c630fc47c4a41a0c4f1a697d3e

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 2h 06m 29s
✔️ adoption-docs-preview SUCCESS in 1m 31s

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/f6c337ff84424033bcb1e356f072c82e

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 1h 24m 44s
✔️ adoption-docs-preview SUCCESS in 2m 48s

jistr added a commit to jistr/data-plane-adoption that referenced this pull request Feb 27, 2024
In the spirit of PR 290 [1] which is now heading into main branch,
let's remove the image URLs from Autoscaling and Telemetry service
adoption. This should make the docs downstream-ready.

[1] openstack-k8s-operators#290
@yadneshk
Copy link
Contributor Author

Looking at the zuul, seems to have failed here

TASK [mariadb_copy : test connection to podified DBs (show databases)] *********
fatal: [localhost]: FAILED! => {"changed": true, "cmd": "#!/bin/bash\nset -euxo pipefail\n\n\nMARIADB_IMAGE=quay.io/podified-antelope-centos9/openstack-mariadb:current-podified\n# TODO: remove the default(external_...) when CI is transitioned to use 'source_...'\nSOURCE_MARIADB_IP=192.168.122.100\ndeclare -A SOURCE_GALERA_MEMBERS\nSOURCE_GALERA_MEMBERS=(\n[\"standalone.localdomain\"]=192.168.122.100\n# ...\n)\nSOURCE_DB_ROOT_PASSWORD=\"ZAUPKUiiDb\"\n\nPODIFIED_MARIADB_IP=10.217.4.248\nPODIFIED_CELL1_MARIADB_IP=10.217.4.198\nPODIFIED_DB_ROOT_PASSWORD=\"12345678\"\n\n# The CHARACTER_SET and collation should match the source DB\n# if the do not then it will break foreign key relationships\n# for any tables that are created in the future as part of db sync\nCHARACTER_SET=utf8\nCOLLATION=utf8_general_ci\n\n\n# Test connection to podified DBs (show databases)\noc run mariadb-client --image $MARIADB_IMAGE -i --rm --restart=Never -- \\\n    mysql -rsh \"$PODIFIED_MARIADB_IP\" -uroot \"-p$PODIFIED_DB_ROOT_PASSWORD\" -e 'SHOW databases;'\noc run mariadb-client --image $MARIADB_IMAGE -i --rm --restart=Never -- \\\n    mysql -rsh \"$PODIFIED_CELL1_MARIADB_IP\" -uroot \"-p$PODIFIED_DB_ROOT_PASSWORD\" -e 'SHOW databases;'", "delta": "0:01:00.175897", "end": "2024-02-27 07:48:16.876321", "msg": "non-zero return code", "rc": 1, "start": "2024-02-27 07:47:16.700424", "stderr": "+ MARIADB_IMAGE=quay.io/podified-antelope-centos9/openstack-mariadb:current-podified\n+ SOURCE_MARIADB_IP=192.168.122.100\n+ declare -A SOURCE_GALERA_MEMBERS\n+ SOURCE_GALERA_MEMBERS=([\"standalone.localdomain\"]=192.168.122.100)\n+ SOURCE_DB_ROOT_PASSWORD=ZAUPKUiiDb\n+ PODIFIED_MARIADB_IP=10.217.4.248\n+ PODIFIED_CELL1_MARIADB_IP=10.217.4.198\n+ PODIFIED_DB_ROOT_PASSWORD=12345678\n+ CHARACTER_SET=utf8\n+ COLLATION=utf8_general_ci\n+ oc run mariadb-client --image quay.io/podified-antelope-centos9/openstack-mariadb:current-podified -i --rm --restart=Never -- mysql -rsh 10.217.4.248 -uroot -p12345678 -e 'SHOW databases;'\nWarning: would violate PodSecurity \"restricted:v1.24\": allowPrivilegeEscalation != false (container \"mariadb-client\" must set securityContext.allowPrivilegeEscalation=false), unrestricted capabilities (container \"mariadb-client\" must set securityContext.capabilities.drop=[\"ALL\"]), runAsNonRoot != true (pod or container \"mariadb-client\" must set securityContext.runAsNonRoot=true), seccompProfile (pod or container \"mariadb-client\" must set securityContext.seccompProfile.type to \"RuntimeDefault\" or \"Localhost\")\nerror: timed out waiting for the condition", "stderr_lines": ["+ MARIADB_IMAGE=quay.io/podified-antelope-centos9/openstack-mariadb:current-podified", "+ SOURCE_MARIADB_IP=192.168.122.100", "+ declare -A SOURCE_GALERA_MEMBERS", "+ SOURCE_GALERA_MEMBERS=([\"standalone.localdomain\"]=192.168.122.100)", "+ SOURCE_DB_ROOT_PASSWORD=ZAUPKUiiDb", "+ PODIFIED_MARIADB_IP=10.217.4.248", "+ PODIFIED_CELL1_MARIADB_IP=10.217.4.198", "+ PODIFIED_DB_ROOT_PASSWORD=12345678", "+ CHARACTER_SET=utf8", "+ COLLATION=utf8_general_ci", "+ oc run mariadb-client --image quay.io/podified-antelope-centos9/openstack-mariadb:current-podified -i --rm --restart=Never -- mysql -rsh 10.217.4.248 -uroot -p12345678 -e 'SHOW databases;'", "Warning: would violate PodSecurity \"restricted:v1.24\": allowPrivilegeEscalation != false (container \"mariadb-client\" must set securityContext.allowPrivilegeEscalation=false), unrestricted capabilities (container \"mariadb-client\" must set securityContext.capabilities.drop=[\"ALL\"]), runAsNonRoot != true (pod or container \"mariadb-client\" must set securityContext.runAsNonRoot=true), seccompProfile (pod or container \"mariadb-client\" must set securityContext.seccompProfile.type to \"RuntimeDefault\" or \"Localhost\")", "error: timed out waiting for the condition"], "stdout": "pod \"mariadb-client\" deleted", "stdout_lines": ["pod \"mariadb-client\" deleted"]}

Copy link
Contributor

@jistr jistr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks 👍

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/5a520efff68044b29ddff9bf76907fa9

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 2h 16m 55s
✔️ adoption-docs-preview SUCCESS in 2m 13s

@fao89
Copy link
Contributor

fao89 commented Feb 28, 2024

recheck

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/0abf656b052549008376e98e4b33396c

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 2h 15m 46s
✔️ adoption-docs-preview SUCCESS in 2m 05s

@fao89
Copy link
Contributor

fao89 commented Feb 28, 2024

it is failing to start the telemetry service:

2024-02-28T12:28:52Z	INFO	Controllers.OpenStackDataPlaneDeployment	Deploying service	{"controller": "openstackdataplanedeployment", "controllerGroup": "dataplane.openstack.org", "controllerKind": "OpenStackDataPlaneDeployment", "OpenStackDataPlaneDeployment": {"name":"openstack","namespace":"openstack"}, "namespace": "openstack", "name": "openstack", "reconcileID": "04db0b63-4c08-44c0-b2b3-74594b6de117", "service": "telemetry"}
2024-02-28T12:28:52Z	ERROR	Controllers.OpenStackDataPlaneDeployment	OpenStackDeployment error for NodeSet openstack	{"controller": "openstackdataplanedeployment", "controllerGroup": "dataplane.openstack.org", "controllerKind": "OpenStackDataPlaneDeployment", "OpenStackDataPlaneDeployment": {"name":"openstack","namespace":"openstack"}, "namespace": "openstack", "name": "openstack", "reconcileID": "04db0b63-4c08-44c0-b2b3-74594b6de117", "ObjectType": "*v1beta1.OpenStackDataPlaneDeployment", "ObjectNamespace": "openstack", "ObjectName": "openstack", "error": "Secret \"ceilometer-compute-config-data\" not found"}
github.com/openstack-k8s-operators/lib-common/modules/common/util.LogErrorForObject
	/opt/app-root/src/go/pkg/mod/github.com/openstack-k8s-operators/lib-common/modules/[email protected]/util/log.go:65
github.com/openstack-k8s-operators/dataplane-operator/controllers.(*OpenStackDataPlaneDeploymentReconciler).Reconcile
	/remote-source/controllers/openstackdataplanedeployment_controller.go:250
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Reconcile
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:119
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:316
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:266
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:227
2024-02-28T12:28:52Z	ERROR	Reconciler error	{"controller": "openstackdataplanedeployment", "controllerGroup": "dataplane.openstack.org", "controllerKind": "OpenStackDataPlaneDeployment", "OpenStackDataPlaneDeployment": {"name":"openstack","namespace":"openstack"}, "namespace": "openstack", "name": "openstack", "reconcileID": "04db0b63-4c08-44c0-b2b3-74594b6de117", "error": "Secret \"ceilometer-compute-config-data\" not found"}
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:329
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:266
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/[email protected]/pkg/internal/controller/controller.go:227

https://logserver.rdoproject.org/90/290/3e605b69055ff26859fc02972fb4bf28322dc055/github-check/data-plane-adoption-osp-17-to-extracted-crc/526e820/controller/ci-framework-data/logs/openstack-k8s-operators-openstack-must-gather/namespaces/openstack-operators/pods/dataplane-operator-controller-manager-7797bdf97b-grjq9/logs/manager.log

@yadneshk yadneshk force-pushed the telemetry_adoption branch 2 times, most recently from ddaa397 to f8189c8 Compare March 1, 2024 05:46
@yadneshk
Copy link
Contributor Author

yadneshk commented Mar 1, 2024

Depends-On: openstack-k8s-operators/install_yamls#723

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/ad4d7e9611a141ddb161b8ce30a5974b

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 2h 09m 08s
✔️ adoption-docs-preview SUCCESS in 1m 46s

@yadneshk yadneshk force-pushed the telemetry_adoption branch from f8189c8 to 7b4482f Compare March 27, 2024 06:50
Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/9e64ee89c514497592a880151bb427a7

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 2h 02m 08s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 12m 40s
✔️ adoption-docs-preview SUCCESS in 1m 18s

@yadneshk yadneshk force-pushed the telemetry_adoption branch from 03fbe62 to 782225d Compare July 18, 2024 06:33
Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/7615f8d6aaaf42339a0304fc4f32ea93

data-plane-adoption-osp-17-to-extracted-crc TIMED_OUT in 3h 16m 45s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph TIMED_OUT in 3h 15m 40s
✔️ adoption-docs-preview SUCCESS in 1m 12s

@yadneshk
Copy link
Contributor Author

recheck

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/f5ce87e3fa6e4e009db4abb891154f44

✔️ data-plane-adoption-osp-17-to-extracted-crc SUCCESS in 2h 43m 51s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 12m 23s
✔️ adoption-docs-preview SUCCESS in 1m 16s

@yadneshk
Copy link
Contributor Author

recheck

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/9f81d455583b4a53aa4180b0a3b62955

data-plane-adoption-osp-17-to-extracted-crc RETRY_LIMIT in 45m 47s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph RETRY_LIMIT in 45m 55s
✔️ adoption-docs-preview SUCCESS in 1m 17s

@yadneshk
Copy link
Contributor Author

recheck

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/517cedf63b0a4bb69a7ca45028aa411c

data-plane-adoption-osp-17-to-extracted-crc TIMED_OUT in 3h 15m 11s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 33m 29s
✔️ adoption-docs-preview SUCCESS in 1m 16s

@yadneshk yadneshk force-pushed the telemetry_adoption branch from 782225d to 78fc67d Compare July 22, 2024 04:58
Copy link

Merge Failed.

This change or one of its cross-repo dependencies was unable to be automatically merged with the current state of its repository. Please rebase the change and upload a new patchset.
Warning:
Error merging github.com/openstack-k8s-operators/data-plane-adoption for 290,78fc67d9fa50636c8c3b83283fb248deac37989f

@yadneshk yadneshk force-pushed the telemetry_adoption branch from 78fc67d to 6dd08f5 Compare July 22, 2024 05:13
Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/f212155abcf64ac1ace792bc9aca616f

data-plane-adoption-osp-17-to-extracted-crc TIMED_OUT in 3h 16m 00s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph TIMED_OUT in 3h 15m 25s
✔️ adoption-docs-preview SUCCESS in 1m 16s

@yadneshk
Copy link
Contributor Author

recheck

openstack-k8s-operators/telemetry-operator#433 merged

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/157888b4e6f84c86ad4cd8e9bdad7ff7

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 2h 16m 39s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph TIMED_OUT in 3h 15m 50s
✔️ adoption-docs-preview SUCCESS in 1m 17s

@yadneshk
Copy link
Contributor Author

recheck

seems openstack-k8s-operators/openstack-operator#959 was needed too

template:
logging:
enabled: false
network: internalapi
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just realized now due to an bug that this field does not exist anymore. It needs to be removed from the guide.

@yadneshk yadneshk force-pushed the telemetry_adoption branch from 6dd08f5 to e7c59e2 Compare July 23, 2024 09:14
@jlarriba
Copy link
Contributor

/lgtm

@yadneshk yadneshk force-pushed the telemetry_adoption branch from 3909121 to b93bf29 Compare July 25, 2024 05:11
@jistr jistr merged commit 9507910 into openstack-k8s-operators:main Jul 29, 2024
3 checks passed
@yadneshk yadneshk deleted the telemetry_adoption branch August 12, 2024 04:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants