Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added labels for all the SSV Mainnet nodes, updated gitlab-ci file #1793

Open
wants to merge 34 commits into
base: main
Choose a base branch
from

Conversation

systemblox
Copy link
Contributor

No description provided.

Lior Rutenberg and others added 30 commits December 6, 2023 09:27
v1.2.1: Permission Fork Schedule for Mainnet
* fix: stale operator ID in `p2pNetwork`

---------

Co-authored-by: Lior Rutenberg <[email protected]>
* fix: stale operator ID in `p2pNetwork` (#1229)

* fix: stale operator ID in `p2pNetwork`

---------

Co-authored-by: Lior Rutenberg <[email protected]>

* fix: sufficient timeout for initial duty fetch (#1214)

* fix: sufficient timeout for initial duty fetch (#1214)
---------

Co-authored-by: Matus Kysel <[email protected]>

* deployment: change bootnode for holesky-stage (#1233)

* deploy new bootnode ENR for holesky stage

* Voluntary exit (#1200)

* Voluntary exit

---------

Co-authored-by: moshe-blox <[email protected]>
Co-authored-by: olegshmuelov <[email protected]>
Co-authored-by: MatheusFranco99 <[email protected]>

* feat: include 'connected' in peer scores log (#1224)

* feat: include 'connected' in peer scores log

* P2P metrics  (#1163)

* metrics: added p4 score (invalidMessageDeliveries)

* metrics: added total msgs accepted, msgs accepted from peer with Id, signatures verification

* deploy to the stage

# Conflicts:
#	.gitlab-ci.yml

* metrics: added counters flush once in 8h

* metrics: added counters for duties

* Added auto delete disconnected peers label metrics. Added RSA verifications metric

* disabled ci

* Updated p4 score metric update

* metrics: added p4 score (invalidMessageDeliveries)

* metrics: added total msgs accepted, msgs accepted from peer with Id, signatures verification

* deploy to the stage

# Conflicts:
#	.gitlab-ci.yml

* metrics: added counters flush once in 8h

* metrics: added counters for duties

* Added auto delete disconnected peers label metrics. Added RSA verifications metric

* disabled ci

* Updated p4 score metric update

* review comments fixes

* deploy

* deploy

* testing metrics differ

* added duties created and finalized

* removed extra monitoring.metricsreporter usage

* removed topic from p4score. made it a sum of squares

* deploy to all nodes

* deploy p2p_metrics

* pr review fixes

* trigger ci

* deploy 9-20

* Fix msg validation rsa verification counter metric

* allocate p2p_metrics

* inspect scores more frequently, but don't log every time

* renamed duties created/finalized metrics and refactored to record role

* deploy to 5--8

* deploy to 5--8

* rename metric

* blank space

* approve spec change (just a metric)

* refactors

* reset scores

* metric help msg

* revert gitlab

* deploy

* revert deploy

* Add metrics for signature verifications

* Update differ.config.yaml with approved changes

---------

Co-authored-by: Anton Korpusenko <[email protected]>
Co-authored-by: Gal Rogozinski <[email protected]>
Co-authored-by: Matus Kysel <[email protected]>
Co-authored-by: MatheusFranco99 <[email protected]>
Co-authored-by: moshe-blox <[email protected]>

* Node health (#1203)

* chore: link to SSV API docs in configs & README

* initial commit

* add node health route to ssv API

* update health route

* update health route

* update health route

* update health route

* deploy to stage

* add plaintext response

* lint

* lint

* change to good/bad

* lint

* lint

* refactor

* Revert "deploy to stage"

This reverts commit 2f54f4e.

* lint

* add inbound/outbound count for health + deploy to stage

* change ports back

* update count

* lint

* update conns

* lint

* remove connected peer count

* test blocked ports

* Revert "remove connected peer count"

This reverts commit 79e2b94.

* leave only active peers count

* Revert "test blocked ports"

This reverts commit 6fc9282.

* ci to stage

* add mutex to nodes access

* refactor: node health API (#1222)

* refactor: node health API

* added cpu_cores to healthcheck output

* fix inbound/outbound stats

* Remove CPU core reporting

---------

Co-authored-by: moshe-blox <[email protected]>
Co-authored-by: Matus Kysel <[email protected]>
Co-authored-by: moshe-blox <[email protected]>

* feat: subscribe to a random subnet with 0 validators (#1245)

* feat: subscribe to a random subnet with 0 validators

* Fix: health route host addres (#1246)

* node health api route advertises the host addresses from the config.

* set up listenaddress directly

* feat: rate limit inbound connections by IP (#1226)

* feat: rate limit inbound connections by IP

* activate conngater

* deploy to 5--8

* fix

* fix

* Refactor connection gating in p2p setup

* Update ipLimiter parameters

* revert gitlab

* Revert "revert gitlab"

This reverts commit fcc7902.

* Revert "Revert "revert gitlab""

This reverts commit feb9e4e.

---------

Co-authored-by: Gal Rogozinski <[email protected]>

---------

Co-authored-by: moshe-blox <[email protected]>
Co-authored-by: Lior Rutenberg <[email protected]>
Co-authored-by: Matus Kysel <[email protected]>
Co-authored-by: Nikita Kryuchkov <[email protected]>
Co-authored-by: moshe-blox <[email protected]>
Co-authored-by: olegshmuelov <[email protected]>
Co-authored-by: MatheusFranco99 <[email protected]>
Co-authored-by: Anton Korpusenko <[email protected]>
Co-authored-by: Anton Korpusenko <[email protected]>
Co-authored-by: Gal Rogozinski <[email protected]>
Co-authored-by: Pavel Krolevets <[email protected]>
v1.2.2 (non-squash merge)
Include git commit/version in Docker build (#1231)
fix: (MessageValidation) ignore missing sync duties
fix: (Dockerfile) dnsutils installation
* try 1ms discovery interval on mainnet node 1

* deploy to mainnet 2

* deploy to mainnet 3

* deploy to mainnet 3

* Revert "deploy to mainnet 3"

This reverts commit 38a1746.

* Revert "deploy to mainnet 3"

This reverts commit 4d9e8dc.

* deploy different interval mechanism to node 1

* add domain type filtering (mainnet node 1)

* 100ms discovery interval

* comment

* deploy to node 2

* deploy to mainnet 3 & 4

* revert deployments

* deploy to holesky prod

* revert deployments
* fix: refetch duties after metdata update (`main`)
* new metadata refresh mechanism
* enable custom domain support

* update domain type while loading shares

* print self address

* new address format

* fix unit test

* parse trusted peers in p2p.New

---------

Co-authored-by: Lior Rutenberg <[email protected]>
Co-authored-by: rehs0y <[email protected]>
* Setup exporter-2 for mainnet on prod

* Fixed typo

---------

Co-authored-by: stoyan.peev <[email protected]>
* add option to supply custom graffiti through env file
* v3 proposals
Co-authored-by: rehs0y <[email protected]>

---------

Co-authored-by: guy muroch <[email protected]>
Co-authored-by: moshe-blox <[email protected]>
Co-authored-by: moshe-blox <[email protected]>
Co-authored-by: rehs0y <[email protected]>
* change bloxstaking image to ssv

* change bloxstaking ref to ssv

* change blox to ssv

* Update README.md

Co-authored-by: rehs0y <[email protected]>

* change docker back to bloxstaking

---------

Co-authored-by: guy muroch <[email protected]>
Co-authored-by: moshe-blox <[email protected]>
Co-authored-by: rehs0y <[email protected]>
* add full-node-exporter-3

* add gitlab pipeline changes
@@ -150,7 +69,7 @@ Deploy nodes to prod:
# +----------------------------+
# | 🔴 Deploy SSV Mainnet nodes |
# +----------------------------+
# - .k8/production/mainnet/scripts/deploy-cluster-1-4.sh $DOCKER_REPO_INFRA_PROD $CI_COMMIT_SHA ssv $APP_REPLICAS_INFRA_PROD blox-infra-prod kubernetes-admin@blox-infra-prod ssv.network $K8S_API_VERSION $PROD_HEALTH_CHECK_IMAGE $SSV_NODES_CPU_LIMIT_V3 $SSV_NODES_MEM_LIMIT_V3
- .k8/production/mainnet/scripts/deploy-cluster-1-4.sh $DOCKER_REPO_INFRA_PROD $CI_COMMIT_SHA ssv $APP_REPLICAS_INFRA_PROD blox-infra-prod kubernetes-admin@blox-infra-prod ssv.network $K8S_API_VERSION $PROD_HEALTH_CHECK_IMAGE $SSV_NODES_CPU_LIMIT_V3 $SSV_NODES_MEM_LIMIT_V3
Copy link
Contributor

@y0sher y0sher Oct 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure we want to leave mainnet deployment by default. deploying all 4 operators at once for sure going to make us miss some duties.
@moshe-blox wdyt?

Copy link
Contributor

@moshe-blox moshe-blox Oct 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@y0sher @systemblox yeah we want to avoid any mainnet deployments by default (like today)
mainnet must be manual

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@y0sher @moshe-blox ok. I will update the GitLab-ci file to deploy one operator, and then we can update the remaining nodes one by one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants