SB5.1 - OCI - Inconsistency between Core/CassandraBackup and DP/CassandraBackup #371
-
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Hi @heungheung In sunbird we run two separate cassandra clusters. One for Core module and the other for Data Pipeline module. In the installation docs we tell to use only one cassandra cluster (single node) to save cost. But that is not recommended for production use. In our case the Core cassandra data (600GB+ per node) and server size(7-14 nodes) is quite big. So we take a snapshot and directly upload the snapshot folder to storage account. This saves us time and additional disk required to gz it. Azcopy tool is quite fast to upload the entire directory to storage. During restore also its quite easy to restore this data back just by placing the folders into respective directories and running a few commands. This restore approach works only if the number of nodes are same (backup cluster nodes = restore cluster nodes) as we use the token ring approach to match the data ownership on each node. The restore steps are mentioned here - project-sunbird/sunbird-devops#2443 The DP cassandra is a single node cluster and also has a small data footprint. So we take snapshot and store it as a gz file. We can use the Core cassandra backup approach to the DP cassandra also if data or nodes grow in size. But we haven't felt the need for it till date. Hope this answers your query. |
Beta Was this translation helpful? Give feedback.
Hi @heungheung
In sunbird we run two separate cassandra clusters. One for Core module and the other for Data Pipeline module. In the installation docs we tell to use only one cassandra cluster (single node) to save cost. But that is not recommended for production use.
In our case the Core cassandra data (600GB+ per node) and server size(7-14 nodes) is quite big. So we take a snapshot and directly upload the snapshot folder to storage account. This saves us time and additional disk required to gz it. Azcopy tool is quite fast to upload the entire directory to storage.
During restore also its quite easy to restore this data back just by placing the folders into respective directories and ru…