You are looking at the documentation of a prior release. To read the documentation of the latest release, please
visit here.
How Stash Backup & Restore MongoDB Database
Stash 0.9.0+ supports backup and restore operation of many databases. This guide will give you an overview of how MongoDB database backup and restore process works in Stash.
How Backup Works
The following diagram shows how Stash takes backup of a MongoDB database. Open the image in a new tab to see the enlarged version.
The backup process consists of the following steps:
At first, a user creates a secret with access credentials of the backend where the backed up data will be stored.
Then, she creates a
Repository
crd that specifies the backend information along with the secret that holds the credentials to access the backend.Then, she creates a
BackupConfiguration
crd targeting the AppBinding crd of the desired database. TheBackupConfiguration
object also specifies theTask
to use to backup the database.Stash operator watches for
BackupConfiguration
crd.Once Stash operator finds a
BackupConfiguration
crd, it creates a CronJob with the schedule specified inBackupConfiguration
object to trigger backup periodically.On the next scheduled slot, the CronJob triggers a backup by creating a
BackupSession
crd.Stash operator also watches for
BackupSession
crd.When it finds a
BackupSession
object, it resolves the respectiveTask
andFunction
and prepares a Job definition to backup.Then, it creates the Job to backup the targeted database.
The backup Job reads necessary information to connect with the database from the
AppBinding
crd. It also reads backend information and access credentials fromRepository
crd and Storage Secret respectively.Then, the Job dumps the targeted database and uploads the output to the backend. Stash pipes the output of dump command to uploading process. Hence, backup Job does not require a large volume to hold the entire dump output.
Finally, when the backup is complete, the Job sends Prometheus metrics to the Pushgateway running inside Stash operator pod. It also updates the
BackupSession
andRepository
status to reflect the backup procedure.
Backup Different MongoDB Configurations
This section will show you how backup works for different MongoDB configurations.
Standalone MongoDB
For a standalone MongoDB database, the backup job directly dumps the database using mongodump
and pipe the output to the backup process.
MongoDB ReplicaSet Cluster
For MongoDB ReplicaSet cluster, Stash takes backup from one of the secondary replicas. The backup process consists of the following steps:
- Identify a secondary replica.
- Lock the secondary replica.
- Backup the secondary replica.
- Unlock the secondary replica.
MongoDB Sharded Cluster
For MongoDB sharded cluster, Stash takes backup of the individual shards as well as the config server. Stash takes backup from a secondary replica of the shards and the config server. If there is no secondary replica then Stash will take backup from the primary replica. The backup process consists of the following steps:
- Disable balancer.
- Lock config server.
- Identify a secondary replica for each shard.
- Lock the secondary replica.
- Run backup on the secondary replica.
- Unlock the secondary replica.
- Unlock config server.
- Enable balancer.
How Restore Process Works
The following diagram shows how Stash restores backed up data into a MongoDB database. Open the image in a new tab to see the enlarged version.
The restore process consists of the following steps:
At first, a user creates a
RestoreSession
crd targeting theAppBinding
of the desired database where the backed up data will be restored. It also specifies theRepository
crd which holds the backend information and theTask
to use to restore the target.Stash operator watches for
RestoreSession
object.Once it finds a
RestoreSession
object, it resolves the respectiveTask
andFunction
and prepares a Job definition to restore.Then, it creates the Job to restore the target.
The Job reads necessary information to connect with the database from respective
AppBinding
crd. It also reads backend information and access credentials fromRepository
crd and Storage Secret respectively.Then, the job downloads the backed up data from the backend and injects into the desired database. Stash pipes the downloaded data to the respective database tool to inject into the database. Hence, restore job does not require a large volume to download entire backup data inside it.
Finally, when the restore process is complete, the Job sends Prometheus metrics to the Pushgateway and update the
RestoreSession
status to reflect restore completion.
Restoring Different MongoDB Configurations
This section will show you restore process works for different MongoDB configurations.
Standalone MongoDB
For a standalone MongoDB database, the restore job downloads the backed up data from the backend and pipe the downloaded data to mongorestore
command which inserts the data into the desired MongoDB database.
MongoDB ReplicaSet Cluster
For MongoDB ReplicaSet cluster, Stash identifies the primary replica and restore into it.
MongoDB Sharded Cluster
For MongoDB sharded cluster, Stash identifies the primary replica of each shard as well as the config server and restore respective backed up data into them.
Next Steps
- Install MongoDB addon for Stash following the guide from here.