Do you have a plan for how to keep your business running as usual in case of a disaster strike? If not, it’s high time you had one. Data corruption, the growing number of cyberattacks, natural disasters and failed components are some of the reasons to lay down a business continuity strategy. One part of that plan will be managing the seamless disaster recovery failover and failback between your DR copy and your primary workload.
In this article, we will explain how you can use the cloud to store as your disaster recovery site, and how with the help of NetApp’s Cloud Volumes ONTAP you can manage a seamless disaster recovery failover and failback process for DR.
Establishing a disaster recovery (DR) site is a mandatory ask, but the next important question to answer is whether the DR site should be hosted in the cloud or on-premises. Additionally, what if you are running a multicloud environment, where some of the workloads are hosted on AWS and others on Azure? Where should the DR site be located in that scenario?
A cloud-based disaster recovery site built on AWS disaster recovery or Azure disaster recovery resources has multiple advantages compared to an on-premises DR setup. Let’s take a look at a couple of them below:
Economics: You need NOT invest upfront in constructing the building, buying storage, racks, servers, network appliances, HVAC, and human resources. Using a cloud-based infrastructure saves money by allowing you to leverage a pay-per-use model and get your DR copy running in much less time.
But for cloud-based DR, NetApp offers something even more effective for DR: Cloud Volumes ONTAP®, a virtual appliance running in the cloud, which provides a single pane of glass for your hybrid cloud management. It is a simple, secure, and enterprise-ready DR solution which offers multiple advantages, with seamless disaster recovery failover and failback being one of the most important.
Cloud Volumes ONTAP uses a NetApp feature called SnapMirror® for data replication. SnapMirror replicates the data between the primary site and DR site using ONTAP snapshot technology: the first snapshot is taken to synchronize the data between the two sites, and subsequent snapshots keep the DR site up to date incrementally, only syncing over the changes that happen at primary site. These syncs are cost effective thanks to Cloud Volumes ONTAP’s powerful storage efficiencies.
While it is possible to initiate a SnapMirror manually, it typically works in an automated way: you schedule the data synchronization through a user-friendly wizard, defining your preferred time and frequency for data replication, and rest is taken care of. It operates in an asynchronous mode, where snapshots are replicated to the DR site at a predefined interval.
In the event of a disaster, you can failover your operations to your DR systems assured that your data is ready and available on your NetApp Cloud Volumes ONTAP. SnapMirror relationships can be broken using the OnCommand Cloud Manager, System Manager, CLI, or API, immediately making the DR secondary copy read/write ready.
Once that disaster has been addressed, you can failback your operation to the primary site. SnapMirror makes the resynchronization process to failback to the primary or alternate site a simple process. Both failover and failback processes can be started with the help of a few clicks in the GUI, or automated through the use of the APIs.
There are multiple scenarios for initiating the failback process:
A DR failover scenario is always a dangerous situation for a company to be in. Cloud Volumes ONTAP offers you a way to keep your data protected in such events, and additional features such as storage tiering, compression, and data deduplication that can reduce the storage cost by more than 50%.
If you are already using ONTAP, this is the right time to move your disaster recovery site to Cloud Volumes ONTAP and make it an integral part of your Azure disaster recovery plan or AWS disaster recovery plan. Since the user interface is the same, there’s no need to train your admins to use a new set of tools and processes, which can save time and costs, specifically the cost of disaster recovery.