As more and more applications migrate to the cloud, latency becomes an important benchmark for application performance.
When it comes to highly transactional applications with stringent performance requirements, storage latency can have a negative impact that will be felt all the way down to the end users. In Google Cloud, NetApp has a new way to help you avoid that.
In this blog we’ll cover the new Cloud Volumes ONTAP in Google Cloud support for NVMe caching via Flash Cache® intelligent caching. Coupled with Cloud Volumes ONTAP using the high write speed option, Google Cloud users now have a powerful tool to reduce overall storage latency.
Read on or follow these links to jump below:
A storage system's response time to a read or write request is referred to as its latency. If the time it takes to complete the request, i.e., the latency, is higher than the expected response time, it’s going to have a negative impact on the people and applications that depend on that storage.
There are several factors in cloud that could lead to increased latency:
As cloud offers managed storage options and takes care of aspects such as networking, routing, and infrastructure, customers can adopt specific configurations to help reduce the overall storage latency.
Depending on the choices made for the computer, storage, and network settings, Google Cloud-specific configurations may result in an increase or decrease in GCP latency. Let’s take a look at some of these considerations that customers should be aware of.
For Google Cloud users, there’s a new way to reduce the effects of latency in your operations: Cloud Volumes ONTAP now supports NVMe caching on Google Cloud.
Cloud Volumes ONTAP is an enterprise grade solution from NetApp that delivers advanced storage management capabilities in all the leading cloud platforms: AWS, Azure, and Google Cloud. While AWS and Azure users have benefitted from NVMe caching for some time, Cloud Volumes ONTAP on Google Cloud can now take advantage of this feature as well.
Flash Cache offers a simple and straightforward solution for Cloud Volumes ONTAP customers to reduce Google Cloud Platform latency, without compromising storage efficiency. Flash Cache uses a local NVMe storage module to improve storage latencies and deliver better performance. NVMe intelligent caching enables real-time caching of recently accessed data and metadata to improve read performance. Read-intensive workloads such as databases, file services, LOB applications, and email can greatly benefit from this latency reduction.
In addition to NVMe caching, customers can accelerate application performance by enabling the high write speed setting for Cloud Volumes ONTAP.
You can choose from normal or high write speed while configuring Cloud Volume ONTAP:
Starting from Cloud Volumes ONTAP version 9.13.0, Cloud Volumes ONTAP read/write performance is accelerated by Flash Cache and the high write speed option. The following GCP instance types deployed as HA pairs can take advantage of Flash Cache and high write speed:
Flash Cache and high write speed will both be enabled on the supported instances in this release. An upcoming release will include the capability to activate each of these features independently.
Storage latency is a major pain point for cloud administrators. Since there can be many factors that come together to increase latency, it can be difficult to pinpoint and eliminate specific bottlenecks. With Cloud Volumes ONTAP and NVMe intelligent caching Flash Cache technology, you can eliminate latency concerns to a great extent.
For more, check out this customer case study about using NVMe intelligent caching to speed up the high-demand design process in the EDA vertical.
Latency in Google Cloud is the time taken for applications to provide a meaningful response to user requests. Choosing the right storage type and optimizing the overall application architecture will help with reducing latency and deliver improved performance.
Google Cloud provides a Google Cloud latency dashboard that tracks that latency metrics for different traffic types. Using this dashboard, you can view the latency metrics between regions where the VMs are hosted and user devices accessing them from different geographies. In addition to Google Cloud latency between regions, it also provides visibility of Google Cloud latency between zones. There are also third-party tools that perform Google Cloud latency tests.
Both AWS and GCP have multiple data centers over the globe connected with low latency connectivity between them and can be used for hosting applications with low latency and high performance requirements. The performance and latency in each platform will be dependent on the architecture and component specific configurations. Hence, it is difficult to definitely say whether one platform is faster than the other.