Google Cloud Database: The Right Service for Your Workloads

Written by Yifat Perry, Technical Content Manager | Feb 28, 2021 5:00:00 AM

Google Cloud Platform (GCP) provides a wide range of computing resources, including database services. GCP offers three types of reference architectures for global data distribution—hybrid, multicloud, and regional distribution. When choosing a Google database service, you should take these architectures into consideration.

In this post, we’ll explain data distribution in GCP, and provide an overview of popular Google cloud database services, including key considerations when assessing and choosing a service. We’ll also show how NetApp Cloud Volumes ONTAP can help centralize and simplify the management of Google cloud database resources.

This is part of an extensive series of guides about managed services.

In this article, you will learn:

Deploying Databases on Google Cloud: Single, Hybrid, and Multicloud
Top 7 Google cloud database services
How to choose a Google cloud database
Google cloud database management with Cloud Volumes ONTAP

Deploying Databases on Google Cloud: Single Cloud, Hybrid, and Multicloud Deployment

Google Cloud Platform (GCP) supports three primary deployment models: single cloud, hybrid, and multicloud.

Single Cloud Deployment

The simplest deployment model is to deploy databases on Google Cloud only, via:

Creating of new cloud databases on Google
“Lift and shift” of existing workloads from on-premise to the cloud, and discontinuing the on-premise database resources

Hybrid Deployment: Google Cloud and On-Premises Resources

Hybrid deployments are useful when you have applications in the cloud that need to access on-premises databases or vice versa. For example, if you are performing marketing analytics on-premises and need to access customer databases hosted in the cloud.

There are three primary considerations for deployment a database in a hybrid model - with some data on Google Cloud and some on-premises:

Master database—you need to decide whether your master database is stored on-premises or in the cloud. If you choose the cloud, GCP resources can act as a data hub for on-premises resources. If you choose on-premises, your in-house resources can sync data to the cloud for remote use or backup. This can enable you to maintain mirrored databases, providing a failover in case of disaster.
Managed services—these services are only available for resources in the cloud. If you need to use a hybrid application with your data, you may not be able to access managed services for that application. For example, if you are creating a hybrid cloud database, you cannot fully benefit since your on-premises resources aren’t managed. These services include scalability, redundancy, and automated backups. You can, however, use third-party managed services.
Portability—the type of data store you choose affects the portability of your data. To ensure that data can be transferred reliably, and that configuration and administration are consistent, you need to consider a cross-platform store, such as MySQL. Using homogeneous databases on-premises and in the cloud ensures that you do not have to reformat or rescheme data. This enables you to easily transfer it as needed.

The following diagram illustrates an example of a hybrid architecture with Google Cloud and on-premises systems.

Image source

Multicloud Deployment: Google Cloud and Other Cloud Providers

Multicloud deployments enable you to combine databases deployed on Google Cloud with database services from other cloud providers. This can help you create multiple fail-safes, more effectively distribute your database, or take advantage of a wider array of proprietary cloud features.

When considering a multicloud deployment you should be aware of the following:

Integration—it is important to ensure that client systems can smoothly access databases, regardless of the cloud they are deployed on. You can use open-source client libraries to make databases seamlessly available across clouds, such as jclouds (see the JDBC guide).
Database migration—with multiple cloud providers, you may need to migrate data between clouds. To migrate databases into GCP, you will need to use database replication tools or export/import processes. There are several Google Cloud migration tools you can use to migrate data into Google Cloud, such as the Google Storage Transfer service.

The following diagram illustrates a multicloud deployment involving GCP and another public cloud provider.

Image source

Google Cloud Database Services

GCP offers several Google Cloud database services you can choose from. Below is an introduction to each.

Cloud SQL

Cloud SQL is a fully managed, relational Google Cloud database service that is compatible with SQL Server, MySQL, and PostgreSQL. It includes features for automated backups, data replication, and disaster recovery to ensure high availability and resilience. You can integrate this service with Compute Engine, App Engine, BigQuery, and Kubernetes.

Common use cases for Cloud SQL include:

Lift and shift of on-premise SQL databases to the cloud
Large-scale SQL data analytics
Supporting content management system (CMS) data storage and scalability
Managing databases using Infrastructure as Code (IaC)
Development and deployment of containerized applications and microservices

Cloud Spanner

Cloud Spanner is another fully managed, relational Google Cloud database service. It differs from Cloud SQL by focusing on enabling you to combine the benefits of relational structure and non-relational scalability. It provides strong consistency across rows and high-performance operations. It includes features for automatic replication, built-in security, and multi-language support.

Use cases for Cloud Spanner include:

Supply chain management and manufacturing
Financial trading, analysis, and prediction
Logistics and transportation

BigQuery

BigQuery is a fully managed, serverless data warehouse. You can use it to perform data analyses via SQL and query streaming data. This service includes a built-in Data Transfer Service to help you migrate data from on-premises resources, including Teradata.

BigQuery includes features for machine learning, business intelligence, and geospatial analysis. These features are provided through BigQuery ML, BI Engine, and GIS.

Use cases for BigQuery include:

Process analytics and optimization
Big data processing and analytics
Machine learning-based behavioral analytics and predictions
Data warehouse modernization

Cloud Bigtable

Cloud Bigtable is a fully managed NoSQL Google Cloud database service. It is designed for large operational and analytics workloads. Cloud Bigtable includes features for high availability, zero-downtime configuration changes, and sub-10ms latency. You can integrate it with a variety of tools, including Apache tools like Hadoop, TensorFlow, and Google Cloud services like BigQuery.

Use cases for Cloud Bigtable include:

Financial analysis and prediction
Internet of things (IoT) data ingestion, processing, and analytics
Marketing applications, including hyper personalization

Cloud Firestore

Cloud Firestore is a fully managed, serverless NoSQL Google Cloud database designed for the development of serverless apps. You can use it to store, sync, and query data for web, mobile, and IoT applications. It includes features for offline support, live synchronization, and built-in security. You can integrate Firestore with Firebase, GCP’s mobile development platform, for easier app creation and management.

Use cases for Cloud Firestore include:

Mobile and web applications with both online and offline capabilities
Multi-user, collaborative applications
Real-time analytics
Social media applications
Gaming forums and leaderboards

Firebase Realtime Database

Realtime Database is a NoSQL Google Cloud database that is part of the Firebase platform. It enables you to store and sync data in real-time and includes caching capabilities for offline use. Realtime Database also enables you to implement declarative authentication, matching users by identity or pattern matching. It includes mobile and web software development kits (SDKs) for easier and faster app development.

Use cases for Firebase Realtime Database include:

Development of apps that work across devices
Ad optimization and personalization
Third-party payment processing
Integration of machine learning

Cloud Memorystore

Cloud Memorystore is a fully managed, in-memory Google Cloud data store. It is designed to be secure, highly available, and scalable. Cloud Memorystore enables you to create application caches with sub-millisecond latency for data access. It is compatible with Memcached and Redis protocols.

Use cases for Cloud Memorystore include:

Lift and shift migration of applications
Machine learning applications
Real-time analytics
Low latency data caching and retrieval

How to Choose a Google Cloud Database Service

Even after you explore your database options in Google Cloud, deciding which are the right options for you can be a challenge. When considering your options, keep in mind that many organizations need and can benefit from using multiple services. This enables you to optimize your implementations according to database capabilities, rather than trying to adapt a database service to fit all needs.

Cloud SQL
Cloud SQL is a good option when you need relational database capabilities but don’t need storage capacity over 10TB or more than 4000 concurrent connections. You also need to be skilled at on-premise management.

Cloud Spanner
Cloud Spanner is a good option when you plan to use large amounts of data (more than 10TB) and need transactional consistency. It is also good if you want to use sharding for higher throughput and accessibility.

If you know or think that you might eventually need to be able to horizontally scale your Google Cloud database, Cloud Scanner is a better option than Cloud SQL. If you start with Cloud SQL and need to eventually move to Cloud Spanner, be prepared to re-write your application in addition to migrating your database.

Cloud Firestore/Datastore
Cloud Firestore or Datastore are good options when you plan to focus on app development and need live synchronization and offline support.

If you need to store unstructured data in JSON documents, Cloud Datastore is the recommended option. This is in comparison to if you need to store structured data, in which case Cloud Spanner is recommended.

An additional factor to consider is whether you need atomicity, consistency, isolation, durability (ACID) compliance. If so, you need to choose Cloud Spanner since Cloud Datastore only offers atomic and durable transactions.

Cloud Bigtable
Cloud Bigtable is a good option if you are using large amounts of single key data. In particular, it is good for low-latency, high throughput workloads.

If you need to perform single-region analytics, Cloud Bigtable is preferred over Cloud Spanner. However, if you need multi-regional operations, Cloud Spanner is the recommended solution. For example, Cloud Bigtable is a good option for a time series app created for DevOps monitoring. Meanwhile, Cloud Spanner is the recommended option for an infrastructure monitoring platform designed for software as a service (SaaS) offering.

Cloud Memorystore
Cloud Memorystore is a good option if you are using key-value datasets and your primary concern is transaction latency.

If you do not need disk-based data persistence and are only using the service for caching, Cloud Memorystore should be your choice. However, if you are concerned about issues like cache to database consistency or stream processing, you should choose Cloud Bigtable. Likewise, any time that your volume of data is too big to fit into memory, Cloud Memorystore is not the best option for you.

Google Cloud Database Management with Cloud Volumes ONTAP

NetApp Cloud Volumes ONTAP, the leading enterprise-grade storage management solution, delivers secure, proven storage management services on AWS, Azure and Google Cloud. Cloud Volumes ONTAP supports up to a capacity of 368TB, and supports various use cases such as file services, databases, DevOps or any other enterprise workload, with a strong set of features including high availability, data protection, storage efficiencies, Kubernetes integration, and more.

In particular, Cloud Volumes ONTAP helps in addressing database workloads challenges in the cloud, and filling the gap between your cloud-based database capabilities and the public cloud resources it runs on.

Learn more about Google Cloud Database

Cloud Firestore: An In-Depth Look

Cloud Firestore enables you to store web and mobile applications data, in Google Cloud Platform (GCP). You can leverage Cloud Firestore for real time synchronization between client applications, by using listeners.

This article explains what Cloud Firestore is, how it works, and notes the differences between Cloud Firestore and Realtime Database. Including best practices for Cloud Firestore implementations.

8 Types of Google Cloud Analytics: How to Choose?

Google Cloud Analytics services provide various capabilities you can use to leverage data to improve customer experience and democratize the use of data across various collaborators. Learn how to build efficient architectures while using Google services.

Read more: 8 Types of Google Cloud Analytics: How to Choose?

Understanding Google Cloud High Availability

High availability provides a consistent level of uptime, ensuring workloads experience minimal failure. In GCP, this is achieved by leveraging 24 regions and 73 availability zones and a Compute Engine.

Google Cloud MySQL: The Complete Guide

There are several ways to run MySQL on Google Cloud. You can use Google Cloud SQL, which is a managed Google Cloud service. Alternatively, you can use a Google Cloud Marketplace image to install MySQL on a Compute Engine instance. It is also possible to manually install MySQL on Compute Engine.

This article provides an in-depth look at these three deployments options.

Read more: Google Cloud MySQL: The Complete Guide

Google Cloud PostgreSQL: Managed or Self Managed?

Google Cloud PostgreSQL is a fully managed Google Cloud database service, which allows you to automatically provision and manage PostgreSQL database instances. Learn about the Google Cloud PostgreSQL managed service, and the pros and cons of managed vs. self-managed PostgreSQL on Google Cloud.

Read more: Google Cloud PostgreSQL: Managed or Self Managed?

Google Cloud Big Data: Building Your Big Data Architecture on GCP

The Google Cloud Platform provides multiple services that support big data storage and analysis. Possibly the most important is BigQuery, a high performance SQL-compatible engine that can perform analysis on very large data volumes in seconds.

Learn how Google Cloud Big Data services can help you build a robust big data infrastructure.

Read more: Google Cloud Big Data: Building Your Big Data Architecture on GCP.

Google Cloud NoSQL: Firestore, Datastore, and Bigtable

Google’s cloud platform (GCP) offers a wide variety of database services. Of these, its NoSQL database services are unique in their ability to rapidly process very large, dynamic datasets with no fixed schema. Learn about the big three Google Cloud NoSQL offerings, providing high performance data access for web applications, mobile applications, and huge scale datasets.

Read more: Google Cloud NoSQL: Firestore, Datastore, and Bigtable.

Google Cloud Data Lake: 4 Phases of the Data Lake Lifecycle

A data lake is a central repository designed to store, process, and protect large volumes of structured, semi-structured and unstructured data. You can store the data in its native format and use a variety of data without considering size limitations. Learn about the four phases in a Google Cloud data lake lifecycle, and the tools and services Google provides for implementing them.

Read more: Google Cloud Data Lake: 4 Phases of the Data Lake Lifecycle.

Google Cloud SQL: MySQL, Postgres and MS SQL on Google Cloud

Google Cloud SQL is a managed database service that allows you to run Microsoft SQL Server, MySQL, and PostgreSQL on Google Cloud. The service provides replication, automated backups, and failover to ensure high-availability and resilience. In addition, it provides an easy and fast way to deploy and operate an SQL database in your cloud.

This post introduces the Google Cloud SQL service, explains the features that Google provides for each type of database, the costs, and how to start your first database.

Read more Google Cloud SQL: MySQL, Postgres and MS SQL on Google Cloud.

Google Cloud SQL Pricing, and Limits: A Cheatsheet for Cost Optimization

Google Cloud SQL is a database service that offers managed versions of SQL Server, MySQL, and PostgreSQL. This service can provide significant benefits over on-premises implementations. However, before signing up, you should consider both pricing and its limitations.

This article explains the various pricing breakdowns of SQL database services in Google Cloud, covers the limitations of Google Cloud SQL, and highlights how you can optimize costs with Cloud Volumes ONTAP.

Read more: Google Cloud SQL Pricing, and Limits: A Cheatsheet for Cost Optimization

Should You Still Be Using Google Cloud Datastore?

Google Cloud Datastore is a highly scalable, managed NoSQL database hosted on the Google Cloud Platform.

Google has released Firestore, a new version of Datastore with several improvements and additional features. In future, existing Datastore databases will be automatically upgraded to Firestore.

Read more: Should You Still Be Using Google Cloud Datastore?

Google Cloud Dataflow: The Basics and 4 Critical Best Practices

Google Cloud Dataflow is a managed service used to execute data processing pipelines based on Apache Beam via the Google Cloud Platform (GCP).

Dataflow is a fully pipeline runner that does not require initial setup of underlying resources. Because it is fully integrated with the Google Cloud Platform (GCP), it can easily combine other Google Cloud big data services, such as Google BigQuery.

Read more: Google Cloud Dataflow: The Basics and 4 Critical Best Practices

See Additional Guides on Key Managed Services Topics

Together with our content partners, we have authored in-depth guides on several other topics that can also be useful as you explore the world of managed services.