
Cloud Monitoring is the Google Cloud service that collects, analyzes, and visualizes metrics from your resources so you can keep track of the health and performance of your applications and infrastructure. It provides real-time metrics and alerts, supports custom dashboards, and works alongside Cloud Logging as part of the Cloud Observability Suite. You may recognize it by its former name, Stackdriver Monitoring. For the Professional Cloud Database Engineer exam, the useful framing is that monitoring is how you find out whether a database is keeping up, and what you should be alerted on when it is not.
The core job of Cloud Monitoring is to gather metrics from your Google Cloud resources, let you analyze them, and present them visually so the state of a system is easy to read. The real-time side is what makes it useful operationally, because you can act quickly when something goes wrong rather than discovering a problem after the fact. Custom dashboards let you arrange the metrics you care about so the information you need is available at a glance. None of this is specific to databases on its own, but databases are exactly the kind of resource where slow degradation matters, so the same metrics and alerts apply directly to keeping one running smoothly.
There are two broad categories of metrics in Cloud Monitoring, and the distinction between them comes up often enough that it is worth being precise about. The first category is built-in metrics, which Google Cloud services provide automatically. These cover three main areas. Infrastructure metrics include CPU utilization, disk I/O, network traffic, and memory usage, which give you a view of the underlying hardware and resources. Application metrics include response times, error rates, request rates, and latency, which describe how a service is performing from the user's perspective. System metrics include details like system load and running processes, which speak to the overall health of a system.
For a database, the infrastructure metrics are usually the first place to look. CPU utilization, disk I/O, memory usage, and network traffic are the signals that tell you whether an instance is under strain, and they are available without any extra setup. The application-level metrics like latency and error rates matter too, because a database that is technically up but answering slowly is still a problem for whatever depends on it.
The second category is user-defined metrics, which you create yourself for cases the built-in metrics do not cover. Custom metrics let you instrument your own code to track application-specific values. Third-party application metrics let you bring data from external tools into Cloud Monitoring so it sits alongside your Google Cloud metrics. Log-based metrics are generated from entries in Cloud Logging, which is useful for turning log data, such as the count of a specific error message, into something you can graph and alert on. That last one is worth remembering for databases, because a particular error in the logs can be converted into a metric and then watched the same way you would watch CPU.
A practical point that the exam tends to reward is to use built-in metrics where possible. They are optimized for Google Cloud services and require no additional configuration. User-defined metrics are powerful, but the guidance is to reach for them when the built-in ones genuinely do not cover what you need, not as a default.
Real-time alerts can be configured on any metric you are tracking, which means you can be notified the moment something moves out of its normal range. Common examples include CPU utilization exceeding a threshold such as 90 percent, response times crossing a set limit, spikes in error rates, or a sudden drop in the number of users reaching a service. For a database, those translate cleanly into the things you would want to know about early, such as an instance running hot on CPU, queries getting slow, or errors climbing.
When an alert fires, notifications can go out through several channels. You can receive an email or a text message, or route the alert through third-party services like Slack or PagerDuty. The point of having those options is to get the right people informed at the right time so action can follow promptly. On the exam, the thing to hold onto is that alerting is metric-driven. You pick a metric, set a condition on it, and choose how the notification reaches someone.
Databases rarely live alone in a single project, and Cloud Monitoring can give you a unified view when work is spread across several. The pattern is to set up a Cloud Monitoring workspace in one of your projects, which serves as the central hub, and then link the other projects to that primary workspace through the Google Cloud Console. A common situation is a company with multiple departments, each using its own project but needing a centralized view of operations. Linking those projects to one workspace lets you monitor them together rather than checking each in isolation.
For the Professional Cloud Database Engineer exam, it helps to keep three things straight. Built-in metrics are the default and require no setup, while user-defined and log-based metrics fill the gaps the built-in ones leave. Alerts can be placed on any metric and reach you through email, text, or third-party tools. And a single Cloud Monitoring workspace, linked across projects, is how you get one view of databases that are scattered across more than one project.
Our Professional Cloud Database Engineer course covers Cloud Monitoring alongside Cloud Logging and database performance metrics, with practice questions that drill these distinctions.