Traffic Splitting in Cloud Run for the PCA Exam

Ben Makansi

April 6, 2026

Traffic splitting in Cloud Run shows up often enough on the Professional Cloud Architect exam that it deserves its own write-up. The feature itself is not complicated. What matters is recognizing it as the right answer when a scenario describes gradual rollouts, canary deployments, or A/B testing, because the distractors will push you toward external load balancers, DNS tricks, or running multiple services side by side. None of those are what Cloud Run wants you to do.

What Traffic Splitting Actually Does

A Cloud Run service can have multiple revisions. Every time you deploy a new container image or change configuration, Cloud Run creates a new revision and keeps the old ones around. The service has a routing layer in front of those revisions, and you control what percentage of incoming requests goes to each one.

The canonical setup looks like this. You have a service called my-app. Revision 1 is v1.0, your current stable version, getting 90 percent of traffic. Revision 2 is v1.1, the new version you just deployed, getting 10 percent. Incoming requests hit the routing layer, and Cloud Run distributes them according to the percentages you set.

Two things about this setup matter for the exam. First, this is one Cloud Run service, not two. The revisions are versions of the same service, not separate services. Second, the routing happens inside Cloud Run. There is no external load balancer in this picture, no DNS configuration, no API gateway making the decision.

Why the Exam Cares

The exam cares because traffic splitting is the supported pattern for safe deployments on Cloud Run, and Google wants you to use the platform's native capability rather than bolting on infrastructure that is not needed.

Three deployment patterns all reduce to traffic splitting between revisions. Gradual rollouts move traffic from the old version to the new version over time, often starting at a small percentage and increasing as you confirm nothing is broken. Canary deployments send a small slice of traffic to a new version specifically to catch problems before a full rollout. A/B testing sends different users to different versions to compare behavior or measure outcomes. All three are the same mechanic underneath: percentages assigned to revisions of the same service.

If a question describes any of those patterns on Cloud Run, the answer is traffic splitting between revisions. The percentages are configurable to whatever you want. You can adjust them at any time, and a rollback is just shifting traffic back to the previous revision.

The Distractors to Watch For

The wrong answers on these questions tend to fall into a few buckets, and they are all variations of doing more work than the platform requires.

External load balancer in front of two Cloud Run services. Wrong because Cloud Run does this internally for one service across revisions.
DNS-based routing or weighted DNS records. Wrong for the same reason, and DNS is a poor fit for fine-grained traffic percentages anyway.
Two separate Cloud Run services with traffic divided between them. Wrong because revisions of one service are the supported pattern, and creating separate services breaks the rollback story.
API gateway routing between versions. Adds infrastructure that is not needed when Cloud Run handles the split natively.

If you see any of those options paired with a correct-looking answer that says use Cloud Run's built-in traffic splitting between revisions, take the built-in option.

What to Remember Walking Into the Exam

The recognition pattern is short. Cloud Run plus gradual rollout, canary, or A/B testing equals traffic splitting between revisions. One service, multiple revisions, percentages assigned in the routing layer, no external infrastructure required. Rollback is shifting the percentage back. That is the entire framework.

The mental picture worth carrying into the exam is the diagram of one service, the routing layer, and two revisions splitting at a percentage. If you can sketch that from memory, any traffic-splitting question the Professional Cloud Architect exam puts in front of you collapses to the same answer.

My Professional Cloud Architect course covers Cloud Run traffic splitting alongside the rest of the advanced architecture material.