Vertex AI Model Registry and Versioning for the PCA Exam

Ben Makansi

April 8, 2026

Vertex AI Model Registry is the service that holds the line between training and production for machine learning workloads on Google Cloud. When I work through the ML and AI section of the Professional Cloud Architect curriculum, the Model Registry comes up as the answer whenever a scenario describes a team that needs a single place to track model artifacts, manage versions, evaluate performance, and push the right model to an endpoint without juggling files across buckets and notebooks.

I want to walk through what the Model Registry actually is, how versioning works, what the evaluation and monitoring views give you, and the signals that tell me a Professional Cloud Architect question is pointing toward this service rather than ad hoc artifact storage or a custom MLOps pipeline.

What the Model Registry Is

Vertex AI Model Registry is a centralized repository for managing ML model artifacts, versions, and associated metadata throughout the model lifecycle. The phrase that matters on the exam is centralized repository. It rules out scenarios where the right answer would be a Cloud Storage bucket of pickle files with a spreadsheet tracking which version is in production. The Registry is the GCP-native control plane for models.

Inside the Registry, you get a unified list of every registered model in your project. That includes AutoML models, custom-trained models, and models imported from outside Vertex AI. Each model carries its type, such as Tabular, Object Detection, or Image Classification, along with versioning information and current deployment status. You can also label and organize models, attaching metadata like team ownership or application context.

The exam framing I keep coming back to is this. If a scenario describes a team that has multiple training pipelines producing models, possibly mixed between AutoML and custom-trained, and needs visibility into which model is active and how many versions exist, the Model Registry is the answer. It is the single view that makes the rest of the lifecycle manageable.

Model Versioning

Versioning is where the Registry earns its place on the Professional Cloud Architect exam. A single registered model is not just one artifact. It is a series of versions, each trained with different techniques, hyperparameters, or data. The Registry treats those versions as rows under a parent model entry rather than as separate models.

For a churn classifier, for example, one version might be a Random Forest, another a Gradient Boosting model, and another a Logistic Regression. All three live under the same model in the Registry. Each version carries its own state, with values like Ready indicating it is available for use. One version is marked as the default and can be tagged for production use. You can also assign aliases like experimental or staging, and attach labels for the creator or the owning team.

The architectural implication is reproducibility and governance. Because every version is preserved and labeled, the team can roll back to a prior version, run side-by-side comparisons, or hand off a specific version to another environment without losing track of which artifact corresponds to which training run. On the exam, when a scenario describes a need to track model evolution over time, ensure reproducibility, or control which exact model is serving production traffic, versioning inside the Model Registry is the mechanism the question is pointing at.

The Model Evaluation View

The Registry includes an evaluation view that helps you assess the performance of a trained model before deciding to deploy it. This is the part of the workflow that turns a registered version into a defensible production choice.

At the top of the evaluation page, a confidence threshold control lets you see how metrics like F1 score, precision, and recall respond to different settings. The values update as the threshold moves, which matters because choosing a threshold is part of the production decision, not just a model property. Below the controls, three plots give you the visual picture.

A precision-recall curve to show the trade-off between those two metrics across thresholds.
An ROC curve showing the relationship between the true positive rate and the false positive rate.
A precision and recall by threshold chart to help select a cutoff appropriate for the use case.

For a Professional Cloud Architect question, the takeaway is that the Registry is not just storage. It is also the place where evaluation happens for classification models, which means the architectural pattern of train, register, evaluate, deploy lives entirely inside Vertex AI rather than spreading across multiple tools.

Deploying a Version Directly From the Registry

Once a version has been evaluated, the Registry allows direct deployment to a Vertex AI endpoint. From the options menu next to a version, you click Deploy to endpoint and the version moves into serving. There is no context switch out to a separate deployment tool, and no need to copy artifacts between locations.

This is the pattern I look for on the exam. When a scenario describes a team that wants to move a tested model into production without rewiring its serving stack, deployment from the Model Registry is the answer. The same surface that holds the artifact and the evaluation also handles the promotion to an endpoint. That tight integration is the value proposition on the architecture side, because it shortens the distance between training and serving while keeping versioning intact.

The associated metadata travels with the deployment. The endpoint can serve a specific version, and if the team needs to roll forward or back, they update the endpoint to point at a different version in the Registry rather than redeploying artifacts from scratch.

Post-Deployment Evaluation and Monitoring

The Registry also supports post-deployment evaluation and monitoring. You can compare evaluations of the same model version across different conditions, for example before and after retraining, or across different datasets. Up to five evaluations can be compared side by side.

The comparison view exposes the same threshold control at the top, so you can see how precision, recall, and F1 score shift as you move the cutoff. Below that, the core metrics are laid out for each evaluation, including PR AUC, ROC AUC, log loss, F1 score, precision, and recall. Three visualizations sit alongside the metrics, the precision-recall curve, the ROC curve, and precision and recall across confidence thresholds.

The architectural use cases the Professional Cloud Architect exam expects you to recognize from this view are model drift monitoring, evaluating retraining impact, and selecting which model version to promote to production. If a scenario describes a team that needs to detect drift over time, decide whether a retrained version actually outperforms the current production version, or compare a new version against the incumbent before promotion, the post-deployment evaluation and monitoring capability inside the Model Registry is the mechanism doing that work.

Exam Signals That Point to the Model Registry

A few patterns reliably indicate the Model Registry is the right choice on the Professional Cloud Architect exam. The scenario describes a centralized place to manage ML models. Multiple model types are involved, including AutoML, custom-trained, or imported models. The team needs versioning with aliases or labels for production, staging, and experimental. The scenario calls for evaluation metrics like precision, recall, ROC, or PR AUC tied to specific versions. Deployment to a Vertex AI endpoint is part of the workflow. The team wants to monitor model drift or compare retrained versions against the incumbent.

If you see those signals, the Model Registry is the GCP-native answer. It is the control plane for model lifecycle on Vertex AI, and the exam treats it as the default whenever a question describes the lifecycle problem rather than a single training or serving step in isolation.

If you want to go deeper on the Model Registry and how it fits with the rest of Vertex AI, I cover it in the Professional Cloud Architect course alongside the rest of the ML and AI material.