Agent Platform Model Garden | Generative AI Leader Exam

December 6, 2025

I am Ben Makansi, and Agent Platform Model Garden (formerly Vertex AI Model Garden) is one of those topics on the Generative AI Leader exam where the wording matters more than the engineering. Google describes Model Garden as the place you go to find models, and the exam will test whether you can talk about it that way without getting tangled up in implementation details. This article walks through what Model Garden actually is, what is in it, and the one deployment wrinkle you need to recognize.

What Agent Platform Model Garden is

Agent Platform is the platform. Model Garden is where the models live inside that platform. Google's official phrasing is that Agent Platform Model Garden is a comprehensive AI/ML model library that allows users to discover, test, customize, and deploy models and assets from Google and its partners. Memorize that sentence. The four verbs (discover, test, customize, deploy) come up repeatedly in exam-style questions, and they map cleanly to the workflow the Generative AI Leader exam expects you to understand.

The point of Model Garden is that you do not have to train a model from scratch. You browse a catalog, pick something that is already most of the way to your use case, and go from there. That framing matters because the exam often pits Model Garden against custom training paths, and the correct answer is usually the one that avoids reinventing the wheel.

The three categories of models

Model Garden contains a diverse ecosystem, and Google groups the contents into three buckets. Knowing the buckets and at least one example from each is the kind of detail a question can hinge on.

First-party (Google): Gemini, Imagen, Codey, Chirp. These are Google's own proprietary models. Gemini for general-purpose generative AI, Imagen for image generation, Codey for code, Chirp for speech.
Open weight: Llama, Gemma, Falcon. These are open-weight models you can pull, customize, and deploy. Gemma is Google's own open-weight family, but Llama and Falcon being available in Model Garden is what makes the catalog feel like a true marketplace.
Partner: Claude, Mistral, Jamba. These are third-party proprietary models from companies that have partnered with Google to make their offerings available through Agent Platform.

If a question asks where you would go to access Claude or Llama on Google Cloud, the answer is Agent Platform Model Garden. If a question asks for the single interface that covers Google's own Gemini alongside an open-weight model and a partner model, same answer.

One-click deploy versus custom prediction routine

Most models in Model Garden can be deployed with one click directly from the Model Garden interface. That is the happy path. You pick a model, click deploy, and Agent Platform handles the serving infrastructure for you with autoscaling, monitoring, versioning, and load balancing all included.

The exam wrinkle is the case where one-click does not work. Some models require specialized preprocessing that standard Agent Platform containers cannot handle. Maybe you need specific data transformations, custom feature engineering, or framework-specific logic that is unique to your use case. The standard serving containers do not support this kind of customization.

The solution is a custom prediction routine packaged inside a Docker container. The flow goes like this:

From Model Garden, get the model artifacts. That means the weights, the architecture, and any other files that make up the model.
Build a custom Docker container that packages the model artifacts together with your custom prediction routine and any preprocessing logic.
Push that container to Artifact Registry, which is where Google Cloud stores container images.
Deploy the container to a Agent Platform Endpoint.

The key benefit is that you still get managed serving benefits while using custom processing logic. You do not give up the autoscaling, monitoring, health checks, or GPU support that Agent Platform provides. You only swap out the prediction code inside the container.

How this shows up on the exam

For the Generative AI Leader exam, the Model Garden questions tend to fall into three patterns. First, identification questions where you have to recognize Agent Platform Model Garden as the place to find a specific model like Gemini or Llama. Second, framing questions where you pick out the four verbs (discover, test, customize, deploy) or the diverse ecosystem language. Third, scenario questions where a team needs custom preprocessing that standard containers cannot provide, and the right answer is the custom Docker container plus Artifact Registry plus Agent Platform Endpoint flow.

None of this requires you to write the Dockerfile or know the gcloud commands. The Generative AI Leader exam is about knowing what each piece does and when each piece applies.

My Generative AI Leader course covers Agent Platform Model Garden alongside the rest of the foundational material you need for the exam.

Agent Platform Model Garden for the Generative AI Leader Exam

What Agent Platform Model Garden is

The three categories of models

One-click deploy versus custom prediction routine

How this shows up on the exam

Get tips and updates from GCP Study Hub