Google Vertex AI logo

Google Vertex AI Review

Visit

Build, deploy, and scale ML models on Google Cloud infrastructure

Google Vertex AI is a managed machine learning platform for building, deploying, and scaling AI models.

Google Cloud·Founded 2008·Usage-basedFree TrialMachine Learning PlatformsAI APIsAI Cloud

AI Panel Score

8.2/10

6 AI reviews

Reviewed

AI Editor Approved

About Google Vertex AI

Google Vertex AI is a fully managed, end-to-end machine learning platform hosted on Google Cloud. It consolidates what were previously separate Google Cloud ML services into a single environment, covering the full model lifecycle from data ingestion and labeling through training, evaluation, deployment, and monitoring. Users can work with structured data, images, text, and video across both AutoML and custom training approaches.

The platform targets data scientists, ML engineers, and developers building production-grade AI applications. It supports popular frameworks including TensorFlow, PyTorch, scikit-learn, and XGBoost, and provides managed notebook environments, pipelines, and experiment tracking to support collaborative and reproducible ML workflows.

Vertex AI includes a Model Garden, which provides access to first-party Google foundation models such as Gemini, as well as open-source and third-party models. Through the Generative AI Studio, users can prototype, fine-tune, and deploy large language models and multimodal models without needing to manage underlying infrastructure.

On the MLOps side, Vertex AI offers Feature Store for sharing and serving ML features, Model Registry for version control, and Model Monitoring to detect training-serving skew and data drift in production. These capabilities are intended to help teams move models from experimentation to production more consistently and at scale.

Vertex AI competes directly with AWS SageMaker and Azure Machine Learning in the cloud ML platform market. Pricing is usage-based, varying by compute resources consumed, model type, and API calls made, with no flat monthly subscription required. Google Cloud's free tier includes limited Vertex AI credits for new accounts.

Features

AI

  • Gemini Models Access

    Provides access to the latest Gemini multimodal models capable of understanding and combining text, images, video, and code inputs to generate outputs.

  • Model Garden

    A catalog of 200+ generative AI models including first-party (Gemini, Imagen, Chirp, Veo), third-party (Anthropic's Claude), and open models (Gemma, Llama 3.2).

  • Vertex AI Agent Builder

    A full-stack platform for building, scaling, and governing enterprise-grade AI agents grounded in enterprise data.

  • Vertex AI Studio

    A prompt and testing environment where developers can experiment with Gemini models using text, images, video, or code inputs.

Analytics

  • Gen AI Evaluation Service

    Enterprise-grade tools for objective, data-driven assessment and comparison of generative AI models.

  • Model Monitoring

    Continuously monitors deployed models for input skew and drift to detect degradation in model performance.

  • Vertex AI Evaluation

    A purpose-built MLOps tool for identifying and comparing the best-performing models for a given use case.

Automation

  • Vertex AI Pipelines

    Workflow orchestration tool that automates and standardizes ML project workflows across the development lifecycle.

Collaboration

  • Feature Store

    A managed service for serving, sharing, and reusing ML features across teams and models.

Core

  • Model Registry

    A centralized repository for managing, versioning, and tracking any ML model throughout its lifecycle.

  • Vertex AI Notebooks

    Integrated notebook environments (Colab Enterprise or Workbench) natively connected to BigQuery for unified data and AI workloads.

  • Vertex AI Training and Prediction

    Managed infrastructure for training ML models and deploying them to production using open source frameworks and optimized AI hardware.

Preview

Google Vertex AI desktop previewGoogle Vertex AI mobile preview

Pricing Plans

AutoML Image Data

Contact sales

AutoML model training and prediction for image classification and object detection

  • Training (classification): $3.465/hour
  • Training (object detection): $3.465/hour
  • Training Edge on-device model: $18.00/hour
  • Deployment & online prediction (classification): $1.375/hour
  • Deployment & online prediction (object detection): $2.002/hour
  • Batch prediction: $2.222/hour

AutoML Tabular Data

Contact sales

AutoML model training and inference for tabular classification/regression

  • Training: $21.252/node hour
  • Inference: same price as custom-trained models
  • Batch inference uses 40 n1-highmem-8 machines
  • Vertex Explainable AI at same inference rate
  • Forecasting priced separately under Vertex AI Forecast

Vertex AI Forecast (AutoML)

Contact sales

Time series forecasting with AutoML, tiered prediction pricing

  • Prediction 0–1M count: $0.20/1,000 count
  • Prediction 1M–50M count: $0.10/1,000 count
  • Prediction 50M+ count: $0.02/1,000 count
  • Training: $21.252/hour
  • Up to 5 prediction quantiles at no additional cost
  • Shapley-values explainability available

Vertex AI Forecast (ARIMA+)

Contact sales

ARIMA+ forecasting model training and prediction via BigQuery ML

  • Prediction: $5.00/1,000 count
  • Training: $250.00 per TB × candidate models × backtesting windows
  • Time series decomposition explainability at no additional cost
  • Each job incurs cost of 1 managed pipeline run
  • Additional BigQuery ML pricing applies

Custom Training - CPU Machine Types

Contact sales

Custom model training on CPU-based Compute Engine machine types

  • n1-standard-4: $0.2185/hour up to n1-standard-96: $5.244/hour
  • n2-standard-4: $0.2234/hour up to n2-standard-80: $4.467/hour
  • e2-standard-4: $0.1541/hour up to e2-standard-32: $1.233/hour
  • c2-standard-4: $0.2401/hour up to c2-standard-60: $3.602/hour
  • m1-ultramem up to $28.948/hour for memory-optimized workloads
  • Spot VMs supported; billed per Compute Engine Spot VM pricing

Custom Training - GPU Accelerators

Contact sales

Custom model training with GPU/TPU accelerators attached to machine types

  • NVIDIA T4: $0.4025/hour; NVIDIA V100: $2.852/hour
  • NVIDIA A100: $2.934/hour + $0.440 management fee/hour
  • NVIDIA A100 80GB: $3.928/hour + $0.589 management fee/hour
  • NVIDIA H100 80GB: $9.797/hour + $1.469 management fee/hour
  • NVIDIA H200 141GB: $10.709/hour
  • TPU v2 (8 cores): $5.175/hour; TPU v3 (8 cores): $9.20/hour

Custom Training - GPU-Integrated Machine Types

Contact sales

Machine types with fixed GPU counts (GPU price included)

  • a2-highgpu-1g (1x A100): $4.425/hour
  • a2-highgpu-8g (8x A100): $35.402/hour
  • a2-megagpu-16g (16x A100): $65.707/hour
  • a3-highgpu-8g (8x H100): $101.007/hour
  • a3-megagpu-8g: $106.046/hour
  • a4-highgpu-8g: $148.212/hour

Generative AI on Vertex AI

Contact sales

Generative AI models and foundation model APIs on Vertex AI — see separate pricing page

  • Pricing listed on dedicated Generative AI on Vertex AI pricing page
  • Includes foundation models (Gemini, etc.)
  • Usage-based pricing per token/request

AI Panel Reviews

The Decision Maker

The Decision Maker

Strategic bet, vendor viability, timing, adoption approval
8.6/10

Google's bet on unified ML is real, and 200+ models in one catalog proves it.

Vertex AI is a mature, full-stack ML platform backed by Google's infrastructure and model portfolio. The Model Garden alone — 200+ models including Claude and Llama 3.2 — changes the vendor conversation.

Google. Profitable. Not going anywhere. Vertex AI consolidates what used to be six separate Cloud ML services, and the changelog shows steady shipping. Vendor viability isn't the question here.

Two things to watch. One: deployed AutoML endpoints charge even with zero predictions — you must manually undeploy or you're burning money overnight. Two: H100 training at $9.797/hour plus management fees adds up faster than teams expect when experiments run long. SageMaker has the same problem, but that's not an excuse.

For teams already on Google Cloud with BigQuery workloads, the Vertex AI Notebooks integration and Feature Store make serious sense. The Generative AI Studio lets you prototype against Gemini without touching infrastructure. That's real speed to value. Teams not on GCP will pay an adoption tax that takes time to offset.

Competitive Positioning8.4

Exclusive Model Garden access to Gemini 3 Pro and Veo isn't available on SageMaker — that's a real differentiation for multimodal use cases.

Reputation Risk9.0

Telling the board you're on Vertex AI reads as responsible — it's the defensible choice next to AWS SageMaker and Azure ML.

Speed to Value7.5

Generative AI Studio cuts prototype time, but the always-on endpoint billing model creates budget surprises that slow adoption in cautious orgs.

Strategic Fit8.2

Model Garden with 200+ models including third-party options like Claude means this advances AI capability, not just cost reduction.

Vendor Viability9.8

Google Cloud is a $36B+ annual revenue business — Vertex AI isn't a product they abandon.

Pros

  • 200+ foundation models including Claude, Llama 3.2, and first-party Gemini in one catalog
  • Feature Store, Model Registry, and Model Monitoring cover the full MLOps lifecycle without bolting on third-party tools
  • Native BigQuery integration cuts pipeline complexity for teams already on GCP
  • Spot VM support on custom training keeps GPU costs controllable at scale

Cons

  • Deployed AutoML endpoints bill continuously — even zero-prediction endpoints drain budget until manually undeployed
  • H100 training at $9.797/hour plus management fees makes long experiments expensive fast
  • Teams not on GCP face a real migration cost that offsets early productivity gains
  • No flat monthly pricing — usage-based model makes budget forecasting harder for finance teams

Right for

Teams already on Google Cloud who need production-grade MLOps plus access to Gemini and multimodal foundation models.

Avoid if

Your stack is AWS-native and you're not willing to pay the cross-cloud data egress and tooling migration tax.

The Domain Strategist

The Domain Strategist

Craft and strategy in the product's domain — adapts identity per category, same lens
8.5/10

The most complete MLOps surface in cloud, if you're already living in GCP.

Vertex AI has closed most of the gap with SageMaker on MLOps depth and opened a real lead on foundation model access via Model Garden's 200+ models. If your data is in BigQuery and your team runs TensorFlow or PyTorch, this is the default choice.

Feature Store, Model Registry, Pipelines, Model Monitoring — that's the full MLOps stack, not a checklist of half-built services. The Gen AI Evaluation Service and Vertex AI Studio together give teams a credible path from prototype to production on LLMs without spinning up separate tooling. Someone at Google has shipped real ML infrastructure before; this isn't duct-taped together.

The tradeoff worth naming: endpoint billing doesn't stop when predictions stop. A deployed AutoML model accrues charges at rest — you must explicitly undeploy to stop costs. For teams running many experimental endpoints or doing burst-and-idle workloads, that's a real budget control problem that SageMaker handles more gracefully with serverless inference.

If we adopt this and stay on GCP, in 3 years we have deep BigQuery-Vertex integration, access to first-party Gemini and third-party Claude via a single API surface, and an MLOps workflow that's genuinely hard to replicate on-prem. If we're multi-cloud or AWS-primary, the integration story loses most of its force.

Category Positioning8.5

Competes directly with SageMaker and Azure ML, but Model Garden's first-party Gemini access plus Claude availability gives it a differentiated generative AI position neither competitor currently matches.

Domain Fit8.5

Supports TensorFlow, PyTorch, scikit-learn, XGBoost, managed notebooks via Colab Enterprise, and Pipelines for reproducibility — matches how production ML teams actually operate.

Integration Surface9.0

Native BigQuery connectivity, Colab Enterprise notebooks, and a unified API for both classical ML and Gemini-era generative workloads is the strongest integration story in the category.

Long-term Implications8.0

Deep BigQuery and GCP integration compounds positively over time, but creates meaningful switching cost if the org ever goes multi-cloud.

Strategic Depth9.0

Full lifecycle coverage from labeling through drift monitoring, plus Model Garden with 200+ models including Anthropic's Claude — library-grade depth, not feature-lite.

Pros

  • 200+ models in Model Garden including first-party Gemini and third-party Claude — single API surface for classical and generative workloads
  • Full MLOps stack: Feature Store, Model Registry, Pipelines, and Model Monitoring all production-grade
  • BigQuery-native integration removes a major friction point for teams already in GCP
  • H100 and H200 GPU availability for large-scale training without separate infrastructure contracts

Cons

  • Deployed AutoML endpoints bill continuously even with zero predictions — requires manual undeploy to stop charges
  • Usage-based pricing with no flat subscription makes budget forecasting difficult at scale
  • Generative AI pricing lives on a separate page, fragmenting cost visibility
  • Strong GCP lock-in; integration advantages evaporate in multi-cloud or AWS-primary environments

Right for

Data science teams building production ML and LLM applications on GCP with BigQuery as their data backbone.

Avoid if

Your organization is AWS-primary or actively multi-cloud and can't commit to GCP as the ML runtime.

The Finance Lead

The Finance Lead

Money, total cost of ownership, contracts, procurement math
7.8/10

200+ models, H100s at $9.79/hour, and an idle-endpoint billing trap to watch.

Vertex AI is usage-based with granular public pricing — no sales call required. The idle-endpoint charge is the budget risk most teams miss.

Pricing page is genuinely detailed. H100 80GB at $9.797/hour plus $1.469 management fee. AutoML tabular training at $21.252/node-hour. ARIMA+ at $250/TB × candidate models × backtesting windows — that last multiplier compounds fast. Three dimensions visible without a procurement conversation. Rare for this category. AWS SageMaker buries comparable detail.

The TCO trap: deployed AutoML endpoints bill continuously, zero predictions or not. Team runs 5 idle endpoints for a month — that's real spend, not a rounding error. Year-3 cost depends entirely on compute mix and how disciplined the team is about undeploying models. No flat cap, no ceiling. Forecasting budgets here requires engineering discipline, not just finance work.

Contract flexibility is straightforward — pay-as-you-go, no auto-renewal window, no termination-for-convenience clause to fight. Procurement friction is low. The tradeoff: no committed-use discount is visible on the pricing page, so large training budgets may negotiate offline with Google Cloud reps.

Billing & Procurement7.8

Standard GCP billing applies — invoiced monthly, no onboarding fee, integrates with existing Google Cloud procurement relationships.

Contract Flexibility8.2

Usage-based with no published auto-renewal terms or lock-in; cancel anytime by stopping usage and undeploying endpoints.

Pricing Transparency8.5

Granular per-resource rates publicly listed — CPU, GPU, TPU, AutoML, and GenAI tiers all visible without a sales call.

ROI Clarity7.2

Model Monitoring and Gen AI Evaluation Service provide measurable performance signals, but revenue impact math remains on the buyer.

Total Cost of Ownership6.8

Idle endpoint billing plus ARIMA+ multipliers make year-3 TCO genuinely hard to model without engineering input.

Pros

  • Full compute pricing public: H100 at $9.797/hour, T4 at $0.4025/hour — no sticker shock post-contract
  • 200+ models in Model Garden including Claude and Llama 3.2 — broad optionality without separate vendor contracts
  • Spot VM support reduces training costs materially for fault-tolerant workloads
  • No flat subscription — teams with bursty workloads won't pay for idle months

Cons

  • Idle endpoint billing accrues with zero prediction traffic — requires active ops hygiene to control
  • ARIMA+ training cost multiplies by candidate models and backtesting windows — budget math gets complex fast
  • No visible committed-use discount on the pricing page; volume negotiation likely requires a Google Cloud rep
  • GenAI pricing lives on a separate page — full all-in cost requires reading two pricing documents

Right for

ML teams already on Google Cloud who need a fully managed MLOps platform with broad foundation model access and public compute pricing.

Avoid if

Teams without ML ops discipline to manage endpoint lifecycle will bleed budget on idle deployments.

The Domain Practitioner

The Domain Practitioner

Daily hands-on reality in the product's domain — adapts identity per category, same lens
8.1/10

200+ models, real MLOps depth — but idle endpoints will quietly drain your budget

Vertex AI is a serious production ML platform with genuine depth across training, pipelines, and generative AI. The idle-endpoint billing model is a real operational trap that SageMaker doesn't punish you for as harshly.

Model Garden ships 200+ models including Gemini, Claude, and Llama 3.2. That's not a demo catalog — that's a working foundation for teams who need model optionality without managing inference infrastructure. Feature Store, Model Registry, and Model Monitoring cover the MLOps surface that most teams bolt together from five different tools. Pipelines plus Colab Enterprise notebooks means the experiment-to-production handoff has actual structure, not just vibes.

The billing model demands attention. Deployed AutoML endpoints charge continuously whether or not predictions are made — you must undeploy to stop the meter. At $2.002/hour for object detection endpoints, a forgotten staging deployment costs $1,441/month. That's not a bug, it's a policy. H100 training at $9.797/hour plus $1.469 management fee per hour adds up fast during long fine-tuning runs.

Docs indicate solid framework coverage — TensorFlow, PyTorch, scikit-learn, XGBoost. The changelog is absent from public evidence, which makes tracking breaking changes harder than it should be. Power users will find real depth in Vertex AI Evaluation and the Gen AI Evaluation Service. Day-three reality: this is a platform you can actually live in, but cost hygiene has to be part of your daily workflow.

Day-3 Reality7.8

Deep MLOps tooling holds up past the demo, but idle-endpoint billing and absent public changelog create ongoing operational vigilance requirements.

Documentation Practitioner-Fit7.6

Docs are present and API coverage is confirmed, but no public changelog makes it harder for ML engineers to track deprecations and runtime changes.

Friction Surface7.2

Idle endpoint charges, management fees on top of GPU costs, and ARIMA+ billing that also incurs a pipeline run fee — small friction points compound across a working week.

Power-User Depth8.5

Gen AI Evaluation Service, Feature Store, Model Registry, and 200+ Model Garden entries give serious practitioners genuine advanced surface area to work with.

Workflow Integration8.3

Colab Enterprise notebooks natively connected to BigQuery, plus Vertex AI Pipelines, means the training-to-deployment loop has real structural support.

Pros

  • Model Garden with 200+ models including Claude and Llama 3.2 — real optionality without managing inference infra
  • Feature Store plus Model Registry plus Model Monitoring covers the MLOps stack in one platform
  • Spot VM support for custom training jobs meaningfully cuts GPU costs on non-critical runs
  • PyTorch, TensorFlow, scikit-learn, XGBoost all supported — no framework lock-in

Cons

  • Idle deployed endpoints charge continuously — $2.002/hour for object detection whether or not a prediction is made
  • H100 training carries a $1.469/hour management fee on top of $9.797/hour compute — costs escalate fast
  • No public changelog in evidence, making it harder to track breaking changes across releases
  • AutoML Tabular training at $21.252/node hour is steep for iterative experimentation budgets

Right for

ML engineering teams already on Google Cloud who need a full MLOps stack with serious generative AI model access.

Avoid if

Your team runs many long-lived staging endpoints or does heavy iterative AutoML experimentation without strict cost controls.

The Power User

The Power User

Daily human experience, onboarding, polish, learning curve, reliability
8.1/10

The serious ML platform that makes you earn it before it pays off

Vertex AI is a genuinely complete ML platform — Model Garden, Feature Store, pipelines, monitoring, 200+ foundation models. But it was built for teams who already know what they're doing, not people finding their footing.

The feature list here is real. Model Garden with 200-plus models including Claude, Llama, Gemini, Imagen — that's not marketing fluff. Vertex AI Pipelines, Feature Store, Model Registry, drift monitoring. These aren't half-baked. Compared to AWS SageMaker, Vertex actually feels more consolidated, less like seventeen services pretending to be one. And the BigQuery integration for ARIMA+ forecasting is genuinely useful if your data already lives in Google Cloud.

The pricing will catch you though. Deployed AutoML endpoints charge continuously whether or not a prediction fires. Forget to undeploy? The meter runs. H100 training clusters hit $101/hour for an 8-GPU config. Nobody's calling this cheap. These are enterprise numbers, and the docs make that clear if you read carefully — but new users won't read carefully on day one.

Mobile parity is basically nonexistent — this is a web-only platform and that's fine, nobody's training models on their phone. The real learning curve is month one versus month three. Month one is homework. Month three, if you've committed, it clicks. Not a tool you casually evaluate.

Daily Polish7.2

Vertex AI Studio and the notebook environments feel considered, but the pricing gotcha around always-on endpoints suggests someone wasn't sweating the daily-user experience enough.

Learning Curve6.8

Month one is steep; the platform consolidates what were previously separate Google Cloud ML services, and that history shows in the navigation and mental model required.

Mobile Parity3.5

Web-only platform with no stated mobile experience — acceptable for ML workflows but worth naming honestly.

Onboarding Experience6.5

Free trial exists but no flat free plan, and usage-based pricing with management fees layered on GPU costs makes the first hour feel like budgeting homework, not exploration.

Reliability Feel8.5

Google Cloud infrastructure backing a fully managed platform — category norm suggests high uptime, and the managed notebook and pipeline environments are designed to remove operational overhead.

Pros

  • 200+ foundation models in Model Garden including Gemini, Claude, and Llama 3.2 — genuinely broad
  • End-to-end MLOps coverage: Feature Store, Model Registry, drift monitoring, pipelines all in one place
  • Spot VM support for custom training cuts GPU costs significantly
  • BigQuery integration for forecasting workflows is real, not a checkbox

Cons

  • Deployed endpoints bill continuously even with zero predictions — easy to bleed money passively
  • H100 8-GPU clusters at $101/hour plus management fees — this is enterprise pricing, full stop
  • Onboarding assumes you already speak MLOps; first-hour experience is more reference manual than welcome mat
  • No changelog surfaced publicly, which makes it harder to know what just changed on you

Right for

ML engineering teams already in Google Cloud who need a single platform from data prep through production monitoring.

Avoid if

You're an individual or small team still figuring out your ML workflow — the pricing model will punish experimentation.

The Skeptic

The Skeptic

Contrarian. Watch-outs, deal-breakers, broken promises, category patterns
7.8/10

200+ models, Google's infrastructure, one billing surprise waiting for you

Vertex AI is a real platform — not vapor. The Model Garden breadth and GCP infrastructure integration are genuine differentiators. But the endpoint billing model will catch teams off guard.

Three tells from the pricing page alone. One: AutoML endpoints charge even when idle — you must undeploy to stop the clock. Two: ARIMA+ training runs $250/TB multiplied by candidate models AND backtesting windows — that math compounds fast. Three: no changelog listed, which on a platform this complex is a yellow flag for operational visibility.

The Model Garden at 200+ models — including Claude, Llama 3.2, and Gemini natively — is the clearest gap vs. SageMaker. AWS doesn't match that first-party/third-party breadth in one catalog. Feature Store, Model Registry, and Model Monitoring form a credible MLOps spine. This isn't a demo product.

Exit portability is the real concern. TensorFlow and PyTorch models are portable. But if your pipelines, Feature Store, and Agent Builder all run on Vertex primitives, migration gets expensive fast. Google has sunsetted cloud products before. Not a reason to avoid — reason to architect defensively.

Competitive Differentiation8.2

Model Garden's 200+ models including Claude and Llama 3.2 alongside native Gemini is a real moat SageMaker and Azure ML don't replicate at this breadth.

Exit Portability5.5

Open frameworks (PyTorch, TensorFlow) are portable, but Vertex Pipelines, Feature Store, and Agent Builder create deep GCP lock-in that's costly to unpick.

Long-term Viability8.0

Google Cloud is a $33B+ revenue business and Vertex is clearly a strategic bet — no funding risk, but Google's history of sunsetting products keeps this from scoring higher.

Marketing Honesty7.5

Enterprise-ready claims are substantiated by actual MLOps tooling depth, but idle endpoint charges are buried — that's a real cost behavior the landing page doesn't surface.

Track Record Match8.5

Unified ML platform consolidating scattered services matches the pattern of durable category tools like SageMaker, not the pattern of things that got shut down.

Pros

  • 200+ models in Model Garden including Claude, Llama 3.2, and native Gemini — no competitor matches this catalog natively
  • Feature Store + Model Registry + Model Monitoring is a credible, full MLOps stack, not a checklist
  • Spot VM support for custom training meaningfully reduces GPU cost (H100 drops from $9.797/hour at full rate)
  • BigQuery-native notebooks reduce the data-to-model pipeline friction that kills MLOps velocity

Cons

  • Idle endpoint billing is a budget trap — deployed AutoML models charge continuously regardless of prediction volume
  • ARIMA+ training cost multiplies by candidate models × backtesting windows — easy to generate a $10K+ surprise
  • No changelog visible, which is a transparency gap for a platform with this operational surface area
  • Deep Vertex-native feature usage creates GCP lock-in that's genuinely hard to reverse

Right for

ML engineering teams already on GCP who need a full MLOps stack with broad foundation model access.

Avoid if

You're cost-sensitive, cloud-agnostic, or need predictable monthly spend — the usage-based model bites hard at scale.

Buyer Questions

Common questions answered by our AI research team

Pricing

Does Vertex AI charge for a deployed AutoML model even if no predictions are made?

Yes. According to the pricing page, you pay for each model deployed to an endpoint, even if no prediction is made. Charges continue to accrue as long as the model remains deployed.

Features

What foundation models are available in Vertex AI, and does it include access to Gemini 3 Pro?

Vertex AI provides access to 200+ foundation models, and the homepage explicitly mentions Gemini 3 Pro (referred to as 'Nano Banana Pro (Gemini 3 Pro Image)'), which is available via the Gemini API and can be tried in Vertex AI.

Pricing

Can I use Spot VMs for custom training jobs in Vertex AI to reduce costs, and how are they billed?

Yes, you can use Spot VMs with Vertex AI custom training. They are billed according to Compute Engine Spot VMs pricing, with additional Vertex AI custom training management fees on top of infrastructure usage costs.

Setup

How do I stop incurring charges for a deployed AutoML model endpoint when it's not in use?

To stop incurring charges for a deployed AutoML model endpoint, you must undeploy the model. The pricing page explicitly states: 'You must undeploy your model to stop incurring further charges.'

Integration

Does Vertex AI integrate with BigQuery for AutoML forecasting workflows like ARIMA+?

Yes. The ARIMA+ pricing section references the BigQuery ML pricing page for additional details, and each ARIMA+ training and prediction job also incurs the cost of one managed pipeline run as described in Vertex AI pricing.

Product Information

  • Founded

    2008
  • Pricing

    Usage-based
  • Free Trial

    Available

Platforms

web

About Google Cloud

Enterprise ready, fully-managed, unified AI development platform. Access and utilize Vertex AI Studio, Agent Builder, and 200+ foundation models.

Resources

Documentation
API
Blog

Also in Machine Learning Platforms