Google Cloud
AI Infrastructure: Deployment Types

Enjoy unlimited growth with a year of Coursera Plus for $199 (regularly $399). Save now.

Google Cloud

AI Infrastructure: Deployment Types

Included with Coursera Plus

Gain insight into a topic and learn the fundamentals.
Intermediate level
Some related experience required
5 hours to complete
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level
Some related experience required
5 hours to complete
Flexible schedule
Learn at your own pace

What you'll learn

  • Describe the process of creating a GPU-accelerated cluster.

  • Identify how to provision a GPU-accelerated cluster on GCE.

  • Identify how to provision a GPU-accelerated cluster on GKE.

  • Identify how to deploy AI inference workloads on GKE.

Details to know

Shareable certificate

Add to your LinkedIn profile

Recently updated!

December 2025

Assessments

4 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 6 modules in this course

This module offers an overview of the course and outlines the learning objectives.

What's included

1 plugin

This module details the AI Hypercomputer cluster creation process. It covers the key decisions required, including choosing a machine type, consumption option, deployment option, orchestrator, and cluster image.

What's included

1 assignment6 plugins

This module identifies key configuration options and optimization techniques for deploying an AI Hypercomputer cluster on Google Compute Engine (GCE). It covers selecting machine types, accelerator OS images, deployment options, and strategies for optimizing network performance.

What's included

1 assignment4 plugins

This module identifies configuration options for deploying an AI Hypercomputer cluster on Google Kubernetes Engine (GKE). It covers containerization, GKE modes of operation, networking configurations, and workload optimization techniques like distributed training and GPU sharing.

What's included

1 assignment4 plugins

This module examines optimization techniques for architecting an inference workload on GKE. It covers the GKE inference workflow, key infrastructure and model-level optimizations.

What's included

1 assignment4 plugins

Student PDF links to all modules

What's included

1 reading

Instructor

Google Cloud Training
Google Cloud
2,020 Courses3,779,342 learners

Offered by

Google Cloud

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Coursera Plus

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions