All paths

Site Reliability / Platform Engineering

Keep production running at scale

SLOs/SLIs, on-call, observability, incident response, capacity planning, and the platform tooling that makes other engineers productive. Senior version of cloud engineering. High salary, high impact.

PrometheusGrafanaPagerDutyOpenTelemetryKubernetesTerraformPythonLinux

Salary range

$105k – $180k

entry → mid US

Time to complete

10 wk

20 wk part-time

Lessons

12

4 phases

Capstone

Yes

real cloud account

Salary range is sourced from Levels.fyi 2026 SRE postings + tech-hub-adjusted Glassdoor. Your specific outcome depends on location, experience, interview performance, and market conditions.

Roles this path prepares you for

  • Site Reliability Engineer (SRE)
  • Platform Engineer
  • Senior DevOps Engineer
  • Production Engineer

Curriculum

1

SRE Foundations

Free preview

What SRE is, SLOs/SLIs/error budgets, blameless postmortems.

3 lessons

2

Observability + On-Call

Locked

Prometheus + Grafana, OpenTelemetry tracing, alerting + on-call, distributed tracing.

5 lessons

3

Platform Engineering

Locked

Runbooks + toil, chaos engineering, capacity + cost, platform engineering practice.

3 lessons

4

Career Launch (SRE)

Locked

SRE portfolio, system-design + on-call interviews, resume tuning.

1 lessons

Capstone project + completion certificate

Run an SLO program for a multi-service stack

Deploy a 3-service stack, define SLOs/SLIs, build a dashboard, write runbooks, simulate an incident and run the response.

Deliverables

  • ·SLO dashboard in Grafana with 3+ services
  • ·Runbooks for at least 2 alert types
  • ·Incident timeline document from a simulated outage
  • ·Error budget policy document
Completion certificate. Finish every lesson in this path and we auto-issue a CloudPath Site Reliability / Platform Engineering certificate to your account, with a shareable verification URL. Show it on LinkedIn, your portfolio, or in a recruiter conversation.

Before you start

  • ·Cloud Foundations + DevOps Engineering (or equivalent)
  • ·Comfortable with Kubernetes

What you walk out with

  • Run an SLO program
  • Lead production incidents
  • Build internal developer platforms
  • Interview for SRE / Platform Engineer roles

Preview — opening soon

Curriculum is built, content is being polished. Sign up now and you'll get notified the moment this path opens to enrollment.

Sign up to be notified