Jobiglo

No results.

DevOps Engineer – Cloud AI (Remote, W2 Contract)

Bayside Solutions · Cupertino

New Remote
Contract Remote 🇬🇧 English
Kubernetes Containerized infrastructure Distributed systems Grafana Prometheus Splunk Python Golang CI/CD pipelines Deployment automation Ray clusters Networking troubleshooting Infrastructure as Code Automation frameworks

Job description

About the role

We are seeking a motivated DevOps / Site Reliability Engineer to design, build, and operate large‑scale Kubernetes‑based platforms that support critical engineering services. This W2 contract position is remote, with the hiring company based in Cupertino, CA.

Key responsibilities

  • Design, build, automate, and support scalable Kubernetes platforms (EKS, GKE, AKS) and related services.
  • Operate and troubleshoot production environments running at scale.
  • Develop automation and tooling to improve operational efficiency and reliability.
  • Monitor platform health, performance, and availability using observability tools such as Grafana and Prometheus.
  • Troubleshoot infrastructure, application, and networking issues across distributed systems.
  • Collaborate with engineering teams to enhance deployment, reliability, and scalability practices.
  • Participate in incident response, root‑cause analysis, and operational support.
  • Improve CI/CD workflows and deployment automation.
  • Drive operational excellence through documentation, automation, and process improvements.

Required profile

  • Strong hands‑on experience with Kubernetes platforms (EKS, GKE, AKS) at scale.
  • Deep understanding of containerized infrastructure and distributed systems.
  • Experience with monitoring and observability tools, preferably Grafana and Prometheus.
  • Proficiency in CI/CD pipelines and deployment automation.
  • Experience with Splunk for logging, analysis, and troubleshooting.
  • Solid scripting/automation skills using Python and/or Golang.
  • Ability to troubleshoot production systems under pressure.
  • Excellent communication and collaboration skills.

Required skills

  • Kubernetes
  • Amazon EKS
  • Google GKE
  • Azure AKS
  • Containerized infrastructure
  • Distributed systems
  • Grafana
  • Prometheus
  • Splunk
  • Python
  • Golang
  • CI/CD pipelines
  • Deployment automation
  • Ray clusters/services
  • Networking troubleshooting
  • Infrastructure as Code
  • Automation frameworks

Questions fréquentes

Le salaire n'est pas communiqué publiquement par le recruteur. Vous pouvez postuler et négocier directement avec Bayside Solutions.
Cliquez sur "Postuler maintenant" en haut de la page. Vous pouvez importer votre CV en 1 clic — Jobiglo extrait automatiquement vos informations et postule pour vous.
Le contrat proposé est un Contract basé à Cupertino.

Why are you reporting this job?

Thank you for your report. We will review this job.

Apply in 30 seconds

Enter your email to apply. An account will be created automatically.

By continuing, you accept our terms of use.

Already have an account? Login

Published 1 week ago

Expires 1 month from now

18 views · 0 applications

Boost your chances

Upload your CV — we will match you with relevant openings.

Analyzing your CV...

Bayside Solutions

Cupertino