DevOps Engineer – Cloud AI (Remote, W2 Contract)
Bayside Solutions · Cupertino
Job description
About the role
We are seeking a motivated DevOps / Site Reliability Engineer to design, build, and operate large‑scale Kubernetes‑based platforms that support critical engineering services. This W2 contract position is remote, with the hiring company based in Cupertino, CA.
Key responsibilities
- Design, build, automate, and support scalable Kubernetes platforms (EKS, GKE, AKS) and related services.
- Operate and troubleshoot production environments running at scale.
- Develop automation and tooling to improve operational efficiency and reliability.
- Monitor platform health, performance, and availability using observability tools such as Grafana and Prometheus.
- Troubleshoot infrastructure, application, and networking issues across distributed systems.
- Collaborate with engineering teams to enhance deployment, reliability, and scalability practices.
- Participate in incident response, root‑cause analysis, and operational support.
- Improve CI/CD workflows and deployment automation.
- Drive operational excellence through documentation, automation, and process improvements.
Required profile
- Strong hands‑on experience with Kubernetes platforms (EKS, GKE, AKS) at scale.
- Deep understanding of containerized infrastructure and distributed systems.
- Experience with monitoring and observability tools, preferably Grafana and Prometheus.
- Proficiency in CI/CD pipelines and deployment automation.
- Experience with Splunk for logging, analysis, and troubleshooting.
- Solid scripting/automation skills using Python and/or Golang.
- Ability to troubleshoot production systems under pressure.
- Excellent communication and collaboration skills.
Required skills
- Kubernetes
- Amazon EKS
- Google GKE
- Azure AKS
- Containerized infrastructure
- Distributed systems
- Grafana
- Prometheus
- Splunk
- Python
- Golang
- CI/CD pipelines
- Deployment automation
- Ray clusters/services
- Networking troubleshooting
- Infrastructure as Code
- Automation frameworks
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 1 week ago
Expires 1 month from now
18 views · 0 applications
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
Bayside Solutions
Cupertino
Related job offers
-
Data Analyst I – 12‑Month Contract
WorkGenius Group Cupertino -
Interview Engineer
Terminal Chili -
Principal Programme Management Officer (Temporary)
United Nations Department of Management Strategy, Policy and Compliance NEW YORK -
AI Research Scientist – World Model
Bosch Group Sunnyvale -
Senior AI Research Scientist – World Model
Bosch Group Sunnyvale