Jobiglo

No results.

Senior High Performance Computing Engineer

Texas A&M University · TX

New
Senior 130,000 - 140,000 USD/year 🇬🇧 English
High Performance Computing (HPC) Linux system administration Computer networking Kubernetes Run:ai Slurm NVIDIA DGX Virtualization technologies Infrastructure as a Service (IaaS) DDN storage Network-attached storage Scalable supercomputing architectures Interconnects Storage systems Python Bash Perl MPI OpenMP CUDA Ansible Puppet

Job description

About the role

We are investing $45 million in an NVIDIA DGX SuperPOD to support cutting‑edge AI research across the Texas A&M System. As a Senior High Performance Computing Engineer you will provide technical expertise for designing, deploying and operating large‑scale HPC systems.

Key responsibilities

  • Manage large‑scale HPC cluster operations, including OS upgrades, firmware patching and performance tuning.
  • Oversee networking, security and infrastructure for HPC environments.
  • Lead development of specialized HPC computing clouds and scalable storage solutions.
  • Collaborate with stakeholders to design service‑based solutions and act as a strategic technical resource.
  • Lead enterprise‑wide HPC projects following established project‑management protocols.
  • Mentor junior system administrators and enforce performance standards.

Required profile

  • Bachelor’s degree in a relevant field or equivalent experience.
  • 12 + years of experience in high‑performance computing or related domains.
  • U.S. citizenship, permanent residency or approved asylum/refugee status.

Required skills

  • High‑Performance Computing (HPC) environments.
  • Advanced Linux system administration.
  • Computer networking concepts and protocols.
  • Container orchestration (Kubernetes) and Run:ai for AI workload management.
  • Slurm workload manager.
  • Experience with NVIDIA DGX systems.
  • Virtualization technologies and IaaS platforms.
  • DDN and network‑attached storage solutions.
  • Scalable supercomputing architectures, interconnects and storage systems.
  • Scripting languages: Python, Bash, Perl.
  • Scientific computing frameworks: MPI, OpenMP, CUDA.
  • Configuration management tools: Ansible, Puppet.

What we offer

  • Competitive salary $130‑140 k annually.
  • Opportunity to work on a state‑of‑the‑art AI supercomputing platform.
  • Collaborative environment within Texas A&M University’s Technology Services.

Questions fréquentes

Le salaire proposé pour ce poste est de 130-140k USD par an. Le détail figure dans l'annonce.
Cliquez sur "Postuler maintenant" en haut de la page. Vous pouvez importer votre CV en 1 clic — Jobiglo extrait automatiquement vos informations et postule pour vous.
Source : ats:workday

Why are you reporting this job?

Thank you for your report. We will review this job.

Apply in 30 seconds

Enter your email to apply. An account will be created automatically.

By continuing, you accept our terms of use.

Already have an account? Login

↗ Postuler directement sur tamus.wd1.myworkdayjobs.com
Chat on WhatsApp

Published 5 hours ago

Expires 1 month from now

2 views · 0 interested

Boost your chances

Upload your CV — we will match you with relevant openings.

Analyzing your CV...

Texas A&M University

TX