Full-Time Reliability Engineer jobs in Santa Clara, CA

Now available 3 results are consistent

Sort by:relevance - date

SITE RELIABILITY ENGINEER, AI & HPC INFRASTRUCTURE

Continued development/automation of deployment, monitoring, self-healing and alerting processes is imperative to the success of our engineering groups. This includes managing/operating our HPC clusters, monitoring compute/GPU/netw...

CompanyTesla
AddressPalo Alto, CA
CategoryEngineering/Architecture/scientific
Job typeFull-time
Date Posted 3 days ago See detail

Site Reliability Engineer, AI & HPC Infrastructure

Tesla

Palo Alto, CA

Continued development/automation of deployment, monitoring, self-healing and alerting processes is imperative to the success of our engineering groups. This includes managing/operating our HPC clusters, monitoring compute/GPU/netw...

Senior Staff Site Reliability Engineer

Nvidia

Santa Clara, CA

$164,000 - $310,500 a year

Develop and implement automation frameworks to enhance efficiency for existing and future applications. Collaborate with stakeholders, vendors, architects, and business teams to ensure optimal operation and reliability of applicat...

Internship, Reliability Engineer, Cell Engineering (Fall 2024)

Tesla

Palo Alto, CA

$20 - $50 an hour

Leverage the vast amounts of data generated throughout the cell qualification process to help Tesla design and produce safer, lower-cost, and high-performance cells. Analyze and visualize data to identify trends and risks in cell...