Engineer Manager Reliability jobs in Cupertino, CA

Now available 26 results are consistent

Sort by:relevance - date

SENIOR STAFF SITE RELIABILITY ENGINEER

Develop and implement automation frameworks to enhance efficiency for existing and future applications. Collaborate with stakeholders, vendors, architects, and business teams to ensure optimal operation and reliability of applicat...

CompanyNvidia
AddressSanta Clara, CA
CategoryEngineering/Architecture/scientific
Salary$164,000 - $310,500 a year
Job typeFull-time
Date Posted yesterday See detail

Senior Staff Site Reliability Engineer

Nvidia

Santa Clara, CA

$164,000 - $310,500 a year

Develop and implement automation frameworks to enhance efficiency for existing and future applications. Collaborate with stakeholders, vendors, architects, and business teams to ensure optimal operation and reliability of applicat...

Cloud DevOps / Site Reliability Engineer, Applied Machine Learning

Software And Services

Sunnyvale, CA

Come join Apple's Applied Machine Learning Team, as a Senior Cloud DevOps/ Site Reliability Engineer, to help build & support innovative software applications. The ideal candidates should have a strong background in setting up and...

Site Reliability Engineer - Redis

Software And Services

Cupertino, CA

The ASE Redis SRE team develops applications and tooling that are safe, reliable, scalable, and fast. Success in this role requires expertise in several of the following:- Understanding of core SRE concepts - Monitoring, Alerting,...

Site Reliability Engineer - Solr

Software And Services

Cupertino, CA

The ASE Redis SRE team develops applications and tooling that are safe, reliable, scalable, and fast. Success in this role requires expertise in several of the following:- Understanding of core SRE concepts - Monitoring, Alerting,...

Hardware Reliability Lab Manager

Hardware

Cupertino, CA

- Manage engineering lab day-to-day operations working with external vendors as necessary and ensuring lab security, and safety protocols are upheld.- You will be analyzing and effectively communicating lab related issues and make...

Senior Site Reliability Engineer (SRE) - ASE / iCloud

Software And Services

Cupertino, CA

Operating at our scale, across multiple geographically dispersed data centers and servicing hundreds of millions of users presents unique challenges. SREs @ Apple own the full infrastructure stack; from device driver performance d...

DevOps & Site Reliability Engineer (SRE)

Hardware

Cupertino, CA

This is your opportunity to join a small and nimble team that has proven itself by delivering high-quality products in a timely manner. You excel working in a highly visible and collaborative team. You are a hands-on, proactive, s...

AIML - Sr Engineering Manager, Siri Performance and Reliability

Apple

Cupertino, CA

SummaryKey QualificationsExperience with performance engineering, analysis, and on-device optimization10+ years of professional software engineering experience5+ years managing engineering teams that design complex software engine...

AIML - Infrastructure Services - Site Reliability Engineer, Machine Learning Platform and Infrastructure

Machine Learning And Ai

Cupertino, CA

These services are key to the development and production process of the AIML team. A successful candidate will likely have experience in being a Systems Administrator that has moved on to development and automation in their career...

Sr Site Reliability Engineer - Cross Functional

Software And Services

Cupertino, CA

It requires an engineer with a broad set of technical skills and expertise in systems, organizations, and communication. They need to seamlessly move between difficult engineering problems, project deadlines, organizational commun...

Site Reliability Engineer (SRE) - ASE / iCloud

Software And Services

Cupertino, CA

Operating at our scale, across multiple geographically dispersed data centers and servicing hundreds of millions of users presents unique challenges. SREs @ Apple own the full infrastructure stack; from device driver performance d...

Reliability Engineering Manager (Mac SoC Package Integration)

Hardware

Cupertino, CA

- Guide the development of new reliability tests procedures and specifications- Statistical data analysis to provide risk assessments of a design- Researching a technology to determine possible failure modes- Providing design reco...

AIML - Sr Engineering Manager, Siri Performance and Reliability

Software And Services

Cupertino, CA

The Siri team is looking for an exceptional Engineering Manager to lead our performance engineering efforts in the Siri platform. As Apple continues to ship more devices, providing a consistent, responsive, and high-performance ex...

Service Reliability Engineer (SRE), Data Infrastructure

Apple

Cupertino, CA

Summary Key QualificationsAt least 5 years in a Service Reliability Engineering (SRE), DevOps or infrastructure focused role5+ years of running services in a large scale *nix environmentUnderstanding of SRE principles and goals al...

Internship, Reliability Engineer, Cell Engineering (Fall 2024)

Tesla

Palo Alto, CA

$20 - $50 an hour

Leverage the vast amounts of data generated throughout the cell qualification process to help Tesla design and produce safer, lower-cost, and high-performance cells. Analyze and visualize data to identify trends and risks in cell...

Principal Reliability Engineer

Johnson & Johnson

Santa Clara, CA

Lead implementation of the Design for Reliability principals for electrical and electronic components of a complex medical robotics system. Develop reliability strategy, plan and coordinate activities required to develop and deli...

ML Platform Reliability Engineer - L5

Netflix

Los Gatos, CA

$126K - $160K a year

Rapidly onboard and take ownership of ML Platform repositories, build processes and delivery integrations with critical partners who ship member-scale Netflix personalization systems daily. Identify, plan, design and execute impor...

Hardware Reliability Engineer, Google Cloud

Google

Sunnyvale, CA

Bachelor's degree in Electrical, Industrial, or Mechanical Engineering, a related field, or equivalent practical experience.. 10 years of experience as a Manufacturing, Quality, Reliability, or Product Engineer.. 7 years of experi...

Reliability Engineer / Failure Analysis

Comtech Telecommunications

Santa Clara, CA

Analyses performed at all levels of product to include: concept, design, fabrication, test, installation, operation, maintenance and disposal. Ensure the logical and systematic conversion of customer or product requirements into p...

Senior Site Reliability Engineer

Hireio, Inc.

San Jose, CA

Position Description. Location: Usa/Usa/California/Sf Bay Area, Seattle. Base Salary: 187K - 280K. Sponsor Visa? Yes. Language Requirements: English, Mandarin (Preferred)....

Site Reliability Engineer - Onboard Software

Wayve

Mountain View, CA

Elevate Operational Excellence: Guarantee the seamless operation of our autonomous vehicles on public roads, enhancing our ability to transform urban mobility. Innovate and Automate: Drive the development of cutting-edge tools and...

Principal Site Reliability Engineer - Product Reliability

Zscaler

San Jose, CA

With a complete intolerance to manual work you will drive Toil reduction through automation and tooling. Be a forcing factor within the organization as we transform to a world leading SRE organization. Take the lead in driving Inf...

Senior Staff Site Reliability Engineer (SRE)

Cribl

San Jose, CA

Engage with teams and improve service delivery and reliability across their entire lifecycle. Measure and monitor all production systems with an eye towards availability, latency and overall system health. Seek out the cause of er...

Site Reliability Engineer, Engineering Technology & Operations

Tesla

Palo Alto, CA

The Design Technology team develops and manages software platforms and tooling for product design (energy, solar, passenger vehicles and commercial vehicles). To support Teslas rapid pace of innovation, the team is looking to add...

Senior Site Reliability Engineer (SRE), Multimodal

Character.ai

Menlo Park, CA

Maintain production multimodal services operational. Instrument, monitor and optimize the performance and reliability of our service. Implement and maintain automation tools and processes to prevent and mitigate service disruption...

Reliability Excellence Manager (Engineering & Maintenance)

Blommer Chocolate Company Of California

Union City, CA

$165,000 - $180,000 a year

Reports to: General Manager. Location: 1515 Pacific St, Union City, CA. Manage/lead manufacturing engineering/maintenance-related projects from concept to completion; develop project specifications, quote(s) with external and inte...