Company

Software And ServicesSee more

addressAddressCupertino, CA
CategoryInformation Technology

Job description

We are looking for an SRE with experience building and supporting machine learning (ML) infrastructure. You will apply SRE best practices to ensure the availability, reliability, and performance of our ML systems and services. You will actively engage with our development partners and product teams regularly so the ML services we well aligned with business needs. If you love designing and running systems and infrastructure that will delight millions of customers this team is for you! Responsibilities will include: Support and maintain ML services by measuring and monitoring availability, latency, and overall system health Deploy and support existing and new ML models and infrastructure Provide insights to partner stakeholders through log and telemetry analysis Maintaining documentation and automating manual processes where possible Be part of an oncall rotation providing hands-on technical expertise during service impacting events Collaborate with other engineers on code, infrastructure, and design reviews, and process enhancements

Request

  • Experience with large scale distributed systems, especially ML infrastructure and services including LLMs, Generative AI, and transformers
  • Knowledge of core operating system principles, networking fundamentals, and systems management
  • Demonstrable fluency in at least one of Java, Python, Swift, Rust or GoLang
  • Awareness of key security principles including encryption, keys (types and exchange protocols)
  • Understanding of SRE principals including monitoring, alerting, error budgets, fault analysis, and automation
  • Strong sense of ownership, with a desire to communicate and collaborate with other engineers and teams
  • Ability to succinctly identify and communication technical and architectural problems, while working with partners and their team to iteratively find solutions
Refer code: 9397197. Software And Services - The previous day - 2024-06-23 18:50

Software And Services

Cupertino, CA

Share jobs with friends

Software Development Engineer - Applications

Software And Services

Sunnyvale, CA

just now - seen

Software Development Engineer - Test

Software And Services

San Diego, CA

just now - seen

Senior Development Engineer - tvOS

Software And Services

Cupertino, CA

6 days ago - seen

Sr. Software Development Engineer

Software And Services

Sunnyvale, CA

6 days ago - seen

Senior Software Development Engineer in Test - Apple Vision Pro, UI Frameworks

Software And Services

Sunnyvale, CA

a week ago - seen

Software Development Engineer

Hardware

Cupertino, CA

2 weeks ago - seen

Cellular Layer 1 Control Software Development Engineer

Software And Services

San Diego, CA

2 weeks ago - seen

Senior Software QA Test Development Engineer

Nvidia

$132,000 - $258,750 a year

Santa Clara, CA

3 weeks ago - seen

Software Development Engineer 4

Adobe

$124,000 - $234,200 a year

San Jose, CA

3 weeks ago - seen

Senior Software Development Engineer

Adobe

San Jose, CA

3 weeks ago - seen

Software Development Engineer, Amazon Music

Amazon.com Services Llc

From $115,000 a year

San Francisco, CA

3 weeks ago - seen

Software Development, Instrumentation & Test Engineer

Hardware

Cupertino, CA

3 weeks ago - seen

Health Software Development Engineer in Test

Software And Services

Sunnyvale, CA

3 weeks ago - seen

2024 Intern - Software Development Engineer

Adobe

$45 - $55 an hour

San Jose, CA

a month ago - seen

Software Engineer, Server Development - Apple Vision Pro

Apple

Sunnyvale, CA

a month ago - seen

Software Development Engineer II - Backend (Remote)

Hackerrank

Mountain View, CA

a month ago - seen

Software Development Engineer - Apple Vision Pro

Software And Services

Cupertino, CA

a month ago - seen

Senior Software Development Engineer - Apple Vision Pro

Software And Services

Cupertino, CA

a month ago - seen