Aiml - Staff Software Engineer, On-Device Machine Learning

Company	Machine Learning And AiSee more
Address	Cupertino, CA
Category	Information Technology

Job description

As a member of this team, the successful candidate will:- Build features for our on-device inference stack to support the most relevant accuracy preserving, general purpose techniques that empower model developers to compress and accelerate SoTA models (e.g., LLMs) in apps- Convert models from a high-level ML framework to a target device (CPU, GPU, Neural Engine) for optimal functional accuracy and performance- Write unit and system integration tests to ensure functional correctness and avoid performance regressions- Diagnose performance bottlenecks and work with HW Arch teams to co-design solutions that further improve latency, power, and memory footprint of neural network workloads- Analyze impact of model optimization (compression/quantization etc) on model quality by partnering with modeling and adaptation teams across diverse product use cases.

Request

10-15+ years proven programming skills using standard ML tools such as C/C++, Python, PyTorch, Tensorflow, CUDA/Metal.
Solid understanding of state-of-the-art DNN optimization techniques and how they translate to hardware acceleration architectures, and a general ability to reason about system performance (compute/memory) tradeoffs
Hands-on experience working (training, fine-tuning, optimizing, deploying) with large models (e.g. LLMs).
Hands-on experience applying common machine learning optimization techniques, like quantization and sparsity-induction, to reduce the resource consumption and/or eliminate latency
Experience building APIs and/or core components of ML frameworks
Capacity to iterate on ideas, work with a variety of partners from all parts of the stack — from Apps to Compilation, HW Arch, and Power/Performance analysis
Proven track record to analyze sophisticated and ambiguous problems
Disciplined programming abilities with a strong attention to detail
Strong applied experience with compiler technology to work with CPU, GPU, and ML accelerators
Excellent problem-solving (e.g. via building forward-looking prototype systems), critical thinking, strong communication, and collaboration skills

Refer code: 8134424. Machine Learning And Ai - The previous day - 2024-02-06 16:57

Aiml - Staff Software Engineer, On-Device Machine Learning

Machine Learning And AiSee more

Job description

Request

Related jobs

Aiml - Staff Software Engineer, On-Device Machine Learning

Senior Software Engineer (React Native)

Staff Software Engineer - RaptorDB

Software Engineer III (iOS)

Senior Software Development Engineer

0-1 Software Engineer

Senior Software Engineer, Research

Software Engineer – Frontend

Staff Software Engineer (ML + Python)

Senior Software Engineer, Square Bank Accounts

Senior Software Engineer, Backend

Senior Software Engineer, AI

Fullstack Software Engineer, Spaces

Software Engineer

Senior Software Engineer

Sr Software Development Engineer in Test w/ ML

Software Engineer III, YouTube

System Software Engineer - RAG

Senior CoreMedia Format Engineer

Aiml - Staff Software Engineer, On-Device Machine Learning

Machine Learning And AiSee more

Job description

Request

Share jobs with friends

Related jobs

Aiml - Staff Software Engineer, On-Device Machine Learning

Explore trending job searches in the United States

Top States

Top Cities

Top Job Titles

Highest Paying Jobs