Company

Machine Learning And AiSee more

addressAddressCupertino, CA
CategoryInformation Technology

Job description

As a member of this team, the successful candidate will:- Build features for our on-device inference stack to support the most relevant accuracy preserving, general purpose techniques that empower model developers to compress and accelerate SoTA models (e.g., LLMs) in apps- Convert models from a high-level ML framework to a target device (CPU, GPU, Neural Engine) for optimal functional accuracy and performance- Write unit and system integration tests to ensure functional correctness and avoid performance regressions- Diagnose performance bottlenecks and work with HW Arch teams to co-design solutions that further improve latency, power, and memory footprint of neural network workloads- Analyze impact of model optimization (compression/quantization etc) on model quality by partnering with modeling and adaptation teams across diverse product use cases.

Request

  • 10-15+ years proven programming skills using standard ML tools such as C/C++, Python, PyTorch, Tensorflow, CUDA/Metal.
  • Solid understanding of state-of-the-art DNN optimization techniques and how they translate to hardware acceleration architectures, and a general ability to reason about system performance (compute/memory) tradeoffs
  • Hands-on experience working (training, fine-tuning, optimizing, deploying) with large models (e.g. LLMs).
  • Hands-on experience applying common machine learning optimization techniques, like quantization and sparsity-induction, to reduce the resource consumption and/or eliminate latency
  • Experience building APIs and/or core components of ML frameworks
  • Capacity to iterate on ideas, work with a variety of partners from all parts of the stack — from Apps to Compilation, HW Arch, and Power/Performance analysis
  • Proven track record to analyze sophisticated and ambiguous problems
  • Disciplined programming abilities with a strong attention to detail
  • Strong applied experience with compiler technology to work with CPU, GPU, and ML accelerators
  • Excellent problem-solving (e.g. via building forward-looking prototype systems), critical thinking, strong communication, and collaboration skills
Refer code: 8134424. Machine Learning And Ai - The previous day - 2024-02-06 16:57

Machine Learning And Ai

Cupertino, CA

Share jobs with friends

Related jobs

Aiml - Staff Software Engineer, On-Device Machine Learning

Senior Software Engineer (React Native)

Abbott Laboratories

Alameda, CA

just now - seen

Staff Software Engineer - RaptorDB

Servicenow

San Diego, CA

3 hours ago - seen

Software Engineer III (iOS)

Walmart

Sunnyvale, CA

7 hours ago - seen

Senior Software Development Engineer

Adobe

$173,300 - $312,200 a year

San Francisco, CA

12 hours ago - seen

0-1 Software Engineer

Wispr Ai

$120,000 - $200,000 a year

San Francisco, CA

12 hours ago - seen

Senior Software Engineer, Research

Alembic

$140,000 - $185,900 a year

San Francisco, CA

12 hours ago - seen

Software Engineer – Frontend

Padlet

$150K - $190K a year

San Francisco, CA

12 hours ago - seen

Staff Software Engineer (ML + Python)

Truva

$170,000 - $250,000 a year

San Francisco, CA

12 hours ago - seen

Senior Software Engineer, Square Bank Accounts

Afterpay

San Francisco, CA

12 hours ago - seen

Senior Software Engineer, Backend

Nuna

$162,000 - $198,000 a year

San Francisco, CA

12 hours ago - seen

Senior Software Engineer, AI

Synack

San Francisco, CA

12 hours ago - seen

Fullstack Software Engineer, Spaces

Envoy

$140,000 - $160,000 a year

San Francisco, CA

12 hours ago - seen

Software Engineer

Genentech

San Francisco, CA

12 hours ago - seen

Senior Software Engineer

Calqulate

San Francisco, CA

12 hours ago - seen

Sr Software Development Engineer in Test w/ ML

Apple

Cupertino, CA

13 hours ago - seen

Software Engineer III, YouTube

Youtube

Mountain View, CA

13 hours ago - seen

System Software Engineer - RAG

Nvidia

Santa Clara, CA

18 hours ago - seen

Senior CoreMedia Format Engineer

Software And Services

Cupertino, CA

yesterday - seen