AI System Solutions Architect

Company	d-MatrixSee more
Address	Santa Clara, CA
Form of work	Full-Time
Category	Information Technology

Job description

About us:

d-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The "holy grail" of AI compute has been to break through the memory wall to minimize data movements. We've achieved this with a first-of-its-kind DIMC engine. Having secured over $154M, $110M in our Series B offering, d-Matrix is poised to advance Large Language Models to scale Generative inference acceleration with our chiplets and In-Memory compute approach. We are on track to deliver our first commercial product in early 2024. We are poised to meet the energy and performance demands of these Large Language Models. The company has 70+ employees across Silicon Valley, Sydney and Bengaluru.

Location:

Hybrid, working onsite at our Santa Clara, Ca Headquarters 3 days per week.

d-Matrix is looking for a AI System Solutions Architect to develop world-class products around d-Matrix inference accelerators. In this role you will be engaged with key customers and internal architects, and other key internal and external stakeholders to drive overall System Solutions. This requires technically analyzing, defining outside-in usage cases, and use broad spectrum of technologies to drive a AI server system solution spanning silicon, platform HW/SW, and usages to deliver the best customer experiences with d-Matrix inference accelerators.

Responsibilities

Design, develop, and deploy scalable GenAI inference solutions with d-Matrix accelerators
Work closely with team members across architecture, engineering, product management and business developments to optimize the d-Matrix System Solutions for best performance & power balance, feature set and overall system cost.
Work closely with Datacenter, OEM and ODM customers at early stage of product concept and planning phase, to enable the system design with partners and industrial ecosystem.
Influence and shape the future generations of products and solutions by contributing to the system architecture and technology through the early engagement cycle with customers and industrial partners.
Stay abreast of the latest advancements in GenAI hardware and software technologies and assess their suitability for integration into d-Matrix GenAI inference solutions.
Establish credibility with both engineering and leadership counterparts at top technology companies, communicate technical results and positions clearly and accurately, and drive alignment on solutions.

Experience

5+ years of AI Server System experience by working on multiple projects from architecture, development, design including memory, I/O, power delivery, power management, boot process, FW and BMC/hardware management through bring-up and validation and supported through the release to production.
5+ years of experience in a customer-facing role interfacing with OEMs, ODMs and CSPs.
Detailed understanding of server industry standard busses, such as DDR, PCIe, CXL and other high-speed IO protocol is required.
Ability to work seamlessly across engineering disciplines and geographies to deliver excellent results.
Deep understanding of datacenter AI infrastructure requirements and challenge

Preferred Experience

Hands-on understanding of AI/ML infrastructure and hardware accelerators
Experience with leading AI/ML frameworks such as PyTorch, TensorFlow, ONNX, etc. and container orchestration platforms such as Kubernetes
Outstanding communication and presentation skills

Education

15 + Years of Industry Experience and Engineering degree in Electrical Engineering, Computer Engineering, or Computer Science with extensive experience.

Equal Opportunity Employment Policy

d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We're committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day.

Employment Type: FULL_TIME

Refer code: 7285324. d-Matrix - The previous day - 2023-12-19 10:20

AI System Solutions Architect

d-MatrixSee more

Job description

Related jobs

AI System Solutions Architect

Industrial IoT Solutions Architect - Customer Adoption Team

Solution Architect - Diagnostics Platform - Customer Systems

Finance System Solution Architect - Manager

Finance System Solution Architect - Manager

Solutions / Systems Design Architect - WWBPR

Systems Architect

GIS System Administrator/Solution Architect

Network Systems Architect - BioMed (Hybrid)

Solution Architect, Enterprise Systems

Solution Architect - Diagnostics Platform - Customer Systems

Solution Architect

Senior Network/Senior Systems Engineer/Senior Networks Architect

AI System Solutions Architect

d-MatrixSee more

Job description

Share jobs with friends

Related jobs

AI System Solutions Architect

Explore trending job searches in the United States

Top States

Top Cities

Top Job Titles

Highest Paying Jobs