Company

MetaSee more

addressAddressEagle Mountain, UT
type Form of workOther
CategoryInformation Technology

Job description

Meta is seeking a forward thinking, experienced Accelerator (including GPU) Product Platform Engineering Manager to join the Data Center Site Operations team. The Product Platform Engineering (PPE) team is responsible for the overall performance of Meta's production compute, storage, and accelerator platforms through their life-cycles in our data centers. This role will lead the subset of the PPE team that focuses on accelerator platform hardware.

Accelerators are an important priority for Meta that involves complex systems operating in shared computing clusters. The role scope is focused on maintaining and improving the health of the accelerator platforms from operational testing into mass production through end-of-life. Key responsibilities include identifying systemic hardware, firmware, and tooling issues; engaging in hands-on problem solving; and collaborating effectively with cross-functional engineering and tooling teams to improve performance of the fleet.

Our data centers, and the tens of thousands of servers installed in them, are the foundation upon which our rapidly scaling infrastructure efficiently operates and upon which our innovative services are delivered. Meta is at the leading edge of the global data center industry both in terms of how data centers are designed and operated. This person should enjoy working in a fast-paced environment where adaptability and flexibility will be key to their success.

We seek an individual who can quickly absorb and understand the technical challenges of subject matter experts and local site operations teams, create alignment between these globally distributed teams as well as partner organizations, and can set informed priorities and direction while getting buy-in and commitment from relevant stakeholders. SiteOps Global Product Platform Engineering Manager Responsibilities Manage other PPE team members through efforts that provide end-to-end lifecycle ownership (operational test through end of life decommissioning) of accelerator (including GPU) hardware platforms and associated new technologies in the data centers Serve as the central point of contact representing the accelerator hardware platforms and associated new technologies across SiteOps, and be the subject matter experts on hardware platform issues, for datacenter operations teams Drive complex accelerator technical investigations globally and spanning multiple disciplines such as Hardware, Software/Firmware, Networking and Power & Cooling Work closely with other PPE team members to share best practices and ensure appropriate feedback is given to cross-functional teams. Issue timely alerts and support fixes to operations teams, and assure a robust feedback pipeline to engineering teams Provide serviceability feedback on accelerator production hardware to engineering design teams Provide technical mentorship on large scale data center projects and initiatives to global, cross-functional teams Build strong relationships and collaboration with engineering and cross functional teams across the company.

Actively solicit feedback from teams, and use that feedback to improve operational effectiveness as infrastructure scales Own the cross-functional communication with other technical operations groups to help resolve incidents Collaborate with stakeholders, functional owners and subject matter experts to interpret and articulate business and operations needs Ability to travel up to 30% required Minimum Qualifications BS or BA in technical field (electrical, computer science, or mechanical engineering) or commensurate experience 10+ years experience in NPI (New Product Introduction) hardware development and/or validation, working with cross functional teams to deliver products to production. Experience working across a diverse global organization and building partnerships with cross functional teams inside and outside of the organization Experience triaging and debugging hardware platforms Experience in processing and analyzing large sets of data Proven knowledge of server and storage platforms, principles, technologies, protocols, and standards Experience with GPU and accelerator based platform hardware that operates in computing clusters. Experience managing multiple concurrent projects and managing tight timelines Experience working independently within a multi-disciplinary team of hardware and operations engineers Experience working with Linux or Unix Operating systems Proven technical drafting skills, experience to create documentation for users of all levels Experience mentoring others and leading technical teams Preferred Qualifications BS or BA in technical field (electrical, computer science, or mechanical engineering) Direct experience managing others Large-scale data center environment experience, including hardware deployments, deep system knowledge of Linux, Server Hardware, networking, network protocols, supply chain and Data Center automation Bash, PHP, Python, or Perl scripting experience Experience in data center system and process automation Leadership presence and presentation skills Locations About Meta Meta builds technologies that help people connect, find communities, and grow businesses.

When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology.

People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics. Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com.

$163,000/year to $225,000/year + bonus + equity + benefits Individual pay is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base salary, Meta offers benefits.

Learn more about benefits at Meta.

Refer code: 9020298. Meta - The previous day - 2024-04-14 10:05

Meta

Eagle Mountain, UT
Jobs feed

Senior Director -Biostatistics

Alector

United States

Senior Compensation Analyst

Renaissance

United States

Virtual Customer Service Advisor

Medical Center Hospital

Odessa, TX

Executive Director

Impact Stream

United States

Nurse Practitioner - Bilingual

Lane County, Or

Eugene, OR

Staff Client Platform Engineer

Airbnb

United States

Family Nurse Practitioner / FNP

Np Now

Medford, OR

Sheetmetal Mechanic

Tradesmen International

VISTA, CA

Principal Product Manager - AI & ML

G-P

United States

Share jobs with friends

Related jobs

Siteops Global Product Platform Engineering Manager

Head of Global Equities and FX Trading Production Services

Royal Bank Of Canada

New York, NY

24 hours ago - seen

SiteOps Global Product Platform Engineering Manager

Facebook App

Prineville, OR

yesterday - seen

Global Banking & Markets - FICC Product Engineering - Backend Support Engineer

Jobs | Honorvet Technologies

Dallas, TX

2 days ago - seen

MANAGER, GLOBAL FUNCTION COMMISSIONING & STARTUP

Air Products And Chemicals, Inc.

Houston, TX

4 days ago - seen

Global Product Manager - Optical Connectivity

Te Connectivity

Middletown, PA

4 days ago - seen

Global Markets Risk Manager (VP) - Mortgages and Securitized Products

Bank Of America

New York, NY

5 days ago - seen

UX Product Designer - Global Navigate/Find

Hewlett Packard

$104K - $132K a year

Boise, ID

6 days ago - seen

Marathi Translator (Global)

Productive Playhouse

$20 an hour

Remote

a week ago - seen

Associate Production Engineer

Venture Global Lng

Arlington, VA

a week ago - seen

Director of Pre-Production

Rdg Global Llc

130000.00-150000.00 Per Year

New York, NY

a week ago - seen

Product Manager – Global Sales Reporting

General Motors

Warren, MI

a week ago - seen

Product Development Assistant

Rdg Global Llc

50000.00-550000.00 Per Year

New York, NY

a week ago - seen

Senior Director Global Product Experience (Medical Device)

Bausch Health

Bothell, WA

a week ago - seen

Global Product Director in Lenexa KS

Corbion

Lenexa, KS

2 weeks ago - seen

Senior Manager, Fleet Safety Tech Products, Global Fleet & Products

Amazon.com Services Llc

From $158,300 a year

Bellevue, WA

2 weeks ago - seen

Manager, Global Card Applications - Product

Amex

$90,000 - $165,000 a year

New York, NY

2 weeks ago - seen

Global Marketing Product Director

Sherwin Williams

Minneapolis, MN

2 weeks ago - seen