Company

Intel CorporationSee more

addressAddressAlbuquerque, NM
type Form of workFull-Time
CategoryInformation Technology

Job description

Job Details:
Job Description:
As AI reshapes not only computing, but also business and society, Intel is making major bets on the future of AI and particularly in data center computing. Intel's Datacenter and AI Solutions (DAIS) organization is a critical part of Intel broader AI efforts. DAIS responsibility spans data center workloads from generative AI and deep learning to analytics, HPC, and graphics - all of which are intertwined in the future of computing.
CRT Labs runs the Intel High Performance Computing benchmarking cluster called Endeavour. Endeavour is our renowned and largest cluster showcasing Intel Architecture supporting deals, development, performance optimization and so much more. We are System integrators of future platforms. We also host other clusters to support AI, HPC, Cloud, Enterprise, and other clusters for Technology, Pathfinding, and Innovation.
We partner closely with the Sales and Marketing team, as well as multiple Software Enabling and optimization organizations to deliver performant clusters at scale with unreleased and sometimes unstable hardware. This includes Intel Xeon and Discrete graphics products, both the latest generations and the yet-to-be released versions, high performance storage systems and fastest fabric interconnects available.
We are seeking a HPC and AI Systems Administrator who has a passion for working on Intel's latest technology. The HPC and AI Systems Administrator has deep technical knowledge of the design and deployment of data centers and the associated subsystems. These can include expertise in data center layout, mechanical design systems, cooling (both air and liquid), power delivery and other critical data center design expertise. The deliverables of the role may take the form of design of Intel's data centers, support for customers in designing their data centers or in the development of new products and technologies based on data center design expertise.
The HPC and AI Systems Administrator will be responsible for but limited to:
  • Provide support and maintenance of large cluster hardware and software for optimized performance, security, consistency, and high availability.
  • Manage various Linux OS distributions.
  • Support Hardware such as rack-mounted servers, network switches, and firewalls.
  • Support Intel HPC data center technologies, including servers, fabric, storage.
  • Provide support in cluster debugging, Linux scripting, cluster validation tests, server expansion, file system tests, benchmarking, and job scheduling.
  • Serve as a consultant for all projects and customers of the CRT Datacenter, creating and improving methodologies used in the datacenter to enhance the performance, reliability, and manageability of the CRT clusters.
  • Research emerging capabilities in external HPC and AI clusters to help set direction on where the team needs to be internally.

Qualifications:
This position is not eligible for Intel immigration sponsorship.
Requirements listed would be obtained through a combination of industry relevant job experience, internship experiences and or schoolwork/classes/research. Minimum qualifications are required to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.
Minimum Education:
Bachelor's degree in computer science, Computer Engineering or any other related field and 6+ years of experience OR master's degree in computer science, Computer Engineering or any other related field and 4+ years of experience
Minimum Qualification
  • 6+ years of Linux experience supporting complex HPC clusters.
  • 6+ years of experiencing writing bash scripts, Python, and/or C programs.
  • 1+ year of experience with the technical concepts, architecture, systems, development methods, and disciplines associated with the defined program, and utilizes knowledge to accelerate project completion.

Preferred Qualifications
  • Experience managing cluster systems with 100+ nodes.
  • Experience managing HPC clusters with discrete GPUs.
  • Experience in data center layout, mechanical design systems, cooling (both air and liquid), power delivery and other critical data center design expertise.
  • Experience using and supporting job schedulers such as SLURM, PBS or other schedulers.
  • Experience with high performance interconnects, preferably Mellanox InfiniBand, Omni-Path, or Converged Ethernet.
  • Experience administering high performance cluster file systems (Lustre, GPFS, others).

Job Type:
Experienced Hire
Shift:
Shift 1 (United States of America)
Primary Location:
US, New Mexico, Albuquerque
Additional Locations:
Business group:
The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel's transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies-spanning software, processors, storage, I/O, and networking solutions-that fuel cloud, communications, enterprise, and government data centers around the world.
Posting Statement:
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
Position of Trust
N/A
Work Model for this Role
This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. In certain circumstances the work model may change to accommodate business needs.
Refer code: 9034537. Intel Corporation - The previous day - 2024-04-15 14:16

Intel Corporation

Albuquerque, NM
Jobs feed

Bucket Operator (CDL) - Williston, FL

Xylem I Llc

Williston, FL

CDL DRIVER - CLASS A OR B

Bjw Property Services Llc

Ocala, FL

Clinical Nurse Coordinator Med Surg

Medical City Decatur

Decatur, TX

Service Technician

Adecco

Charleston, WV

$ 29 - $ 30 / Hour

Pastry Cook

Good Good Chocolates

Philadelphia, PA

Licensed Mental Health Professional - Hourly

My Arklamiss

Monroe, LA

Travel Surgical Technologist - $2,193 per week

Your Basin Jobs

Midland, TX

$2,193 Per Week

CDL Driver / Operator

Diamond 7 Construction Llc

Ocala, FL

Class A CDL Driver

Busch-Transou, Lc

Ocala, FL

Travel Surgical Technologist - $1,902 per week

Your Basin Jobs

Midland, TX

$1,902 Per Week

Share jobs with friends

Related jobs

Hpc And Ai System Administrator

Systems Administrator CNM

Central New Mexico Community College

Albuquerque, NM

2 days ago - seen

Embedded System Administrator

Amentum

Albuquerque, NM

4 weeks ago - seen

Embedded System Administrator - 1797

Keylogic Systems

Albuquerque, NM

4 weeks ago - seen

System Administrator 2

Auria

Albuquerque, NM

4 weeks ago - seen

Embedded System Administrator

Encantado Technical Solutions

Albuquerque, NM

4 weeks ago - seen

Database Administrator (Systems/Network Analyst 2)

University Of New Mexico

Albuquerque, NM

4 weeks ago - seen

Embedded System Administrator

Amentum

Albuquerque, NM

4 weeks ago - seen

Office Administrator / Office Manager

Ameritech Systems

Albuquerque, NM

a month ago - seen

Office Administrator / Office Manager

Ameritech Systems

Las Cruces, NM

a month ago - seen

Service Dept - Administrator

Great Western Specialty Systems, Inc.

$20 - $25 an hour

Albuquerque, NM

2 months ago - seen

Geographic Information Systems (GIS) and Data Services Administrator

City Of Las Cruces

Las Cruces, NM

2 months ago - seen

Geographic Information Systems (GIS) and Data Services Administrator

City Of Las Cruces, Nm

Las Cruces, NM

2 months ago - seen

Linux/Windows Systems Administrator III

Edgewater Federal Solutions, Inc.

Albuquerque, NM

2 months ago - seen

System Administrator III

Edgewater Federal Solutions, Inc.

Albuquerque, NM

2 months ago - seen

Systems Administrator _ TS/SCI with Polygraph

General Dynamics

Las Cruces, NM

3 months ago - seen

IT Systems Administrator l

State Of New Mexico

$61,089 - $97,742 a year

Albuquerque, NM

3 months ago - seen

System Administrator - ONSITE in Carlsbad, NM

Edgewater Federal Solutions, Inc.

Carlsbad, NM

3 months ago - seen

Systems Administrator II, Datacenter Support (4910)

Associated Universities, Inc.

Socorro, NM

3 months ago - seen