Company

Vector Talent Resources, Inc.See more

addressAddressVirginia, United States
type Form of workFull-Time
CategoryInformation Technology

Job description

Job Description

VECTOR TALENT RESOURCES JOB OPENING
Job ID: 562
Job Title: Linux Server GPU Engineer
Clearance: TS/SCI Active Clearance Required for this job.
Type of Job: Direct Hire Fulltime

Practice Area: System Engineering
Location: Bethesda, Maryland Hybrid work schedule or compressed schedule available.
Pay: Excellent and very competitive; plus 15 leave days, 11 holidays, 7 sick days, 401kw/Match etc.
Hours: 40 per week
Contact:
juliann@vectortalent.com
Job Description Linux Server/NVidia Admin/ GPU Engineer - TS/SCI
Vector is seeking a Linux Server GPU Engineer position to support the National Media Exploitation Center (NMEC). This role requires an individual that has technical experience of administering Nvidia DGX1 and A100 servers within a physical and virtual environment. This individual should be detail oriented in order to capture customer inquiries appropriately. This role is responsible for interacting with administrators to handle service inquiries and problems. Duties include examining customer problems and implementing appropriate corrective action to initiate a repair or return to service. This role analyzes recurring problems and initiates solutions for preventing reoccurrence and analyzes existing infrastructure for tuning/performance enhancements. The individual will provide systems and software operations and maintenance support in a large, multi-enclave enterprise environment. This individual will work in a team environment to ensure mission needs are met and ensure functionality of capabilities of customers. Individuals in this role may be required to perform technical software configuration, rebooting, and other remedial actions on customer servers. The Customer utilizes an Agile Framework to plan and successfully complete all initiatives. The work location is in Bethesda at the Intelligence Community Campus.
Security Clearance:
TS/SCI
Location:
Bethesda, MD
Responsibilities:

  • GPU Architecture and Design: Collaborate with a multidisciplinary team to define, develop, and optimize GPU architectures, ensuring they meet stringent performance, power efficiency, and feature requirements. Leverage industry insights to drive design decisions. Ensure that GPU designs and integrations are not only optimized for Linux but are also adaptable to other operating systems.
  • Operating System Integration: Work closely with operating system developers to ensure smooth GPU integration with Linux-based systems. Optimize GPU drivers for compatibility, performance, and reliability in a Linux environment. Provide regular maintenance and updates to ensure continued compatibility.
  • Hardware Expertise: Contribute to the design and development of GPU hardware, providing insights into hardware architecture to ensure efficient interaction with software components. Maintain and update hardware designs as needed.
  • CUDA (Compute Unified Device Architecture) /OpenCL (Open Computing Language) Programming: Develop and optimize applications using CUDA or OpenCL, harnessing the full potential of GPU hardware for parallel processing, high-performance computing, and machine learning on Linux platforms. Maintain and update software for optimal performance.
  • Performance Analysis: Analyze GPU performance, identify bottlenecks, and develop strategies to enhance performance across various applications in Linux, addressing both hardware and software considerations. Regularly monitor and improve performance.
  • GPU Tooling: Create and maintain debugging tools, profiling utilities, and performance analysis software tailored for Linux systems to facilitate efficient GPU development and troubleshooting. Keep tools up-to-date and functional.
  • Power Efficiency: Work on power management techniques to optimize GPU power consumption, ensuring efficient operation on both mobile and desktop Linux platforms. Continuously assess and enhance power efficiency strategies.
  • Testing and Validation: Design and execute tests to validate GPU performance and functionality on Linux, including stress testing, benchmarking, and debugging to ensure robust operation. Maintain and expand the testing suite.
  • Documentation: Maintain comprehensive technical documentation, including architectural specifications, code documentation, and Linux-specific best practices for GPU development. Keep documentation up to date with changes and improvements.
  • Industry Insight: Stay updated on the latest trends, innovations, and competitive landscapes within the GPU industry, contributing to research efforts and proposing Linux-specific approaches to GPU design and optimization. Share regular updates and insights with the team.
Minimum Requirement
  • Bachelor's or higher degree in Computer Science, Electrical Engineering, or a related field. Additional years of experience may be considered in lieu of a degree.
  • 10+ years of relevant systems engineering experience.
  • Proven experience in GPU architecture design, and GPU performance optimization.
  • Expertise in operating system integration for Linux.
  • Strong understanding of computer hardware architecture, particularly as it relates to Linux systems.
  • Knowledge of parallel computing, graphics algorithms, and real-time rendering in Linux environments.
  • Familiarity with GPU debugging tools and profiling software for Linux.
  • Excellent problem-solving skills and the ability to collaborate within a team.
  • Effective communication skills for conveying technical information in a Linux context.
  • Proficiency with scripting languages such as Python or BASH.
  • Proficiency with automation tools such Ansible, Puppet, Salt, Terraform, etc.
  • Candidate must, at a minimum, meet DoD 8570.11- IAT Level II certification requirements (currently Security+ CE, CCNA-Security, GICSP, GSEC, or SSCP along with an appropriate computing environment (CE) certification). An IAT Level III certification would also be acceptable (CASP+, CCNP Security, CISA, CISSP, GCED, GCIH, CCSP).
Preferred Qualification
  • Published research or contributions in the GPU industry, especially related to Linux.
  • Experience with machine learning and neural network frameworks on GPUs in Linux.
  • Knowledge of GPU virtualization, cloud computing, and emerging Linux-based technologies in the field.
  • Proficiency in programming languages such as GPU-specific languages.
  • Experience with container technologies (Docker, Kubernetes)
  • Experience with Prometheus/Grafana for monitoring.
  • Knowledge of distributed resource scheduling systems [Slurm (preferred), LSF, etc.]
  • Familiarity with CUDA and managing GPU-accelerated computing systems.
  • Basic knowledge of deep learning frameworks and algorithms
Vector Talent Resources is an Equal Opportunity/Affirmative Action employer. All qualified candidates will receive consideration for employment without regard to disability, protected veteran status, race, color, religious creed, national origin, citizenship, marital status, sex, sexual orientation/gender identity, age or genetic information.
Refer code: 8478848. Vector Talent Resources, Inc. - The previous day - 2024-03-06 21:52

Vector Talent Resources, Inc.

Virginia, United States
Popular Linux Server jobs in top cities
Jobs feed

Maintenance Assembler Technician

Caterpillar

Minnesota, United States

Quality Engineer

Caterpillar

Irving, TX

Sr. Engineering CADD Technician

County Of El Dorado

Placerville, CA

Imaging Production Specialist I

Iron Mountain

Lorida, FL

2024-25 Lecturer Pool International Education and Global Engagement

California State University, Chico

Chico, CA

Eligibility Specialist I/II (MSS)

San Benito County, Ca

Hollister, CA

Art Transportation Specialist - CDL A, Crozier Fine Art

Iron Mountain

Los Angeles, CA

RN Peds Private Duty

My Arklamiss

Homer, LA

InSight Lead Enterprise Architect - DXP

Iron Mountain

Concord, NH

Share jobs with friends

Related jobs

562 - Linux Server Gpu Engineer Ts/Sci Clearance

Servers Tech Linux

Nasscomm

$50 - $55 an hour

Sterling, VA

3 weeks ago - seen

Linux Server Software Engineering Manager

Canonical - Jobs

Milwaukee, WI

a month ago - seen

Linux Server Software Engineering Manager

Canonical - Jobs

Madison, WI

a month ago - seen

Senior Linux Server Solutions Analyst

Caterpillar

Irving, TX

3 months ago - seen

Linux Server Admin / Systems Engineer

Cpmc, Llc.

$74K - $93.7K a year

Oklahoma City, OK

3 months ago - seen

Workstation and Server Admin Linux/Mac

Ntt Data, Inc.

Mountain View, CA

3 months ago - seen

Linux Specialist / Server Specialist - W2 Only

EPMA

Houston, TX

4 months ago - seen

Workstation and Server Admin Linux/Mac

NTT DATA Services

Mountain View, CA

4 months ago - seen

Linux Server Software Engineering Manager

Canonical - Jobs

Fresno, CA

4 months ago - seen

Linux Server Software Engineering Manager

Canonical - Jobs

Anchorage, AK

4 months ago - seen

Principal Consultant - Administering Linux / AIX servers

Genpact

Avenel, NJ

5 months ago - seen

Linux Server Systems Administrator

Fuse Engineering LLC

Annapolis Junction, MD

5 months ago - seen