High Performance Computing (HPC) System Administrator III in GAC Savannah
Unique Skills:
**Hybrid work schedule available:**
Highly Desired Skills:
HPC System Support: Scheduler Management, Code compilation, Large-scale multi-node application support
Engineering Application Support: Applications consistent with development of mechanical and structural systems.
Additional Desired Skills:
Configuration and Provisioning Management: Ansible, Satellite, Foreman
Infrastructure Essentials: Apache, Mysql, DNS, DHCP, IPA, Monitoring, CIFS/Samba, NFS, iSCSI, FC
Storage Experience: NAS and SAN Storage systems, Lustre, GPFS, VAST
Virtualization: VMWare, RHEV
Networking: Basic layer-2 network operations
Data Center Operations: Physical system management
- Assume responsibility for the day-to-day operations of Gulfstream's production HPC cluster.
- Assist end users running applications on the HPC cluster.
- Provide third level support for end users who experience problems on engineering workstations and remote visualization systems.
- Manage, maintain, monitor, and control interactive and batch processes, both scheduled and unscheduled (including on-request processing).
- Ensure engineering-defined batch processing and backups are completed in the correct sequence and within the established time periods.
- Suggest improvements to processing capabilities and efficiencies through system tuning and other hardware and software optimizations and improvements.
- Perform regular monitoring of utilization needs and efficiencies, and reporton tuning initiatives.
- Perform proactive failure trend analysis and root cause analysis for all system failures.
- Produce trend reports to highlight production issues and follow predetermined action and escalation procedures when issues are encountered.
- Monitor, verify, and make appropriate adjustments to support proper application executions.
- Provide technical solutions that meet performance and processing objectives of the business areas.
- Perform upgrades that comply with corporate policies and industry best practices.
- Provide leadership to HPC Administrators during system upgrades and outages.
- Create thorough upgrade plans that comply with corporate policies and industry best practices.
- Assist in the introduction of new technologies that can provide greater capabilities, improved productivity and reduce total cost of ownership.
- Participate in the design of HPC technical solutions.
- Continuously evaluate efficiency, existing technology effectiveness and interoperability and suggest areas for improvement.
- Maintain technical relationships with multiple hardware and software vendors. .
- Work multiple operational windows as required. .
- Provide on-call support 24x7 .
- Assist in development and implementation of technical, hardware and software standards. .
- Experience with management of infiniband-based Linux-based HPC clusters, High Performance parallel storage, and configuration and management of cluster scheduling software.
- Experience managing High Performance Computing low-latency, high-bandwidth interconnects.
- Experience supporting Linux based scientific workstations running visualization applications.
Additional Information
Requisition Number: 217320
Category: Information Systems
Percentage of Travel: Up to 25%
Shift: First
Employment Type: Full-time
Posting End Date: 02/29/2024
Equal Opportunity Employer/Veterans/Disabled.
Gulfstream does not provide work visa sponsorship for this position, unless the applicant is a currently sponsored Gulfstream employee.
Legal Information | Site Utilities | Contacts | Sitemap
Copyright 2023 Gulfstream Aerospace Corporation. All Rights Reserved. A General Dynamics Company.
Gulfstream Aerospace Corporation, a wholly-owned subsidiary of General Dynamics (NYSE: GD), designs, develops, manufactures, markets, services and supports the world's most technologically-advanced business jet aircraft