Job Description
Job Location: Fremont CA- Onsite Need Locals
Job Duration: Long-Term
- Client is looking for Snr Network/Infiniband Engineer.
- Network engineering knowledge is the base requirement, however not only ask.
- Infiniband experience is a must.
- Candidate is expected to be very experienced working with High Performance Computing (HPC) environment and datacenters.
- Candidate shall have hands on experience with InfiniBand products (Mellanox switches, cards) and IB protocols (RDMA, storage protocols, etc.)
- Candidate will directly work with Client team.
- The main responsibility will be designing/configuring the AI datacenters equipment with InfiniBand technology.
- Candidate should be familiar of automation scripting and python.
- Designing and Deploying InfiniBand Network Architectures: This involves planning and implementing InfiniBand network configurations tailored to meet the organization's high-performance computing requirements.
- Configuration and Optimization: Configuring and fine-tuning InfiniBand switches, routers, adapters, and other hardware components to ensure maximum efficiency and reliability within the network infrastructure.
- Network Security Implementation: Implementing security measures and protocols to safeguard sensitive data transmitted over the InfiniBand network, ensuring compliance with industry standards and organizational security policies.
- Monitoring and Troubleshooting: Continuously monitoring network performance and addressing connectivity issues to maintain optimal uptime for the InfiniBand network. This involves troubleshooting technical problems efficiently and effectively.
- Collaboration and Integration: Working closely with cross-functional teams to seamlessly integrate InfiniBand technology into various projects and applications across the organization, providing technical expertise and guidance as needed.
- Vendor Collaboration and Evaluation: Collaborating with hardware and software vendors to evaluate new InfiniBand technologies and recommend enhancements to existing infrastructure based on industry trends and best practices.
- Documentation: Documenting network configurations, procedures, and troubleshooting steps for reference and training purposes, ensuring knowledge transfer and maintaining a comprehensive record of network operations.
- Additionally, the scope may include involvement in coding/scripting tasks and knowledge in code repository solutions, indicating a potential aspect of automation or software-defined networking within the InfiniBand environment.
- Overall, the scope involves comprehensive management and optimization of InfiniBand networks to support computing requirements while ensuring security, reliability, and scalability.