Job Description
- Understand how commodity servers, operating systems and networks function, perform and scale.
- Possess superb troubleshooting, project management and problem analysis skills.
- Drive technical innovation and efficiency in infrastructure operations via automation.
- Design server monitoring and management solutions using automation and self-repair.
- Create processes that enhance operational workflow and provide positive customer impact.
- Dive deep to resolve problems at their root, looking for failure patterns amenable to long-term solutions via simplification and automation.
- Avoid re-inventing the wheel and prefer appropriately simple, repeatable solutions over more complex and failure prone ones.
- Recognize and adopt best practices in documentation, testing, security, operational support at scale, and efficient use of resources.
- Develop appropriate metrics to demonstrate performance at improving operational efficiency.
- In depth knowledge of & experience deploying and operating Windows and Linux.
- Relentless passion for frugality and out-of-the-box engineering.
- Strong system troubleshooting skills.
- Proficiency and experience in automation via Perl/Python programming and shell scripting.
- Good understanding of standard internet protocols (Ethernet, ARP, IP, ICMP, UDP, TCP, SSL, DNS, HTTP, etc.) Demonstrable grasp of security best practices in server configuration, tool development, and access controls.
- Experience in building SQL queries.
- AWS Experience, preferably with Systems Manager or other DevOps systems.
- M-F 8-5/9-6 40hrs
- Yes
- Yes
- Knowledge of .NET, Javascript/Typescript and Powershell scripting
- Knowledge of Git
- Experience deploying or managing servers in large-scale, geographically diverse environments.
- Understanding & experience of managing and monitoring large scale disk sub-systems.
- Operational knowledge of common enterprise switching and routing platforms.