Job Description
- Support our VA clients in their Cloud Operations to Enhance, Optimize, and Maintain their Computing Capabilities across their Technical Landscape.
- Assist the team to implement Site Reliability Engineering best practices such as Ensuring reliability-getting systems back to steady state as soon as possible.
- Being pro-active - living and breathing SLOs to identify and remediate issues before SLAs are violated.
- Architecting for resiliency - informing architectural design decisions to build more reliable systems.
- Propose and Educate team on Reliability Engineering Principles
- Design, Develop, and Implement IT Solutions
- Creating and Maintaining Gold Images following CRISP Information Security Guidelines
- Assist the team to improve the following:
- Proactive monitoring and alerting
- Apply automation principles to replace manual approaches to mitigation.
- Establishment of consistent SLOs and SLIs for various services/applications
- Respond to production incidents using your knowledge and experience in systems engineering and software development.
- Engagement in root cause analysis activities to identify mitigation paths and eliminate issues from recurring.
- Create and maintain infrastructure diagrams, technical documentation, procedures, and Runbooks for Chaos testing where applicable.
Must Have...
- Bachelor of Science in Computer Science or 10+ years equivalent technical experience
- 4+ Years in a Site Reliability Engineer role of Direct AWS and/or Azure Cloud Implementation Experience
- Experience with monitoring of OS/application-level metrics and related visualizations
- 4+ years of experience in Bash shell, Powershell, as well as Java or Python
- Strong analytical skills with a logical mindset and problem-solving approach
- Effective communication skills both orally and written with various audiences.
- Experience and proven ability to work remotely.
Nice to Have...
- Azure Architect Expert Certification
- AWS Solution Architect - Associate
- AWS Certified SysOps Administrator – Associate
- AWS Certified DevOps Engineer – Professional
- Microsoft DevOps Engineer Expert
- ITIL certification or ITIL knowledge
- Department of Veterans Affairs Experience
- Public Trust Clearance
Clearance...
- Applicants selected will be subject to a Security Investigation and May need to Meet Requirements for Access to Protected and/or Classified Information
Location..
- This is a US-Based Remote position, all applicants need to be located within the United States be to considered for the position
Citizenship..
- Applicants need to be a US Citizen to be considered for the position and have physically resided in the United States for a minimum of 3 years due to background investigation requirements.
Benefits: Medical, Vision, Dental, PTO, Sick, and 401k w/employer match
Who we are...
Latitude is a growing Service-Disabled Veteran Owned (SDVOSB) Company within the Federal Sector. We have experience in working with various federal clients across multiple different agencies. We believe in adopting and growing a remote work-from-home culture wherever possible to ensure a healthy work-to-life balance. We believe in automating the mundane and investing in our employees helping them grow their careers over time. We strive for innovation and want to develop solutions that we can take pride in and help break through some of the barriers that exist in technology and the government sectors today.