Must Have:
- Active TS/SCI level security clearance
What You'll Do:
- Provide support for implementation, troubleshooting, and maintenance of cloud systems
- Isolate and resolve problems involving the applications, operating system, hardware, communications, other infrastructure, or any combination as needed
- Prepare problem reports to appropriate leads like Amazon/C2S, NGA-ESC, etc.
- Provide support for the escalation and communication of status to agency management and internal customers
- Installs/loads operating system and application software
- Isolate and resolve hardware and software problems involving the applications, operating system, hardware, communications infrastructure, or any combination of these
- Troubleshoot, maintain integrity and configure network components along with implementing operating systems enhancements to improve reliability and performance
- Integrate new technologies into new and existing systems including the transition and migration of corporate systems
- Monitor and report on system health using various dashboarding tolls such as Elasticsearch Kibana, AWS Cloudwatch, etc.
- The ability to report and work on-site daily (this role is not remote).
Preferred Skills and Experience: (nice to have but not required, ex. Education):
- Demonstrated proficiency with the AWS C2S Suite and tools.
- Troubleshooting and providing AWS technical support to NGA employees.
- Creating SNS Topics
- Connecting to ec2 instances using an SSH client on MobaX or Putty to check for logs.
- Creating IAM users/roles/policies per user requests.
- Granting access to s3 buckets by updating bucket policies using Json.
- Monitoring Data conditioning and dissemination with Kibana Elastic & Cloud Watch (AWS).
- Experience using ServiceNow, Jira, Confluence, etc.
- Creating and rotating IAM user secret & access keys periodically.
- Using LDAP to administer account creation & management on various Linux Servers.
- Administrate our clusters and their lifecycle, in order to guarantee a high degree of reliability, security, scalability, and confidence at any given time.
- Provide support, improve and implement internal components and applications on top of multiple clusters.
- Troubleshoot and triage issues as they arise.
- Work closely with various teams across the organization, including the security team, development team, and operations team for any AWS infrastructure requests.
- Execute with minimal technical supervision, embrace reliability constraints, and be proactive in contributing improvements to the platform.
- Encourage best practice policies. Adapt to various technologies and be willing to get involved Terraform deployments and AWS support community as needed.
- Assist in troubleshooting network, dns and storage issues and make recommendations for cost saving options and growth.
- Participate in a shared on-call rotation when necessary.
- Willing to work or backfill all shifts as needed (Core hours 5A-9P EST)
- Mentor others or serve as a team lead, guiding the day-to-day activities of team members.
- Other duties as assigned.
- Expertise experience in container platform infrastructure in AWS (EKS, ECS)
- Clear understanding of container technologies and the tools/challenges around them.
- Ability to code/script in Python
- Senior level experience managing and administering Linux/Windows based systems.
- Extensive AWS Cloud Formation knowledge (YAML and/or JSON) .
- Systems monitoring experience (Elasticsearch preferred).
- Experience working with GitLab and supporting CI/CD pipelines.
- Experience delivering projects via Agile methodologies
- Excellent verbal and written communication skills.
Job Type: Full-time
Pay: $160,000.00 per year
Benefits:
- 401(k)
- 401(k) matching
- Dental insurance
- Flexible schedule
- Health insurance
- Life insurance
- Paid time off
- Referral program
- Relocation assistance
- Tuition reimbursement
- Vision insurance
Experience level:
- 5 years
Schedule:
- Day shift
- Evening shift
- Monday to Friday
Experience:
- System administration: 5 years (Required)
Security clearance:
- Top Secret (Required)
Shift availability:
- Day Shift (Preferred)
- Night Shift (Preferred)
Ability to Commute:
- Sterling, VA 20164 (Required)
Ability to Relocate:
- Sterling, VA 20164: Relocate before starting work (Required)
Work Location: In person