Title: Site Reliability Manager/Lead
Location: Chicago IL (Hybrid)
Type: FTE
Technical skills:
- Engage, influence, and evangelize SRE practices with development, operational, and product groups to align technology service/solution delivery.
- Drive quality accountability within the organization with well-defined processes, metrics, and goals for process quality. This includes leading effective postmortems and ensuring actions are followed up.
- Manage availability, latency, scalability, and efficiency development by instilling engineering reliability into our development life cycle with a focus on fault-tolerant approaches.
- Drive capacity planning, performance analysis, instrumentation, and other non-functional systems requirements.
- Must be able to define and report "progress" on strategic initiates and project-level tasks to all stakeholders including senior executives, clients and use effective communication approaches with each constituency.
- Implement metrics-driven processes to ensure service quality targets are met.
- Experience and expertise in Continuous Integration / Continuous Deployment practice - coding (Python), tooling, and techniques, particularly evidence of leading organizational and cultural change to adopt CI / CD practices (Bank's ecosystem - Jira, Confluence, BitBucket, git; Jenkins, Artifactory, Terraform, Packer, Rundeck, Ansible).
- Experience with at least one of the data analytics tools such as Splunk and/or ELK.
Key qualifications
- Expert knowledge in all aspects of designing, developing, and managing large real-time systems.
- Prior successful experience as a systems performance or site/systems reliability engineer.
- Strong experience in fault-tolerant approaches in a large-scale distributed environment and high-performance systems,
- Hands-on experience performing as a Site Reliability Engineer (SRE)
- Experience defining, negotiating, measuring, and satisfying Service Level Objectives (SLOs)
- Hands-on expertise working with AWS Cloud Environments and managed services (Redis, RDS, S3, etc)
- Demonstrated experience working in large, complex systems environments.
- A passion for performance excellence, robustness and an engineering mindset
Yash Rastogi
Last Word Consulting.
Direct:
Email: