Are you an experienced, passionate pioneer in technology who wants to work in a collaborative environment? As an experienced Project Delivery Manager- Site Reliability Engineer you will have the ability to share new ideas and collaborate on projects as a consultant without the extensive demands of travel. If so, consider an opportunity with Deloitte under our Project Delivery Talent Model. Project Delivery Model (PDM) is a talent model that is tailored specifically for long-term, onsite client service delivery.
Work you'll do/Responsibilities
Site Reliability Engineer will be playing a key role in building Observability and Resilience capabilities on cloud platform (Azure). Responsibilities of the SRE will be:
The Team
Our Core Technology Operations (CTO) team offers differentiated operate services for our clients with solutions to help organizations scale and optimize critical business operations, drive speed to outcome, deliver business transformation, and build resilience in an uncertain future.
Our operate services within CTO include:
Qualifications
Required
Work you'll do/Responsibilities
Site Reliability Engineer will be playing a key role in building Observability and Resilience capabilities on cloud platform (Azure). Responsibilities of the SRE will be:
- Build and configure alerts, tracing, telemetry, and instrumentation required for Infrastructure Monitoring and Application Performance Management.
- Role entails implementing dashboards to monitor and share Observability at various levels (engineering teams, portfolio, senior management).
- Support resilience engineering (application and infrastructure resilience) to meet availability requirements.
- Work with development engineers, cloud engineers, product teams, and support engineers to gather requirements, implement, and evolve observability and resilience solutions.
The Team
Our Core Technology Operations (CTO) team offers differentiated operate services for our clients with solutions to help organizations scale and optimize critical business operations, drive speed to outcome, deliver business transformation, and build resilience in an uncertain future.
Our operate services within CTO include:
- Foundry Services: Operate services providing flexible, recurring resource capacity for client initiatives, projects, tasks, and enhancement
- Managed Services: Operate services that provide ongoing maintenance, monitoring, and optimization for IT/Engineering applications & products
Qualifications
Required
- At least 4 years of experience defining and implementing Monitoring solutions - alerts, Telemetry, and instrumentation for on-premises and cloud platforms for large enterprises
- Good knowledge on Observability and Application Performance Monitoring best practices, KPIs/metrics on Cloud platforms
- Experience in monitoring tools such as Splunk, Dyna Trace, Prometheus, Cloud Watch, Azure Monitor, New Relic, other open-source tools
- Experience building monitoring solutions for variety of workloads such as Micro services (Java / Spring boot desirable), databases, Kafka, Kubernetes
- Experience in resilience engineering, and implementing high availability solutions
- Experience creating Monitoring dashboards using tools such as Grafana (Preferred), Splunk, Kibana, Power BI
- Ability to work in a fast paced and agile environment
- Bachelor's degree, preferably in Computer Science, Information Technology, Computer Engineering, or related IT discipline; or equivalent experience
- Limited immigration sponsorship may be available
- Ability to travel 10%, on average, based on the work you do and the clients and industries/sectors you serve