maintenance and support of Data Lake Hadoop on prem and Azure cloud implementations.
- Coordination of platform maintenance, including change and release management (planned and emergency deployments, configuration and patching).
- Real-time system
monitoring (custom and off-the-shelf tools). Engineer and implement
custom monitoring solutions if needed. - Participation of Run books development.
- End-to-end Incident and Problem resolution for Data Product Teams.
- Responsible to ensure that the applications met their required service level
- Responsible to support process improvements to continuously improve the stability and performance of the platform
- Identify system bottlenecks and opportunities for process improvements
- Perform in-depth research and identify sources of production issues.
- Effectively perform root cause analysis of issues and report the outcome to business community and Management.
- Develop / utilize core support tools and processes to perform work while improving day-to-day practices for support team members with the goal of delivering service
improvements to the business.