Senior Site Reliability Engineer (Remote)
As an SRE team member, you will participate in all of the day to day activities of operating the payment infrastructure to help maintain high stability, reduced Service Downtime and improve Quality of Service for FIS clients.
This could involve:
- Working with bleeding edge technology while coding, configuring, implementing, maintaining and supporting
- Participate and Lead the Day-to-Day Operations of the Team for Changes and Incident
- Lead Incident Triage Calls and Follow up on Root Cause Analysis
- Configuring and Deploying Product Releases into Production and verifying for Quality work
- Participating in Data Center and Infrastructure Change activities
- Developing Monitoring and Alerting for Payments Platform
- Help identify repetitive non-value adding tasks and be hands-on in coding and scripting to automate them
- Actively Participate and Contribute in Team Meetings and Team Building activities
What you will need:
- At least 2-3 years of experience and knowledge of building and supporting application infrastructure in a private and/or public cloud
- 2-3 years of Change Management experience on ITSM Tools like Service Now
- Experience Initiating and leading or participating in Resolution of Incidents
- 2-3 of Experience developing and or implementing effective Monitoring on standard tools similar to Splunk and Dynatrace
- Knowledge and Skills to comfortably work on Unix Systems, IBM Technologies like WebSphere AS and MQ and Oracle
- to be aGood Team Player who values others' opinions and suggestions and willing to work in a socially and geographically diverse team
- to be a self-starter and someone who can run with minimal supervision
- Hands on experience in developing and implementing production monitoring
- Experience and good understanding Networking
- Hands on Unix Shell Scripting, Perl/Python programming experience