Role: Java SRE
Location: Phoenix, AZ
Fulltime
Job Description
Must Have Technical/Functional Skills
• Core Java, Splunk, Kibana, Grafana
• Databases: Postgres, MongoDB
• Experience in Production support engineering or SRE roles, preferably within the banking industry.
• Skilled in L1/L2 support, debugging, performance monitoring, and working in Agile/Scrum environments. Hands-on with ServiceNow, Spring Boot, REST APIs, and CI/CD pipelines.
• Strong knowledge of cloud services.
Roles & Responsibilities
• Excellent problem-solving skills and the ability to work under pressure in a fast-paced environment.
• Monitor and maintain the health, availability, and performance of production systems and applications.
• Troubleshoot and resolve production incidents, ensuring minimal downtime and service disruption.
• Identifying Defects and working with Dev to get them fixed based on priority.
• Taking care of implementation of RFCs.
• Doing pre and post validation of servers during traffic diversion.
• Collaborate with engineering teams to implement reliability best practices and improve system performance.
• Develop and maintain monitoring alerts and dashboards to ensure visibility into system metrics.
• Participate in on-call rotation and provide timely support for high-impact incidents.
• Implement automation tools and processes to streamline operations and reduce manual workloads.
• Document incidents and solutions for knowledge management and continuous improvement.