Job Type: Contract
Job Category: IT
Job Description
Role: Site Reliability Engineer (SRE)
Location: Brentwood, TN (Onsite)
Contract
Experience: 6–8+ years
Role Description:
Combines software engineering and IT operations to ensure the reliability, scalability, and performance of systems, with a strong focus on automation, stability, and proactive problem-solving to reduce operational issues and improve system efficiency.
Responsibilities:
- Continuously monitor system health and performance using Service Level Indicators (SLIs) and Service Level Objectives (SLOs).
- Establish and maintain effective monitoring and alerting mechanisms to identify potential issues early.
- Set up alerts to notify teams of performance degradation, failures, or outages to enable quick incident response.
- Analyze incidents and system behavior to improve reliability, availability, and overall system performance.
Required Skills
DevOps Engineer