smartrecruiters

Senior Site Reliability Engineer (SRE) @ Experian

Nottingham, gbOnsiteFull-timePosted 16 days ago

Opens on smartrecruiters

About this role

We are looking for a Site Reliability Engineer (SRE) to improve the reliability, and performance of business-critical systems. You will focus on AWS cloud infrastructure, DevOps tooling, and core SRE practices within a distributed, production environment. Reporting to our Lead, you will work with development, platform, and operations teams to ensure systems are stable, scalable, well-monitored and meet defined reliability targets.

Main Responsibilities

Reliability and Operations:

Support high availability, scalability and performance of production systemsWork with defined SLIs, SLOs and SLAs, ensuring services meet agreed reliability targetsIdentify and reduce operational toil through automation and process improvementContribute to the design and implementation of fault-tolerant and resilient systemsParticipate in resilience and failure testing activities to validate system behaviour under fault conditions and improve recoveryAWS & Cloud Operations:

Manage and operate systems hosted on AWS (EC2, EKS/ECS, RDS, S3, Lambda, CloudWatch, IAM, and VPC)Support cloud deployments and infrastructure changes following best practicesHelp with backup, disaster recovery and resiliency planningDevOps & Automation:

Work with CI/CD pipelines and DevOps practices to ensure reliable and repeatable deployments, including build, test and release automation processesUse Infrastructure as Code tools such as Terraform or CloudFormation to manage and provision infrastructureDevelop automation using scripting languages (Python, Bash or similar) to reduce operational toil and improve efficiencyIncident Management:

Participate in production incident response, troubleshooting, and service restorationPerform root cause analysis (RCA) and contribute to post-incident reviewsHelp implement preventive actions to avoid incident recurrenceObservability:

Configure and maintain monitoring, logging, and alerting using tools like CloudWatch, Prometheus, Grafana, Splunk, or DynatraceDevelop dashboards to track system and platform health and reliability metrics across the user journeyImprove alert quality to reduce noise and improve response timesCollaboration:

Work with application and engineering teams to embed reliability into system designCollaborate within a globally distributed team, using clear handovers to ensure continuityShare knowledge and contribute to team-wide best practicesCommunicate with all kinds of stakeholders, influencing decisions through reliability-focused insights Experience in production support, DevOps, SRE, cloud operations, or systems engineeringCloud ExpertiseHands-on experience with AWS cloud services, including compute, container and serverless workloadsPractical experience with CI/CD pipelines and DevOps practices, including Git-based version control, pull request workflows, code reviews, and deployment automationExperience with SRE principles, monitoring, and reliability engineering practicesProficiency in scripting (Python, Bash, or similar) for automation and operational toolingExperience with Linux systems and troubleshooting production issuesAdditionalPreferred Experience

Exposure to data platforms and data pipelinesUnderstanding of data reliability conceptsExperience supporting or operating complex distributed systems Benefits package includes:

Hybrid workingGreat compensation and discretionary bonusCore benefits include pension, Bupa healthcare, Sharesave scheme and more25 days annual leave with 8 bank holidays and 3 volunteering days. You can purchase additional annual leave.We take our people agenda very seriously and focus on what matters; DEI, work/life balance, development, authenticity, collaboration, wellness, reward & recognition, volunteering... the list goes on. Experian's people first approach is award-winning; World's Best Workplaces™ 2024 (Fortune Top 25), Great Place To Work™ in 24 countries, and Glassdoor Best Places to Work 2024 to name a few. Check out Experian Life on social or our Careers Site to understand why.

Experian is proud to be an Equal Opportunity and Affirmative Action employer. Innovation is an important part of Experian's DNA and practices, and our diverse workforce drives our success. Everyone can succeed at Experian and bring their whole self to work, irrespective of their gender, ethnicity, religion, colour, sexuality, physical ability or age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity.

Experian Careers - Creating a better tomorrow together

Find out what its like to work for Experian by clicking here

#LI-Hybrid

This is a hybrid remote/in-office role.

Experian Careers - Creating a better tomorrow together

Find out what its like to work for Experian by clicking here

Skills

TechnologyInformation TechnologyNot ApplicableFinancial Services

Ready to apply?

Install the ResuMinder extension and we'll auto-fill the application in seconds — no rewriting.

Get the extension →