Junior Site Reliability Engineer Job at accesso

Lead Site Reliability Engineer

Groupon is a marketplace where customers discover new experiences and services e...

Location

India , Bangalore

Salary:

Not provided

Groupon

Expiration Date

Until further notice

Requirements

10+ years in systems engineering
at least 5+ years in SRE or DevOps roles
expertise in cloud platforms (GCP, AWS) and container orchestration (Kubernetes, Docker)
proficiency in programming and scripting languages like Python, Go, and Bash
advanced knowledge of Infrastructure as Code (IaC) tools such as Terraform and Ansible
deep understanding of networking, DNS, load balancing, and security principles
proven track record of managing high-availability systems in demanding environments
exceptional analytical and problem-solving skills

Job Responsibility

Architect and maintain fault-tolerant systems, ensuring uptime SLAs of 99.9% or higher
drive automation in infrastructure management and deployment using Terraform, Ansible, Kubernetes, and similar tools
create and optimize CI/CD pipelines to ensure reliable, secure, and efficient software delivery
build and enhance comprehensive observability solutions, including monitoring, logging, and alerting systems using Prometheus, Grafana, and the ELK stack
collaborate with stakeholders to define and achieve SLIs, SLOs, and error budgets aligned with business needs
lead incident response during on-call rotations, ensuring rapid resolution and root cause analysis for critical issues
design and execute performance testing, capacity planning, and scalability strategies for evolving workloads
proactively identify and resolve bottlenecks, increasing system performance and developer efficiency
mentor junior engineers, fostering a collaborative and growth-oriented team environment
guide architectural decisions that drive innovation and enhance system reliability

What we offer

The opportunity to work with cutting-edge technologies in a transformative environment
a collaborative and innovative work values alignment that values your expertise and contributions
professional growth and leadership development pathways tailored to your aspirations
a chance to leave a lasting impact by shaping the future of reliable and scalable systems

Staff Engineer, Site Reliability

LearnUpon is looking for a Staff Site Reliability Engineer to join our team in I...

Location

Ireland , Dublin

Salary:

Not provided

LearnUpon

Expiration Date

Until further notice

Requirements

7+ years of experience in a software or Ops role
5+ years of cloud engineering experience, with at least 2 years experience with AWS
Experience deploying Microservice environments, using containerisation technologies such as Kubernetes and Docker
Experience in designing and implementing Observability tech stacks
Have championed the benefits of Observability to Engineering teams
Can architect the design of SLO/SLI implementation that balances the needs of different teams
Familiar with cost analysis of Observability metrics gathering, Engineering effort, and tooling
Experience building and supporting large-scale distributed systems that back a consumer app or website with associated requirements of performance, security and disaster recovery
Experience with implementing IaaC (e.g. CloudFormation, Terraform etc.), automation tooling (e.g. Puppet, Ansible etc.), CI/CD (e.g. Jenkins, Travis CI, GitLab etc.)
Able to effectively communicate technical ideas to and collaborate with both technical and non-technical peers

Job Responsibility

Identifying opportunities to improve and scale our infrastructure for performance, observability, maintainability, and cost, by creating innovative solutions
Leading our efforts to build an observability function that incorporates application metrics, application transaction tracking, and event log management
Driving the processes to maintain resilient, scalable and cost-effective infrastructure
Working with other Engineering teams to provide infrastructure solutions that meet their ongoing requirements
Building tools focused on measuring, monitoring and alerting, with an eye towards self-service in order to promote Engineers’ ownership of observability
Reacting quickly to changing customer and business needs
Participate in on-call rota
Mentoring junior talent

What we offer

Work in a fun and supportive environment with regular team events
Excellent career progression
Structured learning environment
Competitive salary and company ESOP
Private health insurance
26 days annual leave

Fulltime

Senior Site Reliability Engineer

We're looking for a Senior Site Reliability Engineer for our Currents team, resp...

Location

United States , Austin

Salary:

129600.00 - 232200.00 USD / Year

Braze

Expiration Date

Until further notice

Requirements

Bachelor’s in Computer Science, Software Engineering, or a related STEM field
Five (5) years of experience in any role/occupation/position involving software engineering or site reliability engineering
Experience using distributed systems to deploy and monitor live applications such as Kubernetes or Docker Swarm
Experience working with alerting software (Sentry, Datadog, and/or PagerDuty)
Experience utilizing programming languages (Java, Kotlin, and/or Ruby) to understand and contribute to the codebase
Experience storing data in relational and non-relational databases such as Postgres and MongoDb
Experience with data streaming or queuing systems to build data pipelines with technologies like Kafka, Sidekiq or SQS and SNS
Experience leveraging continuous integration tools such as Jenkins or Buildkite
Experience collaborating with engineers through pull requests and code reviews in version control software such as GitHub or GitLab

Job Responsibility

Solve live performance and reliability issues and prevent their recurrence
Write and review code, educating engineers and building a culture of reliability
Practice sustainable incident response and blameless postmortems
Define and enable standards for monitoring, reliability, and performance
Bridge the gap between infrastructure and platform engineering teams
Support and improve services by planning for scale and reliability
Guide junior engineers in SRE best practices, software engineering, and agile project leadership

What we offer

Competitive compensation that may include equity
Retirement and Employee Stock Purchase Plans
Flexible paid time off
Comprehensive benefit plans covering medical, dental, vision, life, and disability
Family services that include fertility benefits and equal paid parental leave
Professional development supported by formal career pathing, learning platforms, and a yearly learning stipend
A curated in-office employee experience, designed to foster community, team connections, and innovation
Opportunities to give back to your community, including an annual company-wide Volunteer Week and donation matching
Employee Resource Groups that provide supportive communities within Braze

Fulltime

Senior Site Reliability Engineer

We're looking for a Senior Site Reliability Engineer for our Currents team, resp...

Location

United States , San Francisco

Salary:

129600.00 - 232200.00 USD / Year

Braze

Expiration Date

Until further notice

Requirements

Bachelor’s in Computer Science, Software Engineering, or a related STEM field
Five (5) years of experience in any role/occupation/position involving software engineering or site reliability engineering
Experience using distributed systems to deploy and monitor live applications such as Kubernetes or Docker Swarm
Experience working with alerting software (Sentry, Datadog, and/or PagerDuty)
Experience utilizing programming languages (Java, Kotlin, and/or Ruby) to understand and contribute to the codebase
Experience storing data in relational and non-relational databases such as Postgres and MongoDb
Experience with data streaming or queuing systems to build data pipelines with technologies like Kafka, Sidekiq or SQS and SNS
Experience leveraging continuous integration tools such as Jenkins or Buildkite
Experience collaborating with engineers through pull requests and code reviews in version control software such as GitHub or GitLab

Job Responsibility

Solve live performance and reliability issues and prevent their recurrence
Write and review code, educating engineers and building a culture of reliability
Practice sustainable incident response and blameless postmortems
Define and enable standards for monitoring, reliability, and performance
Bridge the gap between infrastructure and platform engineering teams
Support and improve services by planning for scale and reliability
Guide junior engineers in SRE best practices, software engineering, and agile project leadership

What we offer

Competitive compensation that may include equity
Retirement and Employee Stock Purchase Plans
Flexible paid time off
Comprehensive benefit plans covering medical, dental, vision, life, and disability
Family services that include fertility benefits and equal paid parental leave
Professional development supported by formal career pathing, learning platforms, and a yearly learning stipend
A curated in-office employee experience, designed to foster community, team connections, and innovation
Opportunities to give back to your community, including an annual company-wide Volunteer Week and donation matching
Employee Resource Groups that provide supportive communities within Braze

Fulltime

Senior Site Reliability Engineer

We're looking for a Senior Site Reliability Engineer for our Currents team, resp...

Location

United States , New York City

Salary:

129600.00 - 232200.00 USD / Year

Braze

Expiration Date

Until further notice

Requirements

Bachelor’s in Computer Science, Software Engineering, or a related STEM field
Five (5) years of experience in any role/occupation/position involving software engineering or site reliability engineering
Experience must include: Using distributed systems to deploy and monitor live applications such as Kubernetes or Docker Swarm
Working with alerting software (Sentry, Datadog, and/or PagerDuty)
Utilizing programming languages (Java, Kotlin, and/or Ruby) to understand and contribute to the codebase
Storing data in relational and non-relational databases such as Postgres and MongoDb
Data streaming or queuing systems to build data pipelines with technologies like Kafka, Sidekiq or SQS and SNS
Leveraging continuous integration tools such as Jenkins or Buildkite
Collaborating with engineers through pull requests and code reviews in version control software such as GitHub or GitLab

Job Responsibility

Solve live performance and reliability issues and prevent their recurrence
Write and review code, educating engineers and building a culture of reliability
Practice sustainable incident response and blameless postmortems
Define and enable standards for monitoring, reliability, and performance
Bridge the gap between infrastructure and platform engineering teams
Support and improve services by planning for scale and reliability
Guide junior engineers in SRE best practices, software engineering, and agile project leadership

What we offer

Competitive compensation that may include equity
Retirement and Employee Stock Purchase Plans
Flexible paid time off
Comprehensive benefit plans covering medical, dental, vision, life, and disability
Family services that include fertility benefits and equal paid parental leave
Professional development supported by formal career pathing, learning platforms, and a yearly learning stipend
A curated in-office employee experience, designed to foster community, team connections, and innovation
Opportunities to give back to your community, including an annual company-wide Volunteer Week and donation matching
Employee Resource Groups that provide supportive communities within Braze

Fulltime

Site Reliability Engineer

As a member of Kalshi’s engineering team, you’ll help build the next-generation ...

Location

United States , New York

Salary:

100000.00 - 250000.00 USD / Year

Kalshi

Expiration Date

Until further notice

Requirements

4+ years of software engineering experience
Experience designing, building, scaling, and maintaining production services and service-oriented architectures
Strong system design, coding, debugging, performance-tuning, and observability skills
High-quality coding practices with strong testing discipline
Excellent written and verbal communication
comfort working transparently across teams
Strong interpersonal skills across junior-to-principal engineering levels
Ability to think clearly under pressure and dive into any layer of the stack
Passion for building an open financial system that connects the world
Willingness to participate in on-call rotations and swiftly resolve issues

Job Responsibility

Improve observability, reliability, and service availability by defining and measuring key metrics
Build automation and systems that eliminate toil and reduce operational burden
Collaborate with core infrastructure engineers to performance-tune and optimize cloud deployments (Docker, Terraform, Kubernetes, EC2, etc.)
Partner with product teams to minimize service disruptions and automate incident response
Identify and analyze reliability problems across the stack, designing and implementing software for significant, long-term improvements
Mentor engineers and drive a culture where reliability is a core engineering value
Write high-quality, well-tested code that supports internal and external customer needs
Debug complex technical issues and improve system usability, operability, and diagnosability
Review feature designs across the company and ensure security, safety, scalability, and architectural clarity
Build and maintain integrations with third-party vendors

What we offer

equity and benefits

Fulltime

Principle SRE

The Principal Site Reliability Engineer will be a senior technical expert respon...

Location

India , Pune

Salary:

Not provided

Barclays

Expiration Date

Until further notice

Requirements

12+ years in software engineering or infrastructure roles
at least 5 years focused on reliability engineering or SRE
proven experience building and operating fault-tolerant, highly available systems at scale
strong knowledge of distributed systems, resiliency patterns (circuit breakers, retries, failover), and disaster recovery strategies
expertise across infrastructure (compute, storage, networking), application architecture, databases, and integration patterns
ability to troubleshoot complex technical issues across distributed systems and perform deep root cause analysis
skilled at working with development, operations, and architecture teams to embed reliability into design and delivery

Job Responsibility

Drive strategies to improve reliability, maintainability, and scalability across payment flows and platform components
conduct deep technical assessments of system architectures, identifying risks and recommending improvements for fault tolerance and disaster recovery
act as a senior escalation point for production incidents, lead RCA, and implement permanent fixes to prevent recurrence
define and enforce reliability patterns, frameworks, and best practices
advocate and implement chaos engineering principles to validate system resilience under real-world failure scenarios
design and implement full-stack observability solutions, including metrics, logging, distributed tracing, and alerting
develop automation for failover, capacity management, and self-healing mechanisms to reduce operational risk
partner with development, infrastructure, and production support teams to embed reliability into the SDLC
analyze service risk assessments and production incidents to identify systemic issues and drive long-term improvements
promote operational excellence and a mindset of designing for failure across all engineering teams

What we offer

Competitive holiday allowance
Life assurance
Private medical care
Pension contribution

Fulltime

Senior UI Engineer

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...

Location

United Kingdom , London

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Significant progressive experience in backend software development, with a proven track record of owning the design and delivery of complex, large-scale software projects as a senior individual contributor
Deep, hands-on expertise and architectural understanding of enterprise-level middleware technologies including Java, Spring Boot, Kafka, Microservices architecture, GraphQL, and NoSQL databases. Demonstrated experience with high-volume, low-latency distributed systems. Experience with Apache Flink is a significant advantage
Demonstrated ability to architect, design, and implement highly scalable, resilient, secure, and performant distributed systems
Expert-level understanding of the modern Software Development Lifecycle (SDLC), CI/CD pipelines, DevSecOps, and Site Reliability Engineering (SRE) best practices, with extensive, hands-on experience in their practical application
Exceptional verbal and written communication skills, capable of articulating complex technical concepts to diverse audiences and working effectively with business and technical stakeholders to drive positive outcomes
Bachelor’s degree in Computer Science, Engineering, or a related technical field is required
a Master’s degree or equivalent advanced professional qualifications are a plus

Job Responsibility

Actively contribute to and uphold the long-term technical vision and architectural roadmap for core platforms within the Banking Technology middleware ecosystem, encompassing Java, Spring Boot, Kafka, Microservices, GraphQL, and NoSQL databases. Implement and advocate for organizational engineering standards, best practices, and architectural patterns to ensure scalability, reliability, security, and maintainability across all engineering initiatives
Lead by example in coding, design, and problem-solving. Mentor and provide technical guidance to senior and junior software engineers, fostering a culture of innovation, continuous learning, and technical excellence. Share knowledge, best practices, and innovative solutions with the team
Apply and champion DevSecOps and Site Reliability Engineering (SRE) principles in daily work, ensuring high standards of system availability, performance, security, and operational efficiency for critical production platforms. Proactively identify and address technical debt, mitigate system risks, and implement robust disaster recovery capabilities
Act as a primary technical advisor to senior business leaders and product owners, translating complex business requirements into clear, actionable technical designs and innovative solutions. Effectively articulate technical insights, architectural decisions, and development progress to diverse audiences
Actively drive the exploration, evaluation, and hands-on application of emerging technologies, advanced architectural patterns, and innovative solutions (e.g., Apache Flink, Artificial Intelligence) to enhance product offerings and improve engineering productivity
Collaborate extensively with other engineers and technical leads across engineering, product management, and operations to ensure alignment of technical designs, seamless integration of solutions, and achievement of broader organizational goals. Influence technical decisions through deep expertise and well-reasoned arguments
Take ultimate accountability for the successful, on-time delivery of complex, high-quality, and user-centric software components. Uphold rigorous engineering standards through thorough design, code, and security reviews, and contribute to comprehensive technical documentation, ensuring a culture of engineering excellence

What we offer

27 days annual leave (plus bank holidays)
A discretional annual performance related bonus
Private Medical Care & Life Insurance
Employee Assistance Program
Pension Plan
Paid Parental Leave
Special discounts for employees, family, and friends
Access to an array of learning and development resources

Fulltime

Junior Site Reliability Engineer

accesso

Location:
United Kingdom

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
December 05, 2025

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Junior Site Reliability Engineer

Lead Site Reliability Engineer

Staff Engineer, Site Reliability

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Site Reliability Engineer

Principle SRE

Senior UI Engineer

Junior Site Reliability Engineer

accesso

Location:United Kingdom

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:December 05, 2025

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Junior Site Reliability Engineer

Lead Site Reliability Engineer

Staff Engineer, Site Reliability

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Site Reliability Engineer

Principle SRE

Senior UI Engineer

Location:
United Kingdom

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
December 05, 2025