CrawlJobs Logo

Senior Software Engineer, SRE

abridge.com Logo

Abridge

Location Icon

Location:
United States , SF Office, NYC Office

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

210800.00 - 248000.00 USD / Year

Job Description:

Abridge’s services and engineering team are in hyperscale mode. We are looking for experienced SREs to join our team and help improve the performance, stability, and scalability of our software by multiples. This is a distributed systems oriented role and is approximately 80% software focused and 20% cloud infrastructure focused. You will help us build load testing and chaos engineering into our CI pipelines, leverage observability and profiling tools to identify performance bottlenecks and resolve them, work with diverse teams to help rehome their applications onto more scalable infrastructure, and ensure a smooth ride as we hyperscale our application adoption in the healthcare space. You may be embedded with other teams for weeks or months. The platform we are building needs to maximize both engineering velocity and security, will be under tremendous scale, and presents many opportunities to leverage creativity, autonomy, and leadership to take things 0 to 1. This is a unique opportunity in the industry to rapidly grow your career in a rapidly growing company leveraging the best of emerging technologies.

Job Responsibility:

  • Leverage load testing, chaos engineering, and other test practices to identify performance and latency bottlenecks across all of our systems, and make changes to application code to resolve them
  • Drive software changes that can rehome applications at the code level onto new infrastructure (run times, event driven infrastructure, databases, and more) in order to dramatically improve scalability as well as enable multi-tenant deployments
  • Identify and implement software configuration changes and performance tuning parameters that will dramatically improve performance and scalability
  • Build developer tools and software modules that help engineers build code faster and more effectively with more enablements to the entire engineering organization
  • Work with the Platform team to develop, and application teams to adopt, emerging elements of our internal developer platform, such as service templates and self-serve infrastructure
  • Work with application teams to establish and adopt SLOs and error budgets, and drive better metrics for application health that can drive automated canary releases, improved health monitoring, and better engineering practices
  • Uplevel our ability to respond to incidents by improving observability, runbooks, and incident response muscle across the organization
  • Evangelize, document, and train the engineering team on the solutions being built and uplevel them on cloud native design strategies and tools
  • Be a public evangelist for Abridge in the global platform engineering community, including conferences, open source, and research as we pioneer new AI-first cloud-native-first security-first implementations at scale

Requirements:

  • 8+ years of software engineering experience focused on distributed systems or tooling, with an interest in engineering enablement and software scaling
  • At least 2 years experience as a back-end engineer focused on system performance and scalability
  • Experience reducing latency in software by multiples through leveraging observability and profiling tools
  • Experience building on Kubernetes and scaling compute services on Kubernetes
  • experience with related cloud native technologies including ArgoCD, Argo Rollouts, Istio, etc
  • Comfortable implementing and securing services in Google Cloud Platform with Infrastructure as Code, including GCP Projects, VPC Networks, Google Kubernetes Engine, and IAM Roles, Groups and policies
  • Experience building software with backend languages (e.g. Python, GoLang, Node, and Rust)
  • Experience monitoring distributed systems with Prometheus, OpenTelemetry Collector, and Grafana (or something similar), including metrics collection, visualization, alerting, and using observability data to drive performance optimizations
  • Passion for engineering enablement and solving software and distributed systems scaling challenges under pressure
  • Must be willing to travel up to 10%

Nice to have:

Candidates without GCP experience but who have experience with Kubernetes are encouraged to apply

What we offer:
  • Generous Time Off: 14 paid holidays, flexible PTO for salaried employees, and accrued time off for hourly employees
  • Comprehensive Health Plans: Medical, Dental, and Vision coverage for all full-time employees and their families
  • Generous HSA Contribution: If you choose a High Deductible Health Plan, Abridge makes monthly contributions to your HSA
  • Paid Parental Leave: Generous paid parental leave for all full-time employees
  • Family Forming Benefits: Resources and financial support to help you build your family
  • 401(k) Matching: Contribution matching to help invest in your future
  • Personal Device Allowance: Tax free funds for personal device usage
  • Pre-tax Benefits: Access to Flexible Spending Accounts (FSA) and Commuter Benefits
  • Lifestyle Wallet: Monthly contributions for fitness, professional development, coworking, and more
  • Mental Health Support: Dedicated access to therapy and coaching to help you reach your goals
  • Sabbatical Leave: Paid Sabbatical Leave after 5 years of employment
  • Compensation and Equity: Competitive compensation and equity grants for full time employees

Additional Information:

Job Posted:
January 20, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Software Engineer, SRE

Senior Software Engineer, Infrastructure

The InfraOps team’s primary goal is to enable and empower Kiddom’s engineering b...
Location
Location
United States , New York City
Salary
Salary:
160000.00 - 200000.00 USD / Year
kiddom.co Logo
Kiddom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS or MS in Computer Science or a related field
  • 5+ years professional software engineering experience
  • Experience with Java, or Python, Go, Clojure in a production environment
  • Experience designing and building REST APIs
  • Exposure to authorization technologies (OAuth)
  • Experience with continuous integration and automation tools and processes
  • Strong knowledge of design patterns and software engineering best practices
  • Excellent problem solving and debugging skills
  • Strong acumen or exposure to DevOps or SRE methodologies
  • Keen sense for SecOps.
Job Responsibility
Job Responsibility
  • Evangelizing and fostering a healthy DevOps culture here at Kiddom, working with teams to establish best practices and help guide new and existing services.
  • Practicing Infrastructure as Code (IaC) wherever possible, giving us the confidence in repeatable processes that can be automated.
  • Grow our DevOps efforts from small scale to large scale multi-region
  • Share ownership of the entire infrastructure architecture
  • Aim for high availability, high resiliency
  • Support the engineering team with tools to evaluate the performance of their code in production environments, speed up CI/CD pipeline, & feature verification
  • support the engineering team with tools to speed up CI/CD pipeline, feature verification
  • Design and build a scalable, generalized framework for third-party API integrations
  • Leverage existing infrastructure and components to build RESTful web services
  • Build APIs and robust testing environments for internal and external developers
What we offer
What we offer
  • Competitive salary
  • Meaningful equity
  • Health insurance benefits: medical (various PPO/HMO/HSA plans), dental, vision, disability and life insurance
  • One Medical membership (in participating locations)
  • Flexible vacation time policy (subject to internal approval). Average use 4 weeks off per year.
  • 10 paid sick days per year (pro rated depending on start date)
  • Paid holidays
  • Paid bereavement leave
  • Paid family leave after birth/adoption. Minimum of 16 paid weeks for birthing parents, 10 weeks for caretaker parents. Meant to supplement benefits offered by State.
  • Commuter and FSA plans
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Infrastructure

You’ll help shape the future of infrastructure automation for law enforcement sy...
Location
Location
United States , Seattle; Boston
Salary
Salary:
141000.00 - 225600.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience)
  • 8+ years of professional software development experience
  • Strong background building cloud-native, distributed solutions
  • Experience designing tooling and automation to simplify the operational management of SaaS/PaaS systems
  • Proficiency in backend services with multiple managed languages (e.g., Java, Scala, Go, C#, or similar)
  • Expertise with Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation) and building modular, reusable, testable components
  • Familiarity with Kubernetes platforms (e.g., AKS, EKS, or similar)
  • Hands-on experience with CI/CD platforms for automating infrastructure, builds, testing, and releases
  • Strong collaboration and communication skills, with empathy for the needs of engineering teams
Job Responsibility
Job Responsibility
  • Lead engineering architecture design reviews
  • Set a high technical bar for the team through code and architecture design reviews
  • Mentoring engineers
  • Working across teams with Product, Design, and Engineering to create integrated solutions that delight our customers
  • Improve our Engineering process, including long-term thinking, sprint planning and stand-ups
  • Building services that adhere to our high bar on availability and latency in this mission-critical space
  • Working with the latest open source technologies
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Postgres

ClickHouse is launching a strategic Postgres initiative to extend our developer-...
Location
Location
United States
Salary
Salary:
140000.00 - 208000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years in software engineering, ideally with experience building or operating database or cloud platform systems
  • Deep understanding of Postgres — configuration, extensions, operations, and performance tuning
  • Strong programming experience in Ruby, Go, or Python (or willingness to work across languages)
  • Familiarity with cloud infrastructure, APIs, and automation tools (Terraform, Kubernetes, CI/CD)
  • Understanding of distributed systems, data replication, and service orchestration patterns
  • Pragmatic, detail-oriented, and comfortable with both greenfield development and operational ownership
  • Happy to contribute where needed — from backend APIs and platform automation to Postgres internals and debugging
  • Strong communicator who works effectively across teams in a fast-paced, cross-functional environment
  • Operate with a founder’s mindset — take initiative, move quickly, and care deeply about outcomes
Job Responsibility
Job Responsibility
  • Design and build backend services that orchestrate and manage database clusters in ClickHouse Cloud
  • Extend our platform control plane — written in Ruby, Go, and TypeScript — to support new Postgres capabilities
  • Contribute to automation and tooling that simplify cluster provisioning, scaling, and lifecycle management
  • Collaborate with infrastructure, SRE, and product teams to ensure operational excellence, performance, and reliability
  • Develop APIs and integrations that expose new Postgres functionality to customers and internal systems
  • Improve observability, deployment safety, and debugging workflows for database services
  • Participate in design discussions, code reviews, and on-call rotations, contributing to the overall reliability and velocity of the team
  • Operate with autonomy — identifying opportunities, driving execution, and delivering meaningful impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Postgres

ClickHouse is launching a strategic Postgres initiative to extend our developer-...
Location
Location
Canada
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years in software engineering, ideally with experience building or operating database or cloud platform systems
  • Deep understanding of Postgres — configuration, extensions, operations, and performance tuning
  • Strong programming experience in Ruby, Go, or Python (or willingness to work across languages)
  • Familiarity with cloud infrastructure, APIs, and automation tools (Terraform, Kubernetes, CI/CD)
  • Understanding of distributed systems, data replication, and service orchestration patterns
  • Pragmatic, detail-oriented, and comfortable with both greenfield development and operational ownership
  • Happy to contribute where needed — from backend APIs and platform automation to Postgres internals and debugging
  • Strong communicator who works effectively across teams in a fast-paced, cross-functional environment
  • Operate with a founder’s mindset — take initiative, move quickly, and care deeply about outcomes
Job Responsibility
Job Responsibility
  • Design and build backend services that orchestrate and manage database clusters in ClickHouse Cloud
  • Extend our platform control plane — written in Ruby, Go, and TypeScript — to support new Postgres capabilities
  • Contribute to automation and tooling that simplify cluster provisioning, scaling, and lifecycle management
  • Collaborate with infrastructure, SRE, and product teams to ensure operational excellence, performance, and reliability
  • Develop APIs and integrations that expose new Postgres functionality to customers and internal systems
  • Improve observability, deployment safety, and debugging workflows for database services
  • Participate in design discussions, code reviews, and on-call rotations, contributing to the overall reliability and velocity of the team
  • Operate with autonomy — identifying opportunities, driving execution, and delivering meaningful impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Postgres

ClickHouse is launching a strategic Postgres initiative to extend our developer-...
Location
Location
India
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years in software engineering, ideally with experience building or operating database or cloud platform systems
  • Deep understanding of Postgres — configuration, extensions, operations, and performance tuning
  • Strong programming experience in Ruby, Go, or Python (or willingness to work across languages)
  • Familiarity with cloud infrastructure, APIs, and automation tools (Terraform, Kubernetes, CI/CD)
  • Understanding of distributed systems, data replication, and service orchestration patterns
  • Pragmatic, detail-oriented, and comfortable with both greenfield development and operational ownership
  • Happy to contribute where needed — from backend APIs and platform automation to Postgres internals and debugging
  • Strong communicator who works effectively across teams in a fast-paced, cross-functional environment
  • Operate with a founder’s mindset — take initiative, move quickly, and care deeply about outcomes
Job Responsibility
Job Responsibility
  • Design and build backend services that orchestrate and manage database clusters in ClickHouse Cloud
  • Extend our platform control plane — written in Ruby, Go, and TypeScript — to support new Postgres capabilities
  • Contribute to automation and tooling that simplify cluster provisioning, scaling, and lifecycle management
  • Collaborate with infrastructure, SRE, and product teams to ensure operational excellence, performance, and reliability
  • Develop APIs and integrations that expose new Postgres functionality to customers and internal systems
  • Improve observability, deployment safety, and debugging workflows for database services
  • Participate in design discussions, code reviews, and on-call rotations, contributing to the overall reliability and velocity of the team
  • Operate with autonomy — identifying opportunities, driving execution, and delivering meaningful impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer

We're looking for a seasoned Sr DevOps Engineer to help drive the reliability, s...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
brandwatch.com Logo
Brandwatch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-7 years of experience in DevOps, SRE, or Software Engineering roles, with increasing responsibility in system design and operations
  • Extensive experience with containerization (Docker) and orchestration (Kubernetes) in production environments, including managing and scaling clusters
  • Proficiency in Infrastructure as Code (Terraform, CloudFormation, etc.) and configuration management tools (Ansible, Puppet) to automate infrastructure provisioning
  • Strong coding and scripting skills in languages like Python, Go, or Ruby, with the ability to build automation tools for system management
  • Deep knowledge of cloud platforms (AWS and/or GCP) and their services, with experience designing and operating cloud-based infrastructure at scale
  • Solid understanding of networking and security fundamentals in cloud and on-prem environments
  • Experience setting up and tuning monitoring/alerting systems (Prometheus, Grafana, etc.), and a thorough understanding of SRE best practices (SLIs, SLOs, incident management)
  • Strong problem-solving and communication skills, with a track record of working effectively in collaborative team environments
Job Responsibility
Job Responsibility
  • Oversee the reliability, performance, and security of critical production services from design to deployment, ensuring they meet our uptime and performance targets
  • Collaborate with development, QA, and product teams to build and maintain resilient infrastructure and efficient deployment pipelines
  • Automate infrastructure provisioning and software deployments using Infrastructure as Code and CI/CD tools, reducing manual work and errors
  • Participate in and improve our 24×7 on-call process, swiftly troubleshooting incidents and performing root cause analysis to prevent recurrence
  • Document and standardize processes and configurations, sharing knowledge to uplift the entire engineering team’s capabilities
Read More
Arrow Right

Senior Site Reliability Engineer

HiveWatch is seeking a Staff Site Reliability Engineer to join our Platform Team...
Location
Location
United States , El Segundo
Salary
Salary:
183000.00 - 235000.00 USD / Year
hivewatch.com Logo
HiveWatch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of software engineering experience with strong coding skills in production environments
  • 5+ years of SRE, DevOps, or production operations experience
  • Expertise with cloud platforms (AWS preferred) and containerized applications (Docker, Kubernetes)
  • Experience with Infrastructure as Code (Terraform, CloudFormation, or similar)
  • Proficiency in at least one object oriented programming language in our tech stack (Java, Kotlin, Python)
  • Hands-on experience with relational databases and SQL performance optimization
  • Experience with monitoring and observability tools (Prometheus, Grafana, DataDog, or equivalent)
  • Strong debugging skills across distributed systems and microservices architectures
  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Own the reliability of mission-critical systems including production monitoring, alerting, and capacity planning
  • Debug and resolve complex production issues across the full stack, from infrastructure to application code
  • Participate in a regular on-call rotation to provide 24/7 coverage for critical systems
  • Perform root cause analysis requiring deep code-level investigation and implement preventive measures
  • Build automation and tooling to reduce operational toil and improve system reliability
  • Maintain CI/CD pipelines, observability infrastructure, and database performance optimization
  • Increase the resiliency, scalability, and maintainability of production environments
  • Establish on-call procedures and disaster recovery processes
  • Provide technical leadership and mentorship to foster engineering excellence and reliability culture
What we offer
What we offer
  • Comprehensive health coverage: medical, dental, vision, and life insurance
  • Cutting-edge work in an emerging field with huge growth potential
  • Competitive compensation packages designed to reward top talent
  • A modern, newly renovated HQ right on Main Street in El Segundo, CA
  • 401(k) with a 4% company match to help you invest in your future (match launches in 2026)
  • Flexible paid time off so you can recharge when you need it
  • Additional benefits include ClassPass credits and a discount on pet insurance
  • A family-friendly, compassionate culture that values balance and belonging
  • Eligible to participate in HiveWatch Equity Incentive Plan
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer - Developer Experience

We're changing how the world works with data. We built the Data Productivity Clo...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
matillion.com Logo
Matillion
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience with DevOps and/or SRE
  • Experience with code signing and build management tools (Gradle, Maven, ant etc)
  • Familiarity with software configuration management systems and source code version control systems (specifically, GitHub)
  • Some experience with Developer Portals (specifically Backstage)
  • Metrics driven: you are proficient in measuring success, value and gaps
  • A strong background in software engineering and release engineering, with coding experience in one or more of the following languages/libraries/platforms: Java, JavaScript/Typescript, NodeJS, Python, Go, AWS, Docker, Serverless, React, Terraform
  • Inquisitiveness- digging into problems and solutions to understand the underlying technology
  • Autonomy - ability to work on a task and solve problems independently
  • Motivation - sets personal challenges and constantly looking to stretch themselves
  • Problem solving - recognition of problems and recasting difficult-to-solve problems in order to find unique and innovative solutions
Job Responsibility
Job Responsibility
  • Developer Empathy - Gather holistic feedback from Developers and design solutions to improve their lived experience
  • Generative AI - Innovate ways to use Generative AI for Developer assistance and guide the internal Developer community on AI tools
  • Governance and enforcement of policies and procedures around source control, changes, releases, etc
  • Design and implement powerful, well-maintained, and user-friendly development tools, IDEs, and infrastructure that drive high velocity in code development, testing and version control
  • Streamline workflows - Simplify and optimise development workflows to incorporate automation and eliminate unnecessary processes
  • Perform administration and troubleshooting, and provide support to all engineers
  • Enhance collaboration - Work closely among the different Engineering teams to work as one cohesive unit
  • Measurement-driven efficiency - Familiarity with Engineering metrics such as DORA and how to use them to proactively drive efficiency
  • Develop new skills by working with other members of the team
  • Work with the Team Lead and Manager to identify training goals
What we offer
What we offer
  • Company Equity
  • 27 days paid time off
  • 12 days of Company Holiday
  • 5 days paid volunteering leave
  • Group Mediclaim (GMC)
  • Enhanced parental leave policies
  • MacBook Pro
  • Access to various tools to aid your career development
  • Fulltime
Read More
Arrow Right