CrawlJobs Logo

Customer Reliability Engineer

https://www.endorlabs.com Logo

Endor Labs

Location Icon

Location:
United States

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a Customer Reliability Engineer at Endor Labs on our Customer Success team, you will serve as the highest-level technical support resource, handling complex, high-priority issues that require deep product and systems expertise.

Job Responsibility:

  • Own technical escalations from Customer Success Engineers, Solution Architects and Implementation Engineers ensuring swift reproduction and resolution of critical issues
  • Collaborate with Engineering and Product teams to triage and resolve bugs or architectural issues
  • Provide insight and build closely with our engineering teams, translating customer feedback and troubleshooting insights into tangible product improvements
  • Act promptly when technical issues emerge, applying your advanced troubleshooting skills and understanding of programming and DevOps practices to ensure our customers are successful
  • Conduct deep diagnostics, including logs, APIs, and infrastructure troubleshooting
  • Serve as a bridge between the customer and R&D for complex or systemic issues
  • Document and share solutions for long-term knowledge management and root cause prevention

Requirements:

  • Strong background in software engineering, with 4 -10 years of deep understanding of programming languages, application security, and DevOps practices
  • Demonstrated experience in developing custom technical solutions and actively engaging in customer-facing roles, with a proven ability to handle project-based work effectively
  • A passionate advocate for customer success, with a focus on building secure, scalable solutions from the ground up
  • Exceptional communication skills, capable of breaking down complex technical topics into clear, understandable terms for a variety of audiences
  • Proactive and anticipatory approach to problem-solving, with the ability to foresee customer needs and craft strategic solutions that align with their overarching goals
What we offer:
  • Competitive salary and comprehensive benefits package including Health, Dental, Vision and Mental Health plans
  • 401(k) plan to support your longterm financial goals
  • Flexible PTO to maintain a healthy work-life balance
  • Opportunities for co-working and team meetups to foster collaboration
  • A dog-friendly office environment for those who love to bring their fur babies along

Additional Information:

Job Posted:
December 27, 2025

Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Customer Reliability Engineer

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Netherlands
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Germany
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Database Reliability Engineer - Core Team

We are committed to providing our customers with reliable and secure services at...
Location
Location
United Kingdom
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right
New

Reliability & Maintainability Engineering Manager

At Boeing, we innovate and collaborate to make the world a better place. We’re c...
Location
Location
United States , Everett; Renton
Salary
Salary:
147050.00 - 198950.00 USD / Year
boeing.com Logo
Boeing
Expiration Date
January 16, 2026
Flip Icon
Requirements
Requirements
  • Bachelor of Science degree from an accredited course of study in engineering, engineering technology (includes manufacturing engineering technology), chemistry, physics, mathematics, data science, or computer science
  • 5+ years of experience leading engineering teams in R&M or related functional areas
  • Knowledge of the basic Principles, Processes and Lifecycle of Systems Engineering
  • Understanding concept of Technical Performance Measures (customer centric view of a product performance)
  • Knowledge of basic definitions of Reliability, Maintainability, Durability, and Availability
  • General knowledge of probability & statistics and the basis of such in Reliability & Safety analysis
  • Knowledge of System Modeling methods and relation to R&M modeling & analysis (Model Based Engineering)
  • High level knowledge of Airplane Systems and Structures of commercial or military airplanes
  • Demonstrated ability to work in a multi-discipline engineering environment
Job Responsibility
Job Responsibility
  • Develops project plans aligned to an Airplane Development Program and R&M strategy and objectives
  • Implements plans to ensure business, technical and customer requirements are achieved
  • Develops and monitors appropriate metrics to ensure performance to plan
  • Provides technical direction and guidance to the team regarding processes, tools, technology and deliverables
  • Ensures team products and processes meet customer, company, and regulatory requirements for quality and safety
  • Coaches, counsels, mentors and provides developmental opportunities to improve employee satisfaction and retain a skilled and motivated team
  • Forecasts and negotiates with internal customers and other R&M managers resource needs and recruit personnel if needed
  • Collaborates with other SEIT managers and team members
  • Establishes partnerships and good working relationships with internal customers, stakeholders, peers and direct report
What we offer
What we offer
  • Generous company match to your 401(k)
  • Industry-leading tuition assistance program pays your institution directly
  • Fertility, adoption, and surrogacy benefits
  • Up to $10,000 gift match when you support your favorite nonprofit organizations
  • Relocation based on candidate eligibility
  • Opportunity to enroll in a variety of benefit programs, generally including health insurance, flexible spending accounts, health savings accounts, retirement savings plans, life and disability insurance programs, and a number of programs that provide for both paid and unpaid time away from work
  • Fulltime
Read More
Arrow Right
New

Reliability Engineer I

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Northumberland
Salary
Salary:
Not provided
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited) or equivalent experience (ex. heavy industrial maintenance, reliability, or operations experience)
  • Minimum of one year of reliability experience
  • Demonstrates ability to use reliability tool sets
  • Experience in Performance of RCA
  • Involvement with RCM & FMEA
  • Master Level Proficiency in Predictive Technology
  • Vibration I Certification
  • Machine Health Monitoring Intermediate Proficiency
  • Experience with Work Execution Management
  • Technical understanding of electrical or mechanical components, tools, and designs
Job Responsibility
Job Responsibility
  • Promotes and adheres to the ATS safety culture
  • Ensures compliance with regulatory requirements and ATS policies and procedures
  • Partners with internal/external customer for engineered solutions to improve reliability and throughput
  • Identifies opportunities for Capital Expenditures for equipment replacement (develops and communicates ROI)
  • Highly knowledgeable in operating systems, critical elements, and best practices to enable a precision reliability culture
  • Knowledgeable application of common precision tools and practices
  • Partners with peers to perform reliability centered maintenance and deliverables (equipment specific maintenance plan -ESMP)
  • Actively collaborates with maintenance team on the use of predictive, preventative, and precision maintenance technologies and strategies designed to identify or control risks prior to failure and ensure optimum maintenance execution
  • Partners with peers to perform failure mode & effects analysis
  • Understands Work Execution Management (WEM) & improvements identified through reliability strategy session performance
  • Fulltime
Read More
Arrow Right
New

Reliability Engineer

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Tupelo, Mississippi
Salary
Salary:
Not provided
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited) or equivalent experience (ex. heavy industrial maintenance, reliability, or operations experience)
  • Minimum of one year of reliability experience
  • Demonstrates ability to use reliability tool sets
  • Experience in Performance of RCA
  • Involvement with RCM & FMEA
  • Master Level Proficiency in Predictive Technology
  • Vibration I Certification
  • Machine Health Monitoring Intermediate Proficiency
  • Experience with Work Execution Management
  • Technical understanding of electrical or mechanical components, tools, and designs
Job Responsibility
Job Responsibility
  • Promotes and adheres to the ATS safety culture
  • Ensures compliance with regulatory requirements and ATS policies and procedures
  • Partners with internal/external customer for engineered solutions to improve reliability and throughput
  • Identifies opportunities for Capital Expenditures for equipment replacement (develops and communicates ROI)
  • Highly knowledgeable in operating systems, critical elements, and best practices to enable a precision reliability culture
  • Knowledgeable application of common precision tools and practices
  • Partners with peers to perform reliability centered maintenance and deliverables (equipment specific maintenance plan -ESMP)
  • Actively collaborates with maintenance team on the use of predictive, preventative, and precision maintenance technologies and strategies designed to identify or control risks prior to failure and ensure optimum maintenance execution
  • Partners with peers to perform failure mode & effects analysis
  • Understands Work Execution Management (WEM) & improvements identified through reliability strategy session performance
  • Fulltime
Read More
Arrow Right
New

Field Service Reliability Engineer

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Milwaukee, Wisconsin
Salary
Salary:
50.96 - 65.19 USD / Hour
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited)
  • Eight or more years of reliability experience across 2 or more manufacturing sites
  • Demonstrates ability to perform full array of reliability tool sets
  • Strong technical understanding of electrical or mechanical components, tools, and designs
  • Ability to complete a failure mode effects analysis, cause and effect diagrams, root cause failure analysis, life-cycle costing, and risk analysis
  • Ability to research and apply new equipment technology / trends
  • Robust problem solving, mathematical, analytical, and decision making skills
  • Proficiency with computers, maintenance systems, and applications, including Microsoft Office
  • Excellent verbal communication, facilitation, and presentation skills
  • Strong reporting and technical writing capability
Job Responsibility
Job Responsibility
  • Extensive travel required. (Local, National)
  • Promotes and adheres to the ATS safety culture
  • Engages in various work environments and industries to lead reliability centered maintenance efforts
  • Mentors, coaches, and provides reliability best practices for applications in customer facilities, by customer personnel
  • Identifies top potential issues leading to lost production and preventable maintenance spending. Communicates findings with leadership
  • Provides solutions to root cause deficiencies and demonstrates economic benefits to their correction
  • Actively drives the implementation of equipment improvement projects
  • Identifies and implements current and new processes / technologies to increase equipment performance and uptime
  • Champions systems and best practice procedures towards a proactive manufacturing culture
  • Analyzes equipment performance, failure data, and corrective maintenance history to develop and deploy engineering solutions, improved maintenance strategies, preventative maintenance optimization, and other reliability techniques
  • Fulltime
Read More
Arrow Right
New

Field Service Reliability Engineer

Founded in 1985, ATS is a company with a presence in the United States, Mexico a...
Location
Location
United States , Chicago, Illinois
Salary
Salary:
50.96 - 65.19 USD / Hour
atpchemical.com Logo
Advanced Technology Products
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in engineering (ABET accredited)
  • Eight or more years of reliability experience across 2 or more manufacturing sites
  • Demonstrates ability to perform full array of reliability tool sets
  • Strong technical understanding of electrical or mechanical components, tools, and designs
  • Ability to complete a failure mode effects analysis, cause and effect diagrams, root cause failure analysis, life-cycle costing, and risk analysis
  • Ability to research and apply new equipment technology / trends
  • Robust problem solving, mathematical, analytical, and decision making skills
  • Proficiency with computers, maintenance systems, and applications, including Microsoft Office
  • Excellent verbal communication, facilitation, and presentation skills
  • Strong reporting and technical writing capability
Job Responsibility
Job Responsibility
  • Promotes and adheres to the ATS safety culture
  • Engages in various work environments and industries to lead reliability centered maintenance efforts
  • Mentors, coaches, and provides reliability best practices for applications in customer facilities, by customer personnel
  • Identifies top potential issues leading to lost production and preventable maintenance spending. Communicates findings with leadership
  • Provides solutions to root cause deficiencies and demonstrates economic benefits to their correction
  • Actively drives the implementation of equipment improvement projects
  • Identifies and implements current and new processes / technologies to increase equipment performance and uptime
  • Champions systems and best practice procedures towards a proactive manufacturing culture
  • Analyzes equipment performance, failure data, and corrective maintenance history to develop and deploy engineering solutions, improved maintenance strategies, preventative maintenance optimization, and other reliability techniques
  • Provides technical service to operations and manufacturing personnel on equipment related troubleshooting efforts
  • Fulltime
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.