CrawlJobs Logo

Director, Service Reliability Engineering

https://www.marriott.com Logo

Marriott Bonvoy

Location Icon

Location:
United States , Bethesda

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

125600.00 - 203700.00 USD / Year

Job Description:

As Director of SRE, you will lead the team responsible for accelerating and automating the flow of operational activities, ensuring the reliability, performance and scalability of Marriott's critical digital platforms. This position involves establishing reliability-focused engineering practices, mentorship, scale improvement, and collaboration across cross-functional teams.

Job Responsibility:

  • Define and execute Marriott’s SRE vision, aligning with business objectives and technology roadmaps
  • Build, mentor and lead a high-performing SRE team, fostering a culture of collaboration and innovation
  • Establish reliability, observability and automation goals to improve system uptime, performance and scalability
  • Partner with engineering, operations and security teams to drive best practices and continuous improvement
  • Implement reliability-focused engineering practices, including SLAs, SLOs/SLIs and error budgets
  • Design and maintain resilient, scalable and fault-tolerant architectures across cloud and hybrid environments
  • Develop strategies to proactively identify and mitigate risks to system performance and availability
  • Drive root cause analysis (RCA) and post-mortem processes to prevent recurring incidents
  • Champion automation in monitoring, deployment and incident resolution to reduce toil and enhance efficiency
  • Lead and optimize incident response processes, ensuring rapid detection, diagnosis, and resolution of system failures
  • Enhance observability by leveraging monitoring, logging and tracing solutions to provide real-time insights
  • Partner with DevOps teams to improve CI/CD pipelines and reduce deployment risk
  • Champions leaders’ vision for product and service delivery
  • Makes and executes the necessary decisions to keep moving forward toward achievement of goals
  • Provides direction and assistance to other teams regarding projects
  • Determines priorities, schedules, plans and necessary resources to promote completion of any projects on schedule
  • Analyzes information and evaluates results to choose the best solution and solve problems
  • Reviews vendor proposals and selects appropriate vendor for services/technologies/hardware
  • Thinks creatively and practically to develop, execute and implement new project plans
  • Generates and provides accurate and timely results in the form of reports, presentations, etc.
  • Plans, develops, implements, and evaluates the quality of operations

Requirements:

  • Undergraduate degree in computer science, software engineering, or a related field (or equivalent experience)
  • 10+ years of experience in SRE, devsecops or IT operations
  • At least 5 years’ experience in a previous leadership role within SRE, devsecops or IT Operations
  • At least five years of experience in the following technologies - Presentation Management: HTML, CSS, JS, Backbone, Node JS, Android, iOS, Application Platforms: NGINX, Java, Akana, Play Framework, Tomcat, Docker, Openshift, Application Data: PostgreSQL, Couchbase, Cassandra, Integration Services: Apache Kafka, Apache Spark, Akana, Analytics Platforms: Hadoop, dashDB, Cognos, Tableau, Security: Forgerock, OpenID, OAUTH, Ping Identity, Public Cloud: Azure, Google Cloud, AliCloud, Amazon Web Services, CI/CD: Harness
  • Experience with test automation
  • Working knowledge and proven track record of implementing disaster indifferent architecture
  • Experience with CDN and Akamai tools
  • Linux/Unix system administration experience
  • Proficient in scripting and programming languages (like Python, Go, Bash, Shell)
  • Hands on experience with infrastructure as code (like Terraform), container orchestration (like Kubernetes), and reliability automation
  • Working knowledge of networking, databases, distributed systems
  • Deep knowledge of monitoring, logging and incident response tools (like Dynatrace, Splunk, OpsGenie, BigPanda, Prometheus, etc.)
  • Experience implementing and maintaining CI/CD pipelines for large-scale applications
  • Experience creating system architectures for disaster recovery implementation and failover during disasters
  • Familiarity with AI/ML-driven observability and predictive maintenance techniques
  • Exceptional problem solving, communication and stakeholder management skills
  • Experience leading, mentoring and developing high performing SRE teams
  • Experience managing large, cross functional vendor teams
  • Experience defining SLOs/SLIs, error budgets, and KPIs to drive accountability and performance
  • Ability to foster a culture of continuous improvement
  • Proven record of staying ahead of industry trends/informed of emerging technologies to enhance system reliability and efficiency
  • Experience in hospitality is preferred

Nice to have:

Experience in hospitality

What we offer:
  • Bonus program
  • comprehensive health care benefits
  • 401(k) plan with up to 5% company match
  • employee stock purchase plan at 15% discount
  • accrued paid time off (including sick leave where applicable)
  • life insurance
  • group disability insurance
  • travel discounts
  • adoption assistance
  • paid parental leave
  • health savings account (except for positions based out of or performed in Hawaii)
  • flexible spending accounts
  • tuition assistance
  • pre-tax commuter benefits
  • other life and work wellness benefits
  • stock awards
  • deferred compensation plans

Additional Information:

Job Posted:
March 21, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Director, Service Reliability Engineering

Director Engineering- Security Service Edge (SSE)

Join HPE’s Security Service Edge (SSE) organization as a senior engineering lead...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or a related field in a known university
  • advanced degree preferred
  • 15+ years of experience in engineering, with 10+ years in leadership roles managing large-scale teams
  • Demonstrated expertise in cloud-native operations (AWS, Azure, or GCP), infrastructure-as-code, observability, and incident management
  • Strong background in data engineering platforms (Snowflake, Airflow, etc.), data governance, and analytics delivery
  • Experience overseeing security operations, including cloud security architecture, compliance, and access controls
  • Proven ability to drive high-scale, reliable, and efficient SaaS operations
  • Strong business and operational judgment with a track record of cross-functional impact
  • Excellent communication skills, comfortable engaging technical and executive audiences alike
Job Responsibility
Job Responsibility
  • Provide strategic and technical leadership for Ops, Data Engineering and Management teams supporting the SSE platform
  • Lead and grow high-performing, geographically distributed engineering teams
  • Define organizational goals aligned with business and technology roadmaps
  • drive execution against measurable outcomes
  • Champion engineering excellence through DevOps, automation, security-by-design, and modern development practices
  • Collaborate with product, architecture, customer success, and executive leadership to drive technical and business success
  • Foster a culture of innovation, operational excellence, and continuous improvement
  • Provide coaching, career development, and succession planning for leaders and senior engineers
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development
  • An amazing life inside the element
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Director of Engineering, Platform Engineering

In your role as ‘Director of Engineering, Platform Engineering’ you will guide t...
Location
Location
United States , Oakland, California
Salary
Salary:
241000.00 - 305000.00 USD / Year
everlaw.com Logo
Everlaw
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 4 years of experience managing and leading senior engineers, including technical workstream management and execution support
  • At least 2 years of experience managing and leading managers, coaching them on talent management, strategic planning, and execution, with a focus on platform engineering teams
  • At least 5 years of experience as a senior engineer building one or more of - developer productivity tools, highly available platform services (i.e. storage systems, pub-sub systems, search systems, caching solutions, observability solutions) and/or have expertise and experience with infrastructure and/or cloud technologies (like Ansible, Terraform, Kubernetes, Docker etc)
  • You have a good dynamic range that you apply to different situations - you can step back and empower, while also diving deep into the code to understand the details
  • You can communicate at the right altitude with both technical and non-technical stakeholders
  • You have experience working with stakeholder teams (internal and/or external) in setting and collaborating on technical roadmaps
  • You have experience communicating with customers articulating to them how the platform works on reliability, security and compliance matters
  • You have a BS/MS or PhD in Computer Science (or equivalent)
  • You have a sound foundational understanding of a wide range of computer science topics and concerns relating to system and software design
  • You are authorized to work in the United States
Job Responsibility
Job Responsibility
  • Inspire and empower your managers to cultivate high-performing teams, fostering a culture of continuous feedback and professional growth to ensure successful project delivery and career development
  • Use your technical knowledge to align stakeholders across Engineering and Product on the ideal path forward on complex technical decisions and roadmap decisions
  • Strategize, prioritize, resource, and execute against our Engineering roadmap
  • Work with Engineering Operations, cross-functional teams, team members and managers to improve various processes that affect infrastructure growth, support, alignment, collaboration, and accountability
  • Critically observe and understand Everlaw’s platform, tooling, and processes
What we offer
What we offer
  • Equity program
  • 401(k) retirement plan with company matching
  • Health, dental, and vision
  • Flexible Spending Accounts for health and dependent care expenses
  • Paid parental leave and approximately 10 days (80 hours) per year of sick leave
  • Seventeen paid vacation days plus 11 federal holidays
  • Membership to Modern Health to help employees prioritize mental health and wellness
  • Annual allocation for Learning & Development opportunities and applicable professional membership dues
  • Company-sponsored life and disability insurance
  • Work in Downtown Oakland, just steps from the BART line and dozens of restaurants
  • Fulltime
Read More
Arrow Right

MTS Software Architecture - Reliability Engineering

Our team is searching for a Full Stack Member of Technical Staff to collaborate ...
Location
Location
United States , Frisco; Atlanta; Overland Park
Salary
Salary:
145400.00 - 262300.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree Computer Science, engineering or related field of study
  • 9+ years technical engineering experience, including full-stack web development (front-end and back-end)
  • 7+ years or experience in database schema design and writing SQL
  • 3+ years DevOps experience, including infrastructure as code
  • 4+ years hands-on experience with cloud services (AWS, Azure, GCP)
  • 3+ years experience mentoring and coaching team members
  • Expertise in multiple technologies and software stacks
  • Strong understanding of cloud capabilities and how to optimize them for team success
  • Ability to setup a completely new full stack environment from scratch including build steps and backend infrastructure
  • Proficiency in html, css, webpack, JavaScript, at least one front end framework and one backend framework
Job Responsibility
Job Responsibility
  • Imagines, designs and builds full stack web solutions including both the back end and front end
  • Code Review and mentoring of other team members
  • Imagines, designs and builds advanced scheduled jobs and micro-services defining new patterns and orchestrations
  • Imagines, designs and implements advanced data storage mechanisms using relational and non-relational data stores
  • Explores, builds and configures cloud services using infrastructure as code. Recommends new cloud services and patterns
  • Presents ideas which improve an existing system/process/service. Presents new ideas which utilize new frameworks to improve an existing system/process/service
  • Collaborates with team to break down features into user stories and estimate them
  • Awareness of technology roadmap. Updates job knowledge by tracking and understanding emerging engineering practices. Continuously learns, creates content, and teaches others specific subject areas. Informally coaches and contributes to the development of others through mentoring or in house workshops and learning sessions. Coach and develop engineers across functional teams on technology decisions. Influence technology and policy decisions made at Director+ level across organization. Understand financial decisions, including NPV and ROI, based on customer experience/business drivers. Present highly technical concepts to both technical and non-technical decision-makers
  • Provides direction on creation of reliability practices, metrics and tooling based on industry best practices and incident data
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Employee stock grants
  • Employee stock purchase plan
  • Paid time off
  • Fulltime
Read More
Arrow Right

Director of Engineering

The Engineering Director will lead and manage the engineering and maintenance fu...
Location
Location
Saudi Arabia , Riyadh
Salary
Salary:
120000.00 GBP / Year
alfa-executive.com Logo
Alfa-Executive Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years at senior management/directorship level with a demonstrable track record of successfully leading a culturally diverse team while establishing a high-performance culture
  • Proven experience in a senior engineering or technical operations leadership role, ideally within heavy plant hire, construction equipment, or related industries
  • Excellent knowledge of mechanical, hydraulic, electrical, and electronic systems in heavy equipment
  • Demonstrated ability to lead multi-disciplinary teams across several locations
  • Sound understanding of HSE regulations relevant to plant operations
  • Commercial acumen with experience managing budgets and capital investments
  • Proficient in asset management systems, telematics, and data analytics
  • Strong, confident people manager capable of creating and maintaining team spirit, motivating and developing staff, and managing performance as required
  • Applicants must be based in Saudi Arabia or have extensive experience (min of 5 years) of working the Saudi Arabian market
Job Responsibility
Job Responsibility
  • Develop and implement the engineering strategy aligned with business goals and growth plans
  • Provide input into company-wide strategic planning, particularly regarding fleet investment and asset management
  • Lead CAPEX planning for fleet upgrades, replacements, and new acquisitions
  • Oversee the maintenance, repair, and servicing of all fleet, ensuring minimal downtime
  • Develop and implement predictive, preventive, and corrective repair programs
  • Direct planning and scheduling efforts for maintenance and repair work
  • Monitor implementation of Planned Preventive Maintenance (PPM) program
  • Analyse performance metrics and develop improvement plans
  • Implement reliability excellence practices and lead failure analysis
  • Conduct audits of critical asset maintenance
What we offer
What we offer
  • package
  • Fulltime
Read More
Arrow Right

Director of Engineering - JFrog Security

We are seeking a visionary Director of Engineering with a strong product mindset...
Location
Location
Israel , Netanya/Tel Aviv
Salary
Salary:
Not provided
jfrog.com Logo
JFrog
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in software engineering
  • At least 5 years in leadership roles managing diverse R&D teams
  • Proficiency in Software Development: strong experience in building and managing scalable systems using languages such as Go, Python, Java, or Node.js
  • Proven track record of delivering scalable cloud / SaaS services, operating in multi-cloud (AWS, Azure, GCP) and hybrid-cloud environments
  • Experience working in Agile or DevOps-focused organizations
  • Exceptional leadership, communication, stakeholder, and customer management skills
  • Passion for empowering developers and enabling productivity through platform engineering
Job Responsibility
Job Responsibility
  • Build, lead, and mentor multiple cross-functional teams of developers to deliver a scalable and reliable product
  • Foster a culture of collaboration, automation, and continuous improvement within the engineering organization
  • Define and implement procedures and workflows, enabling rapid innovation alongside ensuring enterprise-grade quality of the delivered product
  • Architect and deliver an enterprise-grade solution that operates in both SaaS and self-hosted environments
  • Partner with engineering leaders, product teams, and business stakeholders to ensure successful product adoption by the customers
Read More
Arrow Right

Director of Engineering - AppTrust

We are looking for a visionary Director of Engineering with a strong product min...
Location
Location
Israel , Netanya/Tel Aviv
Salary
Salary:
Not provided
jfrog.com Logo
JFrog
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in software engineering
  • at least 5 years in leadership roles managing diverse R&D teams
  • proficiency in software development: strong experience in building and managing scalable systems using languages such as Go, Python, Java, or Node.js
  • proven track record of delivering scalable cloud / SaaS services, operating in multi-cloud (AWS, Azure, GCP) and hybrid-cloud environments
  • experience working in Agile or DevOps-focused organizations
  • exceptional leadership, communication, stakeholder, and customer management skills
  • passion for empowering developers and enabling productivity through platform engineering
Job Responsibility
Job Responsibility
  • Build, lead, and mentor multiple cross-functional teams of developers to deliver a scalable and reliable product
  • foster a culture of collaboration, automation, and continuous improvement within the engineering organization
  • define and implement procedures and workflows, enabling rapid innovation alongside ensuring enterprise-grade quality of the delivered product
  • architect and deliver an enterprise-grade solution that operates in both SaaS and self-hosted environments
  • partner with engineering leaders, product teams, and business stakeholders to ensure successful product adoption by customers
  • play an active role in the group’s leadership team
Read More
Arrow Right

Director of Engineering

This role involves driving the strategy, development, and seamless integration o...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven leadership experience in systems integration around Private Cloud Solutions and related domains
  • expertise in Kubernetes, containers, microservices, and cloud-native architectures
  • strong expertise in systems engineering principles, design, and technologies, with demonstrated experience in managing complex engineered systems
  • understanding of cloud security principles and identity management tools such as Keycloak
  • experience in DevOps practices, CI/CD pipelines, and continuous integration
  • familiarity with infrastructure monitoring tools (Prometheus, Grafana, ELK Stack)
  • exceptional analytical and problem-solving skills, especially in navigating AI, private cloud-specific, and partner-driven challenges
  • experience in Agile methodologies, particularly Scrum and SAFe frameworks
  • track record in managing complex technical projects and cross-functional teams
  • understanding and application of core storage, compute and networking technologies along with Virtualization, Microservices, Distributed Systems Architecture etc.
Job Responsibility
Job Responsibility
  • Lead engineering teams, promote innovation, collaboration, and improvement
  • manage design, architecture, integration, release, and lifecycle of complex systems for compatibility, performance, and reliability
  • develop integration strategies for Private Cloud Solutions aligned with organizational goals
  • lead strategic initiatives to automate cloud operations, upgrades, and deployments
  • manage cloud security strategies and ensure comprehensive identity and access management practices
  • implement systems to monitor performance, ensuring reliability, scalability, and security of integrated private cloud solutions
  • enforce strict standards for production readiness and release
  • ensure adherence to internal and industry security and operational standards
  • work with business stakeholders, technical leaders, cross-functional teams, and development teams to identify integration challenges, integrate hardware and software, and propose solutions for private cloud implementations
  • implement and sustain Agile practices, process workflows, and documentation frameworks for systems integration
What we offer
What we offer
  • Health & Wellbeing
  • personal & professional development
  • unconditional inclusion
  • Fulltime
Read More
Arrow Right

Director, North America Infrastructure Operations & Reliability

Alimentation Couche-Tard (Circle K) seeks a highly experienced, driven, and dyna...
Location
Location
United States of America , Tempe
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 10 years of demonstrated progressively responsible experience and successful Infrastructure and operations management of distributed global platforms
  • Strong ability to identify needs, take initiative, and prioritize work efforts, balancing operational tasks with longer-term strategic security efforts
  • Proven success in establishing key performance indicators, metrics, and focus to drive operational / service delivery best practices
  • Meticulous planning skills with a balance of risk management and efficient execution
  • Establish and balance priorities between new initiatives and sustaining operations engineering work
  • Ability to establish and maintain trust and rapport with the team and external constituents
  • Experience leading and developing multiple team members and managed service providers
  • Strong knowledge and understanding of infrastructure operations and reliability best practices in a high-volume and critical production service environment
  • Experience managing vendor relationships for all infrastructure services and solutions and reviewing vendor contracts, statements of work, and related documents
  • Experience in DevOps and Infrastructure and Application migration to cloud
Job Responsibility
Job Responsibility
  • Lead a multi-disciplinary North America focused team, in close partnership with managed service providers, to establish roadmaps and successful implementation of technology standards, including hosting, network, storage, workplace, desktop, and other datacenter infrastructure
  • Build strong relationships with company leaders and departments across the organization to understand the business, share knowledge, and foster a collaborative, supportive environment when recommending technology solutions to meet business objectives
  • Partner with cybersecurity and risk management teams to ensure the infrastructure meets security requirements and evolves over time to meet changing needs and best practices
  • Cloud Migration & DevOps: Drive application migration to the cloud, embedding DevOps and observability tooling to enhance delivery and monitoring
  • Implement Observability best practices and tooling to monitor the effectiveness of the delivery of application and infrastructure services
  • Working close with the Operational Resiliency team, develop and implement infrastructure disaster recovery protocols to minimize disruption to business operations in the event of emergency situations
  • Develop and report on relevant KPIs and metrics to drive operational maturity, improved customer experience, and aid in transparency and understanding across the business of the infrastructure organization’s contributions to the business
  • Strong focus on leadership and development of team members and extended team members of managed service partners. Ensuring professional growth, setting direction/priorities, delegating tasks, resolving conflicts, and fostering a winning culture with high-performance oriented team members
  • Fulltime
Read More
Arrow Right