CrawlJobs Logo

Senior Software Engineer, Substrate

palantir.com Logo

Palantir Technologies

Location Icon

Location:
United States , New York

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

135000.00 - 200000.00 USD / Year

Job Description:

Substrate is the team responsible for Palantir’s core production infrastructure — 100s of K8s clusters — from on-prem to the major cloud hyperscalers, whether they are internet-connected or air-gapped, small hardware footprint or large. As a Senior Software Engineer on Substrate, you will design and build Palantir’s managed Kubernetes product offerings across all these environments. You and your team will be responsible for bootstrapping and operating the entire fleet of K8s clusters with zero manual steps by building industry leading tooling and contributing to core CNCF components. You will also be responsible for ensuring scale, stability and security across a matrix of compliance regimes and hosting infrastructure types. Your team culture emphasizes engineering rigor and operational excellence at scale. This means issues in production should be pre-empted and deeply root-caused, and investments in automation and self-healing systems are key.

Job Responsibility:

  • Deliver a container runtime to challenging new environment types - new clouds, on premise, edge devices
  • Build automation and establish standards for operating K8s securely at scale with zero manual ops overhead
  • Drive innovation through adoption of novel K8s features and CNCF tools, making upstream contributions as needed
  • Design the next generation of Palantir’s infrastructure through a deep understanding of internal systems and CNCF standards

Requirements:

  • 4+ years of professional software development experience on core infrastructure with emphasis on operational excellence
  • 2+ years of experience contributing to the system design or architecture (architecture, design patterns, reliability and scaling) of new and existing systems
  • Bachelor's degree in Computer Science or equivalent
  • Systems programming experience with strong proficiency in golang, C/C++ or equivalent
  • Working knowledge or hands on experience of infrastructure automation tools such as Terraform, ansible, puppet or K8s operators, and competent coding in Go, Java, or equivalent for the purposes of automation or scripting
  • Deep familiarity with hardware and OS configurations, diagnostic tooling, networking nuts and bolts
  • Deep familiarity with containers (Docker) and orchestration (Kubernetes) at scale
  • Experience working with a cloud provider (AWS/Azure/GCE), or sysadmin/SRE experience in data centers
  • Experience designing, building, and operating high-scale observability or infrastructure systems
  • Working knowledge of networking fundamentals, experience with CNIs or cloud networking infrastructure preferred
What we offer:
  • Employees (and their eligible dependents) can enroll in medical, dental, and vision insurance as well as voluntary life insurance
  • Employees are automatically covered by Palantir’s basic life, AD&D and disability insurance
  • Commuter benefits
  • Relocation assistance
  • Take what you need paid time off, not accrual based
  • 2 weeks paid time off built into the end of each year (subject to team and business needs)
  • 10 paid holidays throughout the calendar year
  • Supportive leave of absence program including time off for military service and medical events
  • Paid leave for new parents and subsidized back-up care for all parents
  • Fertility and family building benefits including but not limited to adoption, surrogacy, and preservation
  • Stipend to help with expenses that come with a new child
  • Employees can enroll in Palantir’s 401k plan

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Software Engineer, Substrate

New

Principal Site Reliability Engineering Manager

The Principal SRE Manager leads the team responsible for durable, high quality h...
Location
Location
Australia , Perth
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
  • Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
  • Bachelor's Degree in Computer Science, Information Technology, or related field AND 5+ years technical experience in software engineering, network engineering, or systems administration
  • equivalent experience
  • Proven experience leading teams through high severity production incidents in large, distributed systems
  • Demonstrated people leadership experience managing senior engineers or technical incident leaders
  • Strong understanding of incident management, reliability engineering, and live site operations at scale
  • Ability to drive clarity, accountability, and results in ambiguous, time critical situations
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check
Job Responsibility
Job Responsibility
  • Own execution quality for Substrate high severity incidents, ensuring clear command, decisive leadership, and forward momentum during high impact events
  • Act as the senior incident leader or sponsor for long running, high stakes, or cross service incidents, ensuring alignment on impact, risk, and recovery priorities
  • Partner closely with Incident Managers, Subject Matter Experts, and service leaders to ensure effective diagnosis, escalation, and mitigation when ownership is unclear or action is blocked
  • Ensure high quality post incident reviews and drive accountability for repair items that reduce recurrence and systemic risk
  • Ensure consistent application of severity and priority models, outage declaration criteria, and executive escalation paths
  • Lead, coach, and develop a team of Site Reliability Engineers serving as incident responders
  • Build a culture of calm execution, accountability, psychological safety, and continuous learning during and after incidents
  • Hire and grow senior talent capable of operating as trusted leaders in high pressure, executive visible situations
  • Serve as a trusted advisor to engineering leaders and executives on live site risk, readiness, and incident response maturity
  • Communicate clearly and credibly with senior leadership during customer impacting events
  • Fulltime
Read More
Arrow Right
New

Principal Site Reliability Engineer

The Principal SRE leads curial initiatives in the team responsible for durable, ...
Location
Location
Australia , Perth
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
  • OR Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
  • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 5+ years technical experience in software engineering, network engineering, or systems administration
  • OR equivalent experience
  • Proven experience leading teams through high‑severity production incidents in large, distributed systems
  • Strong understanding of incident management, reliability engineering, and live‑site operations at scale
  • Ability to drive clarity, accountability, and results in ambiguous, time‑critical situations
Job Responsibility
Job Responsibility
  • Own execution quality for Substrate high severity incidents, ensuring clear command, decisive leadership, and forward momentum during high‑impact events
  • Act as the senior incident leader or sponsor for long‑running, high‑stakes, or cross‑service incidents, ensuring alignment on impact, risk, and recovery priorities
  • Partner closely with Incident Managers, Subject Matter Experts, and service leaders to ensure effective diagnosis, escalation, and mitigation when ownership is unclear or action is blocked
  • Ensure high‑quality post‑incident reviews and drive accountability for repair items that reduce recurrence and systemic risk. Ensure consistent application of severity and priority models, outage declaration criteria, and executive escalation paths
  • Coach and help develop a team of Site Reliability Engineers serving as incident responders
  • Build a culture of calm execution, accountability, psychological safety, and continuous learning during and after incidents
  • Help hire and grow senior talent capable of operating as trusted leaders in high‑pressure, executive‑visible situations
  • Serve as a trusted advisor to engineering leaders and executives on live‑site risk, readiness, and incident response maturity
  • Communicate clearly and credibly with senior leadership during customer‑impacting events
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Substrate

Substrate is the team responsible for Palantir’s core production infrastructure ...
Location
Location
United States , Seattle
Salary
Salary:
135000.00 - 200000.00 USD / Year
palantir.com Logo
Palantir Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of professional software development experience on core infrastructure with emphasis on operational excellence
  • 2+ years of experience contributing to the system design or architecture (architecture, design patterns, reliability and scaling) of new and existing systems
  • Bachelor's degree in Computer Science or equivalent
  • Systems programming experience with strong proficiency in golang, C/C++ or equivalent
  • Working knowledge or hands on experience of infrastructure automation tools such as Terraform, ansible, puppet or K8s operators, and competent coding in Go, Java, or equivalent for the purposes of automation or scripting
  • Deep familiarity with hardware and OS configurations, diagnostic tooling, networking nuts and bolts
  • Deep familiarity with containers (Docker) and orchestration (Kubernetes) at scale
  • Experience working with a cloud provider (AWS/Azure/GCE), or sysadmin/SRE experience in data centers
  • Experience designing, building, and operating high-scale observability or infrastructure systems
  • Working knowledge of networking fundamentals, experience with CNIs or cloud networking infrastructure preferred
Job Responsibility
Job Responsibility
  • Deliver a container runtime to challenging new environment types - new clouds, on premise, edge devices
  • Build automation and establish standards for operating K8s securely at scale with zero manual ops overhead
  • Drive innovation through adoption of novel K8s features and CNCF tools, making upstream contributions as needed
  • Design the next generation of Palantir’s infrastructure through a deep understanding of internal systems and CNCF standards
What we offer
What we offer
  • Employees (and their eligible dependents) can enroll in medical, dental, and vision insurance as well as voluntary life insurance
  • Employees are automatically covered by Palantir’s basic life, AD&D and disability insurance
  • Commuter benefits
  • Relocation assistance
  • Take what you need paid time off, not accrual based
  • 2 weeks paid time off built into the end of each year (subject to team and business needs)
  • 10 paid holidays throughout the calendar year
  • Supportive leave of absence program including time off for military service and medical events
  • Paid leave for new parents and subsidized back-up care for all parents
  • Fertility and family building benefits including but not limited to adoption, surrogacy, and preservation
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Substrate

Substrate is the team responsible for Palantir’s core production infrastructure ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
palantir.com Logo
Palantir Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of professional software development experience on core infrastructure with emphasis on operational excellence
  • 2+ years of experience contributing to the system design or architecture (architecture, design patterns, reliability and scaling) of new and existing systems
  • Bachelor's degree in Computer Science or equivalent
Job Responsibility
Job Responsibility
  • Deliver a container runtime to challenging new environment types - new clouds, on premise, edge devices
  • Build automation and establish standards for operating K8s securely at scale with zero manual ops overhead
  • Drive innovation through adoption of novel K8s features and CNCF tools, making upstream contributions as needed
  • Design the next generation of Palantir’s infrastructure through a deep understanding of internal systems and CNCF standards
  • Fulltime
Read More
Arrow Right
New

Senior Site Reliability Engineer

The Senior SRE leads curial initiatives in the team responsible for durable, hig...
Location
Location
Australia , Perth
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
  • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience
  • OR equivalent experience
  • Proven experience leading teams through high‑severity production incidents in large, distributed systems
  • Solid understanding of incident management, reliability engineering, and live‑site operations at scale
  • Ability to drive clarity, accountability, and results in ambiguous, time‑critical situations
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check
Job Responsibility
Job Responsibility
  • Own execution quality for Substrate high severity incidents, ensuring clear command, decisive leadership, and forward momentum during high‑impact events
  • Act as the senior incident leader or sponsor for long‑running, high‑stakes, or cross‑service incidents
  • Partner closely with Incident Managers, Subject Matter Experts, and service leaders to ensure effective diagnosis, escalation, and mitigation
  • Ensure high‑quality post‑incident reviews and drive accountability for repair items
  • Ensure consistent application of severity and priority models, outage declaration criteria, and executive escalation paths
  • Coach and help develop a team of Site Reliability Engineers serving as incident responders
  • Build a culture of calm execution, accountability, psychological safety, and continuous learning
  • Help hire and grow senior talent
  • Serve as a trusted advisor to engineering leaders and executives on live‑site risk, readiness, and incident response maturity
  • Communicate clearly and credibly with senior leadership during customer‑impacting events
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

Are you passionate about developing cutting-edge technology that impacts million...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Solid C++/C#/Java skills with at least 5+ years of C++/C#/Java programming experience
  • 5+ years of experience in distributed systems and agile development environment
  • Excellent communications and cross-group collaboration skills which facilitate interactions across team
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design and develop search and platform features
  • Operate and manage live site for substrate search cloud service
  • Collaborate with the team on building a highly scalable and high-performance search stack
  • Collaborate with product manager and partners to understand user requirements and design features to enable rich search experiences
What we offer
What we offer
  • Impact Millions: Your work will directly enhance the search and assistance experience for millions of M365 users across various platforms
  • Collaborative Environment: Work alongside a team of highly skilled and passionate professionals in a collaborative and innovative environment
  • Cutting-Edge Technology: Be at the forefront of technological advancements and contribute to the development of powerful enterprise search and assistance solutions
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

Microsoft Teams is the hub for modern collaboration—bringing together everything...
Location
Location
Canada , Vancouver
Salary
Salary:
114400.00 - 203900.00 CAD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Experience working with Service Fabric or Kubernetes
  • Experience working on large-scale distributed systems, client-server architectures, and distributed database systems
  • Cross group collaboration, negotiation and communication skills
  • Ability to deal with the ambiguity associated with working in a fast-paced and changing environment
  • Experience working with M365 components like AAD, Exchange, Substrate, SharePoint
  • Drive to improve performance, availability and supportability of services
  • Drive to increase efficiencies through automation
Job Responsibility
Job Responsibility
  • Design, develop, and operate high-scale services that power the core messaging infrastructure of Microsoft Teams
  • Apply advanced in‑house AI tools to streamline development workflows, accelerate delivery, and improve system scalability
  • Dive deep into Azure technologies and distributed database systems
  • Collaborate with internal and external partners to design features that drive user growth and engagement
  • Develop features that delight customers while upholding the highest standards of availability, reliability, performance, and scalability
  • Influence and define new designs, architectures, standards, and reusable service libraries that empower teams across Microsoft to build at scale
  • Fulltime
Read More
Arrow Right
New

Staff Scientist - Reservations

The Reservations data science team owns the experience and algorithms powering t...
Location
Location
United States , Seattle
Salary
Salary:
216000.00 - 240000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D., M.S. or Bachelor's degree in Statistics, Economics, Mathematics, Computer Science, Machine Learning, Operations Research, or other quantitative fields
  • 6+ years of industry experience as an Applied or Data Scientist or equivalent
  • Proficiency in programming languages (Python, Java, Scala) and ML frameworks (TensorFlow, PyTorch, Scikit-Learn), underpinned by a solid grasp of MLOps practices, including design documentation, testing, and source code management with Git
  • Agile project management capabilities, adept at using tools like JIRA, and a driven problem-solver with a passion for impacting the retail sector at scale
  • Advanced skills in the development and deployment of large-scale ML models
  • Experience in experimental design and analysis (e.g., A/B and market-level experiments), causal inference
  • Strong business and product sense: delight in shaping vague questions into well-defined analyses and success metrics that drive business decisions
Job Responsibility
Job Responsibility
  • Deploy a wide variety of methodologies, including causal inference techniques, funnel analyses, and econometric modeling to identify our largest business opportunities
  • Work together with Product, Operations, and Engineering partners to design a roadmap of features and initiatives as well as the long-term team strategy
  • Run large scale experiments to validate the impact of new features
  • Present findings to business and executive audiences
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • All full-time employees are eligible to participate in a 401(k) plan
  • Eligible for various benefits
  • Fulltime
Read More
Arrow Right