CrawlJobs Logo

Staff Systems Infrastructure Engineer

solomonpage.com Logo

Solomon Page

Location Icon

Location:
United States , Palo Alto

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

120000.00 - 200000.00 USD / Year

Job Description:

You will be an integral part of our engineering team, collaborating closely with backend, data, and AI/ML engineers to design and implement infrastructure solutions that support our rapid growth and evolving product needs. You'll work directly with our security and compliance teams to ensure our infrastructure meets stringent healthcare regulations, including HIPAA and SOC 2. Members located around the San Francisco Bay Area come to office once or more weekly. While relocation is encouraged, we are a remote-first company. You must be able to work during the core hours in the Pacific timezone. For compliance reasons, we cannot employ you outside the United States.

Job Responsibility:

  • Design, implement, and maintain highly available, scalable, and secure cloud infrastructure on Google Cloud Platform (GCP) to support our Clinical Data Intelligence Platform and SMART on FHIR applications
  • Develop and implement Infrastructure as Code (IaC) solutions to automate provisioning, configuration, and management of our environments
  • Build and optimize CI/CD pipelines using tools like GitHub Actions to enable rapid and reliable deployment of our applications and services
  • Implement and manage monitoring, alerting, and logging solutions with a focus on OpenTelemetry to ensure system health, identify performance bottlenecks, and proactively address issues
  • Collaborate with engineering teams to optimize application performance, reliability, and cost efficiency
  • Ensure strict adherence to security best practices and compliance requirements (e.g., HIPAA, SOC 2) across all infrastructure components and processes
  • Manage and improve database infrastructure (e.g., PostgreSQL, AlloyDB, Cloud SQL) for performance and scalability
  • Take part in rotating on-call duties to maintain the stability and availability of our production systems

Requirements:

  • 7+ years of experience in DevOps, Site Reliability Engineering, or Infrastructure Engineering roles
  • Deep expertise in cloud platforms, with significant experience in Google Cloud Platform (GCP) services (e.g., Kubernetes (GKE), Cloud Run, Cloud SQL, AlloyDB, Pub/Sub, Cloud Storage, Compute Engine)
  • Strong proficiency with Infrastructure as Code (IaC) concepts and tools
  • Extensive experience with CI/CD pipeline development and management, specifically with GitHub Actions
  • Solid understanding of containerization technologies, especially Docker and Kubernetes
  • Proficiency in scripting languages (e.g., Python, Bash) for automation and system management
  • Experience with monitoring, logging, and alerting tools, with a focus on OpenTelemetry
  • Demonstrated knowledge of database administration and optimization, particularly PostgreSQL, AlloyDB, and Cloud SQL
  • A strong commitment to information security and privacy, with experience in implementing and maintaining systems in compliance with frameworks like HIPAA and SOC 2
  • Excellent problem-solving skills and the ability to troubleshoot complex infrastructure issues
  • Clear communication, documentation, and collaboration skills

Nice to have:

Familiarity with healthcare data standards (e.g., FHIR, HL7) and experience supporting SMART on FHIR applications

What we offer:

0.05% – 0.4% and Benefits

Additional Information:

Job Posted:
January 18, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff Systems Infrastructure Engineer

Staff Infrastructure System Engineer

Staff Infrastructure System Engineer role at Ledger, focused on designing, confi...
Location
Location
France , Paris
Salary
Salary:
Not provided
https://www.ledger.com Logo
Ledger
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 10 years of experience in Linux system administration and management in a high availability environment
  • Experience with virtualization and containerization environments (K8s, Docker, Proxmox)
  • Hands-on experience with configuration management tools such as Ansible and Infrastructure-as-code solutions such as SaltStack or Terraform
  • Working knowledge of system monitoring tools- especially Prometheus
  • Knowledge of encryption and PKI technologies
  • Highly motivated and self-driven
  • Proven level of autonomy
  • Ability to communicate, convince, explain, and justify choices
  • Honest and realistic
  • Creative problem solving and solutions assessment skills with an ability to identify develop and implement solutions to meet the needs of the business
Job Responsibility
Job Responsibility
  • Design, configuration and administration of systems infrastructure including core services, internal tools, monitoring solutions and bare metal servers
  • Researching, piloting, integrating, and implementing new technologies and infrastructure solutions
  • Supporting and contributing to the delivery of source code version management, continuous integration tools, and package management solutions
  • Accurately sizing and forecasting systems work packages within the infrastructure domain
  • Act as point of escalation for the design or setup of any systems delivery
  • Monitoring, optimizing and troubleshooting, diagnosing and resolving hardware or software incidents and problems
  • Protecting data, software, and hardware by coordinating, planning and implementing security measures
  • Configuration, monitoring and maintenance of backup and replication routines and organizing disaster recovery readiness
  • Ensure documentation of Ledger's infrastructure systems is up to date
What we offer
What we offer
  • Flexible work options - work from home up to 3 times per week
  • Health & Wellness support - Health and Life Insurance
  • Financial growth opportunities - employees can become shareholders in Ledger
  • Commuter allowance - contribution to preferred means of transportation
  • Learning & Development - comprehensive suite of training solutions providing personalised learning experience
  • Fulltime
Read More
Arrow Right

Staff Infrastructure Engineer

Porter is a Platform as a Service that runs in the user's own cloud. We allow us...
Location
Location
United States , New York
Salary
Salary:
200000.00 - 280000.00 USD / Year
porter.run Logo
Porter
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior backend engineer
  • Experience architecting internal developer/infrastructure platforms
  • Experience programming against hyperscaler and k8s APIs
  • 3+ years experience
  • Go experience is a plus
Job Responsibility
Job Responsibility
  • Own our infrastructure management system
  • Stay up to date on the latest in cloud infrastructure, Kubernetes and DevOps best-practices
  • Raise the standard for code quality and our engineering culture
What we offer
What we offer
  • Medical, vision, dental insurance
  • 401k
  • 6 weeks of PTO, 6 weeks of remote work
  • Free lunch and office snacks
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Database Systems

Zilliz is a fast-growing startup developing the industry’s leading vector databa...
Location
Location
Salary
Salary:
175000.00 - 250000.00 USD / Year
zilliz.com Logo
Zilliz
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience in developing database systems
  • Active contributor to one or more infrastructure software such as Snowflake, CockroachDB, Oracle RDBMS, Google BigQuery, Spanner, Redshift, Aurora, Cosmos DB, MySQL, PostgreSQL, Hudi, Delta Lake, Iceberg, Ray, Spark, Flink, Kafka, Redis, ElasticSearch, etc.
  • Bachelor’s degree or above in computer science, software engineering, or other relevant disciplines
  • Willing to adapt to a fast-changing environment (i.e. the different stages of a startup company)
Job Responsibility
Job Responsibility
  • Develop distributed database systems using Zilliz’s innovative data science platforms
  • Create request plans, develop new systems, and perform prototype verification and testing
  • Design and write core architecture code
  • Provide creative solutions to technical issues that arise during the product development process
  • Take ownership of product performance and stability
  • Research emerging technology to optimize the performance of underlying distributed platforms
  • Manage the Milvus open-source community and broaden Zilliz’s reach worldwide
What we offer
What we offer
  • Competitive compensation (cash + equity)
  • Regular bonus and equity refresh opportunities
  • Medical, dental, and vision insurance
  • Paid time off, including vacation, sick leave, and global reset/wellbeing days
  • Generous 401(k) and regional retirement plans
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Compute

Play a key role in building our platform from zero to one. Partner across teams ...
Location
Location
United States
Salary
Salary:
200000.00 - 275000.00 USD / Year
getdbt.com Logo
dbt Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in software engineering, with expertise in database systems, query engines, or storage systems
  • Strong coding skills at the systems level C++, Rust, Go, Python, or Java
  • Experience designing and scaling distributed systems or SaaS platforms
  • Expertise with cloud infrastructure (AWS, GCP, Azure, Kubernetes, Terraform)
  • Proven ability to lead complex projects and collaborate across functions
  • Excellent problem-solving skills, clear communication, and a strong sense of ownership
Job Responsibility
Job Responsibility
  • Design, build, and maintain the Compute layer that powers dbt’s ability to optimize queries across ingestion, transformation, and consumption
  • Lead technical architecture discussions with a focus on query engines, storage systems, and distributed database design
  • Collaborate with Product, Design, Operations, and Security to deliver well-architected, scalable compute solutions
  • Build services, APIs, and experiences that support user delight, quality, high availability, and performance
  • Tackle ambiguous, open-ended technical challenges with strategic thinking, balancing technical constraints with user needs and product goals
  • Define and drive best practices in testing, observability, and system reliability
  • Mentor engineers across the company, fostering technical growth and collaboration
  • Champion a culture of technical excellence and innovation, influencing engineering direction across multiple teams or domains
What we offer
What we offer
  • Unlimited vacation
  • 401k
  • Pension Plan
  • 16 weeks Paid Parental Leave
  • Wellness stipend
  • Home office stipend
  • Equity Stake
  • Fulltime
Read More
Arrow Right

Systems Engineer

This position plays a pivotal role in designing, implementing, and maintaining t...
Location
Location
United States , Sacaton
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Information Technology, Computer Science, or related field (or equivalent work experience)
  • Minimum of 3+ years of experience in systems engineering, IT infrastructure, or related roles
  • Comprehensive knowledge of operating systems, networking, virtualization, and cloud platforms
  • Hands-on experience with tools for system monitoring, automation, and troubleshooting
  • Strong analytical skills with the ability to solve complex technical problems
  • Excellent verbal and written communication skills to interact with both technical and non-technical stakeholders
Job Responsibility
Job Responsibility
  • Plan and implement IT systems to align with both technical needs and business goals
  • Collaborate with cross-functional teams to integrate, customize, and optimize existing systems
  • Monitor performance and reliability of IT systems, ensuring minimal downtime
  • Identify and resolve software, hardware, and network issues in a timely manner
  • Develop and enforce system security measures to safeguard sensitive data
  • Ensure all systems comply with regulatory and industry standards
  • Contribute to system modernization projects by addressing technical debt and optimizing legacy infrastructure
  • Identify areas for automation to streamline processes and improve efficiency
  • Support the migration of core systems to cloud environments such as AWS, Azure, or Google Cloud
  • Implement scalable and resilient infrastructure to meet future growth
What we offer
What we offer
  • medical, vision, dental, and life and disability insurance
  • eligibility to enroll in our company 401(k) plan
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Reliability

Join us in building the future of finance. Our mission is to democratize finance...
Location
Location
United States , Menlo Park
Salary
Salary:
217000.00 - 255000.00 USD / Year
robinhood.com Logo
Robinhood
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience designing and scaling distributed systems in production environments
  • Deep technical expertise in one or more programming languages (e.g., Python, Go, C++) and strong systems engineering fundamentals
  • Experience leading major infrastructure or reliability initiatives across multiple teams or domains
  • Track record of improving reliability metrics such as SLO adherence, MTTD/MTTR, or cost efficiency at scale
  • Strong mentorship and communication skills, with a focus on collaboration, clarity, and impact
Job Responsibility
Job Responsibility
  • Develop and build software, infrastructure and tools that improve observability, alerting, incident response, and system readiness
  • Serve as a technical leader and reliability domain expert across multiple teams, driving architectural decisions and cross-functional initiatives
  • Design and lead large-scale reliability efforts that impact Robinhood’s most critical systems and services
  • Lead Production Readiness Reviews, championing best practices in pre-production testing, SLO development, and incident response metrics
  • Mentor engineers, foster a reliability-first culture, and drive long-term improvements that reduce operational overhead and improve system health
What we offer
What we offer
  • Performance driven compensation with multipliers for outsized impact, bonus programs, equity ownership, and 401(k) matching
  • 100% paid health insurance for employees with 90% coverage for dependents
  • Lifestyle wallet - a highly flexible benefits spending account for wellness, learning, and more
  • Employer-paid life & disability insurance, fertility benefits, and mental health benefits
  • Time off to recharge including company holidays, paid time off, sick time, parental leave, and more
  • Exceptional office experience with catered meals, events, and comfortable workspaces
  • Fulltime
Read More
Arrow Right

Engineering Manager, Infrastructure

As an Engineering Manager for the Infrastructure team, you’ll lead the engineers...
Location
Location
Canada; United States
Salary
Salary:
195000.00 - 285000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on software or infrastructure engineering experience
  • 2+ years of experience leading teams of senior and staff-level engineers in platform, SRE, or infrastructure domains
  • Proven ability to design and operate large-scale distributed systems in cloud environments (preferably GCP or AWS)
  • Expertise with Kubernetes, Docker, Terraform, Ubuntu, and CI/CD pipelines
  • Familiarity with observability tools (Grafana, Prometheus, ELK, Datadog, NewRelic) and performance tuning
  • Strong grounding in networking, security, and reliability principles
  • Experience managing infrastructure costs, availability SLAs, and high-throughput systems at scale
Job Responsibility
Job Responsibility
  • Lead, coach, and grow a distributed team of high-impact Infrastructure Engineers
  • Partner with senior engineering leadership on strategic initiatives such as cloud migration, infrastructure scaling, platform reliability, and cost efficiency
  • Define and implement modern operational excellence practices, including SLOs, error budgets, incident reviews, and performance monitoring
  • Guide technical decision-making across key areas like Kubernetes, GCP, observability, networking, CI/CD, and IaC (Terraform, Ansible)
  • Collaborate with AI, Data, and Product Engineering teams to ensure infrastructure scalability for ML and AI-native workloads
  • Run effective 1:1s, career development conversations, and quarterly performance reviews
  • Support recruiting efforts to attract top engineering talent across time zones
What we offer
What we offer
  • Equity
  • Company bonus or sales commissions/bonuses
  • 401(k) plan
  • At least 10 paid holidays per year
  • Flex PTO
  • Parental leave
  • Employee assistance program and wellbeing benefits
  • Global travel coverage
  • Life/AD&D/STD/LTD insurance
  • FSA/HSA and medical, dental, and vision benefits
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Cloud Capacity

The Cloud Capacity team plays a critical role in ensuring the Temporal Cloud is ...
Location
Location
United States
Salary
Salary:
170000.00 - 250000.00 USD / Year
temporal.io Logo
Temporal
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience contributing to large-scale infrastructure efforts spanning cloud compute, storage, and networking systems
  • Strong product and operational intuition around managing cloud costs, utilization tracking, and workload forecasting
  • A track record of designing distributed systems and services in a production cloud environment (preferably AWS, GCP, or Azure)
  • Hands-on experience with container orchestration technologies (e.g., Kubernetes) and the surrounding ecosystem
  • Exceptional collaboration and communication skills
  • Comfortable aligning cross-functional stakeholders on complex infrastructure problems, including executives and finance partners
  • 6+ years of experience building production software using Go, Java, or similar languages
Job Responsibility
Job Responsibility
  • Drive the technical vision and roadmap for Temporal’s Cloud Capacity systems in partnership with engineering and product leadership
  • Design and implement infrastructure to track resource utilization, forecast consumption, and support automated capacity planning at scale
  • Lead development of a resource manager that optimizes infrastructure efficiency based on usage trends, cost insights, and evolving customer needs
  • Collaborate cross-functionally with Product, Cloud Infrastructure, and Finance to inform business-critical decisions around provisioning, pricing, and scaling
  • Guide long-term strategy to support intelligent autoscaling, workload isolation, and predictable performance in a multi-tenant cloud environment
What we offer
What we offer
  • Unlimited PTO, 12 Holidays + 2 Floating Holidays
  • 100% Premiums Coverage for Medical, Dental, and Vision
  • AD&D, LT & ST Disability, and Life Insurance (Standard & Supplemental Available)
  • Empower 401K Plan
  • Additional Perks for Learning & Development, Lifestyle Spending, In-Home Office Setup, Professional Memberships, WFH Meals, Internet Stipend and more
  • $3,600 / Year Work from Home Meals
  • $1,500 / Year Career Development & Learning
  • $1,200 / Year Lifestyle Spending Account
  • $1,000 / Year In-Home Office Setup (In addition to Temporal issued equipment)
  • $500 / Year Professional Memberships
  • Fulltime
Read More
Arrow Right