CrawlJobs Logo

Senior Site Reliability Engineer - GCP & Container Platforms

https://www.wellsfargo.com/ Logo

Wells Fargo

Location Icon

Location:
United States , CHARLOTTE, North Carolina / CHANDLER, Arizona

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided
Save Job
Save Icon
Job offer has expired

Job Description:

We are seeking a Senior Site Reliability Engineer (SRE) to help develop our cybersecurity platform operations across Windows, Linux, and cloud-native environments. This role is central to our transformation from app-specific support to platform-wide reliability engineering. You will bring deep expertise in Google Cloud Platform (GCP), container orchestration, and automation, enabling scalable, secure, and resilient infrastructure that supports diverse applications across our enterprise.

Job Responsibility:

  • Ensure high availability, performance, and security of production systems across Windows, Linux, and GCP environments
  • Engineer and support containerized workloads using Kubernetes (GKE) and Docker, enabling scalable microservices architectures
  • Lead infrastructure provisioning and configuration using Terraform, Ansible, and GCP-native tools
  • Develop automation scripts and pipelines to eliminate manual toil and accelerate incident response
  • Implement observability frameworks using SLIs/SLOs, Prometheus, Grafana, and GCP Operations Suite
  • Drive proactive monitoring, alerting, and telemetry across hybrid environments
  • Lead incident response, root cause analysis, and postmortems
  • Build self-healing systems and automated remediation workflows using GCP-native services and scripting
  • Collaborate with InfoSec to enforce hardening standards, manage vulnerabilities, and support compliance initiatives
  • Integrate security into CI/CD pipelines and container platforms using IAM, encryption, and policy enforcement
  • Partner with developers, application owners, and infrastructure teams to deliver reliable, cloud-native platforms
  • Document configurations, runbooks, and operational procedures to enable cross-team reuse and transparency

Requirements:

  • 4+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 2+ years of experience with GCP services, including GKE, IAM, Cloud Functions, and Cloud Monitoring

Nice to have:

  • 4+ years of experience in Windows Server administration and production support
  • Strong scripting skills in PowerShell, Python, or Shell
  • Proficiency in container technologies: Docker and Kubernetes
  • Familiarity with Linux system administration and hybrid cloud environments
  • Experience with infrastructure-as-code tools: Terraform, Ansible
  • Strong understanding of Active Directory, DNS, DHCP, and Windows security principles
  • Security certifications (e.g., CISSP, Security+, GCP Professional Cloud Security Engineer)
  • Experience with CI/CD tools (e.g., GitLab CI and Jenkins)
  • Familiarity with ITIL practices and change management
  • Exposure to ServiceNow, load balancers, certificate management, and endpoint protection tools

Additional Information:

Job Posted:
February 08, 2026

Expiration:
February 13, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Site Reliability Engineer - GCP & Container Platforms

Senior Vice President, Cloud Security Site Reliability Engineer

This role sits within the Cloud Security team which is responsible for Private a...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or equivalent work experience
  • 8+ years of relevant work experience
  • Highly motivated self-starter with excellent interpersonal and communication skills. Able to communicate efficiently at multiple levels of seniority
  • Certification or formal training in site reliability engineering concepts and practices
  • Prior experience working towards SLIs, SLOs and observability capabilities at a large scale
  • 5+ years experience in Python (preferable) or Java, on large scale systems alongside Linux based scripting languages
  • Experience working on observability, logging and metrics toolsets
  • Experience of k8s and container technologies such as Docker, Openshift and EKS.
  • Experience with public cloud technologies such as AWS, GCP or Azure
  • Experience with Secrets products such as HashiCorp Vault or CyberArk
Job Responsibility
Job Responsibility
  • Working across Container products and Secrets products, across Public and Private Cloud, as well as Cloud native specific products
  • Architecting and building tools and platforms that provide capabilities for SRE
  • Collaboration with multiple stakeholders and partners across Engineering and Operations as well as partner teams within the wider Citi organization
  • Actively owning production level incidents till resolution.
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer Cloud Platform

Zilliz is a fast-growing startup developing the industry’s leading vector databa...
Location
Location
Salary
Salary:
175000.00 - 225000.00 USD / Year
zilliz.com Logo
Zilliz
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience in site reliability engineering or similar roles with a focus on cloud-native systems
  • Proficiency in scripting languages such as Python, Go, or Java
  • Strong knowledge of container orchestration technologies like Kubernetes and Docker
  • Expertise with cloud platforms such as AWS, GCP, or Azure, and their respective monitoring and management tools
  • Experience with infrastructure as code tools such as Terraform or Ansible
  • Familiarity with CI/CD tools such as Jenkins, GitLab CI, or Argo
  • Proven ability to troubleshoot complex distributed systems and resolve issues promptly
  • Bachelor’s degree or above in computer science, software engineering, or other relevant disciplines
  • Ability to thrive in a fast-paced, startup environment and handle multiple projects simultaneously
Job Responsibility
Job Responsibility
  • Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms
  • Ensure the reliability, availability, and performance of Zilliz’s distributed database systems
  • Develop and implement strategies for monitoring, incident management, and disaster recovery
  • Automate system operations and maintenance tasks to improve efficiency and reduce manual intervention
  • Design and build tools to manage and monitor infrastructure, ensuring scalability and robustness
  • Collaborate with software engineers to enhance system reliability, scalability, and performance
  • Maintain and improve the CI/CD pipeline to ensure smooth and rapid deployment of changes
  • Actively contribute to the Milvus Vector Database open-source community, focusing on improving reliability and operational efficiency
  • Fulltime
Read More
Arrow Right

Vice President - Cloud Security Site Reliability Engineer

This role sits within the Cloud Security team which is responsible for Private a...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or equivalent work experience
  • 6+ years of relevant work experience
  • Highly motivated self-starter with excellent interpersonal and communication skills. Able to communicate efficiently at multiple levels of seniority
  • Certification or formal training in site reliability engineering concepts and practices
  • Prior experience working towards SLIs, SLOs and observability capabilities at a large scale
  • 4+ years experience in Python (preferable) or Java, on large scale systems alongside Linux based scripting languages
  • Experience working on observability, logging and metrics toolsets
  • Experience of k8s and container technologies such as Docker, Openshift and EKS
  • Experience with public cloud technologies such as AWS, GCP or Azure
  • Experience with Secrets products such as HashiCorp Vault or CyberArk
Job Responsibility
Job Responsibility
  • Working across Container products and Secrets products, across Public and Private Cloud, as well as Cloud native specific products
  • Architecting and building tools and platforms that provide capabilities for SRE
  • Collaboration with multiple stakeholders and partners across Engineering and Operations as well as partner teams within the wider Citi organisation
  • Actively owning production level incidents till resolution.
  • Fulltime
Read More
Arrow Right
New

Senior DevOps Engineer

As a Senior DevOps Engineer on the Payments team, you will play a critical role ...
Location
Location
Salary
Salary:
Not provided
Polygon Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6 or more years of experience building and operating cloud-native infrastructure in production environments
  • Strong hands-on experience with infrastructure as code, container orchestration, and CI/CD systems, including Terraform and Kubernetes
  • Professional experience operating workloads on Google Cloud Platform or another major cloud provider such as AWS or Azure
  • Demonstrated ability to design secure, highly available systems and apply site reliability engineering best practices
  • Experience working in fast-paced, product-focused engineering teams with a strong bias toward automation
Job Responsibility
Job Responsibility
  • Design, build, and operate the cloud infrastructure powering Polygon Labs’ payments platform, with an initial focus on Google Cloud Platform
  • Implement and maintain infrastructure as code using Terraform, emphasizing modularity, reusability, and long-term scalability
  • Partner closely with payments application engineers to define infrastructure requirements and support CI/CD pipelines, observability, and runtime operations
  • Apply security-first principles across networking, compute, and deployment workflows to ensure high availability, integrity, and performance
  • Automate deployment and operational processes to improve reliability and developer experience
  • Establish and evolve infrastructure best practices aligned with site reliability engineering principles
What we offer
What we offer
  • Remote first global workforce
  • Industry leading Medical, Dental and Vision health insurance
  • Company matching 401k with 3% match
  • $1,500 Home Office Set Up Allowance (life-time max)
  • $75 Monthly internet or phone reimbursement
  • Flexible Time Off
  • 1 company wide wellness Friday day off per quarter
  • Company issued laptop
  • Egg freezing, mental health, and employee wellness benefits
  • Fulltime
Read More
Arrow Right

Senior Full Stack Engineer

The Senior Full Stack Engineer will support the modernization of IRS mission-cri...
Location
Location
United States , McLean
Salary
Salary:
Not provided
bln24.com Logo
BLN24
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or related field
  • Minimum 6 years of experience in full-stack software development and architecture
  • Demonstrated expertise in designing and implementing RESTful and GraphQL APIs, and building service-oriented architectures
  • Proficiency with front-end frameworks such as React, Angular, or Vue, and backend technologies such as Node.js, Python, Java, or Spark
  • Solid working knowledge of core web technologies, including HTML, CSS, JavaScript, and modern UI component libraries
  • Hands-on experience with cloud platforms (AWS, GCP, Azure) and container orchestration tools including Kubernetes and OpenShift
  • Familiarity with platforms such as Databricks for data engineering, pipeline integration, or ML model support
  • Experience designing scalable, secure web applications and microservices architectures with considerations for caching, authentication, and maintainability
  • Working knowledge of SQL and NoSQL databases, CI/CD pipelines, infrastructure-as-code, and cloud monitoring tools
  • Experience collaborating in Agile delivery environments, and contributing to code reviews, documentation, and team-based development workflows
Job Responsibility
Job Responsibility
  • Design and develop scalable APIs using REST, GraphQL, and gRPC in compliance with IRS enterprise architecture and security standards (OAuth, JWT)
  • Lead full-stack development of modern, modular web applications that interface with IRS systems and external users
  • Decompose and migrate legacy system functionality (e.g., COBOL-based command codes) into modern service-oriented components
  • Integrate AI-driven services, including ML model endpoints, auto-generated documentation, code conversion workflows, and intelligent test automation
  • Implement CI/CD pipelines and automated testing tools (e.g., Postman, Newman) to ensure secure, validated, and maintainable code
  • Collaborate with DevOps and Site Reliability Engineers to embed observability tools (e.g., Prometheus, Datadog, New Relic) and monitoring dashboards
  • Translate business and functional requirements into API contracts and reusable service patterns, working within Agile Scrum teams
  • Maintain backward compatibility with legacy systems while building toward scalable, cloud-optimized services
  • Ensure IRS and Treasury IT governance compliance, including Section 508 accessibility and cybersecurity policies
What we offer
What we offer
  • Generous medical, dental, and vision plans
  • Opportunity to work in different sectors
  • Flexibility to balance quality work and personal lives
  • Remote working opportunities
  • Fulltime
Read More
Arrow Right
New

Senior Network and Security Engineer

Join a company in the middle of an exciting technology transformation, where you...
Location
Location
United States , King of Prussia
Salary
Salary:
Not provided
davidsbridal.com Logo
David's Bridal
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Technology, Network Engineering, or related field
  • or equivalent combination of education and experience
  • Minimum 7 years of progressive experience in enterprise network engineering, administration, and security
  • Extensive hands-on experience with Cisco routing and switching technologies, including configuration, troubleshooting, and optimization of enterprise-grade equipment
  • Demonstrated experience managing Cisco Meraki cloud-managed networking solutions (MR, MS, MX) in a multi-site environment
  • Proficiency with F5 BIG-IP load balancers, including LTM configuration, iRules, SSL certificate management, and health monitoring
  • Experience designing and managing network connectivity to public cloud platforms (AWS, Azure, GCP), including VPCs, VPNs, Direct Connect/ExpressRoute, and hybrid architectures
  • Strong understanding of network security principles including firewall management, IDS/IPS, network segmentation, and zero-trust concepts
  • Working experience with SIEM platforms (Splunk preferred) for security monitoring, log analysis, and incident detection
  • Hands-on experience with vulnerability scanning tools and remediation processes
Job Responsibility
Job Responsibility
  • Design, implement, configure, and maintain enterprise network infrastructure including routers, switches, firewalls, load balancers, and wireless systems across all company locations
  • Manage and optimize Cisco Meraki wireless access points, switches, and security appliances across 190+ retail store locations, ensuring consistent connectivity and performance for point-of-sale systems, inventory management, and customer WiFi
  • Configure, maintain, and troubleshoot Cisco routing and switching infrastructure at corporate headquarters and distribution center, including VLANs, spanning tree, OSPF/BGP, QoS policies, and access control lists
  • Administer and optimize F5 load balancers (LTM/GTM) to ensure high availability, traffic distribution, SSL offloading, and optimal application delivery for critical business systems
  • Manage network connectivity and express routes to AWS, Azure, and Google Cloud Platform (GCP), ensuring secure, high-performance hybrid cloud architecture
  • Design and implement SD-WAN solutions to optimize traffic routing, reduce costs, and improve application performance across distributed retail locations
  • Plan and execute network capacity planning, ensuring infrastructure scales to meet business growth and seasonal demand fluctuations
  • Develop and maintain comprehensive network documentation including topology diagrams, IP address management (IPAM), configuration standards, and runbooks
  • Own and manage enterprise firewall infrastructure, including rule creation, modification, auditing, and lifecycle management to ensure least-privilege access and defense-in-depth security
  • Administer and monitor Splunk SIEM platform, developing and tuning correlation rules, dashboards, alerts, and reports to detect and respond to security threats
What we offer
What we offer
  • Rewarding Environment and Competitive Pay
  • Generous Dream Maker Discount After First Pay Period
  • Referral Incentive Program
  • Dayforce Wallet – Get Paid Early!
  • Health/Dental/Vision Insurance
  • 401K Program
  • Paid Vacation, Wellness Days & Holidays, including your Birthday off!
  • Pet Benefits
  • Fulltime
Read More
Arrow Right
New

SEN Teacher

Are you a motivated SEN Teacher looking for a role in the heart of Manchester? D...
Location
Location
United Kingdom , Manchester
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
March 05, 2026
Flip Icon
Requirements
Requirements
  • UK QTS is a must
  • Experience in an SEN setting (references required)
  • UK work eligibility
What we offer
What we offer
  • Excellent local parking options
  • Spacious staff room with biscuits and cakes provided
  • Reduced childcare costs
  • Fulltime
Read More
Arrow Right
New

Indirect Procurement Manager

Lead strategic sourcing and build key relationships in a dynamic, global environ...
Location
Location
Japan , Tokyo
Salary
Salary:
9000000.00 - 12700000.00 JPY / Year
https://www.randstad.com Logo
Randstad
Expiration Date
April 17, 2026
Flip Icon
Requirements
Requirements
  • Bachelor's degree in a relevant field (e.g., Supply Chain Management, Business Administration)
  • Minimum 10 years of experience in indirect procurement, with a proven track record of success
  • Strong negotiation and contract management skills
  • Experience working with global teams and stakeholders
  • Excellent communication and interpersonal skills, with fluency in English and Japanese
  • Proficient in relevant procurement software and systems
  • Analytical skills with the ability to develop and implement strategic plans
  • Experience managing a significant financial spend budget
  • Understanding of Japanese business culture and regulatory environment
  • Strong problem-solving and decision-making abilities
Job Responsibility
Job Responsibility
  • Lead the development and execution of indirect procurement strategies for Japan, aligned with global objectives
  • Manage the full procurement lifecycle, from sourcing to contract negotiation and supplier management
  • Develop and maintain strong relationships with key internal stakeholders and external suppliers
  • Drive cost savings and efficiency improvements across the procurement process
  • Ensure compliance with all relevant procurement policies and regulations
  • Collaborate with regional and global procurement teams to share best practices and leverage synergies
  • Analyze market trends and identify opportunities for innovation and improvement
  • Prepare and manage budgets, ensuring effective financial control
What we offer
What we offer
  • 健康保険
  • 厚生年金保険
  • 雇用保険
  • 土曜日
  • 日曜日
  • 祝日
  • Fulltime
Read More
Arrow Right