This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
This role requires the ability to work lawfully in the U.S. without employment-based immigration sponsorship, now or in the future. Do you thrive on architecting and supporting mission-critical platforms that focuses on power reporting, analytics, and engineering across a large organization? At Spectrum, you’ll lead the design and execution of innovative solutions with a focus on stability and rapid delivery, ensuring our platforms are reliable for millions of users. In this role, you will build the core infrastructure and platforms that advance Autonomous Network Operations at Spectrum—ingesting ubiquitous network telemetry, powering large-scale anomaly detection, and maintaining a graph-based digital twin that captures topology, dependencies, and service health. You will operationalize agentic systems that anticipate issues before they become customer-impacting, automatically triage and diagnose root causes, and generate safe, auditable remediation recommendations for our network engineering teams. Your work will define observability standards, SLOs, and automation guardrails, translating advanced analytics into resilient, production-grade capabilities that measurably reduce incidents and MTTR while shaping the engineering landscape at Spectrum.
Job Responsibility:
Lead building and maintaining systems, supporting millions of active users, with hundreds of millions of daily API calls
Design, implement, monitor, enhance, and troubleshoot the infrastructure and APIs supporting our family of applications
Lead maintaining uptime and upholding SLAs for the platform, monitoring and troubleshooting of platform quality issues, identifying bottlenecks and bugs, and devising solutions to these problems, and implementing security and being an advocate for data governance in the company
Develop the infrastructure architecture strategies, standards, target architectures and roadmaps
Always keeps an eye towards reducing compute, bandwidth and storage costs
Diagnose highly complex issues and evaluates, recommends, as well as executes the best resolution
Ensure alignment to drive cross-platform consistency
Architect, review, and harden the overall platform, minimizing application outages
Deliver quality results to the platform engineering team
Requirements:
Bachelor's Degree in computer science or equivalent experience
8+ years of software development experience (JavaScript, TypeScript, Java, Python)
6+ years of exposure to SQL and non-relational databases
6+ years of cloud-based platform engineering role experience
Ability to read, write, speak and understand English
Extensive understanding of code versioning tools such as Git
Extensive knowledge of Computer Science fundamentals (object-oriented design, data structures and algorithm design, problem solving, and complexity analysis)
Extensive experience with building cloud-based, highly-automated, near real time data processing platforms
Demonstrates in-depth problem-solving skills, especially in complex systems
Ability to learn new technologies quickly
Extensive foundation in “bridging the divide” between software products and the infrastructure that runs it
Effective written and verbal communication skills
Demonstrated in-depth analytical and troubleshooting abilities
Nice to have:
Experience developing and operating large scale (+10k nodes) graph systems
Experience developing, testing, and maintaining agentic systems and platforms
3+ years leading or mentoring engineering teams in a platform or infrastructure-focused environment
Hands-on experience designing and operating large-scale, highly available systems handling hundreds of millions of daily API calls
Deep experience with at least one major cloud provider (AWS, GCP, or Azure), including cost optimization and capacity planning
Proven track record implementing observability at scale (metrics, tracing, logging) using tools such as Prometheus, Grafana, Datadog, New Relic, or similar
Experience with infrastructure-as-code and automation frameworks (e.g., Terraform, CloudFormation, Ansible, or similar)
Strong background in security and data governance for cloud platforms, including identity and access management, encryption, and compliance best practices
Experience architecting event-driven and streaming data platforms (e.g., Kafka, Kinesis, or Pub/Sub) for real-time analytics and reporting
Prior success driving cross-team technical alignment and establishing platform standards, patterns, and reference architectures
Master’s degree in Computer Science, Software Engineering, or related field with a focus on distributed systems, data-intensive applications, or cloud architecture
What we offer:
A comprehensive pay and benefits package that rewards employees for their contributions to our success, supporting all aspects of their well-being at every stage of life