This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a skilled and motivated Site Reliability Engineer to join our team in Trimble’s Core Cloud Platform. The ideal candidate will have a strong background in cloud platforms, infrastructure as code, and automation via programming/scripting languages. You will embed with a product delivery team to drive the reliability, scalability, and security of the team’s services and infrastructure. The Core Cloud Platform group builds the foundational common services used by dozens of Trimble products and millions of users.
Job Responsibility:
Develop and maintain infrastructure as code (IaC) using Terraform to ensure reliable and scalable cloud environments
Implement and enhance observability solutions using tools like New Relic, DataDog, Sumologic and Splunk for monitoring, logging, and alerting
Perform code deployments and manage CI/CD pipelines using Jenkins, Github, and related tooling to ensure smooth and efficient delivery processes
Automate routine tasks and workflows to increase operational efficiency and reduce manual intervention
Evaluate system designs and architectures for reliability, performance, security, and efficiency, ensuring best practices are followed
Lead incident response efforts, conduct root cause analysis, and implement long-term solutions for complex issues
Develop and maintain comprehensive runbooks and procedures for incident response and operational tasks
Collaborate with cross-functional teams to review and provide feedback on technical designs, ensuring alignment with SRE principles
Participate in on-call rotations and handle critical incidents with confidence and expertise
Continuously improve documentation for systems and services, contributing to a knowledge-sharing culture within the team
Requirements:
Bachelor's degree in a relevant field of study (e.g., Computer Science, Computer Engineering, Software Engineering, Information Technology, Information Systems)
7-10 years of relevant work experience
Hands-on experience automating and improving processes in a software development & production environment
Working knowledge of one or more programming languages, such as Python, Go, Javascript, or similar
Ability to evaluate and troubleshoot technical issues with an attention to detail in problem-solving
Interest in automation and optimization of workflows for improved efficiency
Effective verbal and written communication skills
Ability to take on new challenges, with a willingness to receive both general and detailed instructions
Flexibility to adapt to evolving project requirements and timelines
This position requires a flexible schedule, which may include early mornings, late nights, and/or weekends to meet business needs
This position participates in an oncall rotation for 24x7 support