CrawlJobs Logo

DevOps Engineer - Observability Focus

amaris.com Logo

Amaris Consulting

Location Icon

Location:
Sweden , Gothenburg

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Join our team dedicated to driving operational excellence and reliability through advanced monitoring and observability practices. As a key member of the DevOps and engineering community, your mission is to ensure system transparency, performance, and resilience by embedding observability across the technology stack.

Job Responsibility:

  • Design, implement, and maintain comprehensive monitoring solutions using Splunk, Grafana Stack (Grafana, Loki, Tempo), Prometheus, and Apica to ensure system reliability and performance transparency
  • Configure and optimize alerting workflows with PagerDuty for fast detection, efficient triage, and proactive incident management
  • Build and maintain actionable dashboards and KPIs that deliver insights into service health, user experience, and infrastructure performance
  • Develop scripts and integrations to automate data ingestion, correlation, and analysis across various monitoring sources
  • Continuously improve monitoring coverage and observability maturity
  • collaborate with teams to define SLIs/SLOs and identify opportunities for tuning
  • Explore new tools and techniques to enhance observability and automate monitoring workflows, leveraging AI-assisted operations where applicable
  • Partner closely with engineering, DevOps, and security teams to ensure observability is embedded across the full stack and aligned with organizational goals

Requirements:

  • Experience in monitoring and observability using: Splunk, Grafana, Loki, Tempo, Prometheus, Apica, and PagerDuty
  • Experience with infrastructure technologies such as: AWS, Kubernetes, and Docker
  • Proficiency in automation and configuration tools: Terraform, Ansible, Python
  • Experience with collaboration and communication tools: Slack, Confluence, Jira
  • Proficiency in programming languages: Python, Java, JavaScript, TypeScript
  • Strong analytical and problem-solving skills with attention to detail
  • Demonstrated DevOps mindset and ability to work collaboratively across teams
  • English fluent. Additional languages are a plus
  • You demonstrate strong communication skills and a proactive approach to continuous improvement

Nice to have:

Additional languages are a plus

What we offer:
  • Experience working in a young and international atmosphere, with colleagues on all 5 continents
  • Access a variety of training courses and continuously improve your skills
  • Take part in the events organized by our team: work socials, team building events... moments you won't want to miss!

Additional Information:

Job Posted:
February 14, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for DevOps Engineer - Observability Focus

Senior Observability Engineer

Coralogix is a modern, full-stack observability platform transforming how busine...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Site Reliability, DevOps, or Platform Engineering with a focus on observability
  • Proven expertise with at least one major observability platform (e.g., Prometheus, Victoria Metrics, OpenSearch)
  • Hands-on experience with Kubernetes, including deep knowledge of controllers, operators, and Helm
  • Experience writing Kubernetes controllers (controller-runtime, KubeBuilder)
  • Strong programming skills in Go or Python (Rust is a plus)
  • Experience designing, scaling, and operating observability systems at enterprise scale
  • Familiarity with at least one major cloud provider (AWS, Azure, or GCP)
  • Strong understanding of distributed systems, telemetry pipelines, and instrumentation standards (e.g., OpenTelemetry)
  • Excellent communication skills with the ability to explain complex topics to diverse stakeholders
Job Responsibility
Job Responsibility
  • Design, implement, and maintain observability features such as Alerting, SLOs, Reporting, and Synthetic Tests
  • Manage and scale OpenTelemetry Collectors and other observability agents across Kubernetes environments
  • Write and maintain Kubernetes Controllers using frameworks like controller-runtime and KubeBuilder
  • Operate and optimize the internal Coralogix account, ensuring proper usage, cost efficiency, and best practices adoption
  • Define and enforce observability guidelines and standards across the organization
  • Partner with engineering teams to embed observability by default into products and services
  • Control observability-related costs while maximizing performance, visibility, and value
  • Contribute to upstream projects such as OpenTelemetry, helping shape industry standards
  • Explore and implement cutting-edge observability technologies, including eBPF-based approaches
  • Fulltime
Read More
Arrow Right

DevOps Engineer

HPE Operations is our innovative IT services organization. It provides the exper...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of DevOps engineering experience with strong focus on CI/CD
  • Proficiency with CI/CD tools: Jenkins, GitLab CI/CD, Azure DevOps, Bamboo, or CircleCI
  • Strong knowledge of version control (Git) and branching strategies
  • Experience with containerization and orchestration (Docker, Kubernetes, Helm)
  • Scripting/programming in Python, Bash, Groovy, or similar
  • Familiarity with cloud platforms: AWS (CodePipeline, EKS), Azure (DevOps, AKS), or GCP (Cloud Build, GKE)
  • Bachelor's or Master's degree in Computer Science, IT, or related field
  • Strong troubleshooting and analytical skills
  • Excellent communication and collaboration with global teams
  • Ability to thrive in a fast-paced, agile environment
Job Responsibility
Job Responsibility
  • CI/CD Pipeline Design & Management: Build and maintain robust CI/CD pipelines
  • Automate code integration, build, testing, and deployment processes
  • Implement GitOps workflows with ArgoCD/Tekton
  • Infrastructure as Code & Automation: Use Terraform, Ansible, or CloudFormation for provisioning
  • Develop reusable automation scripts for deployment and environment setup
  • Manage artifact repositories (Nexus, Artifactory)
  • Monitoring & Reliability: Implement observability frameworks (Prometheus, Grafana, ELK, OpenTelemetry)
  • Ensure system reliability, scaling, and resilience
  • Conduct root-cause analysis and implement improvements
  • Security & Compliance: Integrate security testing tools (SAST, DAST, dependency scanning)
What we offer
What we offer
  • Health & Wellbeing: comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Personal & Professional Development: programs to help reach career goals
  • Unconditional Inclusion: inclusive work environment celebrating individual uniqueness
  • Flexibility to manage work and personal needs
  • Fulltime
Read More
Arrow Right

Cloud Technical Architect / Data DevOps Engineer

The role involves designing, implementing, and optimizing scalable Big Data and ...
Location
Location
United Kingdom , Bristol
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • An organised and methodical approach
  • Excellent time keeping and task prioritisation skills
  • An ability to provide clear and concise updates
  • An ability to convey technical concepts to all levels of audience
  • Data engineering skills – ETL/ELT
  • Technical implementation skills – application of industry best practices & designs patterns
  • Technical advisory skills – experience in researching technological products / services with the intent to provide advice on system improvements
  • Experience of working in hybrid environments with both classical and DevOps
  • Excellent written & spoken English skills
  • Excellent knowledge of Linux operating system administration and implementation
Job Responsibility
Job Responsibility
  • Detailed development and implementation of scalable clustered Big Data solutions, with a specific focus on automated dynamic scaling, self-healing systems
  • Participating in the full lifecycle of data solution development, from requirements engineering through to continuous optimisation engineering and all the typical activities in between
  • Providing technical thought-leadership and advisory on technologies and processes at the core of the data domain, as well as data domain adjacent technologies
  • Engaging and collaborating with both internal and external teams and be a confident participant as well as a leader
  • Assisting with solution improvement activities driven either by the project or service
  • Support the design and development of new capabilities, preparing solution options, investigating technology, designing and running proof of concepts, providing assessments, advice and solution options, providing high level and low level design documentation
  • Cloud Engineering capability to leverage Public Cloud platform using automated build processes deployed using Infrastructure as Code
  • Provide technical challenge and assurance throughout development and delivery of work
  • Develop re-useable common solutions and patterns to reduce development lead times, improve commonality and lowering Total Cost of Ownership
  • Work independently and/or within a team using a DevOps way of working
What we offer
What we offer
  • Extensive social benefits
  • Flexible working hours
  • Competitive salary
  • Shared values
  • Equal opportunities
  • Work-life balance
  • Evolving career opportunities
  • Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

DevOps Engineer

You will be part of MDOTM’s Platform Team. As a DevOps Engineer, you'll play a k...
Location
Location
Salary
Salary:
Not provided
mdotm.ai Logo
MDOTM Ltd
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Professional DevOps Experience: Solid professional experience in a DevOps or Site Reliability Engineering (SRE), with a focus on automation, infrastructure, and CI/CD
  • CI/CD Familiarity: Hands-on experience working with and contributing to CI/CD pipelines
  • Containerization & Orchestration Basics: Practical experience with Docker and exposure to Kubernetes, including writing Dockerfiles and debugging containerized applications
  • Infrastructure as Code (IaC) Exposure: Familiarity with IaC tools like Terraform or Ansible for managing infrastructure
  • Observability Basics: Experience with at least one monitoring, logging, or alerting system (e.g., Prometheus, Grafana, ELK Stack, Zabbix) and a curiosity for SRE principles
  • Scripting & Automation: Proficiency in at least one scripting language (e.g., Python, Bash) for automating common tasks
  • Security Awareness: Basic understanding of security best practices in the SDLC
  • Problem Solving & Troubleshooting: Good analytical skills and a methodical approach to troubleshooting system and infrastructure issues
  • Soft skills: Clear English communication, collaborative spirit, and a growth mindset aligned with DevOps culture of shared ownership and rapid feedback
  • Good knowledge of English (both written and spoken)
Job Responsibility
Job Responsibility
  • Support & enhance CI/CD pipelines: Contribute to the design, implementation, and automation of CI/CD pipelines (GitLab CI/CD) to ensure fast, safe, and repeatable releases across all environments
  • Infrastructure as Code (IaC) implementation: Assist in the adoption and management of infrastructure using Terraform and Ansible, helping provision and manage on-premise and AWS resources in a controlled, auditable, and reproducible manner
  • Containerization & orchestration operation: Work with Docker and Kubernetes to package services and operate them, contributing to declarative configuration, GitOps practices, and efficient resource utilization
  • Observability & reliability monitoring: Help implement and maintain comprehensive monitoring, logging, and tracing solutions (Prometheus, Grafana, ELK, Zabbix). Participate in on-call rotations as part of a team to ensure high availability and support low Mean Time To Recovery (MTTR)
  • Security & compliance support: Contribute to embedding security best practices (OWASP Top 10, least privilege IAM, shift-left testing) into the SDLC and assist in automating security controls within pipelines
  • Data Infrastructure maintenance: Support the management and automation of database infrastructure (MySQL, MongoDB), including contributing to provisioning, backups, and automated schema migrations
  • Platform Service Operations: Automate and support the deployment, scaling, and operational health of core platform services (authentication, data pipelines, messaging) for internal consumers
  • Collaborative contribution: Participate in technical design discussions, learn from architectural decisions related to infrastructure and operations, and help document runbooks & standards
  • Continuous improvement: Embrace and help champion DevOps culture ("you build it, you run it") and share knowledge through demos, pair programming, and internal workshops
Read More
Arrow Right

Staff DevOps Engineer – DevX

We're looking for an accomplished Staff DevOps Engineer (DevX) to spearhead the ...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
brandwatch.com Logo
Brandwatch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7–10 years' experience in DevOps, Software Engineering, or related roles, with a focus on build/release engineering and developer productivity
  • Strong proficiency in designing CI/CD pipelines and using related tools (e.g. Jenkins, GitLab CI, GitHub Actions) to automate software delivery
  • Extensive experience with Infrastructure as Code (Terraform, CloudFormation, etc.) and configuration management tools (Ansible, Puppet)
  • Proficient in one or more programming languages (such as Python, Java, or C#) and scripting, with an ability to build internal tools and integrations
  • Deep understanding of cloud platforms (AWS and/or GCP) and containerization technologies (Docker, Kubernetes) at scale
  • Familiarity with monitoring and logging tools to ensure the solutions you build are observable and reliable
  • Excellent problem-solving skills, a collaborative mindset, and experience mentoring other engineers or leading technical projects
Job Responsibility
Job Responsibility
  • Design, build, and maintain internal platforms and tools (CI/CD pipelines, developer portals, automation frameworks) that improve developer productivity and software quality
  • Collaborate with software engineering teams to understand workflow pain points and deliver solutions that streamline coding, testing, and deployment
  • Implement and refine continuous integration and delivery processes to enable rapid, reliable releases, including build automation, test frameworks, and deployment strategies
  • Provide expert guidance and mentorship on DevOps best practices, tooling, and automation to engineers across the organization
  • Drive innovation in developer experience by evaluating new technologies and championing continuous improvement in our engineering processes
Read More
Arrow Right

DevOps Engineer

Radix is building the most trusted data and analytics platform in multifamily. J...
Location
Location
Kosovo , Prishtine
Salary
Salary:
Not provided
radix.com Logo
Radix (AZ)
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in a DevOps, SRE, or infrastructure-focused engineering role with a strong understanding of CI/CD concepts and tools such as Jenkins, GitLab CI, or CircleCI
  • Hands-on experience working in cloud environments like AWS, Azure, or Google Cloud and are comfortable designing, deploying, and managing scalable cloud infrastructure
  • Proficiency in scripting languages such as Python, Bash, or Ruby to automate tasks and improve operational efficiency
  • Practical experience with containerization using Docker and orchestration using Kubernetes
  • Experienced with configuration management tools such as Ansible, Chef, or Puppet to maintain consistent environments
  • Naturally approach problems with curiosity, seeking to understand root causes and explore innovative solutions
  • Demonstrate resilience in fast-moving, high-growth environments and remain effective when priorities or conditions change quickly
  • Adapt easily to ambiguity, shifting requirements, and evolving technologies, adjusting your approach with confidence
  • Thrive in a startup environment where ownership, iteration, and continuous improvement are core to how you work
  • Bring additional value through familiarity with infrastructure-as-code tools (Terraform, CloudFormation, Pulumi), cloud security best practices, and observability tools such as Prometheus, Grafana, ELK, or Splunk
Job Responsibility
Job Responsibility
  • Design and maintain CI/CD pipelines that accelerate software delivery and improve release reliability
  • Collaborate with engineering teams to streamline development workflows and strengthen DevOps best practices
  • Ensure high availability, scalability, and performance across production and development environments
  • Implement infrastructure as code (IaC) using Terraform, Ansible, CloudFormation, or similar tools
  • Enhance the security and compliance posture of Radix infrastructure and applications
  • Troubleshoot and resolve issues across development, staging, and production systems with urgency and clarity
  • Build and maintain monitoring, logging, and alerting systems to proactively detect and respond to incidents
  • Improve observability and system visibility to support data-driven operational decisions
  • Introduce automation to reduce operational toil and increase engineering efficiency
  • Stay current with emerging DevOps technologies and recommend improvements to infrastructure, tooling, and processes
What we offer
What we offer
  • Medical, dental and vision coverage designed to support your wellbeing
  • Unlimited PTO
  • Pre-IPO Equity
  • Performance Bonus
  • Learn From the Best
  • Build Category-Defining Products
Read More
Arrow Right

DevOps Engineer

As our Platform Engineer / Site Reliability Engineer (SRE), you’ll build and own...
Location
Location
Poland , Warsaw
Salary
Salary:
100000.00 - 165000.00 USD / Year
dualentry.com Logo
DualEntry
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years experience overall OR 5+ years deep AWS-focused engineering experience
  • Deep expertise across AWS (EC2, ECS/EKS, RDS, IAM, networking, etc.)
  • Strong experience with IaC (Terraform, CloudFormation)
  • Strong command of CI/CD (GitHub Actions, Jenkins, similar)
  • Experience with Docker, Kubernetes, and modern deployment workflows
  • Hands-on experience with observability tools (Prometheus, Grafana, Datadog)
  • Comfortable building systems from scratch with high autonomy
  • Time zone flexibility to support partial 24-hour coverage
  • Pragmatic, hard-working, curious, and high-velocity by default
Job Responsibility
Job Responsibility
  • Lead the initial setup of our DevOps and platform engineering practices
  • Design and deliver an internal platform for personal or feature environments to boost developer velocity
  • Build and maintain AWS-based infrastructure for performance, scale, and security
  • Build CI/CD pipelines and automate release processes end to end
  • Implement observability tooling (logging, monitoring, alerting) to detect and resolve issues early
  • Collaborate with developers to remove bottlenecks and improve reliability
  • Ensure high availability through monitoring, incident response, and recovery strategies
  • Drive Infrastructure as Code best practices and infrastructure automation
  • Document systems and processes for clarity and reproducibility
What we offer
What we offer
  • Significant equity ownership
  • Equity: ~$50K
  • Remote-first team
  • Visa sponsorship option for relocation to NYC within 2+ years
  • Time Off: 15 PTO days + 12 public holidays
  • Early-stage role with high autonomy and real long-term upside
  • Fulltime
Read More
Arrow Right

Staff Engineer - DevOps

As a Staff Engineer - DevOps you will maintain, manage and improve day-to-day op...
Location
Location
India , Mumbai
Salary
Salary:
Not provided
https://clevertap.com/ Logo
CleverTap
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS/MS degree in Computer Science or a related field from a premier institute with top grades or equivalent practical experience
  • 5+ years of hands-on experience in DevOps, with a focus on Azure cloud infrastructure and tools
  • Infrastructure as Code (IaC): Proven experience in provisioning and managing infrastructure using Terraform or Bicep in production environments
  • On-Call Support: Hands-on experience with production incident handling and being part of on-call rotations
  • Containerization: Expertise in packaging and deploying applications using Docker
  • Familiarity with orchestrating containers using Azure Kubernetes Service (AKS) or Kubernetes
  • CI/CD Pipelines: Experience in automating build and deployment pipelines using tools like Azure DevOps, Jenkins, Bamboo, or TeamCity
  • Monitoring & Observability: Experience in implementing and managing monitoring systems like Azure Monitor, Prometheus, Grafana, or equivalent tools to ensure service health
  • Web Servers: Experience with HTTP web servers such as Apache, Nginx, Tomcat, or Jetty
  • Networking Concepts: Solid understanding of at least four of the following: DNS mechanics and configuration, TCP handshake process, IPtables filtering, TCP states and their significance, Analyzing TCP traffic, Routing, switching, subnets, VPN, and NAT
Job Responsibility
Job Responsibility
  • Execute standard operating procedures designed to ensure platform security and availability
  • Recommend and implement improvements towards standard operating procedures
  • Execute planned stories where the implementation is straightforward
  • Participate in platform architecture development discussions
  • Work with members from rest of the engineering team and customer success
  • Work with vendor support teams to solve issues
What we offer
What we offer
  • Be a part of a global growth stage startup
  • Work in a fast-paced, dynamic environment where your contribution matters
  • You are passionate about technology and its impact on the high growth mobile technology space
  • Innovate at scale, with learning opportunities
  • Fulltime
Read More
Arrow Right