CrawlJobs Logo

Infrastructure Kubernetes Specialist

https://www.inetum.com Logo

Inetum

Location Icon

Location:
Portugal , Lisbon

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As an Infrastructure Kubernetes Specialist, you will define and implement the containerized platform in collaboration with various technical departments (Infrastructure, Production, Security). You will also establish platform standards and support project teams in migrating their applications to the new environment.

Job Responsibility:

  • Design and implement Kubernetes clusters (sizing, high availability, automated installation, IAM, etc.)
  • Manage SDN and CNI technologies (e.g., Calico, Cilium)
  • Implement backup solutions (e.g., Velero)
  • Handle persistent volume management (e.g., CEPH, Portworx)
  • Integrate CI/CD tools (e.g., Kustomize, ArgoCD)
  • Collaborate cross-functionally to define platform standards
  • Support project teams in container migration efforts
  • Participate in strategic planning and technical evaluations

Requirements:

  • Proven expertise in Kubernetes infrastructure
  • Experience with CI/CD tools and container orchestration
  • Proficiency in Python development
  • Familiarity with Agile methodologies
  • Strong communication skills in English (minimum B2 level)
  • Autonomous and proactive mindset
  • Knowledge of Microsoft Azure is a plus

Nice to have:

Knowledge of Microsoft Azure

Additional Information:

Job Posted:
October 07, 2025

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Infrastructure Kubernetes Specialist

Founding Infrastructure Engineer

As the first dedicated Infrastructure Engineer at Reducto, you will influence ev...
Location
Location
United States , San Francisco
Salary
Salary:
150000.00 - 300000.00 USD / Year
reducto.ai Logo
Reducto
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have 5+ years of hands-on experience in building or supporting production-grade infrastructure and reliability processes for high-throughput systems
  • Are comfortable with Python or similar languages
  • Exceptional at working across cloud platforms, container orchestration (e.g., Kubernetes), networking, and storage technologies
  • Build your own tools on the fly to diagnose, experiment, and address reliability problems
  • Bring a quantitative, hands-on approach to system operations, automation, and continuous improvement
  • Are your own worst critic—have an extremely high bar for quality and always aim for robust solutions rather than quick fixes
Job Responsibility
Job Responsibility
  • Designing, building, and maintaining highly available, scalable infrastructure to support intensive AI/ML workloads and real-time model deployments
  • Implementing robust monitoring, alerting, and observability systems to ensure system health, performance, and uptime across cloud and on-prem environments
  • Debugging, optimizing, and automating infrastructure for fast iteration and rapid deployment cycles, focusing on both reliability and developer velocity
  • Proactively identifying, investigating, and resolving incidents to minimize downtime and maintain world-class service levels for enterprise customers
  • Collaborating closely with engineers, ML specialists, and founders to shape product, infrastructure, and security strategies
What we offer
What we offer
  • Unlimited PTO
  • Free lunch daily at the office
  • Reimbursed Transportation
  • Generous health insurance covering medical, dental, and vision
  • Health and Wellness Budget up to $150/mo reimbursement
  • Parental Leave
  • Fulltime
Read More
Arrow Right

Senior Technical Operations Specialist

We're on the hunt for a talented and proactive individual to join our team, some...
Location
Location
Poland , Gdańsk
Salary
Salary:
Not provided
navblue.aero Logo
NAVBLUE Limited
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Successful completion of a post-secondary degree or diploma in computer science or technology (or equivalent)
  • Five or more years of proven experience in a technical role supporting multiple systems
  • Experience using scripting languages to perform and automate tasks
  • Using Infrastructure as Code and Configuration as Code such as Terraform and Ansible
  • Experience deploying production services into cloud environments
  • AWS Cloud Practitioner
  • AWS SysOps Administrator or Solutions Architect Associate preferred
  • Linux Foundation Certified Systems Administrator (LFCS) or similar is a bonus
  • Solid knowledge of Operating Systems & ability to perform troubleshooting required
  • Proven track record building and maintaining infrastructure in cloud environments
Job Responsibility
Job Responsibility
  • Ensuring availability across numerous services, whether they are custom software, commercial software, or free and open source solutions
  • Monitoring system and application performance, and logs
  • Creating and testing backup and recovery procedures
  • Responding to alerts and incidents when they occur
  • Investigating and finding solutions to operational issues at the infrastructure, network, os and application levels
  • Escalating issues to vendors or partners when appropriate
  • Follow and improve the best practices and standards that help us keep services safe, secure, and reliable
  • Improve or create our best practices to ensure the smooth operation of services and execution of procedures
  • Develop and improve SOPs for the maintenance of our services and their underlying systems
  • Develop and improve Infrastructure as Code (IaC) and Configuration as Code (CaC) used to maintain services and systems
What we offer
What we offer
  • Stable employment based on a full-time job contract
  • Flexible working hours and work-from-home opportunities (3 days in office)
  • International working environment in a dynamic company
  • Access to the latest knowledge and technologies enabling professional development
  • Training and development possibilities
  • Participating in international projects and international trips
  • Competitive salary dependent on experience and qualifications
  • Private medical coverage for you and your family
  • Sport card
  • Life insurance for you and your family
  • Fulltime
Read More
Arrow Right

Infrastructure Engineer

We are working with a Global Professional Services client as they look to add to...
Location
Location
United Kingdom , Manchester
Salary
Salary:
55000.00 - 60000.00 GBP / Year
eutopiaonline.com Logo
Eutopia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven hands-on commercial experience with Kubernetes in a production environment (preferably Red Hat OpenShift)
  • Experience in automating infrastructure processes using scripting and Infrastructure as Code (IaC) tools
  • Experience building and managing CI/CD pipelines (ie Azure DevOps, ArgoCD)
  • Expertise in Microsoft Azure cloud services and solutions
  • Expertise in the VMware Cloud Foundation suite
  • Experience with automation tools such as Terraform or Ansible
  • Kubernetes certification, such as Red Hat Certified Specialist in OpenShift Administration (RHCSA) or an equivalent (ie CKA/CKAD)
  • Certification in relation to on prem solutions – this may be VMware Certified Professional (VCP), RedHat Certified System Administrator (RHSA), or similar
  • Cloud certification(s) ie Azure Administrator Associate, Azure DevOps Engineer Expert, Azure Security Engineer Associate etc
  • Must live within easy commute of central Manchester
Job Responsibility
Job Responsibility
  • Work with both on prem and cloud infrastructure
  • Design, implement and manage solutions
What we offer
What we offer
  • Annual bonus
  • Healthcare
  • Continual professional development opportunities
  • Supportive environment
  • Fulltime
Read More
Arrow Right
New

AI Platform Site Reliability Engineering Specialist

The AI Platform Site Reliability Engineering Specialist will operate and maintai...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science or related field, or equivalent job experience
  • 5 years of production experience in SRE / Infrastructure / ops for large-scale systems
  • Strong programming/scripting skills (Python, Go, Java, or equivalent)
  • Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)
  • Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)
  • Familiarity with GPU / AI compute clusters, high-performance data storage, and distributed architectures
  • Experience with monitoring / observability / logging / alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)
  • Networking and systems engineering knowledge (TCP/IP, DNS, routing, load balancing, distributed storage)
  • Solid experience in capacity planning, performance tuning, scaling, and incident response
  • Demonstrated ability to lead RCAs, deploy fixes, and drive reliability improvements
Job Responsibility
Job Responsibility
  • Operate, monitor, and maintain the infrastructure supporting GenAI applications ( training, inference, feature store, data ingestion, model serving)
  • Design and build automation for core platform capabilities, reducing manual toil
  • Develop and maintain infrastructure-as-code (IaC) for provisioning and managing compute, storage, network, GPU clusters, Kubernetes / container orchestration, etc.
  • Establish, monitor and enforce SLOs/SLIs/LSAs, error budgets, alerting, and dashboards
  • Lead incident response, root cause analysis (RCA), postmortems, and systemic remediation
  • Perform capacity planning, scaling strategies, workload scheduling and resource forecasting
  • Optimize cost vs. performance trade-offs in large-scale compute environments
  • Harden systems for security, compliance, auditability, and data governance
  • Collaborate across teams (cloud engineers, data engineers, infrastructure, security) to ensure safe deployment, rollout, rollback, and integration of new systems
  • Define disaster recover (DR) strategies, back/restore practices, fault tolerance mechanisms
Read More
Arrow Right

DevOps Architect

Join NLS as a DevOps Architect! Lead cloud migrations, implement DevOps practice...
Location
Location
United States , Springfield
Salary
Salary:
Not provided
nlsnow.com Logo
Next Level Solutions ltd.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expertise with major cloud platforms, with a strong focus on Azure
  • Deep experience with CI/CD practices, tools, and pipelines
  • Proficiency with containerization and orchestration (Docker, Kubernetes)
  • Strong Infrastructure as Code (IaC) experience using tools like Terraform, Ansible, Chef, or Puppet
  • Ability to implement and maintain security and compliance across the cloud ecosystem
  • Strong scripting and automation skills
  • Experience with monitoring and logging solutions
  • Knowledge of RDBMS and NoSQL databases
  • Strong understanding of Software Development Lifecycle
  • Experience with system performance tuning and automation of deployment/configuration baselines
Job Responsibility
Job Responsibility
  • Design and configure cloud infrastructures to support migrations and modernization initiatives
  • Partner with technical and business stakeholders to translate requirements into scalable architectures
  • Collaborate with architects and engineers to develop integration strategies across systems
  • Work with infrastructure teams to map software solutions to effective hardware implementations
  • Ensure all solutions meet corporate and regulatory security requirements in partnership with IT Security
  • Provide strategic and technical guidance to leadership, including evaluating alternatives, assessing risk, and planning resources
  • Support tool and metrics development, process improvement, data interpretation, and performance analysis
  • Monitor system performance and maintain solution integrity
  • Lead proofs-of-concept to evaluate new technologies and approaches
  • Assess legacy and current applications and recommend enhancements for performance, design, and quality
What we offer
What we offer
  • Competitive salary package with an annual bonus
  • Comprehensive and inclusive health benefits
  • Flexible working arrangements
  • Opportunities for internal advancement and career development
  • Relaxed office environment with casual dress code
  • Company sponsored activities and team building events
  • PTO & paid company holidays
  • Traditional & Roth 401k with a 5% company match
  • Physical and mental wellness program
Read More
Arrow Right

System Engineering Specialist

We are seeking a System Engineering Specialist to join our Software Infrastructu...
Location
Location
Romania , Bucharest
Salary
Salary:
Not provided
vodafone.com Logo
Vodafone
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Has 3–5 years of experience in Dev/DevOps environments
  • Experienced in managing public cloud infrastructure (AWS, Azure, GCP, OCI) and on-prem environments (VMWare)
  • Proficient in scripting languages such as Bash and Python
  • Skilled in CI/CD tools including Git, GitLab, GitHub, Jenkins, FluxCD, and ArgoCD
  • Familiar with microservices architecture and Kubernetes environments
  • Experienced with SQL databases (Oracle) and integration infrastructure
  • Knowledgeable in monitoring and observability tools such as Instana, Prometheus, Grafana, Splunk, and Dynatrace
  • Comfortable administering Atlassian tools (Jira and Confluence)
  • Understands Agile methodologies (Scrum, Kanban) and IT compliance frameworks (ITIL, SOX)
  • Demonstrates strong documentation and communication skills in English (minimum B1 level)
Job Responsibility
Job Responsibility
  • Maintain and improve infrastructure across on-premises and cloud environments
  • Supervise deployments of new application releases and microservices
  • Collaborate with DevOps and Development teams to resolve incidents and maintain system stability
  • Facilitate communication between stakeholders involved in application operations
  • Evaluate infrastructure performance and propose improvements or cost-saving measures
  • Develop and maintain integration infrastructure between applications
  • Mentor teams on DevOps best practices and methodologies
  • Provide availability for exceptional situations including night shifts, overtime, and stand-by support
What we offer
What we offer
  • Hybrid way of working
  • Medical and dental services
  • Life and hospitalization insurance
  • Dedicated employee phone subscription
  • Take control of your benefits and choose any of the below options: MEAL TICKETS/ PRIVATE PENSION/ VACATION VOUCHERS/ CULTURAL VOUCHERS within the budget
  • Special discounts for gyms and retailers
  • Annual Company Bonus
  • Loyalty Programme
  • Ongoing Education – we continuously invest in you to ensure you have everything needed to excel on the job and enhance your skills
  • You get to work with tried and trusted web-technology
  • Fulltime
Read More
Arrow Right

Software Development Senior Specialist

The Software Development Senior Specialist role at NTT DATA involves designing a...
Location
Location
Mexico , GDL
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4–8 years of software engineering experience
  • Exposure to platform engineering, developer experience, cloud infrastructure, or DevOps practices
  • Hands-on experience with TypeScript/JavaScript and Node.js
  • Experience with Infrastructure as Code using Terraform, OpenTofu, or Pulumi
  • Working knowledge of at least one major cloud provider (AWS, GCP, or Azure)
  • Familiarity with containers and orchestration frameworks (Docker, Kubernetes, EKS/GKE preferred)
  • Exposure to observability stacks such as Prometheus, Grafana, OpenTelemetry, or Datadog
  • Experience creating or consuming service scaffolding, platform templates, or internal developer tooling
  • Ability to write clean, maintainable code supported by strong testing practices
  • A customer-centric approach to internal tooling
Job Responsibility
Job Responsibility
  • Design, build, and evolve platform capabilities including Infrastructure as Code modules, Kubernetes automation, service templates, and observability integrations
  • Develop reusable frameworks and self-service tooling in TypeScript/Node.js that simplify service provisioning, deployment workflows, and operational readiness
  • Contribute to the creation and adoption of golden paths and paved-road templates
  • Implement platform components focused on availability, security, and cost efficiency
  • Work closely with product teams and internal stakeholders to understand pain points, gather feedback, and iterate on platform features
  • Participate in roadmap planning, technical design reviews, and platform KPI development
  • Contribute to engineering culture by sharing best practices, participating in code reviews, and mentoring junior engineers
What we offer
What we offer
  • Collaborative culture
  • Flexible working hours
  • Fulltime
Read More
Arrow Right

Lead Platform Engineer

Embark on a transformative journey as a Lead Platform Engineer. At Barclays, our...
Location
Location
United States , Whippany, New Jersey
Salary
Salary:
170000.00 - 230000.00 USD / Year
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Programming in Python
  • familiarity with Java is a plus
  • Designing and deploying solutions with AWS services (S3, Glue, Kinesis, Lambda, ECS, IAM, KMS, API Gateway, Step Functions, MSK, CloudFormation, etc.)
  • Applying Generative AI concepts to develop and deploy AI applications
  • Leading engineering teams and setting technical direction
  • Designing secure, scalable, and cost-efficient systems
Job Responsibility
Job Responsibility
  • Development and delivery of high-quality software solutions by using industry aligned programming languages, frameworks, and tools. Ensuring that code is scalable, maintainable, and optimized for performance
  • Cross-functional collaboration with product managers, designers, and other engineers to define software requirements, devise solution strategies, and ensure seamless integration and alignment with business objectives
  • Collaboration with peers, participate in code reviews, and promote a culture of code quality and knowledge sharing
  • Stay informed of industry technology trends and innovations and actively contribute to the organization’s technology communities to foster a culture of technical excellence and growth
  • Adherence to secure coding practices to mitigate vulnerabilities, protect sensitive data, and ensure secure software solutions
  • Implementation of effective unit testing practices to ensure proper code design, readability, and reliability
  • To contribute or set strategy, drive requirements and make recommendations for change. Plan resources, budgets, and policies
  • manage and maintain policies/ processes
  • deliver continuous improvements and escalate breaches of policies/procedures
  • If managing a team, they define jobs and responsibilities, planning for the department’s future needs and operations, counselling employees on performance and contributing to employee pay decisions/changes
What we offer
What we offer
  • medical, dental and vision coverage
  • 401(k)
  • life insurance
  • other paid leave for qualifying circumstances
  • incentive award
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Fulltime
Read More
Arrow Right