This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Our agentic process automation platform helps enterprises automate complex, decision-heavy processes that traditional automation can’t handle and GenAI can’t be trusted with. We enable organizations to scale operations, resist hallucinations, and bring end-to-end visibility and control to your most complex processes. Powered by a new kind of computing platform, Maisa combines AI-driven problem solving with programmatic execution, so every action is reliable, auditable, and built for enterprise scale.
Job Responsibility:
Build and maintain AWS cloud infrastructure using Terraform, Pulumi, and Helm charts
Manage and scale Kubernetes clusters and container orchestration
Design and implement infrastructure-as-code for repeatable, reliable deployments
Support both cloud-based and on-premise installation requirements
Optimize cloud costs while maintaining performance and reliability targets
Plan and execute infrastructure capacity and scaling strategies
Implement comprehensive monitoring and logging using Grafana, Prometheus, and (future) ElasticSearch/Kibana
Define and track SLIs, SLOs, and error budgets for critical services
Build alerting strategies that enable proactive incident response
Lead incident response, post-mortems, and continuous improvement initiatives
Create and maintain runbooks and operational documentation
Configure and maintain CI/CD pipelines in GitHub Actions
Automate deployment, scaling, and recovery processes
Implement infrastructure security best practices (encryption at rest/in transit, network policies, IAM)
Manage disaster recovery and business continuity procedures
Collaborate with development teams to optimize application performance and reliability
Work with enterprise infrastructure teams on deployment requirements and integration
Support technical discussions around architecture and deployment models
Respond to infrastructure and availability questions during vendor assessments
Requirements:
Strong demonstrable experience in DevOps, SRE, or cloud infrastructure engineering roles
Strong hands-on experience with AWS cloud services and infrastructure
Proficiency with infrastructure-as-code tools (Terraform, Pulumi)
Experience managing Kubernetes in production environments
Knowledge of CI/CD pipelines and deployment automation
Experience with monitoring and observability tools (Grafana, Prometheus)
Understanding of networking, security best practices, and system hardening
Strong troubleshooting and problem-solving skills for complex distributed systems
Ability to balance reliability, performance, and velocity
Fluent Spanish (essential—you'll interact directly with client infrastructure teams)
Experience with Helm charts, programming (Go, Python, Rust), container security, compliance frameworks (SOC 2, ISO 27001), and penetration testing are all valuable additions
Nice to have:
Experience with Helm charts, programming (Go, Python, Rust), container security, compliance frameworks (SOC 2, ISO 27001), and penetration testing