This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for an experienced cloud development engineer to work on our HPC -CSM Manageability solution. Role involves designing, implementing, and maintaining our HPC CSM manageability platform hosted on Kubernetes infrastructure. The position requires in-depth expertise in cloud-native technologies, particularly Kubernetes, along with a strong background in automation and DevOps practices. Good understanding of security on Cloud Native applications is expected.
Job Responsibility:
Design, implement, and execute comprehensive test plans for the CSM platform, including functional, regression, integration, and performance testing
Validate HPC system management capabilities such as node provisioning, monitoring, workload orchestration, and system upgrades
Develop automated test suites using Python, Bash, and CI/CD frameworks to ensure rapid and repeatable test execution
Integrate automated testing into the development pipeline to support continuous delivery
Identify, document, and track defects
work with engineering teams to resolve issues
Perform stress testing and scale testing on large HPC clusters
Monitor and analyze system metrics to assess stability under load.
Requirements:
Bachelor's degree preferred or Associate degree holder (technical field) with 8-12 years working experience in related fields desired
Strong understanding of Linux (RHEL, SLES, Ubuntu) system administration
Experience with Kubernetes, containers (Docker/Podman), and networking fundamentals
Proficiency in scripting languages (Python, Bash) for automation
Familiarity with HPC architectures, job schedulers (Slurm, PBS Pro), and workload management concepts
Experience with test automation frameworks (e.g., pytest, Robot Framework, Jenkins CI/CD)
Hands-on experience in system-level testing, API testing, and performance validation
Familiarity with Git, Jira, Confluence, and defect tracking workflows
Experience with monitoring and log analysis tools (Grafana, Prometheus, ELK stack) is a plus.
Nice to have:
Experience with monitoring and log analysis tools (Grafana, Prometheus, ELK stack)
Cloud Architectures
Cross Domain Knowledge
Design Thinking
Development Fundamentals
DevOps
Distributed Computing
Microservices Fluency
Full Stack Development
Security-First Mindset
Solutions Design
Testing & Automation
User Experience (UX)
What we offer:
Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
Specific programs catered to helping reach career goals
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.