This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for an experienced Lead Big Data Developer with strong expertise in Big Data framework. The ideal candidate should have hands-on experience working with Big Data technologies along Java programming language and AWS EMR and other services.
Job Responsibility:
Design, develop, and maintain data pipelines on AWS EMR (Elastic MapReduce) to support data processing and analytics
Implement data ingestion processes from various sources including APIs, databases, and flat files
Optimize and tune big data workflows for performance and scalability
Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and deliver solutions
Manage and monitor EMR clusters, ensuring high availability and reliability
Develop ETL (Extract, Transform, Load) processes to cleanse, transform, and store data in data lakes and data warehouses
Implement data security best practices to ensure data is protected and compliant with relevant regulations
Create and maintain technical documentation related to data pipelines, workflows, and infrastructure
Troubleshoot and resolve issues related to data processing and EMR cluster performance
Requirements:
Bachelor’s degree in computer science, Information Technology, or a related field
7 – 9 years of experience in data engineering, with a focus on big data technologies
Strong experience with AWS services, particularly EMR, S3, Redshift, Lambda, and Glue
Proficiency in programming languages Java
Experience with big data frameworks and tools such as Hadoop, Spark, Hive, and Pig
Solid understanding of data modelling, ETL processes, and data warehousing concepts
Experience with SQL and NoSQL databases
Familiarity with CI/CD pipelines and version control systems (e.g., Git)
Strong problem-solving skills and the ability to work independently and collaboratively in a team environment