This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Local only. Job consists of setting up Change Data Capture (or CDC) for multiple types of databases for the purpose of hydrating a data lake. Along with data hydration, job requires knowledge on ETL transformations using Apache spark, both streaming and batch processing of data. Engineer needs to know how to work with Apache Spark Data Frames, ETL jobs, and streaming data pipelines that will orchestrate raw CDC data and transform it into useable and query-able data for analytics.
Job Responsibility:
setting up Change Data Capture (or CDC) for multiple types of databases for the purpose of hydrating a data lake
orchestrate raw CDC data and transform it into useable and query-able data for analytics
Requirements:
Java – Mid to Senior level experience
Python – Mid level experience (pyspark)
Apache Spark – Data Frames, Spark SQL, Spark Streaming and ETL pipelines
Apache Airflow
Extensive knowledge with S3 and S3 operations (CRUD)
EMR & EMR Serverless
Glue Data Catalog
Step Functions
MWAA (Managed Workflows Apache Airflow)
Lambdas (Python)
AWS Batch
Debezium or other CDC knowledge required
knowledge on ETL transformations using Apache spark, both streaming and batch processing of data
know how to work with Apache Spark Data Frames, ETL jobs, and streaming data pipelines
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.