CrawlJobs Logo

Data Engineer

enormousenterprise.com Logo

Enormous Enterprise

Location Icon

Location:
United States, Plano

Category Icon
Category:
IT - Software Development

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Local only. Job consists of setting up Change Data Capture (or CDC) for multiple types of databases for the purpose of hydrating a data lake. Along with data hydration, job requires knowledge on ETL transformations using Apache spark, both streaming and batch processing of data. Engineer needs to know how to work with Apache Spark Data Frames, ETL jobs, and streaming data pipelines that will orchestrate raw CDC data and transform it into useable and query-able data for analytics.

Job Responsibility:

  • setting up Change Data Capture (or CDC) for multiple types of databases for the purpose of hydrating a data lake
  • orchestrate raw CDC data and transform it into useable and query-able data for analytics

Requirements:

  • Java – Mid to Senior level experience
  • Python – Mid level experience (pyspark)
  • Apache Spark – Data Frames, Spark SQL, Spark Streaming and ETL pipelines
  • Apache Airflow
  • Extensive knowledge with S3 and S3 operations (CRUD)
  • EMR & EMR Serverless
  • Glue Data Catalog
  • Step Functions
  • MWAA (Managed Workflows Apache Airflow)
  • Lambdas (Python)
  • AWS Batch
  • Debezium or other CDC knowledge required
  • knowledge on ETL transformations using Apache spark, both streaming and batch processing of data
  • know how to work with Apache Spark Data Frames, ETL jobs, and streaming data pipelines
  • Big Data concepts, including performance tuning

Nice to have:

  • Scala – not required but a plus
  • Apache Hudi – not required, but a plus
  • Apache Griffin – not required, but a plus
  • AWS Deequ – not required, but a plus

Additional Information:

Job Posted:
December 08, 2025

Work Type:
On-site work
Job Link Share:
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.