CrawlJobs Logo

Software Engineer, Data Acquisition

openai.com Logo

OpenAI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

293000.00 - 385000.00 USD / Year

Job Description:

The Data Acquisition team within the Foundations organization at OpenAI is responsible for all aspects of data collection to support our model training operations. Our team manages web crawling and GPTBot services and works closely with Data Processing, Architecture, and Scaling teams. We are looking for a skilled Software Engineer to join our Data Acquisition team.

Job Responsibility:

  • Own and lead engineering projects in the area of data acquisition including web crawling, data ingestion, and search
  • Collaborate with other sub-teams, such as Data Processing, Architecture, and Scaling, to ensure smooth data flow and system operability
  • Work closely with the legal team to handle any compliance or data privacy-related matters
  • Develop and deploy highly scalable distributed systems capable of handling petabytes of data
  • Architect and implement algorithms for data indexing and search capabilities
  • Build and maintain backend services for data storage, including work with key-value databases and synchronization
  • Deploy solutions in a Kubernetes Infrastructure-as-Code environment and perform routine system checks
  • Conduct and analyze experiments on data to provide insights into system performance

Requirements:

  • BS/MS/PhD in Computer Science or a related field
  • 4+ years of industry experience in software development
  • Strong expertise in large stateful distributed systems and data processing
  • Proficiency in Kubernetes, and Infrastructure-as-Code concepts
  • Willingness and enthusiasm for trying new approaches and technologies
  • Ability to handle multiple tasks and adapt to changing priorities
  • Strong communication skills, both written and verbal

Nice to have:

Experience with large web crawlers a plus

What we offer:
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Relocation support for eligible employees
  • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided
  • Offers Equity
  • performance-related bonus(es) for eligible employees

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Engineer, Data Acquisition

Data Engineer

We are recruiting a Data Engineer to join the Rally Engineering team. The role t...
Location
Location
United Kingdom , Banbury
Salary
Salary:
Not provided
prodrive.com Logo
Prodrive
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficiency in data analysis software programs (e.g., Motec i2, WinDarab, ATLAS, etc.)
  • Hands-on experience with data acquisition hardware (loom, data logger, sensor technologies, etc.) and relevant software
  • Good understanding of vehicle dynamics theory and application
  • Ability to develop internal tools/software for data processing and analysis
  • Educated to Degree or equivalent in a relevant discipline (e.g., Mechanical or Automotive Engineering)
  • Candidate should ideally have experience of working in a similar role engineering for a Manufacturer Team in the WRC or Rally Raid. Circuit racing background would also be considered (F2, F3 or similar).
Job Responsibility
Job Responsibility
  • Travelling role: Data Engineer at all test and rally events of the World Rally Raid Championship
  • Car sensor specification, calibration, and testing for Events and pre-Event testing in accordance with guidelines set by the Rally Engineering Group. Ensure that car commences each event, and is maintained throughout, with the optimum configuration. Ensure that spare parts are readily available and track sensors spare parts purchases
  • Prompt publication of accurate and detailed Engineering documents and reports before (simulation analysis, sensitivity studies, system configuration) and after each Event and test (data findings, simulation correlation, performance of failsafe strategies, etc.)
  • Ability to effectively communicate data findings to Rally Engineers and lead data troubleshooting efforts at test and rally events. Capable of using data processing and analysis as a decision-making tool
  • Ensure good communication with the crew, engine, and design groups regarding all channel strategies, whether to monitor, control, or failsafe contingencies. Additionally, ensure that driver requests are communicated appropriately within the team.
What we offer
What we offer
  • An attractive salary which will grow in line with your ongoing development and impact
  • 25 days holiday (which increases with long service) with an opportunity to purchase up to 15 extra days
  • Training opportunities for continuing professional development
  • Onsite subsidised staff restaurant
  • Car and pension salary sacrifice schemes
  • Cyclescheme
  • Exercise classes
  • Paid time off for volunteering
  • Consultations with our Fit 4 Life expert
  • Social events throughout the year
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Customer Acquisition Team

As a Senior Full-Stack Engineer on our CAT team, you will build industry-first p...
Location
Location
United States
Salary
Salary:
190000.00 - 220000.00 USD / Year
humaninterest.com Logo
Human Interest
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience building and maintaining software in production
  • Desire to work with the following technologies: Node, TypeScript, React, AWS and PostgreSQL
  • Top notch communication skills. You can communicate well with engineers and non-engineers alike
  • Strong desire to learn, think creatively, and share knowledge with others
  • Enjoy mentoring other engineers and deeply review their code
  • Proactive and empathetic mindset - you love to roll up your sleeves to fix problems for our customers
  • Past experience working in startups and/or fintech
Job Responsibility
Job Responsibility
  • Design, build, and maintain our user experiences for both external and internal customers
  • Work on business-critical, foundational services which serve as the data funnel of our entire platform
  • Improve complex processes and systems to make them more robust and require less human intervention
  • Collaborate with other engineers and stakeholders to share knowledge and build expertise
  • Develop ownership over domains in our system and make informed engineering tradeoffs
  • Advocate for and delight internal and external users
  • Write clean, high-quality code and tests to keep our system fast, reliable, and monitorable
  • Lead and participate in development life cycle activities like design, coding, testing and production release
  • Contribute to our evolving engineering standards, tooling, and processes
What we offer
What we offer
  • A great 401(k) plan: Our own! Our 401(k) includes a dollar-for-dollar employer match up to 4% of compensation (immediately vested) and $0 plan fees
  • Top-of-the-line health plans, as well as dental and vision insurance
  • Competitive time off and parental leave
  • Addition Wealth: Unlimited access to digital tools, financial professionals, and a knowledge center to help you understand your equity and support your financial wellness
  • Lyra: Enhanced Mental Health Support for Employees and dependents
  • Carrot: Fertility healthcare and family forming benefits
  • Candidly: Student loan resource to help you and your family plan, borrow, and repay student debt
  • Monthly work-from-home stipend
  • quarterly lifestyle stipend
  • Engaging team-building experiences, ranging from virtual social events to team offsites, promoting collaboration and camaraderie
  • Fulltime
Read More
Arrow Right

Data Engineer

Barbaricum is seeking a Data Engineer to provide support an emerging capability ...
Location
Location
United States , Omaha
Salary
Salary:
Not provided
barbaricum.com Logo
Barbaricum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active DoD Top Secret/SCI clearance required
  • 8+ years of demonstrated experience in software engineering
  • Bachelor’s degree in computer science or a related field
  • 8+ years of experience working with AWS big data technologies (S3, EC2) and demonstrate experience in distributed data processing, Data Modeling, ETL Development, and/or Data Warehousing
  • Demonstrated mid-level knowledge of software engineering best practices across the development lifecycle
  • 3+ years of experience using analytical concepts and statistical techniques
  • 8+ years of demonstrated experience across Mathematics, Applied Mathematics, Statistics, Applied Statistics, Machine Learning, Data Science, Operations Research, or Computer Science especially around software engineering and/or designing/implementing machine learning, data mining, advanced analytical algorithms, programming, data science, advanced statistical analysis, artificial intelligence
Job Responsibility
Job Responsibility
  • Design, implement, and operate data management systems for intelligence needs
  • Use Python to automate data workflows
  • Design algorithms databases, and pipelines to access, and optimize data retrieval, storage, use, integration and management by different data regimes and digital systems
  • Work with data users to determine, create, and populate optimal data architectures, structures, and systems
  • and plan, design, and optimize data throughput and query performance
  • Participate in the selection of backend database technologies (e.g. SQL, NoSQL, etc.), its configuration and utilization, and the optimization of the full data pipeline infrastructure to support the actual content, volume, ETL, and periodicity of data to support the intended kinds of queries and analysis to match expected responsiveness
  • Assist and advise the Government with developing, constructing, and maintaining data architectures
  • Research, study, and present technical information, in the form of briefings or written papers, on relevant data engineering methodologies and technologies of interest to or as requested by the Government
  • Align data architecture, acquisition, and processes with intelligence and analytic requirements
  • Prepare data for predictive and prescriptive modeling deploying analytics programs, machine learning and statistical methods to find hidden patterns, discover tasks and processes which can be automated and make recommendations to streamline data processes and visualizations
Read More
Arrow Right

Senior Data Engineer

A typical day may involve collaborating with partners, you will design data mode...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Computer Science or equivalent experience with 5+ years as Data Engineer or similar role
  • Programming skills in Python & Java (good to have)
  • Design data models for storage and retrieval to meet product and requirements
  • Build scalable data pipelines using Spark, Airflow, AWS data services (Redshift, Athena, EMR), Apache projects (Spark, Flink, Hive, and Kafka)
  • Familiar with modern software development practices (Agile, TDD, CICD) applied to data engineering
  • Enhance data quality through internal tools/frameworks detecting DQ issues
  • Working knowledge of relational databases and SQL query authoring
Job Responsibility
Job Responsibility
  • Design data models, acquisition processes, and applications to address needs
  • Lead business growth and enhance product experiences
  • Collaborate with Product, Engineering, Research and Data Scientists across programs
  • Take ownership of problems from end-to-end: extracting/cleaning data, and understanding source systems
  • Improve the quality of data by adding sources, coding rules, and producing metrics
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Senior Data Engineer

Atlassian is looking for a Senior Data Engineer to join our Data Engineering Tea...
Location
Location
United States , San Francisco
Salary
Salary:
135600.00 - 217800.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Computer Science or equivalent experience with 5+ years as Data Engineer or similar role
  • Programming skills in Python & Java (good to have)
  • Design data models for storage and retrieval to meet product and requirements
  • Build scalable data pipelines using Spark, Airflow, AWS data services (Redshift, Athena, EMR), Apache projects (Spark, Flink, Hive, and Kafka)
  • Familiar with modern software development practices (Agile, TDD, CICD) applied to data engineering
  • Enhance data quality through internal tools/frameworks detecting DQ issues
  • Working knowledge of relational databases and SQL query authoring
Job Responsibility
Job Responsibility
  • Collaborating with partners, you will design data models, acquisition processes, and applications to address needs
  • Lead business growth and enhance product experiences
  • Collaborate with Technology Teams, Global Analytical Teams, and Data Scientists across programs
  • Extracting/cleaning data, understanding generating systems
  • Improve data quality by adding sources, coding rules, and producing metrics as requirements evolve
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Data Engineer

Atlassian is looking for a Data Engineer to join our Data Engineering Team. You ...
Location
Location
United States , San Francisco
Salary
Salary:
186800.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Computer Science or equivalent experience with 3+ years as a Data Engineer or a similar role
  • Programming skills in Python & Java (good to have)
  • Design data models for storage and retrieval to meet product and requirements
  • Build scalable data pipelines using Spark, Airflow, AWS data services (Redshift, Athena, EMR), Apache projects (Spark, Flink, Hive, and Kafka)
  • Familiar with modern software development practices (Agile, TDD, CICD) applied to data engineering
  • Enhance data quality through internal tools/frameworks detecting DQ issues
  • Working knowledge of relational databases and SQL query authoring
Job Responsibility
Job Responsibility
  • Design data models, acquisition processes, and applications to address needs
  • Lead business growth and enhance product experiences
  • Collaborate with Technology Teams, Global Analytical Teams, and Data Scientists across programs
  • Extract/clean data and understand generating systems
  • Improve the quality of data by adding sources, coding rules, and producing metrics
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Senior Data Engineer

Collaborate with engineering and TPM leaders, developers, and process engineers ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Computer Science or equivalent experience with 8+ years as a Senior Data Engineer or similar role
  • 10+ Years of progressive experience in building scalable datasets and reliable data engineering practices.
  • Proficiency in Python, SQL, and data platforms like DataBricks
  • Proficiency in relational databases and query authoring (SQL).
  • Demonstrable expertise designing data models for optimal storage and retrieval to meet product and business requirements.
  • Experience building and scaling experimentation practices, statistical methods, and tools in a large scale organization
  • Excellence in building scalable data pipelines using Spark (SparkSQL) with Airflow scheduler/executor framework or similar scheduling tools.
  • Expert experience working with AWS data services or similar Apache projects (Spark, Flink, Hive, and Kafka).
  • Understanding of Data Engineering tools/frameworks and standards to improve the productivity and quality of output for Data Engineers across the team.
  • Well versed in modern software development practices (Agile, TDD, CICD)
Job Responsibility
Job Responsibility
  • Collaborate with engineering and TPM leaders, developers, and process engineers to create data solutions that extract actionable insights from incident and post-incident management data, supporting objectives of incident prevention and reducing detection, mitigation, and communication times.
  • Work with diverse stakeholders to understand their needs and design data models, acquisition processes, and applications that meet those requirements.
  • Add new sources, implement business rules, and generate metrics to empower product analysts and data scientists.
  • Serve as the data domain expert, mastering the details of our incident management infrastructure.
  • Take full ownership of problems from ambiguous requirements through rapid iterations.
  • Enhance data quality by leveraging and refining internal tools and frameworks to automatically detect issues.
  • Cultivate strong relationships between teams that produce data and those that build insights.
  • Fulltime
Read More
Arrow Right
New

Head of Factory Software & Vehicle Diagnostics

At Mach Industries, we are designing and building the world’s most advanced prod...
Location
Location
United States , Huntington Beach
Salary
Salary:
170000.00 - 250000.00 USD / Year
machindustries.com Logo
Mach Industries
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Electrical Engineering, Mechanical Engineering, Robotics, or a related engineering field
  • 10+ years of experience in software engineering, controls engineering, automated testing, manufacturing software, or firmware systems
  • 5+ years of experience leading technical teams or engineering organizations
  • Proven track record of shipping production-critical software or managing large-scale automated test systems
  • Strong systems-level thinking across software, hardware, networks, and manufacturing workflows
  • Deep expertise in one or more of the following areas: Manufacturing Execution Systems (MES)
  • PLCs and industrial controls (Beckhoff, Siemens, B&R, Allen-Bradley)
  • Firmware flashing, bootloaders, and secure signing
  • Vehicle or embedded diagnostics (CAN, LIN, Ethernet, UDS, custom protocols)
  • Test automation frameworks, HIL systems, or end-of-line validation
Job Responsibility
Job Responsibility
  • Build, lead, and develop a cross-functional organization including manufacturing software engineers, controls engineers, firmware-tools engineers, diagnostic engineers, and data platform engineers
  • Own the end-to-end architecture for factory software, including MES-like systems, build tracking, serialization, and production workflow tools
  • Lead the design and implementation of vehicle flashing, commissioning, and diagnostics pipelines inside the factory
  • Define and deliver the vehicle–factory communication framework (CAN, Ethernet, custom protocols, telemetry ingestion, APIs)
  • Oversee all end-of-line (EOL) software, automated test stands, calibration systems, and data acquisition infrastructure
  • Partner with manufacturing engineering, build engineering, design engineering, flight software, and NPI teams to integrate software tools and processes across the vehicle lifecycle
  • Implement highly reliable production-grade software with redundancy, observability, and real-time data health monitoring
  • Drive rapid iteration and continuous improvement of test coverage, automation, and factory efficiency
  • Own uptime, performance, and correctness for all software critical to production and test operations
  • Establish coding standards, architecture strategies, and long-range roadmaps for factory software and diagnostics
What we offer
What we offer
  • Offers Equity
  • healthcare
  • dental and vision plans
  • retirement savings
  • paid time off
  • funds for continuing education, training, and career growth
  • Fulltime
Read More
Arrow Right