CrawlJobs Logo

Data Curator and Annotator

aciinfotech.com Logo

ACI Infotech

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The Data Curator and Annotator will be responsible for curating, labeling, and maintaining high-quality datasets to support ML training, RAG pipelines, and evaluation. This role requires precision in annotation, strong attention to detail, and the ability to establish reliable guidelines and workflows. The ideal candidate will collaborate closely with engineers and data scientists to ensure datasets are accurate, secure, and aligned with business and research needs.

Job Responsibility:

  • Curate and label datasets for ML training and evaluation
  • Define annotation guidelines and quality control processes
  • Develop efficient labeling workflows with quality gates
  • Ensure privacy, security, and bias mitigation in datasets
  • Collaborate with engineers and data scientists to improve data utility
  • Build trusted evaluation datasets for ranking and RAG tasks

Requirements:

  • Experience labeling or curating datasets for NLP or search
  • Familiarity with annotation tools such as Label Studio or Prodigy
  • Strong attention to detail and commitment to labeling consistency
  • Comfort working with enterprise domain data
  • Experience with QA processes for annotation quality
  • Strong written communication for guideline creation
  • Respect for privacy, security, and ethical data principles

Nice to have:

  • Domain knowledge in BFSI, retail, or healthcare
  • Experience creating evaluation datasets for LLMs
  • Multi-lingual annotation experience
  • Comfort with basic Python scripting

Additional Information:

Job Posted:
December 14, 2025

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Data Curator and Annotator

AI Data Manager

This is not a standard data management role; it’s a rare opportunity to be at th...
Location
Location
United States , Palo Alto
Salary
Salary:
140000.00 - 260000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of hands-on experience in AI data operations, human data annotation, or a similar data-centric role within a top-tier AI company
  • direct experience translating complex researcher needs into effective data curation and annotation workflows
  • highly adaptable and thrive on cross-functional collaboration, with a proven ability to work across a comprehensive data pipeline, not just within a single vertical like human annotation
  • experience working with vision or multimodal data pipelines
  • a hands-on individual contributor who is driven by the work, not by people management
Job Responsibility
Job Responsibility
  • Translate researcher needs into actionable data annotation and curation strategies for our SOTA vision, 3D, and audio models
  • own and manage end-to-end data pipelines and annotation workflows, collaborating with external partners and labeling teams to ensure the highest quality data
  • provide horizontal management across multiple data pipelines, ensuring consistency and quality as we expand into new modalities
  • develop innovative data curation strategies, working with a diverse mix of human-annotated, raw, and synthetic data to solve complex model challenges
  • partner directly with researchers to diagnose model performance issues and propose data-driven solutions to improve results
  • define the standards for data quality and annotation excellence, establishing the foundation for how Luma scales its data operations
  • Fulltime
Read More
Arrow Right

Software Engineer, Robotics

Scale's Robotics business unit is dedicated to solving the data bottleneck in Ph...
Location
Location
Argentina; Uruguay
Salary
Salary:
Not provided
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 6 years of high-proficiency software engineering experience
  • Strong programming skills in Python and TypeScript/Node.js for production systems
  • Experience with React and modern frontend development for 3D interfaces
  • Concurrent and real-time systems, with special attention to timing constraints
  • Understanding of distributed systems, workflow orchestration, and cloud infrastructure (AWS, Temporal, Kubernetes, Docker)
  • Experience with databases (MongoDB, PostgreSQL) and data processing at large scale
  • Track record of working with cross-functional teams including ML engineers, researchers, and customers
  • Strong communication skills and ability to operate with high autonomy
Job Responsibility
Job Responsibility
  • Own and architect large-scale data processing pipelines for robotics and autonomous vehicle datasets
  • Build ML training and fine-tuning pipelines using Scale's robotics data
  • Work across backend (Python, Node.js, C++), and frontend (React, TypeScript) stacks to build end-to-end solutions
  • Develop tools and real-time systems for robotics data collection, teleoperation, model evaluation, data curation, and data annotation
  • Interact directly with robotics and AV stakeholders to understand their technical needs and drive product development
  • Design comprehensive monitoring and evaluation frameworks for robotics models and data quality
  • Solving complex, late-stage industry challenges in concurrent and real-time robotic systems, with strict attention to timing constraints and data integrity
  • Collaborate with ML engineers and researchers to bring robotics research into production
  • Deliver features at high velocity while maintaining system reliability and performance
Read More
Arrow Right

Software Engineer, Robotics

Scale's Robotics business unit is dedicated to solving the data bottleneck in Ph...
Location
Location
Mexico , Mexico City
Salary
Salary:
Not provided
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of high-proficiency software engineering experience, with a strong background in complex systems and the ability to independently research, analyze, and unblock hard technical problems
  • Strong programming skills in Python and TypeScript/Node.js for production systems
  • Experience with React and modern frontend development for 3D interfaces
  • Concurrent and real-time systems, with special attention to timing constraints
  • Understanding of distributed systems, workflow orchestration, and cloud infrastructure (AWS, Temporal, Kubernetes, Docker)
  • Experience with databases (MongoDB, PostgreSQL) and data processing at large scale
  • Track record of working with cross-functional teams including ML engineers, researchers, and customers
  • Strong communication skills and ability to operate with high autonomy
Job Responsibility
Job Responsibility
  • Own and architect large-scale data processing pipelines for robotics and autonomous vehicle datasets
  • Build ML training and fine-tuning pipelines using Scale's robotics data
  • Work across backend (Python, Node.js, C++), and frontend (React, TypeScript) stacks to build end-to-end solutions
  • Develop tools and real-time systems for robotics data collection, teleoperation, model evaluation, data curation, and data annotation
  • Interact directly with robotics and AV stakeholders to understand their technical needs and drive product development
  • Design comprehensive monitoring and evaluation frameworks for robotics models and data quality
  • Solving complex, late-stage industry challenges in concurrent and real-time robotic systems, with strict attention to timing constraints and data integrity. This often involves deep investigation, reviewing academic papers, and direct collaboration with robotics vendors
  • Collaborate with ML engineers and researchers to bring robotics research into production
  • Deliver features at high velocity while maintaining system reliability and performance
Read More
Arrow Right

Senior Expert for Industry Foundation Models

You drive the development of domain specific AI and shape the next generation of...
Location
Location
Germany , Munich
Salary
Salary:
Not provided
bmw.de Logo
BMW
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree in computer science, mathematics, data science, or a related field, or equivalent senior industry experience
  • Five to ten years of professional experience, including several years in advanced AI and technical leadership roles
  • Ability to translate complex technical concepts into AI strategies, products, roadmaps, and measurable impact
  • Deep expertise in foundation models, particularly multimodal and reasoning models, including adaptation and post training methods
  • Proven experience building large scale AI systems, including distributed training, high throughput inference, GPU acceleration, and cost optimization
  • Strong background in industrial or automotive data, processes, and IT systems
  • Excellent software engineering skills in Python and modern ML frameworks, with the ability to adapt to internal platforms and toolchains
Job Responsibility
Job Responsibility
  • You define the technical and business product vision and system architecture for Large Industry Models, covering model, data, and platform layers
  • You lead and technically mentor an engineering team building multimodal foundation model stacks, including language, vision, and action models
  • You guide core technical and architectural decisions, selecting and adapting foundation models and designing scalable AI systems
  • You develop a comprehensive industrial data strategy, including data sourcing, curation, annotation, and feedback loops
  • You ensure reliable delivery of LIMs into cloud production environments with strong MLOps, evaluation, safety, and compliance standards
What we offer
What we offer
  • Challenging projects with which we shape the mobility of tomorrow together
  • Wide range of personal and professional development opportunities
  • Attractive, fair and performance-related remuneration
  • High level of job security
  • Annual special payments such as vacation pay, Christmas bonus, and profit sharing
  • Flexible working hours including six weeks annual leave and overtime compensation
  • Discounted BMW & MINI conditions
  • Fulltime
Read More
Arrow Right

Research Intern - GenAI

Appen is seeking Research Interns to support innovative research in Generative A...
Location
Location
Australia , Chatswood, Sydney
Salary
Salary:
Not provided
appen.com Logo
Appen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Postgraduate students in Linguistics, Computer Science, AI, Data Science, or similar disciplines preferred
  • strong final-year and recent undergraduate candidates in these fields will also be considered
  • Familiarity with programming languages such as Python, R, or similar tools used in data analysis and machine learning
  • Experience with data annotation, model evaluation, or prompt engineering
  • Understanding of multilingual NLP, speech technologies, or agentic AI systems
  • Strong written communication skills, especially for summarizing research and drafting technical content
  • Ability to work independently and collaboratively in a remote research environment
Job Responsibility
Job Responsibility
  • Conduct literature reviews on topics such as adversarial prompting, multilingual evaluation, and agentic AI
  • Assist in dataset curation, annotation, and quality assurance for speech, text, and multimodal data
  • Support model evaluation experiments, including prompt engineering and red teaming
  • Develop scripts and tools for data analysis, visualization, and automation
  • Contribute to internal documentation, research reports, and thought leadership content
  • Participate in team meetings and cross-functional collaborations
  • Help prepare materials for conferences, publications, and workshops
What we offer
What we offer
  • Hands-on experience in applied AI research with real-world impact
  • Mentorship from experienced researchers and exposure to industry workflows
  • Opportunities to contribute to publications, datasets, and thought leadership
  • A collaborative and inclusive research environment
Read More
Arrow Right

Metabolomics Scientist

Join Enveda as a Metabolomics Scientist in Boulder, CO, and help us transform na...
Location
Location
United States , Boulder
Salary
Salary:
131000.00 - 140000.00 USD / Year
enveda.com Logo
Enveda
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in chemistry or a related scientific discipline with 2+ years of experience in LC-MS-based metabolomics
  • Strong expertise in metabolite annotation and identification
  • Proficiency with one or more metabolomics data processing tools (e.g., MZmine, XCMS, MS-DIAL, MetaboScape)
  • Hands-on experience with large-scale metabolomics studies, ideally involving human biospecimens
Job Responsibility
Job Responsibility
  • Serve as a subject-matter expert in metabolite annotation and identification from LC-MS-based metabolomics data
  • Lead follow-up investigations on mis-annotations and unknown features
  • Build, curate, and maintain internal spectral libraries to strengthen metabolite annotation capabilities
  • Contribute to data QC, review, and troubleshooting, helping to continuously improve robustness and reproducibility of the platform
What we offer
What we offer
  • 90% Medical, Dental, Vision
  • 401k Match
  • Flexible PTO
  • Adoption Assistance
  • Fulltime
Read More
Arrow Right

Music Directors and Composers - Ai trainer

Join our team as a Music Directors and Composers - Ai trainer, where your expert...
Location
Location
India , Noida
Salary
Salary:
Not provided
aqusag.com Logo
AquSag Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience as a music director, composer, or arranger across diverse genres
  • Advanced knowledge of music theory, orchestration, composition techniques, and digital music technologies
  • Exceptional written and verbal communication skills with a collaborative, detail-oriented approach
  • Demonstrated ability to curate, annotate, and critique music for educational or professional purposes
  • Comfort with remote collaboration tools and digital audio workstations (DAWs)
  • Passion for innovation at the intersection of music and technology
  • Strong organizational skills with the ability to manage multiple priorities in a fast-paced environment
  • Experience: 2+ years
Job Responsibility
Job Responsibility
  • Design and deliver high-quality training data, guiding AI systems in understanding musical direction, composition, and performance nuances
  • Review, annotate, and curate music samples, scores, and notations to support AI model development
  • Collaborate with engineers and data scientists to interpret model results and suggest creative improvements
  • Develop guidelines for data curation with a strong focus on musicality, genre, and emotional expression
  • Provide actionable feedback on AI-generated music outputs to ensure musical integrity and industry relevance
  • Lead workshops or knowledge-sharing sessions to upskill team members on music theory, arrangement, and emerging trends
  • Communicate complex musical concepts clearly and effectively in both written and verbal forms
  • Fulltime
Read More
Arrow Right

Research Technician II

As a community, the University of Rochester is defined by a deep commitment to M...
Location
Location
United States of America , Rochester
Salary
Salary:
19.96 - 27.94 USD / Hour
urmc.rochester.edu Logo
University of Rochester
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree (BS/BA) in Psychology, Neuroscience, Cognitive Science, Computer Science, or related field
  • Min 1-year research experience OR undergraduate honors thesis
  • Proficiency in Matlab or Python
  • familiarity with the other
Job Responsibility
Job Responsibility
  • Regularly provides routine technical support for research activities and/or laboratory support
  • Assists with the design of new experiments, multimedia sampling, maintenance, operation and calibration of scientific monitoring equipment for research projects
  • Responsible for general data collection and assisting with analysis, recruiting participants, and may support and assist other technicians
  • Managing day-to-day research activities for the laboratories
  • Helping to run experiments with human participants and training new lab members in use of lab techniques and equipment
  • Help write and update lab IRB protocols
  • Keep track of equipment needs, help order essential supplies
  • Maintain lab calendars, schedule lab meetings/events
  • Design and/or maintain lab websites across URMC, BCS, external platforms
  • Set up and manage psychophysics machines, remote links, and rooms
  • Fulltime
Read More
Arrow Right