CrawlJobs Logo

Research Engineer, Evaluations - Meta Superintelligence Labs

meta.com Logo

Meta

Location Icon

Location:
United States , Menlo Park

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

219000.00 - 301000.00 USD / Year

Job Description:

Meta is seeking Research Engineers to join the Evaluations team within Meta Superintelligence Labs. Evaluations are the core of AI progress at MSL, determining what capabilities get built, which features get prioritized, and how fast our models improve. As a Research Engineer on this team, you will curate and build the benchmarks for our most advanced AI models, across text, vision, audio, and beyond. You'll work alongside world-class researchers and engineers to collect, develop, and deploy novel benchmarks and reinforcement learning environments. This is a technical role requiring research engineering skills and the ability to work independently on a variety of open-ended machine learning challenges with high reliability. The evaluations you build will directly impact the research direction and major model lines within MSL, making engineering reliability, rigor, and scalability paramount. You will excel by maintaining high velocity while adapting to rapidly shifting priorities as we advance the technical research frontier. You'll need to be flexible and adaptive, tackling a wide variety of problems in the evaluations space, from implementing existing benchmarks to developing novel benchmarks and environments to implementing evaluation tooling at scale.

Job Responsibility:

  • Curate and integrate publicly available and internal benchmarks to direct the capabilities of frontier model development
  • Develop and implement evaluation environments, including environments for novel model capabilities and modalities
  • Collaborate with external data vendors to source and prepare high-quality evaluation datasets
  • Execute on the technical vision of research scientists designing new benchmarks and evaluations
  • Build robust, reusable evaluation pipelines that scale across multiple model lines and product areas
  • Contribute to evaluation tooling that measures the quality and reliability of evaluation suites

Requirements:

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 5+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing and completing medium to large technical features independently, without guidance
  • Software engineering practices including version control, testing, and code review practices
  • Demonstrated experience of working independently and adapting to rapidly changing priorities

Nice to have:

  • Publications at peer-reviewed venues (NeurIPS, ICML, ICLR, ACL, EMNLP, or similar) related to language model evaluation, benchmarking, or deep learning
  • Hands-on experience with language model post-training and deep learning systems, or building reinforcement learning environments
  • Experience implementing or developing evaluation benchmarks for large language models and multimodal models (e.g., vision-language, audio, video)
  • Experience working with large-scale distributed systems and data pipelines
  • Familiarity with language model evaluation frameworks and metrics
  • Track record of open-source contributions to ML evaluation tools or benchmarks
What we offer:
  • bonus
  • equity
  • benefits

Additional Information:

Job Posted:
February 17, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Engineer, Evaluations - Meta Superintelligence Labs

New

Research Engineering Manager, Evaluations, Meta Superintelligence Labs

Meta is seeking a Research Engineering Manager to lead the Evaluations team with...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 4+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • 3+ years of experience managing or leading technical teams, including hiring, mentoring, and performance management
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Proven track record of leading medium to large-scale technical projects from conception to deployment
  • Demonstrated experience balancing hands-on technical work with people management and strategic planning
  • Clear communication and experience influencing cross-functional stakeholders
Job Responsibility
Job Responsibility
  • Build, mentor, and grow a team of research engineers and scientists focused on evaluation infrastructure and benchmarking
  • Conduct performance reviews, career development conversations, and provide technical mentorship to team members
  • Foster a culture of engineering excellence, research rigor, and rapid iteration within the team
  • Partner with recruiting to hire world-class research engineering talent
  • Curate and integrate publicly available and internal benchmarks to direct the capabilities of frontier model development
  • Oversee the development and implementation of evaluation environments, including environments for novel model capabilities and modalities
  • Establish partnerships with external data vendors to source and prepare high-quality evaluation datasets
  • Influence the technical roadmap for evaluation infrastructure in collaboration with MSL Infra team
  • Translate the technical vision of research scientists into actionable engineering plans and execution strategies
  • Partner with research scientists, product teams, and other engineering teams to align evaluation priorities with organizational goals
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

AI Research Scientist, Personalization, Meta SuperIntelligence Labs

Meta is seeking AI research scientists to help us build the solutions for Person...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Phd in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience in Generative AI models and building LLM technologies particularly post training
  • Experience solving complex problems and comparing alternative solutions, tradeoffs, and different perspectives to determine a path forward. Proven experience of proactively identifying, scoping and implementing innovative research solutions
  • Programming experience in Python and hands-on experience with frameworks like Pytorch, Spark
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop and improve personalization in Meta’s frontier foundation models
  • Directly contribute to experiments, including designing experimental details, authoring reusable code, running evaluations, and organizing results
  • Prioritize research that can be applied to Meta's product development
  • Lead complex research projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right
New

Product Analyst

We are looking for a Mid-Level Product Data Analyst to work closely with the Pro...
Location
Location
Salary
Salary:
Not provided
rubylabs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in an analytical field (Math, Data Science, Economics, or similar)
  • 2–4 years of experience as a Product or Data Analyst
  • Strong experience with Mixpanel (funnels, cohorts, custom events, breakdowns)
  • Experience with Product Analytics and user behavior analysis
  • Strong SQL skills for extracting and validating product data
  • Ability to work with ad hoc requests and ambiguous product questions
  • Strong analytical thinking and attention to detail
  • Clear communication skills — able to explain insights to non-technical stakeholders
  • Comfortable working in a fast-moving product environment
Job Responsibility
Job Responsibility
  • Own product analytics and user behavior tracking across the platform
  • Build, maintain, and optimize Mixpanel dashboards, funnels, cohorts, and reports
  • Monitor core product KPIs (activation, retention, engagement, conversion, feature adoption)
  • Perform deep user behavior analysis to identify friction points and opportunities
  • Analyze feature performance before and after releases
  • Run ad hoc analyses for Product Managers and Leadership with fast turnaround
  • Identify trends, anomalies, and performance changes in product metrics
  • Support A/B tests and experiments with proper measurement and post-analysis
  • Work closely with Product and Engineering to ensure correct event tracking, naming conventions, and data quality
  • Translate data into clear, actionable insights — not just charts
What we offer
What we offer
  • Remote Work Environment
  • Unlimited PTO
  • Paid National Holidays
  • Company-provided MacBook
  • Flexible Independent Contractor Agreement with tax advantages, networking opportunities, reduced employment obligations, and the freedom to work from anywhere
  • Fulltime
Read More
Arrow Right
New

Engineer, Supplier Quality Engineering

We are seeking a detail-oriented and analytical Supplier Quality Engineer to joi...
Location
Location
Malaysia , Penang
Salary
Salary:
Not provided
sandisk.com Logo
Sandisk
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Engineering (Electrical Electronic, Material Science, Physics or equivalent)
  • Internship with working experience of supplier management for components or product engineer where gets to learn manufacturing processes, reliability and quality requirement for components such as Digital/Analog IC, semiconductor packaging, crystals, diodes, transistors, capacitors, inductors, and regulators
  • Hands-on experience in process, test, and quality
  • Experience in ASIC, PMIC and memory devices is an added advantage
Job Responsibility
Job Responsibility
  • Own all aspects of supplier quality and reliability for Electrical Components, along with program management of supplier readiness in support of new product development
  • Work with suppliers, and internal hardware, firmware, product quality and reliability engineering teams to ensure electrical components perform to expectations in Sandisk products
  • Understand and be familiar with supplier manufactory processes to identify KPIV parameters
  • Audit supplier QMA and manufacturing processes
  • Drive supplier resolution of device failures via the 8D method
  • Coordinate with Program Management, Design Engineering, Procurement and Factory for successful program execution with multiple suppliers, making recommendations on supplier selection
  • Conduct supplier audits, process assessments to ensure compliance with quality standards
  • Evaluate and score for periodic business reviews
  • Review supplier process changes notices and plans the appropriate level of qualification activities
  • Collaborate with cross-functional teams to resolve quality issues and improve processes
  • Fulltime
Read More
Arrow Right
New

Software Engineer, Data Governance

Join us in building the future of finance. Our mission is to democratize finance...
Location
Location
United States , Bellevue
Salary
Salary:
166000.00 - 195000.00 USD / Year
robinhood.com Logo
Robinhood
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong coding and problem-solving skills with proficiency in Python or Go (or similar languages)
  • Experience with server-side frameworks such as Django or GoLang
  • Familiarity with Kubernetes, AWS, and cloud-native development
  • Excellent communication skills with a proven ability to work cross-functionally
  • Curiosity and drive to navigate complex systems, regulatory requirements, and fast-changing technology
Job Responsibility
Job Responsibility
  • Design and build backend services and automation frameworks that instrument, govern, and enforce data usage, retention, and access policies across Robinhood’s online and analytical systems
  • Partner with AI/ML, Risk, and Privacy teams to operationalize governance and compliance in emerging AI systems, including ML Models and Agentic AI workflows
  • Enable governance Robinhood’s offline analytical systems and new data infrastructure workflows
  • Build internal tools and automation that strengthen our enterprise data governance posture by enabling auditability, data integrity, and privacy respecting design across our infrastructure
  • Own end-to-end delivery of governance solutions from design and prototyping to production deployment driving measurable impact in data reliability, compliance readiness, and trust
What we offer
What we offer
  • Performance-driven compensation with multipliers for outsized impact, bonus programs, equity ownership, and 401(k) matching
  • 100% paid health insurance for employees with 90% coverage for dependents
  • Lifestyle wallet — a highly flexible benefits spending account for wellness, learning, and more
  • Employer-paid life & disability insurance, fertility benefits, and mental health benefits
  • Time off to recharge including company holidays, paid time off, sick time, parental leave, and more
  • Exceptional office experience with catered meals, events, and comfortable workspaces
  • Fulltime
Read More
Arrow Right
New

Fashion Associate

Are you passionate about creating an exceptional shopping experience? Do you enj...
Location
Location
Canada , Kingston
Salary
Salary:
Not provided
rw-co.com Logo
R&W
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in retail, sales, or customer service is preferred
  • Ability to multitask and stay organized in a fast-paced, dynamic environment
  • Customer-focused mindset with the ability to create a welcoming and engaging experience
  • Strong communication and interpersonal skills
  • Ability to work in a fast-paced environment and adapt to changing priorities
  • Team player with a positive attitude and attention to detail
  • Passion for fashion: good sense of style and solid knowledge of fashion trends
  • Proficient in POS, ERP, ATS systems and Microsoft Office Suite
  • Flexible availability
  • able to work evenings, weekends and holidays
Job Responsibility
Job Responsibility
  • Deliver an outstanding shopping experience that builds strong customer relationships and loyalty
  • Process transactions efficiently and accurately to ensure customers feel valued at checkout
  • Promote a customer-first approach by providing personalized assistance, answering questions, and contributing to an inclusive, welcoming environment
  • Use product knowledge and current promotions to drive add-on sales and support store performance
  • Maintain a clean, organized, and visually appealing sales floor
  • Support daily store operations to ensure smooth and consistent execution
What we offer
What we offer
  • Flexible Hours: We adapt to your availability to offer a schedule that fits your needs
  • Career Advancement: Opportunities for professional growth and career development*
  • Paid Time Off: Flexible days and vacation time to help you balance school, work and personal life*
  • Enjoy up to 70% off on personal purchases in your store and 50% off all RCL brands (Reitmans, RW&CO, PENN. Penningtons)*
  • Generous Referral Policy: Refer your professional network and earn rewards for every successful hire – the more you refer, the more you earn!
  • Parttime
Read More
Arrow Right
New

Software Engineering Intern, Android

Join us in building the future of finance. Our mission is to democratize finance...
Location
Location
United States , Menlo Park
Salary
Salary:
48.00 USD / Hour
robinhood.com Logo
Robinhood
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a full-time, degree-seeking program with an expected graduation date in Winter 2026/Spring 2027
  • A solid foundation in software engineering
  • An interest in front-end work, specifically visually driven product design and shaping user experience
  • Familiarity with Android components and experience building or contributing to at least one Android app
  • Experience building, maintaining, or contributing to open-source projects
  • Familiarity with Kotlin, RxJava 2, Dagger 2, and other libraries in our tech stack
  • A self-starter mentality and a proactive approach
Job Responsibility
Job Responsibility
  • Collaborate with our award-winning design team throughout the full product lifecycle
  • Contribute to new features and play a role in building products that will change how America views finance
  • Help shape the future of our Android platform
  • Explore and build innovative Android UI/UX
  • Work with technologies such as Kotlin, RxJava 2, Retrofit/OkHttp, Dagger 2, and Room
What we offer
What we offer
  • Market competitive compensation structure
  • Quarterly lifestyle wallet for personal wellness, learning and development, and more
  • Time away including company holidays, paid time off, and sick time
  • Lively office environment with catered meals, fully stocked kitchens, and geo-specific commuter benefits
  • Fulltime
Read More
Arrow Right
New

UX Researcher

We’re building a category-leading product in the QR code space, and we’re lookin...
Location
Location
Salary
Salary:
Not provided
rubylabs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2–4 years of experience in UX research, funnel analysis, growth optimization, or product analytics
  • Strong experience analyzing user behavior using session recording tools (e.g., Hotjar, FullStory, Clarity) and web analytics (e.g., GA4, Amplitude, Mixpanel)
  • Ability to synthesize both qualitative and quantitative data into actionable insights
  • Experience collaborating with Product, Design, and Growth teams in a fast-moving environment
  • Strong understanding of conversion funnels, A/B testing, and iterative optimization processes
  • Excellent communication skills and the ability to advocate for the user while driving business outcomes
  • Proactive, entrepreneurial mindset — able to take ownership and drive projects end-to-end
Job Responsibility
Job Responsibility
  • Analyze user session recordings, heatmaps, and analytics to identify pain points and UX friction
  • Conduct competitor research to benchmark best practices and identify differentiation opportunities
  • Suggest and prioritize UX improvements across the entire customer funnel — from landing pages to onboarding to product engagement
  • Collaborate with the Growth Manager to design, prioritize, and run A/B tests and experiments
  • Work with Product, Growth, and Engineering teams to implement improvements
  • Continuously monitor results, validate hypotheses, and refine strategies based on data
What we offer
What we offer
  • Remote Work Environment
  • Unlimited PTO
  • Paid National Holidays
  • Company-provided MacBook
  • Flexible Independent Contractor Agreement
  • Fulltime
Read More
Arrow Right