CrawlJobs Logo

Research Engineer, Frontier Evals & Environments - Finance

openai.com Logo

OpenAI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

205000.00 - 380000.00 USD / Year

Job Description:

The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI/ASI. This team builds ambitious evaluations to measure and steer our models, and creates self-improvement loops to steer our training, safety, and launch decisions. Some of the team's open-sourced evaluations include SWE-bench Verified, MLE-bench, PaperBench, and SWE-Lancer, and the team built and ran frontier evaluations for GPT4o, o1, o3, GPT 4.5, ChatGPT Agent, and GPT5. If you are interested in feeling firsthand the fast progress of our models, and steering them towards good, this is the team for you.

Job Responsibility:

  • Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas
  • Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it
  • Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities

Requirements:

  • Strong engineering and statistical analysis skills (with at least 2-3 years of full-time technical experience)
  • Passionate about evals for real world applications and knowledge work
  • Detail-oriented and thorough
  • Team player / willing to do a variety of tasks to move the team forward
  • Passionate and knowledgeable about AGI/ASI measurement
  • Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end

Nice to have:

  • An ability to work cross-functionally
  • Excellent communication skills
What we offer:
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Relocation support for eligible employees
  • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided
  • Offers Equity

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Engineer, Frontier Evals & Environments - Finance

AI Architect

We’re hiring an AI Architect to sit at the intersection of frontier AI research,...
Location
Location
United States , San Francisco; New York
Salary
Salary:
201600.00 - 241920.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep technical background in applied AI/ML: 5–10+ years in research, engineering, solutions engineering, or technical product roles working on LLMs or multimodal systems, ideally in high-stakes, customer-facing environments
  • Hands-on experience with model improvement workflows: demonstrated experience with post-training techniques, evaluation design, benchmarking, and model quality iteration
  • Ability to work on hard, ambiguous technical problems: proven track record of partnering directly with advanced customers or research teams to scope, reason through, and execute on deep technical challenges involving frontier models
  • Strong technical fluency: you can read papers, interrogate metrics, write or review complex Python/SQL for analysis, and reason about model-data trade-offs
  • Executive presence with world-class researchers and enterprise leaders
  • excellent writing and storytelling
  • Bias to action: you ship, learn, and iterate.
Job Responsibility
Job Responsibility
  • Translate research → product: work with client side researchers on post-training, evals, safety/alignment and build the primitives, data, and tooling they need
  • Partner deeply with core customers and frontier labs: work hands-on with leading AI teams and frontier research labs to tackle hard, open-ended technical problems related to frontier model improvement, performance, and deployment
  • Shape and propose model improvement work: translate customer and research objectives into clear, technically rigorous proposals—scoping post-training, evaluation, and safety work into well-defined statements of work and execution plans
  • Translate research into production impact: collaborate with customer-side researchers on post-training, evaluations, and alignment, and help design the data, primitives, and tooling required to improve frontier models in practice
  • Own the end-to-end lifecycle: lead discovery, write crisp PRDs and technical specs, prioritize trade-offs, run experiments, ship initial solutions, and scale successful pilots into durable, repeatable offerings
  • Lead complex, high-stakes engagements: independently run technical working sessions with senior customer stakeholders
  • define success metrics
  • surface risks early
  • and drive programs to measurable outcomes
  • Partner across Scale: collaborate closely with research (agents, browser/SWE agents), platform, operations, security, and finance to deliver reliable, production-grade results for demanding customers
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • commuter stipend
  • equity based compensation.
  • Fulltime
Read More
Arrow Right
New

Technical Author-Technical Publications

Position Title: Technical Author-Technical Publications. Reports to: Principal E...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
April 25, 2026
Flip Icon
Requirements
Requirements
  • Sound technical expertise in the field of technical publications, preferably heavy engineering or automotive
  • Ability to understand/interpret the engineering drawings
  • Good working knowledge creating/updating Operator & Maintenance Manuals (OMM), Service Manuals, Instruction & Training Manuals
  • Proficiency in authoring tools such as Adobe InDesign, Adobe Frame Maker, and Arbortext Editor
  • Knowledge on illustration tools such as Arbortext IsoDraw, Corel Draw, Adobe Illustrator, Adobe Photoshop
  • Working knowledge on CAD tools such as Solidworks, NX would be an added advantage
  • Good communication, and interpersonal skills
  • Graduate in Mechanical Engineering/Automobile, or equivalent from a reputed college
  • Relevant experience of 3 to 5 Years
  • Good in teamwork and co-ordination with other teams
Job Responsibility
Job Responsibility
  • Service and Repair manuals [SRM], Operator & Maintenance Manuals [OMM] using various authoring tools
Read More
Arrow Right
New

Pcb technician

Job Description: Expertise in all type of components soldering Ex. Chip Compone...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
April 06, 2026
Flip Icon
Requirements
Requirements
  • Expertise in all type of components soldering Ex. Chip Component, IC’s, through hole components
  • Expertise in component value measurement & component body marking
  • Knowledge about PCB assembly, inspection & Testing
  • Knowledge of pick, Place, Stencil Printer & Reflow Machine
  • Knowledge about electronic circuits & harness diagram
  • Knowledge of assembled PCB board bring up test
  • Maintain 5S
  • Knowledge of electronic components & Connectors
  • experience 5
Read More
Arrow Right
New

TMT Technician

We are seeking a qualified TMT Technician with a B.Sc. degree and 2-6 years of e...
Location
Location
India , Jamshedpur
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
April 10, 2026
Flip Icon
Requirements
Requirements
  • B.Sc. degree
  • 2-6 years of experience in healthcare
  • Treadmill Test (TMT)
  • Echocardiography (Echo)
  • Strong communication and interpersonal skills
  • Attention to detail
Job Responsibility
Job Responsibility
  • Perform TMT (Treadmill Test) procedures
  • Conduct Echocardiograms (Echo)
  • Ensure patient safety and comfort during tests
  • Maintain and calibrate equipment
  • Record and document test results accurately
  • Collaborate with healthcare professionals
What we offer
What we offer
  • Competitive contract compensation
  • Opportunity to work in a reputable healthcare setting
  • Gain valuable experience in cardiac diagnostics
Read More
Arrow Right
New

Chauffeur ce international - adr

We are looking for an international CE driver with ADR certification for the tra...
Location
Location
Belgium , Fleurus
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
May 13, 2026
Flip Icon
Requirements
Requirements
  • Valid Category CE driving license with Code 95
  • Medical certificate
  • Driver card
  • Valid ADR certificate
  • Good command of English and/or Dutch, French, or German
Job Responsibility
Job Responsibility
  • Transport of radioactive and nuclear materials across Europe
  • Management of logistical documents such as route paperwork and checklists
Read More
Arrow Right
New

First aid attendant

HIRING FOR A SAFETY AMBASSADOR! Are you eager and excited to start a new job in ...
Location
Location
Canada , Surrey
Salary
Salary:
21.00 CAD / Hour
https://www.randstad.com Logo
Randstad
Expiration Date
March 29, 2026
Flip Icon
Requirements
Requirements
  • Proven experience or strong interest in workplace health and safety
  • Current first aid and CPR certification is required
  • Excellent observational and reporting skills, excellent attention to detail
  • Ability to lift up to 50lbs repetitively
  • Ability to follow precise verbal and written instructions
  • Good organizational skills
  • Must be able to read, write, speak and receive instruction and directions in English
  • Must be eligible to work in Canada
  • Must have basic computer knowledge
  • Holding an active Intermediate First Aid Certification is a must
Job Responsibility
Job Responsibility
  • Worksite Inspections: Assist managers with regular worksite inspections to identify potential hazards and ensure compliance with safety protocols
  • Training and Onboarding: Support the delivery of health and safety training for new and existing employees, ensuring everyone understands and adheres to safety policies
  • First Aid and Incident Response: Provide first aid support for minor injuries and illnesses as needed. Maintain accurate records of all first aid treatments, incidents, and safety observations and report them to Randstad management
  • Hazard Monitoring and Reporting: Actively monitor the warehouse for safety concerns and hazards, and promptly report any issues to management
  • Safe Environment Maintenance: Champion efforts to maintain a clean, organized, and safe working environment for everyone
  • Shipping and Receiving: Scan and track incoming and outgoing orders, receive and process new inventory, and inspect all receivables for defects, damages, or missing items
  • Inventory Management: Perform detailed manual inventory counts, properly scan inventory and incoming goods, and label warehouse stock for quick and easy identification and retrieval
  • Order Fulfillment: Pick, prepare, package, and label goods for shipment, double-checking items to verify the accuracy of outgoing orders
  • General Support: Perform other warehouse tasks as assigned by management to support daily operations
What we offer
What we offer
  • Monday - Friday schedule
  • Free parking and transit accessible
  • Opportunity to work in a safety-driven and team-oriented environment
  • Gain valuable experience in material handling and workplace first aid
  • Be part of a company that values communication, innovation, and employee well-being
  • Fulltime
Read More
Arrow Right
New

Production Assistant

Are you a dedicated and detail-oriented individual looking to build a career as ...
Location
Location
Canada , Delta
Salary
Salary:
22.61 CAD / Hour
https://www.randstad.com Logo
Randstad
Expiration Date
March 30, 2026
Flip Icon
Requirements
Requirements
  • Must be comfortable with repetitive heavy lifting
  • Must be eligible to work in Canada
  • Must be punctual, dependable, and able to commit to full-time hours
  • Previous experience in food production is a strong asset
  • Strong verbal communication skills for effective collaboration within a team environment
  • Excellent attention to detail to ensure product quality and accurate documentation
  • Ability to work effectively and maintain composure in a fast-paced, high-pressure manufacturing setting
  • Proactive problem-solving abilities, particularly when clearing machine jams and addressing minor equipment issues
  • A strong sense of teamwork and cooperation, with a willingness to assist colleagues wherever needed
  • Good physical stamina and the ability to perform repetitive tasks while standing for extended periods - lifting up to 55 lbs
Job Responsibility
Job Responsibility
  • Operate packaging machinery, including baggers, forkers, and dividers, ensuring smooth and continuous production
  • Perform quality control checks on products before packaging, culling any items that do not meet customer specifications
  • Monitor equipment for jams or malfunctions and take prompt action to resolve issues to minimize downtime
  • Accurately package, tray, and box finished products according to specific guidelines for shipping and handling
  • Operate and monitor metal detectors, maintaining meticulous logs as required by food safety protocols
  • Assist with machine changeovers, including swapping bags and Quick Locks for different product runs
  • Maintain a clean, sanitary, and organized work area by sweeping floors, cleaning machinery, and removing waste
  • Adhere strictly to all Good Manufacturing Practices (GMPs) and safety procedures at all times
  • Collaborate with team members and communicate effectively with the Bagger Operator and Shift Supervisor regarding product quality, equipment issues, or production delays
  • Assist with material handling, including stacking trays and boxes on pallets and moving supplies to the production line
What we offer
What we offer
  • Weekly pay deposits with 4% vacation pay added
  • Friendly work environment with a strong, supportive company culture
  • On-site training and orientation provided
  • Long-term opportunity with the potential for permanent hire
  • Fulltime
Read More
Arrow Right
New

Reach Truck Operator

We are seeking a skilled and safety-conscious Forklift Operator to join our clie...
Location
Location
Canada , Surrey
Salary
Salary:
21.00 - 25.00 CAD / Hour
https://www.randstad.com Logo
Randstad
Expiration Date
April 02, 2026
Flip Icon
Requirements
Requirements
  • Minimum of 1-2 years of proven forklift experience in a fast-paced warehouse environment
  • Valid and current forklift operator certification
  • Strong understanding of warehouse safety regulations and best practices
  • Experience with a Warehouse Management System (WMS)
  • Ability to work in narrow aisles and at heights (spatial awareness is key)
  • Proficiency in English with the ability to read and follow work orders/safety guidelines
  • Excellent attention to detail and commitment to accuracy
  • Ability to work independently and as part of a team
Job Responsibility
Job Responsibility
  • Safely and efficiently operate a stand-up reach truck to move products and materials throughout the warehouse
  • Execute order picking and put-away tasks with a high degree of accuracy
  • Load, unload, and stack pallets in high-bay racking systems, ensuring stability and safety
  • Use an RF scanner or other inventory management tools to track product movement and maintain accurate records
  • Conduct daily pre-operational safety checks on the forklift and report any issues immediately
  • Assist in physical inventories and cycle counts
  • Maintain a clean, neat, and orderly work area, adhering to all company safety and OSHA standards
  • Assist with other warehouse duties as assigned
What we offer
What we offer
  • Competitive Pay: Earn a great hourly wage at leading local companies
  • Schedule Variety: We have various schedules and shifts available to fit different lifestyles
  • Team Environment: Be part of a professional and friendly, supportive team
  • Skill Enhancement: Gain valuable knowledge and enhance your skills within the logistics industry
  • Fulltime
Read More
Arrow Right