Expert Prompt Curators for Advanced AI Evaluation Dataset Job at Mercor, Remote

RzVLMnZLdmE3NWJhd0x5TFh2YkF6eE5kZWc9PQ==
  • Mercor
  • Remote

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation.

Key Responsibilities

  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution).
  • Ensure prompts are objective, self-contained, and yield clear, unambiguous answers.
  • Test prompts against advanced AI models and document failures/successes.
  • Provide reasoning steps and solutions for each prompt.
  • Classify prompts into subject domains for dataset organization.
  • Collaborate with reviewers for expert validation and prompt refinement.

Qualifications

  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.).
  • Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references.
  • Experience in academic research, benchmarking, or test question design preferred.
  • Attention to detail and ability to provide concise reasoning explanations.
  • Familiarity with AI models and their limitations is a plus.

Requirements

  • Remote and asynchronous — set your own hours.
  • Expected commitment: ~10–20 hours/week.
  • Project duration: ~2 months, with possible extensions based on dataset needs.
  • Opportunity to contribute to high-impact AI safety and evaluation research.

Compensation & Contract Terms

  • Competitive hourly compensation based on expertise.
  • Independent contractor engagement.
  • Payments for services rendered processed weekly via Stripe Connect.

Application Process

  • Submit your resume or CV highlighting your subject matter expertise.
  • Complete a brief questionnaire about your background and areas of specialization.
  • Selected applicants may be asked to draft a short test prompt.
  • You’ll receive follow-up within a few days regarding next steps.

Job Tags

Remote job, Hourly pay, Full time, Contract work, Temporary work, For contractors,

Similar Jobs

Sunlighten

Salesforce Developer Job at Sunlighten

 ...Sunlightens Salesforce Developer will thrive at the intersection of innovation, customer experience and smart tech solutions. You will develop and maintain all integrations, including between NetSuite and Shopify, ensuring data consistency and operational efficiency... 

Cooperation Fund

Assistant Horticulturist Job at Cooperation Fund

 ...gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law. In addition, reasonable accommodations... 

Department of Developmental Services - Headquarters

Senior Software Engineer (Front-End) Job at Department of Developmental Services - Headquarters

 ...thinking IT organization where your skills and ideas help shape the future of technology and your career. Under the general direction of the Chief Technology Innovation Officer, the Information Technology Specialist II serves as the departments senior front-end... 

BOEING

Integral Fuel Tank Sealer A- 59106 Job at BOEING

 ...subject to testing for marijuana, cocaine, opioids, amphetamines, PCP, and alcohol when criteria is met as outlined in our policies. Union Representation Statement: This is an hourly position governed by the International Association of Machinists (IAM-751) Collective... 

Culver's

Crew Member Job at Culver's

 ...your lifestyle. 18+ Full Time: Starting at $16.75 with Fri/Sat/Sun Closing Availability. 1617-Year-Old: Starting at Fifteen with Fri/Sat/Sun Closing Availability. 15-Year-Old: Starting at Ten to Twelve 14 Year Old: Starting at Nine Offering Flexible...