Mathematics Model Prompt Evaluator Job at SaidGig, Remote

Q3FaRDZZVm53SEM4SWIxcnNORTNOR0dHanc9PQ==
  • SaidGig
  • Remote

Job Description

Role Overview

Expert mathematicians are invited to author and verify high-quality open-ended prompts for AI model evaluation. In this role, you will craft and review challenging, unambiguous mathematical problems across core subdomains, assessing AI reasoning quality and helping establish rigorous evaluation standards for frontier language models.

Task Types

You will be assigned one of two task types:

  • Authoring Task: Create 5 original, open-ended prompts from your assigned subdomain at varying difficulty levels (undergraduate, advanced undergraduate, or graduate/professional). Prompts should require human judgment to evaluate the quality of the AI''s response, such as chain-of-thought reasoning or proof construction.
  • Verification Task: Review 5 authored prompts for clarity, scope alignment, difficulty accuracy, and uniqueness. Edit prompts and difficulty ratings where needed.
Mathematics Subdomains Covered

Probability & Statistics, Algebra (including Linear Algebra), Ordinary/Partial Differential Equations & Dynamical Systems, Geometry, Graph Theory, Number Theory.

Key Responsibilities
  • Author clear, unambiguous, open-ended mathematical prompts that elicit evaluable AI responses.
  • Verify prompts are within the scope of the assigned subdomain and correctly rated for difficulty.
  • Ensure all 5 prompts in a task are sufficiently distinct from one another with varying difficulty levels.
  • Apply expert judgment to assess the depth and quality of mathematical reasoning required.
  • Edit prompts and difficulty assignments where standards are not met.
Ideal Qualifications
  • Master''s degree or higher in Mathematics, Applied Mathematics, Statistics, or a closely related field.
  • 2–6 years of professional or research experience in a quantitative field.
  • Strong command of graduate-level mathematical concepts including proof writing, analysis, and formal reasoning.
  • Experience in academic research, mathematical competition design, or quantitative industry roles is a plus.
  • Excellent written English and ability to craft precise, well-scoped technical questions.
Work Terms

Expected commitment: 10+ hours/week. Asynchronous, fully remote work.

Job Tags

Remote job

Similar Jobs

TOTAL CARE CONNECT

Community Paramedic Job at TOTAL CARE CONNECT

 ...company-supplied devices / technology, onboarding / community paramedic academy training. About Total Care Connect Total Care Connect...  ...Conditions Non-transport, in-home care settings. You will travel between patient homes. Flexible scheduling: shifts may vary... 

Johns Hopkins Medicine

PHYSICIAN - INTERNAL MEDICINE Job at Johns Hopkins Medicine

 ...Johns Hopkins Community Physicians (JHCP) is looking for an Internal Medicine Physician to join our new West Falls Church practice. Johns Hopkins Community Physicians serves Maryland, Northern Virginia and Washington DC with over 40 locations. Benefits: Full-time... 

StarLight Scholars Academy

Mini (13 Seater) School Bus Driver (2025-2026 School Yr) NO CDL REQUIRED! Job at StarLight Scholars Academy

 ...Summary We are looking for a dependable, safety-focused, and professional School Bus Driver to become a valued member of our transportation team for the 20252026 academic year. The ideal candidate will demonstrate a strong commitment to student safety, exhibit punctuality... 

Samsung SDS America

MDM Systems Implementation Engineer Job at Samsung SDS America

 ...About Us: Samsung SDS, the IT and innovation hub of Samsung, delivers innovative cloud, AI, digital logistics, cybersecurity, and enterprise solutions to transform the way businesses work and operate. We serve organizations across industries and are driving digital... 

David Protein

Art Director Job at David Protein

 ...packaging, advertising, website, retail, OOH, and miscellaneous creative projects Ideate campaigns and translate concepts and creative...  ...PTO ~ We work in the office 5 days per week in New York City when culture lines up, it is fun to be in the office together....