Swift Engineer (5+ YOE) – AI / LLM Code Evaluation (Remote, Contract)

Posted 2026-05-06

Remote, USA Full-time Immediate Start

Company: Mercor.
Type: Contract (Full-time or Part-time).
Location: Remote (Worldwide).
Language: Professional English required.

USD $30 – $90/hour (depending on experience & evaluation performance).
Weekly payments via Stripe or Wise.
Flexible workload (project-based, scalable hours).

Work directly with leading AI teams to improve how large language models reason about code, systems design, and technical problem-solving.
You will evaluate and refine AI-generated responses, making them more accurate, reliable, and aligned with real-world engineering standards.

Evaluate AI-generated answers to coding and system design problems.
Execute and validate code outputs.
Identify bugs, inefficiencies, and incorrect reasoning.
Assess code quality & readability.
Assess algorithmic correctness.
Assess system design logic.
Annotate responses with structured, actionable feedback.
Follow defined evaluation frameworks and quality benchmarks.

Swift (expert level).
Software Engineering (5+ years).
Data Structures & Algorithms.
Systems Design.
Debugging & Code Review.
Problem Solving (Medium–Hard level).
Code Execution & Testing.
API Design & Backend Logic.
Performance Optimization.
Version Control (Git).
Experience using LLMs in development workflows.
Ability to evaluate reasoning, not just outputs.

RLHF / AI Model Evaluation.
Competitive Programming.
Open-source contributions (merged PRs).
Multi-language experience (Python, JS, etc.).
Technical writing / explaining complex concepts.

Degree in Computer Science or related field (BS/MS/PhD).
Strong real-world engineering background.
Detail-oriented and highly analytical.
Comfortable identifying subtle logic flaws and edge cases.
Able to work independently in async environments.

Improve the quality and reasoning of AI-generated code.
Influence how AI systems assist developers globally.
Deliver high-quality evaluation outputs that directly impact model performance.

Location: Remote - Anywhere

AI model evaluation
API design
Algorithm development
Code review
Data science
Data structures
Debugging
Git
JavaScript
Large language model (LLM)
Performance optimization
Python
Reinforcement learning from human feedback (RLHF)
Software engineering
Swift
Systems design
Technical writing
Testing

Similar Jobs

Recent Jobs

Performance Tester with Loadrunner and JMeter experience - 100% remote

Chronic Care Management LVN - REMOTE (California License Required)

Entry Level Mortgage Loan Processor - PAID Training Program (Remote)

【Full Remote】日英バイリンガル - Logistics Senior Project Manager -

Consumer & Small Business Loan Processor II (Remote - OR, CA, WA & ID)

Experienced Machine Learning Engineer (Agentic Systems, Healthcare) – Remote

Freelance Machine Learning Developer /Python/

Threat Analyst, Machine Learning (Remote, East/Central)

W2-Cincinnati, OH (Remote) :: Python/Machine Learning Developer (Only G.C / U.S.C)

Machine Learning Engineers (Remote, Continental United States)

You May Also Like

Independent Contractor - Customer Onboarding Services

**Experienced Delivery Station Customer Service Associate – Last-Mile Logistics and Customer Support**

Remote Data Entry Specialist – Work From Home Opportunity | 1400+ Positions Available | arenaflex

**Experienced Data Entry Specialist – Remote Work Opportunity with arenaflex (Work from Home) – Immediate Hiring Now**

**Part-Time Remote Data Entry Specialist – E-Commerce Product Management & Inventory Control (No Experience Required)**

**Experienced Customer Service Representative – Warren and Washington Counties**

Business Development Manager - Agriculture ERP Solutions

**Experienced Data Entry Specialist (Remote) – Aviation Industry Data Management**

Machine Learning Engineer

Medical Monitor (Gastroenterology)

Back to Job Board