[Remote] Online Generalist Jobs in Canada
Posted 2026-05-06
Remote, USA
Full-time
Immediate Start
Note: The job is a remote job and is open to candidates in USA. Rex.zone is hiring full-time remote Online Generalists to support AI/ML data operations for large language model training pipelines. The role involves completing online tasks such as data labeling, RLHF preference ranking, prompt evaluation, and QA evaluation across various datasets.
Responsibilities
- Perform data labeling for NLP tasks (classification, summarization, named entity recognition) and LLM evaluation workflows
- Create preference data for RLHF by ranking/scoring model outputs and writing clear rationales
- Conduct prompt evaluation and response scoring using web-based evaluation interfaces
- Execute QA evaluation, adjudication, and consistency checks to improve training data quality
- Handle content safety labeling using policy-based decisions and detailed guidelines
- Complete computer vision annotation including image tagging, bounding boxes, and segmentation as needed
- Document edge cases, follow versioned guidelines, and escalate ambiguity appropriately
Skills
- Based in Canada and able to work full-time remotely
- Strong reading comprehension, attention to detail, and consistent decision-making
- Comfortable following detailed annotation guidelines and meeting quality standards
- Experience in data labeling, QA evaluation, RLHF, or LLM evaluation
Company Overview