Red Team Harmful Manipulation Evaluation AI Trainer
LinkedIn · États-Unis
Job description
About the role
We are looking for experienced professionals in behavioral science, trust & safety, or human‑computer interaction to join our AI Labor Marketplace as remote, part‑time consultants. As an AI Trainer, you will help improve large language model safety by designing and evaluating adversarial prompts that simulate harmful manipulation scenarios.
Key responsibilities
- Design realistic adversarial prompts that target manipulation and influence risks.
- Execute prompts against AI systems and capture model outputs.
- Apply structured annotation rubrics to assess model behavior.
- Provide clear written justifications for each evaluation.
- Review peer submissions to ensure quality and consistency.
- Identify edge cases and nuanced failure modes.
- Incorporate feedback and maintain calibration over time.
Required profile
- Background in behavioral science, social psychology, trust & safety, HCI, disinformation research, or a related field.
- 3–10+ years of relevant professional or research experience.
- Strong analytical writing and decision‑making under ambiguity.
- Experience with AI evaluation, red‑team activities, or content policy is preferred.
- Ability to apply structured guidelines consistently across tasks.
Required skills
What we offer
- Hourly compensation ranging from $100 to $120.
- Flexible, self‑managed schedule.
- Remote work with no location‑specific constraints.
- Project‑based engagement allowing simultaneous work with other vendors, subject to their policies.
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 3 hours ago
Expires 1 month from now
7 views · 0 applications
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
États-Unis
Related job offers
-
Academic Research Collaborator – Remote (up to $110/hr)
Mercor États-Unis -
Front End Engineer – Remote (JavaScript, CSS3)
Sundayy États-Unis -
Management Consultant – Remote, $100/hr
Mercor États-Unis -
Senior Software Engineer – AI & Java
mastercard Missouri -
Lead Product Management – Technical
Mastercard