Analyze flagged AI conversations to assess malicious intent across four domains: data exfiltration, ransomware, worms, and exploits. Apply offensive security expertise to distinguish legitimate research from genuine attack intent. Directly train AI safety classifiers through ground-truth labeling.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Job Description
Cyberattacks cause billions in damages annually โ ransomware cripples hospitals, data exfiltration exposes millions. As a Cybersecurity Labeling Expert, you'll be on the front lines of AI safety: reviewing real-world conversations flagged as potentially malicious and determining whether they represent genuine threats. Your judgments directly train the systems that keep AI out of the hands of bad actors.
What you'll doAnalyze flagged AI conversations โ ranging from plain text to code-heavy exchanges โ and apply your security expertise to assess intent and harm across four domains:
- Scaled data exfiltration
- Ransomware
- Worms / self-replicating code
- Local & remote exploits
Some conversations may involve POC exploit development; your expertise will determine what crosses the line.
Interested in remote work opportunities in Cyber Security? Discover Cyber Security Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
The difference between a security researcher and a threat actor often comes down to context, specificity, and intent โ exactly what automated systems struggle to detect. Your ground-truth labels directly improve the classifiers that decide what AI will and won't help with.
What we're looking for- Hands-on offensive security background: red team, malware analysis, pen testing, or exploit research
- Ability to read between the lines โ distinguishing legitimate security work from genuine attack intent
- Comfort interpreting code-heavy conversations
- Tier 2โ3 experience: Masters / early-career through Senior / Principal
You're a strong fit if you've done red team consulting, threat intelligence analysis, vulnerability research, or AI safety labeling where nuanced judgment under ambiguity is routine.
Logistics- Requires access to a secure review interface and ability to handle PII
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Contract and Payment Terms- You will be engaged as an independent contractor.
- This is a fully remote role that can be completed on your own schedule.
- Projects can be extended, shortened, or concluded early depending on needs and performance.
- Your work at Mercor will not involve access to confidential or proprietary information from any employer, client, or institution.
- Payments are weekly on Stripe or Wise based on services rendered.
- Please note: We are unable to support H1-B or STEM OPT candidates at this time.
Similar Jobs
Explore other opportunities that match your interests
Senior Staff Engineer - AI Security
GEICO
Alignerr
Senior Product Security Engineer - Cryptography