Design and evaluate autonomous AI agents across multiple LLMs, providing expert human feedback to leading AI organisations. Assess production-grade modular software architecture and provide high-density technical feedback for LLM training. Work on complex, multi-step architectural workflows.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
- Job Title: AI Agent Developer (Remote)
- Location: Remote (INDIA)
- Work Mode: Fully Remote
Role Overview
Help design and evaluate autonomous AI agents across multiple LLMs, spanning health, education, daily life, and other real-world domains (all coding work). Shape the future of agentic AI systems by providing expert human feedback to leading AI organisations. Help train Large Language Models (LLMs) for complex, multi-step architectural workflows.
Key Responsibilities
AI Agent Evaluation
- Write evaluation rubrics with objective pass/fail criteria
- Debug agent traces to identify failure patterns
- Stress test agents against edge cases, prompt injection, and tool misuse
Technical Assessment
Interested in remote work opportunities in Development & Programming? Discover Development & Programming Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
- Assess production-grade modular software architecture
- Analyse multi-turn system interactions and behaviours
- Provide high-density technical feedback for LLM training
Project Workflow
- Create an account and upload a resume/ID
- Complete the onboarding assessment
- Start earning through flexible task assignments
Qualifications
- Experience in backend engineering, AI automation, or complex systems integration
- Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting)
- Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases
- Practical experience building for live, non-mocked environments and handling multi-turn system interactions
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
Preferred (Nice to Have)
- Experience integrating agents with live tools such as Supabase, Gmail, and other APIs
- Familiarity with persistent state and session-tracking patterns
- Experience in identifying privacy leaks, authority escalation, or indirect prompt injection vulnerabilities
Compensation
- Hourly compensation ranges from USD $30–$50, depending on experience and task complexity
- Payments are issued weekly via supported payout platforms (e.g., PayPal or AirTM)
- Full compensation details are provided prior to task acceptance
Equal Opportunity Statement
Selection decisions are based solely on skills, qualifications, and project requirements. We are committed to inclusive and fair engagement practices and consider all qualified applicants without regard to legally protected characteristics.
Apply Now!
Similar Jobs
Explore other opportunities that match your interests
job returns
Tech Lead, Large Language Model Evaluation and Training Datasets
agilegrid solutions