Freelance AI Evaluation Engineer (Python/Full-Stack)
Mindrift
About This Role
Create challenging coding test cases for AI systems, review and refine production codebases, and analyze AI failures. Work on part-time, non-permanent projects for leading tech companies.
Requirements
• Degree in Computer Science, Software Engineering, or related fields
• 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
• Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
• Experience writing tests (functional, integration - not just running them)
• Docker containers (running evaluations locally in containers)
• CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
• English proficiency - B2
Benefits
• Flexible work schedule
• Opportunity to work on challenging projects with leading tech companies
• Potential earnings of up to $30 per hour equivalent
Originally posted on Himalayas
Ready to Apply?
Click the button below to visit the company's application page.
Apply for this Position