This page was automatically translated and may contain errors. View in English.
Mindrift

Freelance Agent Evaluation Engineer

Mindrift

Qatar · Freelance

가장 먼저 지원하세요

경험
5+ yrs
샐러리
USD 50 / hour
채용 공고
1
게시됨
3일 전

직무 설명

About Mindrift

Mindrift specializes in connecting skilled professionals with project-based opportunities in artificial intelligence, focusing on the testing, evaluation, and enhancement of AI systems for prominent technology firms. Participation is structured around specific projects rather than permanent employment.

Project Overview: AI Coding Agent Evaluation

This project involves the creation of a comprehensive dataset designed to assess the capabilities of AI coding agents. The goal is to determine how effectively these agents can handle authentic developer tasks.

Key Responsibilities

  • Construct realistic developer environments, simulating a virtual company with a complete codebase, necessary infrastructure, and contextual information (including tickets, documentation, and communications) to establish a credible development history.
  • Develop challenging tasks and define precise evaluation criteria within these simulated environments. This includes crafting effective prompts and establishing clear definitions of what constitutes a

답변을 원하시면 남겨주세요. 다른 용도로는 사용하지 않습니다.

클릭하여 살펴보세요드래그 앤 드롭 또는 반죽 스크린샷

PNG, JPG, GIF, MP4, WebM, MOV · 파일당 최대 20MB · 최대 5개 파일