View on Himalayas for pricing

Listing data from Himalayas. Visit the original listing for the most up-to-date information.

Himalayas

Freelance Agent Evaluation Engineer

MexicoPosted about 7 hours agoVerified 22 minutes ago

About This Gig

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair Design tasks set in isolated environments - emulations of a developer's workstation: a Linux mach

Skills & Tags

AI Evaluation Engineering Test Automation Engineering Software Testing Freelance Software Development AI Training Data Specialist

About the Seller

Mindrift

on Himalayas

Pricing available on Himalayas

See Price on Himalayas

Opens on Himalayas in a new tab

Listing data from Himalayas. Visit the original listing for the most up-to-date information.

Want similar gigs?

Get notified when new development & tech gigs are posted.

Set Up Alert

Similar Gigs

HimalayasNew

View price

ServiceNow Developer/Application Programmer

*This position requires an Active Public Trust clearance or higher to be considered.* ProSync Technology Group, LLC (ProSync) is an award-winning, SDVOSB Defense Contracting company with a strong military heritage and a record of excellence in supporting the Department of Defense and the Intelligence Community. If you have prior military service or government contracting experience, are proud to serve and support our nation, and want to help support ProSync's mission to "Define and Redefine the State of Possible,” please apply today! The Senior ServiceNow Developer / Application Programmer provides full lifecycle support, maintenance, and enhancement of ServiceNow solutions for a federal cybersecurity and IT program. This role serves as the primary technical resource for sustaining production ServiceNow environments, resolving issues, supporting upgrades, and implementing configuration changes. The position also supports Microsoft 365 and low‑code platforms, translating business needs

ServiceNow DevelopmentApplication ProgrammingIT Development

Licensed Mental Health Counselor (LMHC) - Norwell, MA

Licensed Mental Health Counselor (LMHC) $70 K-85K/yr Position Requirement: Remote Full-Time and Part-time Positions Available If you are looking for a new opportunity to grow with a clinician founded and clinician led organization, look no further! As an authentic, unique, and skilled counselor, you have a plethora of options when it comes to work. Hospitals, community clinics, investment bank funded group practice startups, state agencies, and more. At OptiMindHealth , we continuously strive to provide consistent, high-quality care and expand the reach of mental health care services in our communities. Unfortunately, the need often outpaces the services available to those we serve and only adds to the current mental health crisis in our nation. To combat this crisis and further our mission, we need more compassionate and skilled clinicians to join our cause. We are not interested in the “burnt-out” clinician that no longer finds meaning in their work and is just going through the mot

Mental Health CounselingClinical CounselingOutpatient Mental Health

Architect with London Industry Experience

We are seeking an Architect with experience in the London architecture industry who is currently based in South Africa. This is a full-time, remote-friendly opportunity for a motivated professional who can contribute to architectural design, technical documentation, and project coordination across a range of project types. Requirements Professional architectural experience, ideally including London-based work Currently based in South Africa Proficiency in relevant design software Strong portfolio and technical documentation skills Excellent communication and organization skills Originally posted on Himalayas

Junior IT Architect JobsEntry Level IT Architecture Jobs

South Africa

about 7 hours ago