90% OFF - $9.99
11:59:59
Claim Lifetime Deal
Job Posting
Posted today
Closes Aug 3, 2026

Reinforcement Learning Environments Engineer - Cybersecurity

Preference Model Toronto, Ontario

Listing sourced from adzuna on 7/5/2026. CVCraft does not host this job; clicking Apply redirects to the source.

Job Description

About Us Preference Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that reflect real-world complexity, with diverse tasks and robust reward functions. Our founding team has previous experience on Anthropic’s data team building data infrastructure, and datasets behind Claude. We are partnering with …

Salary Context for Machine Learning Engineer

The US national median for Machine Learning Engineer is $162,000.

Tailor your resume to this role before applying

75% of applicants get filtered by ATS before a human reads them. Run a free 60-second scan to see what keywords are missing from your resume.

Free ATS Scan

Pass the ATS Filter Before Applying

Most resumes never reach a human. CVCraft scans your resume against this exact role's keywords in 60 seconds — free.

Scan My Resume Free