AI Safety & Evaluation Engineer

DeWinter BH ·www.dewintergroup.com

Location Campbell, CA, - None Specified -
Work type Remote
Salary USD 175 / hour
Type Full time
Level Mid
Source Shazamme
Information Technology Accepting Candidates
Apply direct

Title: AI Safety and Evaluations Engineer
Job Type: Contract
Contract Length: 12 Months
Pay Range: $50/hr – $175/hr
Start Date: ASAP
Location: Remote

About the Opportunity:

Our client, a leader in AI testing and Generative AI solutions, is looking for a skilled AI Safety and Evaluations Engineer to join their team for a 12-month engagement. This project involves designing and building rigorous evaluation frameworks to measure model bias, hallucinations, and toxicity, ensuring models are safe and compliant before deployment. This is a high-impact role that requires a self-motivated professional who can hit the ground running and deliver results quickly.

Key Responsibilities & Deliverables:

This role is focused on the successful completion of specific tasks and deliverables. Your responsibilities will include:

  • Designing and building rigorous evaluation frameworks to measure model bias, hallucinations, and toxicity.
  • Creating automated "Eval" datasets to benchmark new models before they are promoted to production.
  • Developing metrics for "Grounding" and "Faithfulness" in RAG-based systems.
  • Building monitoring tools that flag harmful or non-compliant AI outputs in real-time.
  • Partnering with legal and ethics teams to translate policy into technical safety constraints.
Required Skills & Experience:

We are looking for someone with a proven track record of successful contract engagements. The ideal candidate will have:
  • 3+ years of experience in AI Research or Quality Engineering.
  • Deep expertise in model evaluation techniques and NLP metrics (ROUGE, BLEU, BERTScore). This isn't a learning role—you need to be a subject matter expert.
  • Demonstrated ability to work autonomously and manage your own time effectively to meet project goals.
  • Experience with Python, data analysis tools, and LLM-as-a-Judge frameworks.
  • Strong communication skills to provide clear and concise status updates to the project team.
#LI-LD1 
#LI-JN1

Frequently asked questions

Who is hiring for the AI Safety & Evaluation Engineer role?
DeWinter BH is hiring for the AI Safety & Evaluation Engineer position, a Shazamme client. Apply directly on the employer's career site.
Where is the AI Safety & Evaluation Engineer job located?
The AI Safety & Evaluation Engineer role with DeWinter BH is based in Campbell, US. The role is remote-friendly.
Is the AI Safety & Evaluation Engineer role remote?
Yes — the AI Safety & Evaluation Engineer position at DeWinter BH is remote. Candidates based in US are preferred.
What does the AI Safety & Evaluation Engineer role pay?
DeWinter BH lists the AI Safety & Evaluation Engineer role at up to USD 175 per hour.
Is the AI Safety & Evaluation Engineer role full-time or contract?
This is a full time position at DeWinter BH.
What experience level is the AI Safety & Evaluation Engineer role?
The AI Safety & Evaluation Engineer position is aimed at mid-level candidates.
How do I apply for the AI Safety & Evaluation Engineer role at DeWinter BH?
Apply directly on DeWinter BH's career page via the Apply button on this listing. ZammeJobs links straight through to the employer's ATS — no third-party form, no resume database.
Apply direct