feat(vr): setup daily run for VR (!792) · Merge requests · GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / Prompt Library

Hongtao Yang requested to merge vr-daily-run into main Oct 08, 2024

What does this merge request do and why?

This MR adds daily run on LLM judge for vulnerability resolve.

The results will be written to a new BQ dataset: dev-ai-research-0e2f8974.vulnerability_resolve_daily_runs

This MR also update example configs to use the latest judge and dataset.

Numbered steps to set up and validate the change are strongly suggested.

Edited Oct 09, 2024 by Hongtao Yang