feat(vr): setup daily run for VR
What does this merge request do and why?
This MR adds daily run on LLM judge for vulnerability resolve.
The results will be written to a new BQ dataset: dev-ai-research-0e2f8974.vulnerability_resolve_daily_runs
This MR also update example configs to use the latest judge and dataset.
Ref: #437 (closed)
How to set up and validate locally
Numbered steps to set up and validate the change are strongly suggested.
Merge request checklist
-
I've ran the affected pipeline(s) to validate that nothing is broken. -
Tests added for new functionality. If not, please raise an issue to follow up. -
Documentation added/updated, if needed.
Edited by Hongtao Yang