-
feat: add script for latency summaries in code suggestions evals 0 of 2 checklist items completed
-
Add resolved and completed SWE evaluators 0 of 2 checklist items completed
- Merged
- 13
- Approved
updated -
feat: add SWE evaluation to Duo Workflow 0 of 2 checklist items completed
- Merged
- 6
- Approved
updated -
feat: update evaluators to Claude 3.5 Sonnet 1 of 2 checklist items completed
- Merged
- 3
- Approved
updated -
feat: add fireworks inference provider support 0 of 2 checklist items completed
- Merged
- 3
- Approved
updated -
feat: add codesuggestions client for litellm 0 of 2 checklist items completed
- Merged
- 8
- Approved
updated -
Draft: Add job for code suggestions test 0 of 2 checklist items completed
-
feat: add cerebras client support 0 of 2 checklist items completed
- Merged
- 6
- Approved
updated -
feat: add gemini client 0 of 2 checklist items completed
- Merged
- 6
- Approved
updated -
feat: add openai as cs eval client 0 of 2 checklist items completed
- Merged
- 2
- Approved
updated -
feat: add groq provider for CS 0 of 2 checklist items completed
- Merged
- 7
- Approved
updated -
feat: update code-suggestions evaluation scripts 0 of 2 checklist items completed
- Merged
- 4
- Approved
updated -
Draft: Codestral latency test scripts 0 of 2 checklist items completed
-
Draft: Add production AIGW client for code-suggesetions 0 of 2 checklist items completed!105
-
Draft: add latency aggregation script 0 of 2 checklist items completed
-
Add support for eval with additional context 0 of 2 checklist items completed
- Merged
- 16
- Approved
updated -
feat: support model_provider argument in code suggestions evaluation 0 of 2 checklist items completed
- Merged
- 7
updated -
fix: wrap vertexai gcloud calls within functions 0 of 2 checklist items completed
-
chore: update code suggestions evaluation guide 0 of 2 checklist items completed
- Merged
- 16
updated -
docs: add guide to testing x-ray rag with code gen 1 of 2 checklist items completed
- Merged
- 9
- Approved
updated