Skip to content

Actions: stanford-crfm/helm

Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,456 workflow runs
2,456 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix Qwen2-Audio-Instruct empty outputs (#3474)
Test #8268: Commit ce04810 pushed by teetone
March 26, 2025 07:55 9m 40s main
March 26, 2025 07:55 9m 40s
Fix Qwen2-Audio-Instruct empty outputs
Test #8267: Pull request #3474 synchronize by ImKeTT
March 26, 2025 06:07 10m 3s ImKeTT:fix_qwen2
March 26, 2025 06:07 10m 3s
Fix Qwen2-Audio-Instruct empty outputs
Test #8266: Pull request #3474 synchronize by ImKeTT
March 26, 2025 00:42 10m 16s ImKeTT:fix_qwen2
March 26, 2025 00:42 10m 16s
Fix Qwen2-Audio-Instruct empty outputs
Test #8265: Pull request #3474 synchronize by ImKeTT
March 26, 2025 00:40 4m 14s ImKeTT:fix_qwen2
March 26, 2025 00:40 4m 14s
Fix Qwen2-Audio-Instruct empty outputs
Test #8264: Pull request #3474 opened by ImKeTT
March 25, 2025 21:40 10m 21s ImKeTT:fix_qwen2
March 25, 2025 21:40 10m 21s
Set trust_remote_code for TyDiQA (#3473)
Test #8263: Commit 9fa4cfb pushed by yifanmai
March 25, 2025 21:06 10m 41s main
March 25, 2025 21:06 10m 41s
Set trust_remote_code for TyDiQA
Test #8262: Pull request #3473 synchronize by yifanmai
March 25, 2025 20:41 10m 39s yifanmai/tydiqa-trust-remote-code
March 25, 2025 20:41 10m 39s
feat: add complete implementation of CLEAR dataset (#3466)
Test #8260: Commit 82f9d58 pushed by yifanmai
March 25, 2025 05:32 10m 10s main
March 25, 2025 05:32 10m 10s
feat: add complete implementation of CLEAR dataset
Test #8259: Pull request #3466 synchronize by suhana13
March 25, 2025 05:09 9m 41s medhelm-clear-full
March 25, 2025 05:09 9m 41s
Add ConvFinQACalc (#3453)
Test #8258: Commit 7f44dd3 pushed by yifanmai
March 25, 2025 00:10 9m 54s main
March 25, 2025 00:10 9m 54s
Added the MIMIC-IV-BHC benchmark to MedHelm scenarios (#3459)
Test #8257: Commit d556e18 pushed by yifanmai
March 25, 2025 00:10 9m 27s main
March 25, 2025 00:10 9m 27s
Add MedHELM to documentation for downloading raw results (#3462)
Test #8256: Commit 8c50c99 pushed by yifanmai
March 25, 2025 00:09 10m 4s main
March 25, 2025 00:09 10m 4s
Update requirements.txt (#3472)
Test #8255: Commit b466325 pushed by yifanmai
March 24, 2025 21:52 10m 3s main
March 24, 2025 21:52 10m 3s
Use Anthropic tokenizer from Hugging Face (#3467)
Test #8254: Commit e6fa7eb pushed by yifanmai
March 24, 2025 21:41 9m 49s main
March 24, 2025 21:41 9m 49s
feat: add complete implementation of CLEAR dataset
Test #8253: Pull request #3466 synchronize by suhana13
March 24, 2025 21:29 4m 4s medhelm-clear-full
March 24, 2025 21:29 4m 4s
feat: add complete implementation of CLEAR dataset
Test #8252: Pull request #3466 synchronize by suhana13
March 24, 2025 21:25 4m 8s medhelm-clear-full
March 24, 2025 21:25 4m 8s
Move pytrec_eval to its own optional dependencies section (#3470)
Test #8250: Commit 4ec117f pushed by yifanmai
March 24, 2025 19:14 9m 51s main
March 24, 2025 19:14 9m 51s
Allow using alternate annotator models for AIR-Bench 2024 (#3468)
Test #8249: Commit 16452fa pushed by yifanmai
March 24, 2025 17:41 10m 15s main
March 24, 2025 17:41 10m 15s
Move pytrec_eval to its own optional dependencies section
Test #8248: Pull request #3470 synchronize by yifanmai
March 24, 2025 17:37 10m 26s yifanmai/remove-pytrec
March 24, 2025 17:37 10m 26s