Skip to content

Actions: stanford-crfm/helm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
3,806 workflow runs
3,806 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat: add complete implementation of CLEAR dataset
Test #8243: Pull request #3466 opened by suhana13
March 23, 2025 22:38 7m 20s medhelm-clear-full
March 23, 2025 22:38 7m 20s
3323 adaptive evaluation
Test #8242: Pull request #3397 synchronize by yuhengtu
March 23, 2025 15:46 Action required sangttruong:3323-adaptive_evaluation
March 23, 2025 15:46 Action required
Scenario tests
Scenario tests #318: Scheduled
March 23, 2025 15:34 9m 22s main
March 23, 2025 15:34 9m 22s
Scenario tests
Scenario tests #317: Scheduled
March 22, 2025 15:34 10m 53s main
March 22, 2025 15:34 10m 53s
Added the MIMIC-IV-BHC benchmark to MedHelm scenarios
Test #8240: Pull request #3459 synchronize by asad-aali
March 22, 2025 06:48 Action required asad-aali:main
March 22, 2025 06:48 Action required
Added the MIMIC-IV-BHC benchmark to MedHelm scenarios
Test #8239: Pull request #3459 synchronize by asad-aali
March 22, 2025 06:27 Action required asad-aali:main
March 22, 2025 06:27 Action required
Add bert-score to dependencies (#3463)
Update requirements.txt #98: Commit 46dacf0 pushed by yifanmai
March 22, 2025 05:26 9m 35s main
March 22, 2025 05:26 9m 35s
Add bert-score to dependencies (#3463)
Test #8238: Commit 46dacf0 pushed by yifanmai
March 22, 2025 05:26 10m 2s main
March 22, 2025 05:26 10m 2s
Add metadata for MMLU-Pro (#3458)
Test #8237: Commit f0f6a0a pushed by yifanmai
March 22, 2025 04:35 10m 45s main
March 22, 2025 04:35 10m 45s
Add bert-score to dependencies
Test #8236: Pull request #3463 opened by yifanmai
March 22, 2025 04:33 10m 47s yifanmai/bert-score-dep
March 22, 2025 04:33 10m 47s
Added the MIMIC-IV-BHC benchmark to MedHelm scenarios
Test #8234: Pull request #3459 synchronize by asad-aali
March 21, 2025 23:10 10m 23s asad-aali:main
March 21, 2025 23:10 10m 23s
Add proxy static files to manifest (#3461)
Test #8233: Commit 8da5466 pushed by yifanmai
March 21, 2025 23:07 10m 58s main
March 21, 2025 23:07 10m 58s
Add proxy static files to manifest
Test #8232: Pull request #3461 opened by yifanmai
March 21, 2025 23:06 11m 13s yifanmai/proxy-manifest
March 21, 2025 23:06 11m 13s
March 21, 2025 21:03 9m 46s
3323 adaptive evaluation
Test #8229: Pull request #3397 synchronize by yuhengtu
March 21, 2025 17:28 Action required sangttruong:3323-adaptive_evaluation
March 21, 2025 17:28 Action required
Scenario tests
Scenario tests #316: Scheduled
March 21, 2025 15:35 10m 2s main
March 21, 2025 15:35 10m 2s
3323 adaptive evaluation
Test #8228: Pull request #3397 synchronize by yuhengtu
March 21, 2025 08:49 Action required sangttruong:3323-adaptive_evaluation
March 21, 2025 08:49 Action required
Added the MIMIC-IV-BHC benchmark to MedHelm scenarios
Test #8227: Pull request #3459 opened by asad-aali
March 21, 2025 02:17 4m 14s asad-aali:main
March 21, 2025 02:17 4m 14s
Add metadata for MMLU-Pro
Test #8226: Pull request #3458 opened by liamjxu
March 21, 2025 00:40 10m 15s jialiang/add_mmlu_pro_metadata
March 21, 2025 00:40 10m 15s
Update stale URL https://crfm.stanford.edu/helm/latest/ (#3457)
Test #8225: Commit d14768d pushed by yifanmai
March 20, 2025 21:43 9m 40s main
March 20, 2025 21:43 9m 40s