feat: add complete implementation of CLEAR dataset #3466

suhana13 · 2025-03-23T22:38:47Z

Added the full implementation of the CLEAR Dataset (https://www.nature.com/articles/s41746-024-01377-1). Earlier, we had just implemented it for checking "alcohol dependence". Now, it integrates all 13 conditions from the source paper (including bipolar, chronic pain etc.)

@MiguelAFH, @aunell, @HennyJie (tagging just as FYI)

@yifanmai - let me know if any additional changes are needed!

yifanmai

Looks good, thanks!

One thing to note is that this will send 1000 requests per condition for a total of 12k requests. If you want to reduce the total number of requests, you can set a lower number of requests on the run entry using the max_eval_instances=100 run expander.

yifanmai · 2025-03-24T19:18:00Z

src/helm/benchmark/scenarios/clear_scenario.py

+
+        self.condition = condition
+        self.name = f"clear_{condition}"
+        self.description = f"A dataset for evaluating {self.CONDITION_PROMPTS[condition]} detection from patient notes with yes/no/maybe classifications."


Add # noqa: E501 at the end of this line to make the linter happy.

Thank you! Fixed!

yifanmai · 2025-03-24T22:59:00Z

You need two spaces before the #:

-        self.description = f"A dataset for evaluating {self.CONDITION_PROMPTS[condition]} detection from patient notes with yes/no/maybe classifications." # noqa: E501
+        self.description = f"A dataset for evaluating {self.CONDITION_PROMPTS[condition]} detection from patient notes with yes/no/maybe classifications."  # noqa: E501

yifanmai · 2025-03-24T22:59:38Z

src/helm/benchmark/run_specs/medhelm_run_specs.py

        ),
        input_noun=None,
        output_noun="Respond only with 'A', 'B', or 'C'. Do not add any other text, punctuation, or symbols",
        max_train_instances=0,
+        max_eval_instances=100,


Do this in the run entry instead of the run expander.

yifanmai · 2025-03-24T23:00:04Z

src/helm/benchmark/presentation/run_entries_medhelm.conf

-  {description: "clear:model=qwen/qwen2.5-7b-instruct,model_deployment=huggingface/qwen2.5-7b-instruct-4bit", priority: 1},
-  {description: "clear:model=microsoft/phi-3.5-mini-instruct,model_deployment=huggingface/phi-3.5-mini-instruct-4bit", priority: 1},
+  #Alcohol Dependence
+  {description: "clear:condition=alcohol_dependence,model=google/gemini-1.5-pro-001,model_deployment=stanfordhealthcare/gemini-1.5-pro-001", priority: 1},


add max_eval_instances=100, here and below. (if desired)

feat: add complete implementation of CLEAR dataset

0902892

yifanmai approved these changes Mar 24, 2025

View reviewed changes

suhana13 added 2 commits March 24, 2025 21:25

style: linter issue

6187fc5

feat: add max eval instances

4df553a

yifanmai reviewed Mar 24, 2025

View reviewed changes

fix: linting and eval instances

73481cd

yifanmai merged commit 82f9d58 into main Mar 25, 2025
8 checks passed

yifanmai deleted the medhelm-clear-full branch March 25, 2025 05:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add complete implementation of CLEAR dataset #3466

feat: add complete implementation of CLEAR dataset #3466

suhana13 commented Mar 23, 2025

yifanmai left a comment

yifanmai Mar 24, 2025

suhana13 Mar 24, 2025

yifanmai commented Mar 24, 2025

yifanmai Mar 24, 2025

suhana13 Mar 25, 2025

yifanmai Mar 24, 2025

suhana13 Mar 25, 2025

feat: add complete implementation of CLEAR dataset #3466

feat: add complete implementation of CLEAR dataset #3466

Conversation

suhana13 commented Mar 23, 2025

yifanmai left a comment

Choose a reason for hiding this comment

yifanmai Mar 24, 2025

Choose a reason for hiding this comment

suhana13 Mar 24, 2025

Choose a reason for hiding this comment

yifanmai commented Mar 24, 2025

yifanmai Mar 24, 2025

Choose a reason for hiding this comment

suhana13 Mar 25, 2025

Choose a reason for hiding this comment

yifanmai Mar 24, 2025

Choose a reason for hiding this comment

suhana13 Mar 25, 2025

Choose a reason for hiding this comment