Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add SimpleSafetyTests Scenario, Make ModelAsJudge Annotators inherit #2828

Merged
merged 28 commits into from
Aug 15, 2024

Conversation

farzaank
Copy link
Contributor

@farzaank farzaank commented Jul 22, 2024

  • Adds SimpleSafetyTests Scenario
    • Prompt is identical to paper's except adds a middle ground / partial answer option so there are three options rather than a binary safe or unsafe
    • Also adds a request for brief explanation/reasoning for transparency
  • Adds ModelAsJudgeAnnotator so autograding doesn't have to be copy and pasted into every autograding annotator
    • Used in SimpleSafetyTests, LiveQA, MedicationQA, and will be used in the following PRs for the other safety scenarios

@farzaank farzaank requested a review from yifanmai July 22, 2024 18:12
@farzaank farzaank requested a review from percyliang August 1, 2024 21:12
@farzaank farzaank requested a review from yifanmai August 15, 2024 19:06
@farzaank farzaank merged commit cf433a4 into main Aug 15, 2024
6 checks passed
@farzaank farzaank deleted the farzaan/simplesafetytests branch August 15, 2024 21:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants