Have certain large-scale models been fine-tuned specifically for the current test set? #566
Unanswered
Alwaysoffline
asked this question in
Q&A
Replies: 1 comment
-
Utilize a data containment detector to verify its authenticity. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
How to address the issue of certain large models being fine-tuned specifically for existing evaluation sets in order to achieve high scores?
Beta Was this translation helpful? Give feedback.
All reactions