Have certain large-scale models been fine-tuned specifically for the current test set? #566

Alwaysoffline · 2023-11-10T06:52:01Z

Alwaysoffline
Nov 10, 2023

How to address the issue of certain large models being fine-tuned specifically for existing evaluation sets in order to achieve high scores?

tonysy · 2023-11-14T08:32:07Z

Utilize a data containment detector to verify its authenticity.

0 replies