Percy Liang @percyliang, Twitter Profile

Percy Liang @percyliang

a year ago

I worry about language models being trained on test sets. Recently, we emailed [email protected] to opt out of having our (test) data be used to improve models. This isn't enough though: others running evals could still inadvertently contribute those test sets to training.

39 111 1K 291K 159

Percy Liang @percyliang

a year ago

A better solution would to have all the LM providers agree on a common repository of examples that should be excluded from any training run.

5 3 135 21K 1

Percy Liang @percyliang

a year ago

But this might not be enough either: if we want to measure cross-task generalization, we have to ensure that no examples of a task/domain are represented in the training data. This is essentially impossible.

9 6 116 19K 5

Paul Haahr @haahr

a year ago

@percyliang Maybe some variations on robots tags? E.g., <meta name="robots" content="notraining">?

0 0 1 429 0

Fabio Massimo Zanzotto @znz8

a year ago

@percyliang Besides The Dark Side of Language observing the problem, we do have another simpler solution... under peer review! Hopefully, we can discuss it publicly soon. If you want, I'm here to describe it

0 0 1 39 0

Eugene Brevdo @yablak

a year ago

@percyliang Watermark your test sets

0 0 0 35 0

tyoc213 @tyoc213

a year ago

@percyliang Isn't a license enought for that? wonder if there are @creativecommons licenses that can stop that... or if any other @OpenSourceOrg license stops that.... or legally we have been surpassed by current speed of crawling/gathering data?

0 0 0 77 0