-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
optimise packaging and calling of benchmark #140
Comments
Re installing and running the benchmark, possible workflow:
(Maybe it would be easier to have |
(copying over from discussion on mattermost, so that it is properly archived here) We could kind of swap things around: We have a canonical repository for the benchmark, which holds: a) all the games that are officially in the benchmark, b) a scripts folder, that contains ba) This is what people clone if they want to run the core benchmark. The requirements in that repo contain everything that the invidual games need, plus of course If students now want to create their own game, they start create their own repo and model it on those repos; that is, they create their own (I don't know what would be the best way to add backends... Either clone the clemgame repo and work there (and install via |
Does anyone have a comment on the last part (would it be possible to define a backend outside of the directory where the other backends live)? @phisad , @Gnurro , @AnneBeyer ? |
(Actually, it looks like it wouldn't be a big problem to also install scripts (which |
I assigned this to myself, so I will think about it when I have finished writing (hopefully in Feb.). |
To enable inclusion of clembench into standard benchmark runs made by others (e.g., in Playpen evaluations, but also for people training a new model and wanting to evaluate it), it would be good if our packaging could still be improved:
We're not far away from this, but my understanding is that running the benchmark at the moment still requires a bit of babysitting.
The text was updated successfully, but these errors were encountered: