Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Contributing to OpenCompass #24

Open
bittersweet1999 opened this issue May 8, 2024 · 3 comments
Open

Contributing to OpenCompass #24

bittersweet1999 opened this issue May 8, 2024 · 3 comments

Comments

@bittersweet1999
Copy link

bittersweet1999 commented May 8, 2024

First of all, thank you for your high-quality open-source project. We are very interested in your EQ bench and creative writing bench, as they share many similarities with our subjective evaluations in OpenCompass. I would like to know if you would be willing to integrate these two benches into OpenCompass to enable a more diverse range of evaluations?
Here is the link of OpenCompass: https://github.com/open-compass/opencompass
And here is a demo for subjective evaluation in Opencompass: https://github.com/open-compass/opencompass/blob/main/configs/eval_subjective_alignbench.py

@sam-paech
Copy link
Contributor

Hello! Glad you are liking the benchmarks. I'm more than happy for them to be included in your OpenCompass eval suite. I don't have a lot of free time at the moment to integrate them myself, however I can reassess in ~a month. Otherwise if you want to get started on it, I can answer questions & assist when I have time. :)

@bittersweet1999
Copy link
Author

Hi! I will also try to integrate them when some free time. And after integration, you can make your leaderboard here: https://hub.opencompass.org.cn/home
just like this
image

@sam-paech
Copy link
Contributor

Sounds good!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants