Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Which mmrpoj file for Beepo 32b? (or which one for Qwen 32b?) #1326

Open
illtellyoulater opened this issue Jan 20, 2025 · 2 comments
Open

Which mmrpoj file for Beepo 32b? (or which one for Qwen 32b?) #1326

illtellyoulater opened this issue Jan 20, 2025 · 2 comments

Comments

@illtellyoulater
Copy link

illtellyoulater commented Jan 20, 2025

A couple of questions:

  • README recommends Beepo-22b as the largest and most powerful LLM. Is there an mmproj that can be used with it? Or are there any plans to release one?

  • yi-34b is the biggest LLM supporting vision through mmproj. However Qwen 32b is the highest scoring 32b LLM (and the only one capable of even surpassing a few 72b models). So is there an mmproj that could be used with it? Or do you plan releasing one?

Thanks!

@LostRuins
Copy link
Owner

Unfortunately while there is Pixtral, nobody has bothered to add llama.cpp support for it yet. So for now your best options will be Qwen2VL or MiniCPM.

@illtellyoulater
Copy link
Author

illtellyoulater commented Jan 20, 2025

I’m sorry, but I’m not sure which part of your answer applies to each of my questions.
Could you clarify, please?
Thanks!

EDIT - making my way through it...

  • Beepo-22b and mmproj: as of now, there’s no mmproj available or compatible with Beepo-22b.
  • Qwen-32b and mmproj: while Pixtral is a multi-modal framework, it currently lacks llama.cpp support. This makes using Qwen-32b with mmproj infeasible at the moment.

So, the best available paths for now are:

  • MiniCPM (SOTA vision model paired with a small LLM)
  • Qwen2VL (good vision model paired with a significantly larger LLM)
  • yi-34b (the only ~30b multi-modal solution for now; not sure how good the vision is for this one, but perhaps the best compromise, overall, for now)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants