Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Qwen2VL's ViT #787

Open
parth1313 opened this issue Feb 11, 2025 · 0 comments
Open

Add Qwen2VL's ViT #787

parth1313 opened this issue Feb 11, 2025 · 0 comments

Comments

@parth1313
Copy link

parth1313 commented Feb 11, 2025

Hi everyone,

I want to integrate the ViT from the Qwen2-VL model into the module. According to the documentation, it states:
"The LAVIS library includes a standard model module that builds the foundation for many major language-vision models such as ALBEF, BLIP, ALPRO, and CLIP."

Could anyone guide me on how to add this custom model (ViT of Qwen2-VL) to the module?
Do I need to implement the entire architecture and other components?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant