Add Qwen2VL's ViT #787

parth1313 · 2025-02-11T11:49:58Z

Hi everyone,

I want to integrate the ViT from the Qwen2-VL model into the module. According to the documentation, it states:
"The LAVIS library includes a standard model module that builds the foundation for many major language-vision models such as ALBEF, BLIP, ALPRO, and CLIP."

Could anyone guide me on how to add this custom model (ViT of Qwen2-VL) to the module?
Do I need to implement the entire architecture and other components?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen2VL's ViT #787

Add Qwen2VL's ViT #787

parth1313 commented Feb 11, 2025 •

edited

Loading

Add Qwen2VL's ViT #787

Add Qwen2VL's ViT #787

Comments

parth1313 commented Feb 11, 2025 • edited Loading

parth1313 commented Feb 11, 2025 •

edited

Loading