AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39 • 3
ParGo: Bridging Vision-Language with Partial and Global Views Paper • 2408.12928 • Published Aug 23, 2024 • 2