Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Po Hsiang Yu's picture

Po Hsiang Yu

EasyMoneySniper66

AI & ML interests

None yet

Organizations

None yet

Collections 4

Multi-modality LVM
  • VoCo-LLaMA: Towards Vision Compression with Large Language Models

    Paper • 2406.12275 • Published Jun 18, 2024 • 32
  • TroL: Traversal of Layers for Large Language and Vision Models

    Paper • 2406.12246 • Published Jun 18, 2024 • 36
  • Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning

    Paper • 2406.15334 • Published Jun 21, 2024 • 9
  • Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

    Paper • 2406.12742 • Published Jun 18, 2024 • 15
Multi-modality LVM Datasets
  • MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs

    Paper • 2406.11833 • Published Jun 17, 2024 • 64
  • Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

    Paper • 2406.11230 • Published Jun 17, 2024 • 35
  • Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models

    Paper • 2406.14035 • Published Jun 20, 2024 • 13
  • Needle In A Multimodal Haystack

    Paper • 2406.07230 • Published Jun 11, 2024 • 55

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs