Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TIGER-Lab 's Collections
MoCha
General-Reasoner
VL-Rethinker
Vamba
TheoremExplain
ABC
VisualWebInstruct
PixelWorld
AceCoder
CritiqueFineTuning
MAmmoTH-VL
ScholarCopilot
VISTA
OmniEdit
MEGA-Bench
VLM2Vec
TIGERScore
MAmmoTH
UniIR
ImagenHub
Science
StructLM
ConsistI2V
Mantis
MAmmoTH2
VideoScore
Long-Context

VisualWebInstruct

updated 16 days ago

Scaling up MM data

Upvote
1

  • TIGER-Lab/VisualWebInstruct-Recall

    Viewer • Updated Mar 16 • 361k • 966 • 3

  • TIGER-Lab/VisualWebInstruct-Seed

    Viewer • Updated Mar 16 • 60.3k • 242 • 16

  • TIGER-Lab/VisualWebInstruct

    Viewer • Updated Apr 10 • 1.91M • 442 • 34

  • VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

    Paper • 2503.10582 • Published Mar 13 • 23

  • TIGER-Lab/MAmmoTH-VL2

    Image-Text-to-Text • Updated 14 days ago • 184 • 12

  • Running on Zero
    2
    2

    MAmmoTH-VL2

    🐠

    Strong Vision Language Model trained with VisualWebInstruct

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs