Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gary109 's Collections
DeepSeek
video segmentation
Generation 3D
Text-to-Audio
LLM
Prompting
Text-to-Image
Representations
Transformers
Robot
Vision Transformers
Diffusion Model
text-to-3D
Text-to-Video
ML
RLHF
Video 優化
Image Completion
Others
multimodal
Auto
Vision-Language
Application
Optimization
Cost
Semantic Segmentation
Video Generation
Code Generation
ASR
Generative
Whisper
AGI
Funny
music
SVC
Datasets
yolo
Watermarking
生成式AI導論 2024
Text-to-Embedding
RAG
image-to-3D
Music Captions
OCR
Audio

video segmentation

updated Aug 19, 2024
Upvote
-

  • Tracking Anything with Decoupled Video Segmentation

    Paper • 2309.03903 • Published Sep 7, 2023 • 28

  • ProPainter: Improving Propagation and Transformer for Video Inpainting

    Paper • 2309.03897 • Published Sep 7, 2023 • 27

  • UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

    Paper • 2312.15715 • Published Dec 25, 2023 • 21

  • SAM 2: Segment Anything in Images and Videos

    Paper • 2408.00714 • Published Aug 1, 2024 • 115

  • Medical SAM 2: Segment medical images as video via Segment Anything Model 2

    Paper • 2408.00874 • Published Aug 1, 2024 • 52

  • Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning

    Paper • 2408.07931 • Published Aug 15, 2024 • 22
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs