-
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Paper • 2402.03162 • Published • 19 -
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 65 -
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
Paper • 2407.02371 • Published • 55 -
Large Language Diffusion Models
Paper • 2502.09992 • Published • 113
Pengxiang Li
pengxiang
AI & ML interests
Video generation, Image editing, AD
Recent Activity
published
a model
about 11 hours ago
pengxiang/Qwen2.5-1.5B-Open-R1-GRPO
updated
a dataset
about 16 hours ago
pengxiang/coins_new
authored
a paper
1 day ago
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to
Deliberative Reasoners
Organizations
None yet
Collections
1
models
8

pengxiang/Qwen2.5-1.5B-Open-R1-GRPO
Updated

pengxiang/LNS_1B
Updated
•
6
•
1

pengxiang/TrackDiffusion_SVD_Stage2
Text-to-Video
•
Updated

pengxiang/TrackDiffusion_SVD_Stage1
Text-to-Video
•
Updated

pengxiang/TrackDiffusion_Pretrain
Updated
•
1
•
1

pengxiang/GLIGEN_1_4
Updated
•
1

pengxiang/TrackDiffusion_ModelScope
Text-to-Video
•
Updated

pengxiang/trackdiffusion_ytvis
Text-to-Video
•
Updated
•
2
datasets
16
pengxiang/coins_new
Viewer
•
Updated
•
4.91k
•
335
pengxiang/COIN
Viewer
•
Updated
•
528
•
5
pengxiang/tvqa
Preview
•
Updated
•
80
pengxiang/COINs
Viewer
•
Updated
•
1.59k
•
573
pengxiang/sthv2
Updated
•
25
pengxiang/youcook2
Updated
•
88
pengxiang/UVO
Viewer
•
Updated
•
799
•
115
pengxiang/youcook
Viewer
•
Updated
•
407
•
95
pengxiang/clevrer
Viewer
•
Updated
•
10k
•
37
pengxiang/oops
Updated
•
39