Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing Paper • 2411.16832 • Published Nov 25, 2024 • 2
Bigger is not Always Better: Scaling Properties of Latent Diffusion Models Paper • 2404.01367 • Published Apr 1, 2024 • 23
Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models Paper • 2410.03659 • Published Oct 4, 2024 • 6
AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results Paper • 2404.16205 • Published Apr 24, 2024
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing Paper • 2411.16832 • Published Nov 25, 2024 • 2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Paper • 2412.15206 • Published Dec 19, 2024
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Paper • 2502.13146 • Published Feb 18 • 1
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Paper • 2502.14296 • Published Feb 20 • 46
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving Paper • 2412.15208 • Published Dec 19, 2024
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving Paper • 2503.24381 • Published Mar 31 • 1
NTIRE 2025 Challenge on UGC Video Enhancement: Methods and Results Paper • 2505.03007 • Published about 1 month ago
The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report Paper • 2504.10686 • Published Apr 14
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization Paper • 2505.12366 • Published 18 days ago
VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction Paper • 2505.20279 • Published 10 days ago • 4
Generative AI for Autonomous Driving: Frontiers and Opportunities Paper • 2505.08854 • Published 23 days ago • 1