Media Summary: Are diffusion policies in robot learning too brittle for the real world? In this video, we introduce REACH (Recovery through ... CausalLens: Sensitivity-Guided Multi-Head Causal Intervention for Hallucination Mitigation in Large Vision-Language Models. Presentation video of the paper Unsafe2Safe: Controllable Image Anonymization for Downstream Utility. Authors: Minh T. Dinh, ...
Cvpr 2026 When Safety Collides - Detailed Analysis & Overview
Are diffusion policies in robot learning too brittle for the real world? In this video, we introduce REACH (Recovery through ... CausalLens: Sensitivity-Guided Multi-Head Causal Intervention for Hallucination Mitigation in Large Vision-Language Models. Presentation video of the paper Unsafe2Safe: Controllable Image Anonymization for Downstream Utility. Authors: Minh T. Dinh, ... Adapting In-context Generation for Enhanced Composed Image Retrieval. Significant advancements made in reconstructing hands from images have delivered accurate single-frame estimates, yet they ... Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos.
TIGeR: A Unified Framework for Time, Images and Geolocation Retrieval This 5-minute presentation gives a high-level overview ... Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. GeoRelight: Learning Joint Geometrical Relighting and Reconstruction with Flexible Multi-Modal Diffusion Transformers Y. Xue, ...