Media Summary: Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios Zhipeng Sui, ... MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention. Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement.

Cvpr2026 Flashcache - Detailed Analysis & Overview

Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios Zhipeng Sui, ... MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention. Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. In this video, we introduce a novel video object detection framework called D2FANet. D2FANet is the first framework to jointly ... Chengxing Lin, Jinhong Deng, Yinjie Lei, Wen Li. "Deformation-based In-Context Learning for Point Cloud Understanding. [CVPR 2026]SFR-Net: Steering-Fusion-Refining Network in Multi-label Zero-Shot Sewer Defect Detection

PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention. This video presents GHPT, a novel framework for real-time relightable Gaussian Splatting using hybrid path tracing. Project Page: ... The 5-minute introduction video of IntrinsicWeather. Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset. TAPE: Task-Adaptive Prototype Evolution in Audio-Language Models for Fully Few-shot Class-incremental Audio Classification. This is the official 5-minute poster presentation video for

FlashDecoder: Real-Time Latent-to-Pixel Streaming Decoder with Transformers. [CVPR 2026] Content-Adaptive Hierarchical Hyperprior for Neural Video Coding This is an introduction video for our work submitted to

Photo Gallery

[CVPR 2026]  Adaptive Spatial-Temporal Window
[CVPR 2026] MixerCSeg
[CVPR 2026]
CVPR 2026 Poster Presentation
[CVPR 2026] 44354_MMCP-GEN_YouTube video
[CVPR 2026] Deformation-based In-Context Learning for Point Cloud Understanding
[CVPR 2026]SFR-Net: Steering-Fusion-Refining Network in Multi-label Zero-Shot Sewer Defect Detection
[CVPR 2026] Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols - ViFailback
CVPR 2026 PA-Attack
[CVPR 2026] Beyond Perceptual Shortcuts: Causal-Inspired Debiasing Optimization for Video Reasoning
[CVPR 2026] GHPT
[CVPR 2026] IntrinsicWeather: Controllable Weather Editing in Intrinsic Space
View Detailed Profile
[CVPR 2026]  Adaptive Spatial-Temporal Window

[CVPR 2026] Adaptive Spatial-Temporal Window

Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios Zhipeng Sui, ...

[CVPR 2026] MixerCSeg

[CVPR 2026] MixerCSeg

MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention.

[CVPR 2026]

[CVPR 2026]

Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement.

CVPR 2026 Poster Presentation

CVPR 2026 Poster Presentation

In this video, we introduce a novel video object detection framework called D2FANet. D2FANet is the first framework to jointly ...

[CVPR 2026] 44354_MMCP-GEN_YouTube video

[CVPR 2026] 44354_MMCP-GEN_YouTube video

[

[CVPR 2026] Deformation-based In-Context Learning for Point Cloud Understanding

[CVPR 2026] Deformation-based In-Context Learning for Point Cloud Understanding

Chengxing Lin, Jinhong Deng, Yinjie Lei, Wen Li. "Deformation-based In-Context Learning for Point Cloud Understanding.

[CVPR 2026]SFR-Net: Steering-Fusion-Refining Network in Multi-label Zero-Shot Sewer Defect Detection

[CVPR 2026]SFR-Net: Steering-Fusion-Refining Network in Multi-label Zero-Shot Sewer Defect Detection

[CVPR 2026]SFR-Net: Steering-Fusion-Refining Network in Multi-label Zero-Shot Sewer Defect Detection

[CVPR 2026] Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols - ViFailback

[CVPR 2026] Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols - ViFailback

Official video for the

CVPR 2026 PA-Attack

CVPR 2026 PA-Attack

PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention.

[CVPR 2026] Beyond Perceptual Shortcuts: Causal-Inspired Debiasing Optimization for Video Reasoning

[CVPR 2026] Beyond Perceptual Shortcuts: Causal-Inspired Debiasing Optimization for Video Reasoning

Video presentation of our

[CVPR 2026] GHPT

[CVPR 2026] GHPT

This video presents GHPT, a novel framework for real-time relightable Gaussian Splatting using hybrid path tracing. Project Page: ...

[CVPR 2026] IntrinsicWeather: Controllable Weather Editing in Intrinsic Space

[CVPR 2026] IntrinsicWeather: Controllable Weather Editing in Intrinsic Space

The 5-minute introduction video of IntrinsicWeather.

[CVPR 2026] Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Dataset

[CVPR 2026] Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Dataset

Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset.

CVPR 2026 TAPE

CVPR 2026 TAPE

TAPE: Task-Adaptive Prototype Evolution in Audio-Language Models for Fully Few-shot Class-incremental Audio Classification.

CVPR 2026 Poster: A Denoising-Debiasing Framework for WS-VAD

CVPR 2026 Poster: A Denoising-Debiasing Framework for WS-VAD

This is the official 5-minute poster presentation video for

[CVPR 2026] FlashDecoder: Real-Time Latent-to-Pixel Streaming Decoder with Transformers

[CVPR 2026] FlashDecoder: Real-Time Latent-to-Pixel Streaming Decoder with Transformers

FlashDecoder: Real-Time Latent-to-Pixel Streaming Decoder with Transformers.

[CVPR 2026] Content-Adaptive Hierarchical Hyperprior for Neural Video Coding

[CVPR 2026] Content-Adaptive Hierarchical Hyperprior for Neural Video Coding

[CVPR 2026] Content-Adaptive Hierarchical Hyperprior for Neural Video Coding

CVPR 2026 AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation

CVPR 2026 AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation

This is an introduction video for our work submitted to

CVPR 2026: Domain-Skewed Federated Learning with Feature Decoupling and Calibration

CVPR 2026: Domain-Skewed Federated Learning with Feature Decoupling and Calibration

This is a talk about

[CVPR 2026] Dynamic Exposure Burst Image Restoration

[CVPR 2026] Dynamic Exposure Burst Image Restoration

[Project website]: https://woo525.github.io/DEBIR/ [Paper]: https://arxiv.org/abs/2603.21784 [Code]: ...