Computer Vision and Pattern Recognition
PhyGround: Benchmarking Physical Reasoning in ...
Computer Vision and Pattern Recognitionlibrarian
21 views
Image Generators are Generalist Vision Learners
Computer Vision and Pattern RecognitionVision Banana
53 views
MM-WebAgent: A Hierarchical Multimodal Web Age...
Computer Vision and Pattern Recognitionlibrarian
64 views
ActionParty: Multi-Subject Action Binding in G...
Computer Vision and Pattern RecognitionAlexander Pondaven
86 views
No Hard Negatives Required: Concept Centric Le...
Computer Vision and Pattern RecognitionHai Pham*
84 views
Do VLMs Need Vision Transformers? Evaluating S...
Computer Vision and Pattern Recognitionlibrarian
90 views
SAVeS: Steering Safety Judgments in Vision-Lan...
Computer Vision and Pattern Recognitionlibrarian
92 views
DreamPartGen: Semantically Grounded Part-Level...
Computer Vision and Pattern Recognitionlibrarian
95 views
Near-perfect photo-ID of the Hula painted frog...
Computer Vision and Pattern Recognitionyoavram
183 views
Multilayer Graph Approach to Deep Subspace Clu...
Computer Vision and Pattern Recognitionlovro-sindicic
169 views
Label-independent hyperparameter-free self-sup...
Computer Vision and Pattern Recognitionlovro-sindicic
174 views
PersonaLive! Expressive Portrait Image Animati...
Computer Vision and Pattern RecognitionGrisha Samokhin
183 views
Mull-Tokens: Modality-Agnostic Latent Thinking
Computer Vision and Pattern Recognitionlibrarian
197 views
Linear Gaussian Bounding Box Representation an...
Computer Vision and Pattern Recognitionrahulraj Kk
192 views
Point3R: Streaming 3D Reconstruction with Expl...
Computer Vision and Pattern Recognitionlibrarian
495 views
FADRM: Fast and Accurate Data Residual Matchin...
Computer Vision and Pattern Recognitionlibrarian
455 views
HalluSegBench: Counterfactual Visual Reasoning...
Computer Vision and Pattern Recognitionlibrarian
542 views
Whole-Body Conditioned Egocentric Video Prediction
Computer Vision and Pattern Recognitionlibrarian
531 views
Reinforcing Spatial Reasoning in Vision-Langua...
Computer Vision and Pattern Recognitionlibrarian
606 views
Outside Knowledge Conversational Video (OKCV) ...
Computer Vision and Pattern Recognitionlibrarian
484 views
Decoupling the Image Perception and Multimodal...
Computer Vision and Pattern Recognitionlibrarian
631 views
Direct Numerical Layout Generation for 3D Indo...
Computer Vision and Pattern Recognitionlibrarian
657 views
Refer to Anything with Vision-Language Prompts
Computer Vision and Pattern RecognitionShengcao Cao
644 views
Let Androids Dream of Electric Sheep: A Human-...
Computer Vision and Pattern RecognitionAnastasia Kokkanen
731 views
Delving into RL for Image Generation with CoT:...
Computer Vision and Pattern Recognitionlibrarian
597 views
Let Androids Dream of Electric Sheep: A Human-...
Computer Vision and Pattern Recognitionlibrarian
617 views
SpatialScore: Towards Unified Evaluation for M...
Computer Vision and Pattern RecognitionHaoning Wu
685 views
VTBench: Evaluating Visual Tokenizers for Auto...
Computer Vision and Pattern Recognitionlibrarian
670 views
Does Feasibility Matter? Understanding the Imp...
Computer Vision and Pattern Recognitionlibrarian
593 views
MathCoder-VL: Bridging Vision and Code for Enh...
Computer Vision and Pattern Recognitionlibrarian
664 views
StreamBridge: Turning Your Offline Video Large...
Computer Vision and Pattern Recognitionlibrarian
632 views