Skip to content

Vincentqyw/cv-arxiv-daily

Repository files navigation

Updated on 2025.12.06

Usage instructions: here

Table of Contents
  1. SLAM
  2. SFM
  3. Visual Localization
  4. Keypoint Detection
  5. Image Matching
  6. NeRF

SLAM

Publish Date Title Authors PDF Code
2025-12-04 TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards Mauro Martini et.al. 2512.04772 null
2025-12-03 What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models Tianchen Deng et.al. 2512.03422 null
2025-12-02 VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM Zihan Zhu et.al. 2512.02293 null
2025-12-01 KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM Zaid Nasser et.al. 2512.01889 null
2025-12-01 Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching Yue Pan et.al. 2512.01850 null
2025-12-01 AgriLiRa4D: A Multi-Sensor UAV Dataset for Robust SLAM in Challenging Agricultural Fields Zhihao Zhan et.al. 2512.01753 null
2025-12-01 EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly Xiaokun Pan et.al. 2512.01296 null
2025-11-30 Integration of UWB Radar on Mobile Robots for Continuous Obstacle and Environment Mapping Adelina Giurea et.al. 2512.01018 null
2025-11-30 EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes Xiaoshan Wu et.al. 2512.00771 null
2025-11-29 Odometry Without Correspondence from Inertially Constrained Ruled Surfaces Chenqi Zhu et.al. 2512.00327 null
2025-11-26 Dual-Agent Reinforcement Learning for Adaptive and Cost-Aware Visual-Inertial Odometry Feiyang Pan et.al. 2511.21083 null
2025-11-25 Estimating Fog Parameters from a Sequence of Stereo Images Yining Ding et.al. 2511.20865 null
2025-11-25 The origin of B-type runaway stars based on kinematics Yanjun Guo et.al. 2511.20566 null
2025-11-25 Metric, inertially aligned monocular state estimation via kinetodynamic priors Jiaxin Liu et.al. 2511.20496 null
2025-11-25 AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend Hengyi Wang et.al. 2511.20343 null
2025-11-25 Stellar Parameters of BOSS M dwarfs in SDSS-V DR19 Dan Qiu et.al. 2511.20005 null
2025-11-26 Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors Yuchen Zhou et.al. 2511.19031 null
2025-11-24 AutoOdom: Learning Auto-regressive Proprioceptive Odometry for Legged Locomotion Changsheng Luo et.al. 2511.18857 null
2025-11-24 SP-VINS: A Hybrid Stereo Visual Inertial Navigation System based on Implicit Environmental Map Xueyu Du et.al. 2511.18756 null
2025-11-24 Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing Xiaotong Huang et.al. 2511.18755 null
2025-11-24 Stable Multi-Drone GNSS Tracking System for Marine Robots Shuo Wen et.al. 2511.18694 null
2025-11-23 Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span Heeseung Yun et.al. 2511.18470 null
2025-11-22 Unobservable Subspace Evolution and Alignment for Consistent Visual-Inertial Navigation Chungeng Tian et.al. 2511.17992 null
2025-11-21 Target-Bench: Can World Models Achieve Mapless Path Planning with Semantic Targets? Dingrui Wang et.al. 2511.17792 null
2025-11-21 IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation Yifan Li et.al. 2511.17384 null
2025-11-21 MonoSpheres: Large-Scale Monocular SLAM-Based UAV Exploration through Perception-Coupled Mapping and Planning Tomáš Musil et.al. 2511.17299 null
2025-11-21 SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors Kunyi Li et.al. 2511.17207 null
2025-11-20 CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering Joni Vanherck et.al. 2511.16349 null
2025-11-20 Building temporally coherent 3D maps with VGGT for memory-efficient Semantic SLAM Gergely Dinya et.al. 2511.16282 null
2025-11-20 LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM Sibaek Lee et.al. 2511.16144 null
2025-11-20 Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments Renxiang Xiao et.al. 2511.16091 null
2025-11-20 Semantic Glitch: Agency and Artistry in an Autonomous Pixel Cloud Qing Zhang et.al. 2511.16048 null
2025-11-11 Real-time Point Cloud Data Transmission via L4S for 5G-Edge-Assisted Robotics Gerasimos Damigos et.al. 2511.15677 null
2025-11-19 Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition Xufei Wang et.al. 2511.15597 null
2025-11-18 A visual study of ICP variants for Lidar Odometry Sebastian Dingler et.al. 2511.14919 null
2025-11-18 SLAM-AGS: Slide-Label Aware Multi-Task Pretraining Using Adaptive Gradient Surgery in Computational Cytology Marco Acerbis et.al. 2511.14639 null
2025-11-23 Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors Jeryes Danial et.al. 2511.14335 null
2025-11-18 MA-SLAM: Active SLAM in Large-Scale Unknown Environment using Map Aware Deep Reinforcement Learning Yizhen Yin et.al. 2511.14330 null
2025-11-18 iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion Hao Wang et.al. 2511.14149 null
2025-11-17 GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry Chiyun Noh et.al. 2511.13216 null
2025-11-16 DPVO-QAT++: Heterogeneous QAT and CUDA Kernel Fusion for High-Performance Deep Patch Visual Odometry Cheng Liao et.al. 2511.12653 null
2025-11-14 Autonomous Underwater Cognitive System for Adaptive Navigation: A SLAM-Integrated Cognitive Architecture K. A. I. N Jayarathne et.al. 2511.11845 null
2025-11-12 DualVision ArthroNav: Investigating Opportunities to Enhance Localization and Reconstruction in Image-based Arthroscopy Navigation via External Cameras Hongchao Shu et.al. 2511.10699 null
2025-11-12 Generation-Agnostic Zero-Energy Devices for Sustainable Connectivity, Sensing, and Localization Navid Amani et.al. 2511.09372 null
2025-11-12 UMIGen: A Unified Framework for Egocentric Point Cloud Generation and Cross-Embodiment Robotic Imitation Learning Yan Huang et.al. 2511.09302 null
2025-11-12 SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields Sangheon Yang et.al. 2511.09072 null
2025-11-10 Integration of Visual SLAM into Consumer-Grade Automotive Localization Luis Diener et.al. 2511.06919 null
2025-11-10 Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes Meijun Guo et.al. 2511.06765 null
2025-11-10 Semi-distributed Cross-modal Air-Ground Relative Localization Weining Lu et.al. 2511.06749 null
2025-11-08 ViTaMIn-B: A Reliable and Efficient Visuo-Tactile Bimanual Manipulation Interface Chuanyu Li et.al. 2511.05858 null
2025-11-08 3D Mapping Using a Lightweight and Low-Power Monocular Camera Embedded inside a Gripper of Limbed Climbing Robots Taku Okawara et.al. 2511.05816 null
2025-11-07 Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments Laura Alejandra Encinar Gonzalez et.al. 2511.05404 null
2025-11-06 Synchronous Observer Design for Landmark-Inertial SLAM with Almost-Global Convergence Arkadeep Saha et.al. 2511.04531 null
2025-11-06 PUL-SLAM: Path-Uncertainty Co-Optimization with Lightweight Stagnation Detection for Efficient Robotic Exploration Yizhen Yin et.al. 2511.04180 null
2025-11-04 Analytical modelling of a stop-less modular bus service with an application to charging strategies comparison Haoran Zhao et.al. 2511.03754 null
2025-11-04 Self-Supervised Moving Object Segmentation of Sparse and Noisy Radar Point Clouds Leon Schwarzer et.al. 2511.02395 null
2025-11-03 TurboMap: GPU-Accelerated Local Mapping for Visual SLAM Parsa Hosseininejad et.al. 2511.02036 null
2025-11-03 CM-LIUW-Odometry: Robust and High-Precision LiDAR-Inertial-UWB-Wheel Odometry for Extreme Degradation Coal Mine Tunnels Kun Hu et.al. 2511.01379 null
2025-11-11 Tackling the Kidnapped Robot Problem via Sparse Feasible Hypothesis Sampling and Reliable Batched Multi-Stage Inference Muhua Zhang et.al. 2511.01219 null
2025-11-03 LiDAR-VGGT: Cross-Modal Coarse-to-Fine Fusion for Globally Consistent and Metric-Scale Dense Mapping Lijie Wang et.al. 2511.01186 null
2025-11-01 Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles Hyungtae Lim et.al. 2511.00635 null
2025-10-31 WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond Zhicong Sun et.al. 2510.27133 null
2025-10-30 AgriGS-SLAM: Orchard Mapping Across Seasons via Multi-View Gaussian Splatting SLAM Mirko Usuelli et.al. 2510.26358 null
2025-10-30 Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM Ali Caglayan et.al. 2510.26131 null
2025-10-29 EA3D: Online Open-World 3D Object Extraction from Streaming Videos Xiaoyu Zhou et.al. 2510.25146 null
2025-10-28 Spatiotemporal Calibration of Doppler Velocity Logs for Underwater Robots Hongxu Zhao et.al. 2510.24571 null
2025-10-28 GeVI-SLAM: Gravity-Enhanced Stereo Visua Inertial SLAM for Underwater Robots Yuan Shen et.al. 2510.24533 null
2025-10-28 A Survey on Collaborative SLAM with 3D Gaussian Splatting Phuc Nguyen Xuan et.al. 2510.23988 null
2025-10-26 TWC-SLAM: Multi-Agent Cooperative SLAM with Text Semantics and WiFi Features Integration for Similar Indoor Environments Chunyu Li et.al. 2510.22754 null
2025-10-26 Policies over Poses: Reinforcement Learning based Distributed Pose-Graph Optimization for Multi-Robot SLAM Sai Krishna Ghanta et.al. 2510.22740 null
2025-10-26 LVD-GS: Gaussian Splatting SLAM for Dynamic Scenes via Hierarchical Explicit-Implicit Representation Collaboration Rendering Wenkai Zhu et.al. 2510.22669 null
2025-10-26 RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience Huilin Yin et.al. 2510.22600 null
2025-10-26 UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models Wenming Tu et.al. 2510.22588 null
2025-10-26 Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing Xiang Fei et.al. 2510.22529 null
2025-10-24 Underwater Visual-Inertial-Acoustic-Depth SLAM with DVL Preintegration for Degraded Environments Shuoshuo Ding et.al. 2510.21215 null
2025-10-23 Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired Navigation Marziyeh Bamdad et.al. 2510.20549 null
2025-10-23 Degradation-Aware Cooperative Multi-Modal GNSS-Denied Localization Leveraging LiDAR-Based Robot Detections Václav Pritzl et.al. 2510.20480 null
2025-10-21 Underwater Dense Mapping with the First Compact 3D Sonar Chinmay Burgul et.al. 2510.18991 null
2025-10-21 DeepDetect: Learning All-in-One Dense Keypoints Shaharyar Ahmed Khan Tareen et.al. 2510.17422 null
2025-10-18 LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching Aidyn Ubingazhibov et.al. 2510.16438 null
2025-10-17 VAR-SLAM: Visual Adaptive and Robust SLAM for Dynamic Environments João Carlos Virgolino Soares et.al. 2510.16205 null
2025-10-17 Dynamic Recalibration in LiDAR SLAM: Integrating AI and Geometric Methods with Real-Time Feedback Using INAF Fusion Zahra Arjmandi et.al. 2510.15803 null
2025-10-17 LVI-Q: Robust LiDAR-Visual-Inertial-Kinematic Odometry for Quadruped Robots Using Tightly-Coupled and Efficient Alternating Optimization Kevin Christiansen Marsim et.al. 2510.15220 null
2025-10-16 3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation JoungBin Lee et.al. 2510.14945 null
2025-10-15 Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU Ruiqi Ye et.al. 2510.13546 null
2025-10-15 Through the Lens of Doubt: Robust and Efficient Uncertainty Estimation for Visual Place Recognition Emily Miller et.al. 2510.13464 null
2025-10-15 DAMM-LOAM: Degeneracy Aware Multi-Metric LiDAR Odometry and Mapping Nishant Chandna et.al. 2510.13287 null
2025-10-14 SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding Zhiliu Yang et.al. 2510.12749 null
2025-10-14 PolygMap: A Perceptive Locomotion Framework for Humanoid Robot Stair Climbing Bingquan Li et.al. 2510.12346 null
2025-10-09 ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation Guanghao Li et.al. 2510.08551 null
2025-10-09 RTGS: Real-Time 3D Gaussian Splatting SLAM via Multi-Level Redundancy Reduction Leshu Li et.al. 2510.06644 null
2025-10-07 Human3R: Everyone Everywhere All at Once Yue Chen et.al. 2510.06219 null
2025-11-02 Dropping the D: RGB-D SLAM Without the Depth Sensor Mert Kiray et.al. 2510.06216 null
2025-10-07 Coordinate-Consistent Localization via Continuous-Time Calibration and Fusion of UWB and SLAM Observations Tien-Dat Nguyen et.al. 2510.05992 null
2025-10-06 OKVIS2-X: Open Keyframe-based Visual-Inertial SLAM Configurable with Dense Depth or LiDAR, and GNSS Simon Boche et.al. 2510.04612 null
2025-10-04 TCB-VIO: Tightly-Coupled Focal-Plane Binary-Enhanced Visual Inertial Odometry Matthew Lisondra et.al. 2510.03919 null
2025-11-19 Visual Odometry with Transformers Vlardimir Yugay et.al. 2510.03348 null
2025-10-02 RSV-SLAM: Toward Real-Time Semantic Visual SLAM in Indoor Dynamic Environments Mobin Habibpour et.al. 2510.02616 null
2025-10-02 EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction Lingxiang Hu et.al. 2510.02080 null
2025-10-02 Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale Yongbo Chen et.al. 2510.01665 null
2025-10-02 Statistical Uncertainty Learning for Robust Visual-Inertial State Estimation Seungwon Choi et.al. 2510.01648 null
2025-10-01 Instant4D: 4D Gaussian Splatting in Minutes Zhanpeng Luo et.al. 2510.01119 null
2025-10-01 Semantic Visual Simultaneous Localization and Mapping: A Survey on State of the Art, Challenges, and Future Directions Thanh Nguyen Canh et.al. 2510.00783 null
2025-09-30 Benchmarking Egocentric Visual-Inertial SLAM at City Scale Anusha Krishnan et.al. 2509.26639 null
2025-09-30 Graphite: A GPU-Accelerated Mixed-Precision Graph Optimization Framework Shishir Gopinath et.al. 2509.26581 null
2025-09-30 Radio-based Multi-Robot Odometry and Relative Localization Andrés Martínez-Silva et.al. 2509.26558 null
2025-09-30 DEPTHOR++: Robust Depth Enhancement from a Real-World Lightweight dToF and RGB Guidance Jijun Xiang et.al. 2509.26498 null
2025-09-30 Side Scan Sonar-based SLAM for Autonomous Algae Farm Monitoring Julian Valdez et.al. 2509.26121 null
2025-09-30 User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality Conghao Zhou et.al. 2509.25905 null
2025-09-29 PROFusion: Robust and Accurate Dense Reconstruction via Camera Pose Regression and Optimization Siyan Dong et.al. 2509.24236 null
2025-09-28 GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State Guole Shen et.al. 2509.23737 null
2025-09-28 From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations Javed Ahmad et.al. 2509.23555 null
2025-09-27 EKF-Based Fusion of Wi-Fi/LiDAR/IMU for Indoor Localization and Navigation Zeyi Li et.al. 2509.23118 null
2025-09-26 Good Weights: Proactive, Adaptive Dead Reckoning Fusion for Continuous and Robust Visual SLAM Yanwei Du et.al. 2509.22910 null
2025-09-26 IMU-Preintegrated Radar Factors for Asynchronous Radar-LiDAR-Inertial SLAM Johan Hatleskog et.al. 2509.22288 null
2025-09-25 Real-Time Indoor Object SLAM with LLM-Enhanced Priors Yang Jiao et.al. 2509.21602 null
2025-09-25 PL-VIWO2: A Lightweight, Fast and Robust Visual-Inertial-Wheel Odometry Using Points and Lines Zhixin Zhang et.al. 2509.21563 null
2025-09-25 AnywhereVLA: Language-Conditioned Exploration and Mobile Manipulation Konstantin Gubernatorov et.al. 2509.21006 null
2025-11-16 MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM Yuxuan Zhou et.al. 2509.20757 null
2025-09-25 SLAM-Free Visual Navigation with Hierarchical Vision-Language Perception and Coarse-to-Fine Semantic Topological Planning Guoyang Zhao et.al. 2509.20739 null
2025-09-24 Optical Ocean Recipes: Creating Realistic Datasets to Facilitate Underwater Vision Research Patricia Schöntag et.al. 2509.20171 null
2025-09-23 Bioinspired SLAM Approach for Unmanned Surface Vehicle Fabio Coelho et.al. 2509.19522 null
2025-09-23 CU-Multi: A Dataset for Multi-Robot Collaborative Perception Doncey Albin et.al. 2509.19463 null
2025-09-23 Towards Robust LiDAR Localization: Deep Learning-based Uncertainty Estimation Minoo Dolatabadi et.al. 2509.18954 null
2025-09-23 An Extended Kalman Filter for Systems with Infinite-Dimensional Measurements Maxwell M. Varley et.al. 2509.18749 null
2025-09-22 Semantic-Aware Particle Filter for Reliable Vineyard Robot Localisation Rajitha de Silva et.al. 2509.18342 null
2025-09-22 ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos Shi Chen et.al. 2509.17864 null
2025-09-21 SLAM-Former: Putting SLAM into One Transformer Yijun Yuan et.al. 2509.16909 null
2025-09-21 ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM Amanuel T. Dufera et.al. 2509.16863 null
2025-09-19 SLaM-DiMM: Shared Latent Modeling for Diffusion Based Missing Modality Synthesis in MRI Bhavesh Sandbhor et.al. 2509.16019 null
2025-09-19 Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion Yinong Cao et.al. 2509.15673 null
2025-09-19 STARC: See-Through-Wall Augmented Reality Framework for Human-Robot Collaboration in Emergency Response Shenghai Yuan et.al. 2509.15507 null
2025-09-18 Human Interaction for Collaborative Semantic SLAM using Extended Reality Laura Ribeiro et.al. 2509.14949 null
2025-09-18 BEV-ODOM2: Enhanced BEV-based Monocular Visual Odometry with PV-BEV Fusion and Dense Flow Supervision for Ground Robots Yufei Wei et.al. 2509.14636 null
2025-09-18 Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods Adam D. Hines et.al. 2509.14516 null
2025-10-03 MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping Zhihao Cao et.al. 2509.14191 null
2025-10-08 BIM Informed Visual SLAM for Construction Monitoring Asier Bikandi-Noya et.al. 2509.13972 null
2025-09-17 UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry Tae-Wook Um et.al. 2509.13713 null
2025-09-17 Barometer-Aided Attitude Estimation Méloné Nyoba Tchonkeu et.al. 2509.13649 null
2025-09-16 Semantic 3D Reconstructions with SLAM for Central Airway Obstruction Ayberk Acar et.al. 2509.13541 null
2025-09-16 MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM Yinlong Bai et.al. 2509.13536 null
2025-09-18 MATTER: Multiscale Attention for Registration Error Regression Shipeng Liu et.al. 2509.12924 null
2025-09-16 Match Chat: Real Time Generative AI and Generative Computing for Tennis Aaron Baughman et.al. 2509.12592 null
2025-09-15 See What I Mean? Mobile Eye-Perspective Rendering for Optical See-through Head-mounted Displays Gerlinde Emsenhuber et.al. 2509.11653 null
2025-09-15 Gaussian-Plus-SDF SLAM: High-fidelity 3D Reconstruction at 150+ fps Zhexi Peng et.al. 2509.11574 null
2025-09-28 Autonomous Close-Proximity Photovoltaic Panel Coating Using a Quadcopter Dimitri Jacquemont et.al. 2509.10979 null
2025-09-13 FastTrack: GPU-Accelerated Tracking for Visual SLAM Kimia Khabiri et.al. 2509.10757 null
2025-09-12 Robust Localization in Modern Cellular Networks using Global Map Features Junshi Chen et.al. 2509.10433 null
2025-09-12 Efficient and Accurate Downfacing Visual Inertial Odometry Jonas Kühne et.al. 2509.10021 null
2025-10-10 SMapper: A Multi-Modal Data Acquisition Platform for SLAM Benchmarking Pedro Miguel Bastos Soares et.al. 2509.09509 null
2025-09-11 S-BEVLoc: BEV-based Self-supervised Framework for Large-scale LiDAR Global Localization Chenghao Zhang et.al. 2509.09110 null
2025-09-10 Good Deep Features to Track: Self-Supervised Feature Extraction and Tracking in Visual Odometry Sai Puneeth Reddy Gottam et.al. 2509.08333 null
2025-09-10 Behaviorally Heterogeneous Multi-Agent Exploration Using Distributed Task Allocation Nirabhra Mandal et.al. 2509.08242 null
2025-09-10 Deep Visual Odometry for Stereo Event Cameras Sheng Zhong et.al. 2509.08235 null
2025-09-10 Online Dynamic SLAM with Incremental Smoothing and Mapping Jesse Morris et.al. 2509.08197 null
2025-09-09 Sensing with Mobile Devices through Radio SLAM: Models, Methods, Opportunities, and Challenges Yu Ge et.al. 2509.07775 null
2025-11-04 Radar-Based Odometry for Low-Speed Driving Luis Diener et.al. 2509.07683 null
2025-09-09 Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark Yandi Yang et.al. 2509.07362 null
2025-09-08 Detection and Recovery of Adversarial Slow-Pose Drift in Offloaded Visual-Inertial Odometry Soruya Saha et.al. 2509.07130 null
2025-09-08 Co-Located VR with Hybrid SLAM-based HMD Tracking and Motion Capture Synchronization Carlos A. Pinheiro de Sousa et.al. 2509.06582 null
2025-09-15 Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation Ian Page et.al. 2509.06433 null
2025-09-07 DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal Fusion Mengmeng Liu et.al. 2509.06023 null
2025-09-06 Multi-LVI-SAM: A Robust LiDAR-Visual-Inertial Odometry for Multiple Fisheye Cameras Xinyu Zhang et.al. 2509.05740 null
2025-09-30 LiDAR-BIND-T: Improved and Temporally Consistent Sensor Modality Translation and Fusion for Robotic Applications Niels Balemans et.al. 2509.05728 null
2025-09-04 Stitching the Story: Creating Panoramic Incident Summaries from Body-Worn Footage Dor Cohen et.al. 2509.04370 null
2025-09-04 Odometry Calibration and Pose Estimation of a 4WIS4WID Mobile Wall Climbing Robot Branimir Ćaran et.al. 2509.04016 null
2025-09-03 IL-SLAM: Intelligent Line-assisted SLAM Based on Feature Awareness for Dynamic Environments Haolan Zhang et.al. 2509.02972 null
2025-09-02 Coral: A Unifying Abstraction Layer for Composable Robotics Software Steven Swanbeck et.al. 2509.02453 null
2025-09-02 Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction Xueyang Kang et.al. 2509.01873 null
2025-09-01 ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association Ganlin Zhang et.al. 2509.01584 null
2025-09-01 FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field Fan Zhu et.al. 2509.01547 null
2025-09-01 SR-SLAM: Scene-reliability Based RGB-D SLAM in Diverse Environments Haolan Zhang et.al. 2509.01111 null
2025-08-31 DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments Yi Liu et.al. 2509.00741 null
2025-08-30 AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility Detection Houshu He et.al. 2509.00433 null
2025-08-29 The Rosario Dataset v2: Multimodal Dataset for Agricultural Robotics Nicolas Soncini et.al. 2508.21635 null
2025-08-28 Observer Design for Optical Flow-Based Visual-Inertial Odometry with Almost-Global Convergence Tarek Bouazza et.al. 2508.21163 null
2025-08-28 Adam SLAM - the last mile of camera calibration with 3DGS Matthieu Gendrin et.al. 2508.20526 null
2025-08-24 SEER-VAR: Semantic Egocentric Environment Reasoner for Vehicle Augmented Reality Yuzhi Lai et.al. 2508.17255 null
2025-08-24 VROOM - Visual Reconstruction over Onboard Multiview Yajat Yadav et.al. 2508.17172 null
2025-08-23 DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration Jiayi Li et.al. 2508.17034 null
2025-08-23 A Workflow for Map Creation in Autonomous Vehicle Simulations Zubair Islam et.al. 2508.16856 null
2025-09-12 COSMO-Bench: A Benchmark for Collaborative SLAM Optimization Daniel McGann et.al. 2508.16731 null
2025-08-22 GPL-SLAM: A Laser SLAM Framework with Gaussian Process Based Extended Landmarks Ali Emre Balcı et.al. 2508.16459 null
2025-08-21 GelSLAM: A Real-time, High-Fidelity, and Robust 3D Tactile SLAM System Hung-Jui Huang et.al. 2508.15990 null
2025-08-19 SLAM-based Safe Indoor Exploration Strategy Omar Mostafa et.al. 2508.14235 null
2025-09-05 Online 3D Gaussian Splatting Modeling with Novel View Selection Byeonggwon Lee et.al. 2508.14014 null
2025-08-19 ROVER: Robust Loop Closure Verification with Trajectory Prior in Repetitive Environments Jingwen Yu et.al. 2508.13488 null
2025-08-18 XR-NPE: High-Throughput Mixed-precision SIMD Neural Processing Engine for Extended Reality Perception Workloads Tejas Chaudhari et.al. 2508.13049 null
2025-08-16 DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects Tingbang Liang et.al. 2508.11950 null
2025-08-14 CVIRO: A Consistent and Tightly-Coupled Visual-Inertial-Ranging Odometry on Lie Groups Yizhi Zhou et.al. 2508.10867 null
2025-08-14 Super LiDAR Reflectance for Robotic Perception Wei Gao et.al. 2508.10398 null
2025-08-12 Transient Noise Removal via Diffusion-based Speech Inpainting Mordehay Moradi et.al. 2508.08890 null
2025-08-09 EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events Siyu Chen et.al. 2508.07003 null
2025-08-07 A Multi-view Landmark Representation Approach with Application to GNSS-Visual-Inertial Odometry Tong Hua et.al. 2508.05368 null
2025-08-07 Speech LLMs in Low-Resource Scenarios: Data Volume Requirements and the Impact of Pretraining on High-Resource Languages Seraphina Fong et.al. 2508.05149 null
2025-08-06 Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline Linqing Zhao et.al. 2508.04597 null
2025-10-15 Inland-LOAM: Voxel-Based Structural Semantic LiDAR Odometry and Mapping for Inland Waterway Navigation Zhongbi Luo et.al. 2508.03672 null
2025-08-04 A Moment Matching-Based Method for Sparse and Noisy Point Cloud Registration Xingyi Li et.al. 2508.02187 null
2025-08-04 AID4AD: Aerial Image Data for Automated Driving Perception Daniel Lengerer et.al. 2508.02140 null
2025-08-01 CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry Jingchao Xie et.al. 2508.00568 null
2025-07-31 The Monado SLAM Dataset for Egocentric Visual-Inertial Tracking Mateo de Mayo et.al. 2508.00088 null
2025-07-31 Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes Xiaohan Li et.al. 2507.23677 null
2025-07-31 DRACo-SLAM2: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar EquippedUnderwater Robot Teams with Object Graph Matching Yewei Huang et.al. 2507.23629 null
2025-07-31 GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting Jaeseok Park et.al. 2507.23273 null
2025-07-30 Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques Weide Liu et.al. 2507.22791 null
2025-07-30 UAVScenes: A Multi-Modal Dataset for UAVs Sijie Wang et.al. 2507.22412 null
2025-07-29 Impact of Underwater Image Enhancement on Feature Matching Jason M. Summers et.al. 2507.21715 null
2025-07-29 Adaptive Prior Scene-Object SLAM for Dynamic Environments Haolan Zhang et.al. 2507.21709 null
2025-08-01 Multi-robot LiDAR SLAM: a practical case study in underground tunnel environments Federica Di Lauro et.al. 2507.21553 null
2025-07-28 $S^3$ LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping Ruoyu Fan et.al. 2507.20854 null
2025-07-28 Large-Scale LiDAR-Inertial Dataset for Degradation-Robust High-Precision Mapping Xiaofeng Jin et.al. 2507.20516 null
2025-07-26 DOA: A Degeneracy Optimization Agent with Adaptive Pose Compensation Capability based on Deep Reinforcement Learning Yanbin Li et.al. 2507.19742 null
2025-07-25 DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations Ziren Gong et.al. 2507.19474 null
2025-07-25 The Eloquence team submission for task 1 of MLC-SLM challenge Lorenzo Concina et.al. 2507.19308 null
2025-07-31 SmartPNT-MSF: A Multi-Sensor Fusion Dataset for Positioning and Navigation Research Feng Zhu et.al. 2507.19079 null
2025-07-25 A Fast and Light-weight Non-Iterative Visual Odometry with RGB-D Cameras Zheng Yang et.al. 2507.18886 null
2025-07-24 G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM Gyuhyeon Pak et.al. 2507.18344 null
2025-07-23 Physics-based Human Pose Estimation from a Single Moving RGB Camera Ayce Idil Aytekin et.al. 2507.17406 null
2025-08-01 CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance Peiqi Chen et.al. 2507.17312 null
2025-07-21 DiffPF: Differentiable Particle Filtering with Generative Sampling via Conditional Diffusion Models Ziyu Wan et.al. 2507.15716 null
2025-07-21 Dense-depth map guided deep Lidar-Visual Odometry with Sparse Point Clouds and Images JunYing Huang et.al. 2507.15496 null
2025-07-21 All-UWB SLAM Using UWB Radar and UWB AOA Charith Premachandra et.al. 2507.15474 null
2025-07-21 BenchDepth: Are We on the Right Way to Evaluate Depth Foundation Models? Zhenyu Li et.al. 2507.15321 null
2025-07-20 LoopNet: A Multitasking Few-Shot Learning Approach for Loop Closure in Large Scale SLAM Mohammad-Maher Nakshbandi et.al. 2507.15109 null
2025-11-04 Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey Jiahui Zhang et.al. 2507.14501 null
2025-07-18 SaWa-ML: Structure-Aware Pose Correction and Weight Adaptation-Based Robust Multi-Robot Localization Junho Choi et.al. 2507.13702 null
2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model Maulana Bisyir Azhari et.al. 2507.13145 null
2025-07-17 MoCap2GT: A High-Precision Ground Truth Estimator for SLAM Benchmarking Based on Motion Capture and IMU Fusion Zichao Shu et.al. 2507.12920 null
2025-07-17 Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot Luca Garello et.al. 2507.12273 null
2025-07-16 Tree-SLAM: semantic object SLAM for efficient mapping of individual trees in orchards David Rapado-Rincon et.al. 2507.12093 null
2025-07-11 Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework Deteng Zhang et.al. 2507.08364 null
2025-07-10 Hardware-Aware Feature Extraction Quantisation for Real-Time Visual Odometry on FPGA Platforms Mateusz Wasala et.al. 2507.07903 null
2025-07-10 IRAF-SLAM: An Illumination-Robust and Adaptive Feature-Culling Front-End for Visual SLAM in Challenging Environments Thanh Nguyen Canh et.al. 2507.07752 null
2025-07-09 g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM Quanjie Qiu et.al. 2507.07142 null
2025-07-08 Mapping the Catacombs: An Underwater Cave Segment of the Devil's Eye System Michalis Chatzispyrou et.al. 2507.06397 null
2025-07-08 Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems Hang Que et.al. 2507.05718 null
2025-07-07 Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR Tao Du et.al. 2507.04662 null
2025-07-06 Lidar Variability: A Novel Dataset and Comparative Study of Solid-State and Spinning Lidars Doumegna Mawuto Koudjo Felix et.al. 2507.04321 null
2025-07-09 Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM Xiaolei Lang et.al. 2507.04004 null
2025-07-04 Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps Chong Cheng et.al. 2507.03737 null
2025-07-01 RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles David Hunt et.al. 2507.00937 null
2025-07-01 Generation of Indoor Open Street Maps for Robot Navigation from CAD Files Jiajie Zhang et.al. 2507.00552 null
2025-06-30 VOCAL: Visual Odometry via ContrAstive Learning Chi-Yao Huang et.al. 2507.00243 null
2025-06-29 TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints Zhen Tan et.al. 2506.23207 null
2025-06-29 Event-based Stereo Visual-Inertial Odometry with Voxel Map Zhaoxing Zhang et.al. 2506.23078 null
2025-06-26 Adaptive Multipath-Based SLAM for Distributed MIMO Systems Xuhong Li et.al. 2506.21798 null
2025-06-24 Ark: An Open-source Python-based Framework for Robot Learning Magnus Dierking et.al. 2506.21628 null
2025-06-26 EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting Taoyu Wu et.al. 2506.21420 null
2025-06-26 CURL-SLAM: Continuous and Compact LiDAR Mapping Kaicheng Zhang et.al. 2506.21077 null
2025-06-25 SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning Mimo Shirasaka et.al. 2506.20394 null
2025-06-25 Real-Time Obstacle Avoidance Algorithms for Unmanned Aerial and Ground Vehicles Jingwen Wei et.al. 2506.20311 null
2025-06-24 Posterior Cramér-Rao Bounds on Localization and Mapping Errors in Distributed MIMO SLAM Benjamin J. B. Deutschmann et.al. 2506.19957 null
2025-06-23 GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM Annika Thomas et.al. 2506.18885 null
2025-06-23 MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation Tianchen Deng et.al. 2506.18678 null
2025-06-24 Multimodal Fusion SLAM with Fourier Attention Youjie Zhou et.al. 2506.18204 null
2025-06-22 ADA-DPM: A Neural Descriptors-based Adaptive Noise Point Filtering Strategy for SLAM Yongxin Shao et.al. 2506.18016 null
2025-06-21 Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems Sebastian Sansoni et.al. 2506.17775 null
2025-06-18 MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System Miaoxin Pan et.al. 2506.15402 null
2025-06-24 RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories Qingsong Yan et.al. 2506.15242 null
2025-06-18 SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization Hanjun Kim et.al. 2506.15175 null
2025-06-18 VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments Bingbing Zhang et.al. 2506.15126 null
2025-06-16 Slanted light-sheet array microscopy for large volume imaging at rates exceeding 100 Hz Kai Long et.al. 2506.13664 null
2025-06-16 Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots Jaehong Oh et.al. 2506.13149 null
2025-06-16 A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method Zhanhua Xin et.al. 2506.13100 null
2025-06-16 SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure Shahram Najam Syed et.al. 2506.13089 link
2025-06-12 LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System Hongbeen Park et.al. 2506.10567 null
2025-06-11 VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots Miguel Á. González-Santamarta et.al. 2506.09583 null
2025-06-10 UFM: A Simple Path towards Unified Dense Correspondence with Flow Yuchen Zhang et.al. 2506.09278 null
2025-06-10 Princeton365: A Diverse Dataset with Accurate Camera Pose Karhan Kayan et.al. 2506.09035 null
2025-06-10 Planar Collisionless Shock Simulations with Semi-Implicit Particle-in-Cell Model FLEKS Hongyang Zhou et.al. 2506.08384 null
2025-06-09 ZeroVO: Visual Odometry with Minimal Assumptions Lei Lai et.al. 2506.08005 null
2025-06-08 Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs Qiong Chang et.al. 2506.07164 null
2025-06-08 UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment Wentao Zhao et.al. 2506.07013 null
2025-06-06 GS4: Generalizable Sparse Splatting Semantic SLAM Mingqi Jiang et.al. 2506.06517 null
2025-06-06 Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception Pushyami Kaveti et.al. 2506.06476 null
2025-06-04 Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset Zirui Wang et.al. 2506.04224 null
2025-06-03 LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM Roman Titkov et.al. 2506.03073 null
2025-06-03 Online Performance Assessment of Multi-Source-Localization for Autonomous Driving Systems Using Subjective Logic Stefan Orf et.al. 2506.02932 null
2025-06-03 VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians Pengchong Hu et.al. 2506.02741 null
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 link
2025-06-03 Olfactory Inertial Odometry: Methodology for Effective Robot Navigation by Scent Kordel K. France et.al. 2506.02373 null
2025-06-01 Globally Consistent RGB-D SLAM with 2D Gaussian Splatting Xingguang Zhong et.al. 2506.00970 link
2025-05-30 Black-box Adversarial Attacks on CNN-based SLAM Algorithms Maria Rafaela Gkeka et.al. 2505.24654 null
2025-05-28 Semantic Exploration and Dense Mapping of Complex Environments using Ground Robots Equipped with LiDAR and Panoramic Camera Xiaoyang Zhan et.al. 2505.22880 null
2025-05-28 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians Hidenobu Matsuki et.al. 2505.22859 null
2025-05-28 UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments Wancai Zheng et.al. 2505.22335 null
2025-05-27 HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving Bingxiang Kang et.al. 2505.20906 null
2025-05-27 ProBA: Probabilistic Bundle Adjustment with the Bhattacharyya Coefficient Jason Chui et.al. 2505.20858 null
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 null
2025-05-25 VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes Tianchen Deng et.al. 2505.18992 link
2025-05-23 CU-Multi: A Dataset for Multi-Robot Data Association Doncey Albin et.al. 2505.17576 null
2025-05-22 TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition Oliver Grainge et.al. 2505.16447 null
2025-05-20 A Methodological Framework for Measuring Spatial Labeling Similarity Yihang Du et.al. 2505.14128 link
2025-05-22 Place Recognition: A Comprehensive Review, Current Challenges and Future Directions Zhenyu Li et.al. 2505.14068 link
2025-05-19 eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks Jad Mansour et.al. 2505.13309 null
2025-05-23 VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold Dominic Maggio et.al. 2505.12549 null
2025-05-18 Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey Calvin Galagain et.al. 2505.12384 null
2025-05-18 Structureless VIO Junlin Song et.al. 2505.12337 null
2025-05-16 EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video Ryan Hoque et.al. 2505.11709 null
2025-05-16 Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization Aaron Wilhelm et.al. 2505.11620 null
2025-05-16 Robust 2D lidar-based SLAM in arboreal environments without IMU/GNSS Paola Nazate-Burgos et.al. 2505.10847 null
2025-05-15 TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation Manthan Patel et.al. 2505.10696 null
2025-05-15 A hybrid SLAM-Payne framework for atmospheric parameter and abundance determination of early-type Stars from LAMOST DR9 low-resolution Spectra Weijia Sun et.al. 2505.10310 null
2025-05-15 Large-Scale Gaussian Splatting SLAM Zhe Xin et.al. 2505.09915 null
2025-05-13 Automated Meta Prompt Engineering for Alignment with the Theory of Mind Aaron Baughman et.al. 2505.09024 null
2025-05-13 MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM Saqi Hussain Kalan et.al. 2505.08388 null
2025-05-13 SKiD-SLAM: Robust, Lightweight, and Distributed Multi-Robot LiDAR SLAM in Resource-Constrained Field Environments Hogyun Kim et.al. 2505.08230 null
2025-05-12 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null
2025-05-12 Ranking-aware Continual Learning for LiDAR Place Recognition Xufei Wang et.al. 2505.07198 null
2025-05-07 Scalable Aerial GNSS Localization for Marine Robots Shuo Wen et.al. 2505.04095 link
2025-05-06 Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions Lukas Schichler et.al. 2505.03565 null
2025-05-06 AquaticVision: Benchmarking Visual SLAM in Underwater Environment with Events and Frames Yifan Peng et.al. 2505.03448 null
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-05 LiDAR-Inertial SLAM-Based Navigation and Safety-Oriented AI-Driven Control System for Skid-Steer Robots Mehdi Heydari Shahna et.al. 2505.02598 null
2025-05-04 Robust Localization, Mapping, and Navigation for Quadruped Robots Dyuman Aditya et.al. 2505.02272 null
2025-05-04 SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2505.01956 null
2025-05-03 GauS-SLAM: Dense RGB-D SLAM with Gaussian Surfels Yongxin Su et.al. 2505.01934 null
2025-05-02 Tightly Coupled Range Inertial Odometry and Mapping with Exact Point Cloud Downsampling Kenji Koide et.al. 2505.01017 null
2025-04-30 An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation Yaming Ou et.al. 2504.21826 null
2025-04-30 eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes Henry John Krumb et.al. 2504.21562 null
2025-04-29 Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. 2504.20496 null
2025-04-28 Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM Leon Davies et.al. 2504.19654 null
2025-04-28 GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM Leon Davies et.al. 2504.19653 null
2025-04-28 GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field Zuxing Lu et.al. 2504.19409 null
2025-04-27 Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users Apurv Varshney et.al. 2504.19345 null
2025-04-27 NANO-SLAM : Natural Gradient Gaussian Approximation for Vehicle SLAM Tianyi Zhang et.al. 2504.19195 null
2025-04-27 MISO: Multiresolution Submap Optimization for Efficient Globally Consistent Neural Implicit Reconstruction Yulun Tian et.al. 2504.19104 null
2025-04-25 Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift Devansh R. Agrawal et.al. 2504.18713 null
2025-04-25 Range-based 6-DoF Monte Carlo SLAM with Gradient-guided Particle Filter on GPU Takumi Nakao et.al. 2504.18056 null
2025-04-24 Autonomous Navigation Of Quadrupeds Using Coverage Path Planning Alexander James Becoy et.al. 2504.17880 null
2025-04-24 BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring Asier Bikandi et.al. 2504.17693 null
2025-04-24 Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images Zebo Huang et.al. 2504.17582 null
2025-04-24 Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization Guangyang Zeng et.al. 2504.17410 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-23 ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration Andrea Conti et.al. 2504.16545 null
2025-04-22 DERD-Net: Learning Depth from Event-based Ray Densities Diego de Oliveira Hitzges et.al. 2504.15863 null
2025-04-23 SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems Abhishek Tyagi et.al. 2504.15305 null
2025-04-20 Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction Weirong Chen et.al. 2504.14516 null
2025-04-20 SG-Reg: Generalizable and Efficient Scene Graph Registration Chuhao Liu et.al. 2504.14440 link
2025-04-19 Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering Jonathan Embley-Riches et.al. 2504.14135 null
2025-04-16 An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World Xingwu Ji et.al. 2504.11698 link
2025-04-18 Doppler-SLAM: Doppler-Aided Radar-Inertial and LiDAR-Inertial Simultaneous Localization and Mapping Dong Wang et.al. 2504.11634 link
2025-04-14 Region Based SLAM-Aware Exploration: Efficient and Robust Autonomous Mapping Strategy That Can Scale Megha Maheshwari et.al. 2504.10416 null
2025-04-14 RoboCup Rescue 2025 Team Description Paper UruBots Kevin Farias et.al. 2504.09778 null
2025-04-11 FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment Sebastián Barbas Laina et.al. 2504.08603 null
2025-04-11 PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection Xiong Li et.al. 2504.08280 null
2025-04-11 II-NVM: Enhancing Map Accuracy and Consistency with Normal Vector-Assisted Mapping Chengwei Zhao et.al. 2504.08204 link
2025-04-10 UWB Anchor Based Localization of a Planetary Rover Andreas Nüchter et.al. 2504.07658 null
2025-04-10 Event Signal Filtering via Probability Flux Estimation Jinze Chen et.al. 2504.07503 null
2025-04-07 Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM Zhicong Sun et.al. 2504.04844 link
2025-04-06 SELC: Self-Supervised Efficient Local Correspondence Learning for Low Quality Images Yuqing Wang et.al. 2504.04497 null
2025-04-06 VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets Alejandro Fontan et.al. 2504.04457 link
2025-04-05 Nonlinear Observer Design for Landmark-Inertial Simultaneous Localization and Mapping Mouaad Boughellaba et.al. 2504.04239 null
2025-04-04 WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments Jianhao Zheng et.al. 2504.03886 null
2025-04-03 SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections Prashant Kumar et.al. 2504.03089 null
2025-04-03 Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision Xiaofeng Han et.al. 2504.02477 null
2025-04-03 MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM Renwu Li et.al. 2504.02437 null
2025-04-02 A Chefs KISS -- Utilizing semantic information in both ICP and SLAM framework Sven Ochs et.al. 2504.02086 null
2025-04-01 Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments Yuchen Zhang et.al. 2504.01997 null
2025-04-02 Strengthening Multi-Robot Systems for SAR: Co-Designing Robotics and Communication Towards 6G Juan Bravo-Arrabal et.al. 2504.01940 null
2025-04-02 Dynamic Initialization for LiDAR-inertial SLAM Jie Xu et.al. 2504.01451 link
2025-04-02 ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue Thomas Pritchard et.al. 2504.01261 link
2025-03-31 SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection Yannick Burkhardt et.al. 2504.00139 null
2025-03-30 A Visual-Inertial Motion Prior SLAM for Dynamic Environments Weilong Sun et.al. 2503.23429 null
2025-03-30 AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos Felix Wimbauer et.al. 2503.23282 link
2025-03-27 HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM Ziren Gong et.al. 2503.21778 null
2025-03-27 STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM Yongxu Wang et.al. 2503.21425 null
2025-03-25 Scene-agnostic Pose Regression for Visual Localization Junwei Zheng et.al. 2503.19543 null
2025-03-25 First Results on UAV-aided User Localization Using ToA and OpenAirInterface in 5G NR Omid Esrafilian et.al. 2503.19529 null
2025-03-25 MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments Yongxin Ma et.al. 2503.19506 link
2025-03-24 Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control Tohid Kargar Tasooji et.al. 2503.19135 null
2025-03-24 GI-SLAM: Gaussian-Inertial SLAM Xulang Liu et.al. 2503.18275 null
2025-03-22 LightLoc: Learning Outdoor LiDAR Localization at Light Speed Wen Li et.al. 2503.17814 link
2025-03-21 Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions Muhua Zhang et.al. 2503.17005 null
2025-03-20 4D Gaussian Splatting SLAM Yanyan Li et.al. 2503.16710 null
2025-03-20 Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education Giovanni Adorni et.al. 2503.16307 null
2025-03-20 Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors Tian Yi Lim et.al. 2503.16275 null
2025-03-19 A Sigma Point-based Low Complexity Algorithm for Multipath-based SLAM in MIMO Systems Anna Masiero et.al. 2503.15286 null
2025-03-19 ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents Hao Liang et.al. 2503.14948 null
2025-03-18 3D Densification for Multi-Map Monocular VSLAM in Endoscopy X. Anadón et.al. 2503.14346 null
2025-03-18 GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics Tingyang Xiao et.al. 2503.14247 link
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Digital Beamforming Enhanced Radar Odometry Jingqi Jiang et.al. 2503.13252 link
2025-03-17 Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes Tatsuro Sakai et.al. 2503.12768 null
2025-03-16 KISS-SLAM: A Simple, Robust, and Accurate 3D LiDAR SLAM System With Enhanced Generalization Capabilities Tiziano Guadagnino et.al. 2503.12660 null
2025-03-16 Deblur Gaussian Splatting SLAM Francesco Girlanda et.al. 2503.12572 null
2025-03-16 M2UD: A Multi-model, Multi-scenario, Uneven-terrain Dataset for Ground Robot with Localization and Mapping Evaluation Yanpeng Jia et.al. 2503.12387 null
2025-03-13 OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Maxim Popov et.al. 2503.10331 null
2025-03-12 Online Language Splatting Saimouli Katragadda et.al. 2503.09447 null
2025-03-12 MonoSLAM: Robust Monocular SLAM with Global Structure Optimization Bingzheng Jiang et.al. 2503.09296 null
2025-03-11 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-11 GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats Kai Deng et.al. 2503.08071 link
2025-03-10 POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality Joey Wilson et.al. 2503.07819 null
2025-03-08 HIPPO-MAT: Decentralized Task Allocation Using GraphSAGE and Multi-Agent Deep Reinforcement Learning Lavanya Ratnabala et.al. 2503.07662 null
2025-03-10 AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones Xiaowei Li et.al. 2503.06890 link
2025-03-08 InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning Seongjun Choi et.al. 2503.06010 link
2025-03-07 THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks Chaoran Xiong et.al. 2503.05112 null
2025-03-07 Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry Chengwei Zhao et.al. 2503.05077 link
2025-03-06 MarsLGPR: Mars Rover Localization with Ground Penetrating Radar Anja Sheppard et.al. 2503.04944 null
2025-03-06 On the Connection Between Magnetic-Field Odometry Aided Inertial Navigation and Magnetic-Field SLAM Isaac Skog et.al. 2503.04286 null
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235 null
2025-03-06 DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems Joshua Bird et.al. 2503.04126 null
2025-03-05 Equivariant Filter Design for Range-only SLAM Yixiao Ge et.al. 2503.03973 null
2025-03-05 Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments Jie Deng et.al. 2503.03373 link
2025-03-05 OpenGV 2.0: Motion prior-assisted calibration and SLAM with vehicle-mounted surround-view systems Kun Huang et.al. 2503.03230 null
2025-03-05 Distributed Certifiably Correct Range-Aided SLAM Alexander Thoms et.al. 2503.03192 link
2025-03-04 Introspective Loop Closure for SLAM with 4D Imaging Radar Maximilian Hilger et.al. 2503.02383 null
2025-03-04 DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting Haoyuan Li et.al. 2503.02223 link
2025-03-03 Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM Marco Giberna et.al. 2503.02050 null
2025-03-03 vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding Ali Tourani et.al. 2503.01783 null
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661 link
2025-03-03 OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding Dianyi Yang et.al. 2503.01646 null
2025-03-03 MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features Chao Ye et.al. 2503.01571 link
2025-03-03 AI-Driven Relocation Tracking in Dynamic Kitchen Environments Arash Nasr Esfahani et.al. 2503.01547 link
2025-03-03 Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning Xintao Chao et.al. 2503.01543 null
2025-03-03 RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation Shu Pan et.al. 2503.01434 null
2025-02-27 BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground Yufei Wei et.al. 2502.20078 null
2025-02-26 Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects Petri Mäkinen et.al. 2502.19169 null
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-02-25 S-Graphs 2.0 -- A Hierarchical-Semantic Optimization and Loop Closure for SLAM Hriday Bavle et.al. 2502.18044 link
2025-02-25 MegaLoc: One Retrieval to Place Them All Gabriele Berton et.al. 2502.17237 link
2025-02-24 SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building Haoming Huang et.al. 2502.16856 link
2025-02-27 Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM Yao Zhang et.al. 2502.16495 null
2025-02-19 Slamming: Training a Speech Language Model on One GPU in a Day Gallil Maimon et.al. 2502.15814 link
2025-02-21 RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes Sicheng Yu et.al. 2502.15633 null
2025-02-20 Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2502.14931 null
2025-02-19 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments Vincent Ress et.al. 2502.13803 null
2025-02-19 Active Illumination for Visual Ego-Motion Estimation in the Dark Francesco Crocetti et.al. 2502.13708 null
2025-02-17 From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations Matteo Scucchia et.al. 2502.12303 null
2025-02-19 pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM Luigi Freda et.al. 2502.11955 link
2025-02-17 Anti-Degeneracy Scheme for Lidar SLAM based on Particle Filter in Geometry Feature-Less Environments Yanbin Li et.al. 2502.11486 null
2025-02-16 GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting Zelin Zhou et.al. 2502.10975 null
2025-02-19 MonoForce: Learnable Image-conditioned Physics Engine Ruslan Agishev et.al. 2502.10156 link
2025-02-13 Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions Dario Pisanti et.al. 2502.09795 null
2025-02-13 DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior Mingrui Li et.al. 2502.09111 null
2025-02-12 LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features Shujie Zhou et.al. 2502.08676 link
2025-02-10 Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map Yingyu Wang et.al. 2502.06292 link
2025-02-09 PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map Yue Pan et.al. 2502.05752 link
2025-02-07 Joint State and Noise Covariance Estimation Kasra Khosoussi et.al. 2502.04584 null
2025-02-05 GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM Mingrui Li et.al. 2502.03228 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657 null
2025-02-04 HeRCULES: Heterogeneous Radar Dataset in Complex Urban Environment for Multi-session Radar SLAM Hanjun Kim et.al. 2502.01946 null
2025-02-03 Statistical enhance learning for modeling and prediction tennis matches at Grand Slam tournaments Nourah Buhamra et.al. 2502.01613 null
2025-02-03 Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter Dabin Kim et.al. 2502.01092 null
2025-02-01 FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps Maximilian Leitenstern et.al. 2502.00395 link
2025-01-31 LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks Liudi Yang et.al. 2501.19382 link
2025-01-31 Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping Yiming Huang et.al. 2501.19319 link
2025-01-31 GO: The Great Outdoors Multimodal Dataset Peng Jiang et.al. 2501.19274 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-28 SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios Yinqi Chen et.al. 2501.16754 null
2025-01-27 Visual-Lidar Map Alignment for Infrastructure Inspections Jake McLaughlin et.al. 2501.14486 link
2025-01-24 Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video Xiaohao Xu et.al. 2501.14319 link
2025-01-24 HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting Javier Yu et.al. 2501.14147 null
2025-01-23 FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation Bingyang Zhou et.al. 2501.13876 null
2025-01-23 VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM Gyuhyeon Pak et.al. 2501.13402 null
2025-01-22 Grid-based Submap Joining: An Efficient Algorithm for Simultaneously Optimizing Global Occupancy Map and Local Submap Frames Yingyu Wang et.al. 2501.12764 null
2025-01-21 DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM Jesse Morris et.al. 2501.11893 link
2025-01-21 Survey on Monocular Metric Depth Estimation Jiuling Zhang et.al. 2501.11841 null
2025-01-19 OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors Dominik Kulmer et.al. 2501.11111 link
2025-01-19 Factor Graph-Based Active SLAM for Spacecraft Proximity Operations Lorenzo Ticozzi et.al. 2501.10950 null
2025-01-23 Mesh2SLAM in VR: A Fast Geometry-Based SLAM Framework for Rapid Prototyping in Virtual Reality Applications Carlos Augusto Pinheiro de Sousa et.al. 2501.09600 null
2025-01-16 Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment Maksim Filipenko et.al. 2501.09490 null
2025-01-15 Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures Pengru Deng et.al. 2501.09203 null
2025-01-15 AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning Assaf Lahiany et.al. 2501.09160 null
2025-01-15 SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM Yuhang Ming et.al. 2501.08880 null
2025-01-15 GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping Sheng Hong et.al. 2501.08672 null
2025-01-16 BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module Dongzhihan Wang et.al. 2501.08659 null
2025-01-15 Self-Organizing Edge Computing Distribution Framework for Visual SLAM Jussi Kalliola et.al. 2501.08629 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2025-01-13 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015 null
2025-01-12 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-11 SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors Zhen Hong et.al. 2501.06469 null
2025-01-09 Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping Wen Tianci et.al. 2501.05242 null
2025-01-07 SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment Yuchun Fan et.al. 2501.03681 link
2025-01-06 HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos Jinglei Zhang et.al. 2501.02973 null
2025-01-09 LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments Haosong Yue et.al. 2501.02580 link
2025-01-04 ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle Yinchuan Wang et.al. 2501.02166 link
2024-12-31 PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM Runnan Chen et.al. 2501.00352 null
2024-12-30 Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields Evgenii Kruzhkov et.al. 2412.20976 null
2024-12-28 MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing Shuo Wang et.al. 2412.20082 null
2024-12-27 DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction Kai Xu et.al. 2412.19584 null
2024-12-26 MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo Byeonggwon Lee et.al. 2412.19130 null
2024-12-23 End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework Fuhua Jia et.al. 2412.17343 null
2024-12-23 LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation Riku Uemura et.al. 2412.17282 null
2024-12-23 Selective Kalman Filter: When and How to Fuse Multi-Sensor Information to Overcome Degeneracy in SLAM Jie Xu et.al. 2412.17235 null
2025-01-03 Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry Zhaoxing Zhang et.al. 2412.16923 link
2024-12-21 Query Quantized Neural SLAM Sijia Jiang et.al. 2412.16476 link
2024-12-20 SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training Wenxi Chen et.al. 2412.15649 link
2024-12-18 Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed Zidong Han et.al. 2412.13912 null
2024-12-18 Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation Sait Akturk et.al. 2412.13752 null
2024-12-18 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching Fernando Amodeo et.al. 2412.13639 link
2024-12-17 NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment Andrea Dunn Beltran et.al. 2412.13176 null
2024-12-18 Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera Zhengdi Yu et.al. 2412.12861 null
2024-12-16 Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration Meisam Kabiri et.al. 2412.12406 null
2024-12-16 MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors Riku Murai et.al. 2412.12392 null
2024-12-16 Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges Martin Aubard et.al. 2412.11840 null
2024-12-19 RoMeO: Robust Metric Visual Odometry Junda Cheng et.al. 2412.11530 null
2024-12-14 Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency Yang Song et.al. 2412.10809 link
2024-12-13 RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting Lizhi Bai et.al. 2412.09868 null
2024-12-12 SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos Yuzheng Liu et.al. 2412.09401 link
2024-12-12 eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction Jad Mansour et.al. 2412.09209 link
2024-12-12 Drift-free Visual SLAM using Digital Twins Roxane Merat et.al. 2412.08496 null
2024-12-10 A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM Zongbo Liao et.al. 2412.07513 null
2024-12-08 DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments Juwon Kim et.al. 2412.05839 null
2024-12-06 MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos Zhengqi Li et.al. 2412.04463 null
2024-12-05 Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset Fuzhang Han et.al. 2412.04287 link
2024-12-10 MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application Hyesu Jang et.al. 2412.03887 null
2024-12-04 Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars John McConnell et.al. 2412.03760 null
2024-12-04 BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement Miguel Arturo Vega Torres et.al. 2412.03434 link
2024-12-04 NeRF and Gaussian Splatting SLAM in the Wild Fabian Schmidt et.al. 2412.03263 link
2024-12-04 MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras Huai Yu et.al. 2412.03146 link
2024-12-04 An indoor DSO-based ceiling-vision odometry system for indoor industrial environments Abdelhak Bougouffa et.al. 2412.02950 null
2024-12-03 ROVER: A Multi-Season Dataset for Visual SLAM Fabian Schmidt et.al. 2412.02506 link
2024-12-04 RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting Zhenzhong Cao et.al. 2412.01217 link
2024-11-28 Visual SLAMMOT Considering Multiple Motion Models Peilin Tian et.al. 2411.19134 null
2024-11-27 ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching Yangrui Dong et.al. 2411.18174 null
2024-11-27 HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction Wei Zhang et.al. 2411.17982 link
2024-11-26 MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework Xiangcheng Hu et.al. 2411.17928 link
2024-11-29 DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting Christian Homeyer et.al. 2411.17660 link
2024-11-25 MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM Vladimir Yugay et.al. 2411.16785 null
2024-11-24 Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors Soumava Paul et.al. 2411.15966 null
2024-11-24 Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors R. Herrmann et.al. 2411.15901 null
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-23 Gassidy: Gaussian Splatting SLAM in Dynamic Environments Long Wen et.al. 2411.15476 null
2024-11-22 OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping Tomas Berriel Martins et.al. 2411.15043 link
2024-11-22 A Benchmark Dataset for Collaborative SLAM in Service Environments Harin Park et.al. 2411.14775 link
2024-11-21 InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation Marziyeh Bamdad et.al. 2411.14358 link
2024-11-20 Robust Monocular Visual Odometry using Curriculum Learning Assaf Lahiany et.al. 2411.13438 null
2024-11-20 Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds Jelena Trisovic et.al. 2411.13310 null
2024-11-19 3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality Hanbeom Chang et.al. 2411.12514 null
2024-11-19 LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments Renxiang Xiao et.al. 2411.12185 null
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-18 The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters Jie Ju et.al. 2411.11250 null
2024-11-17 A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality Wei-Hsiang Lien et.al. 2411.10940 null
2024-11-16 DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment Mangyu Kong et.al. 2411.10722 link
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-15 BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation Yufei Wei et.al. 2411.10195 null
2024-11-13 DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization Yueming Xu et.al. 2411.08373 null
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279 link
2024-11-12 Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments Ankit Shaw et.al. 2411.08231 null
2024-11-12 NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN Sonia Raychaudhuri et.al. 2411.07848 null
2024-11-11 Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems Yasra Chandio et.al. 2411.07146 null
2024-11-11 Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models Jungseok Hong et.al. 2411.06752 null
2024-11-11 HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation Xiaolong Wang et.al. 2411.06700 null
2024-11-08 Development of an indoor localization and navigation system based on monocular SLAM for mobile robots Thanh Nguyen Canh et.al. 2411.05337 null
2024-11-07 Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping Sayat Ibrayev et.al. 2411.04797 null
2024-11-07 MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation Sayan Paul et.al. 2411.04796 null
2024-11-09 DEIO: Deep Event Inertial Odometry Weipeng Guan et.al. 2411.03928 link
2024-11-06 Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward Shashi Kumar et.al. 2411.03866 null
2024-11-06 LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior Jiahui Wang et.al. 2411.03610 link
2024-11-05 LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting Huibin Zhao et.al. 2411.02703 null
2024-11-04 Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing Xinran Zhang et.al. 2411.02553 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-10-31 XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM Xiaomeng Wang et.al. 2410.23690 link
2024-10-30 LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM Yucheng Huang et.al. 2410.23231 link
2024-10-30 ISAC Prototype System for Multi-Domain Cooperative Communication Networks Jie Yang et.al. 2410.22956 null
2024-10-30 SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark HyunJun Jung et.al. 2410.22715 link
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213 null
2024-10-29 EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments Linus Nwankwo et.al. 2410.22200 null
2024-10-28 NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments Taiyi Pan et.al. 2410.21615 link
2024-10-28 coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM Emiliano Höss et.al. 2410.21149 link
2024-11-01 RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior Mingjiang Liang et.al. 2410.20358 null
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-22 AG-SLAM: Active Gaussian Splatting SLAM Wen Jiang et.al. 2410.17422 null
2024-10-22 Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study J. Jorge et.al. 2410.17171 null
2024-10-19 EndoMetric: Near-light metric scale monocular SLAM Raúl Iranzo et.al. 2410.15065 null
2024-10-17 Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot Dongkun Han et.al. 2410.13612 null
2024-10-17 TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal Yanpeng Jia et.al. 2410.13240 null
2024-10-16 QueensCAMP: an RGB-D dataset for robust Visual SLAM Hudson M. S. Bruno et.al. 2410.12520 link
2024-10-18 PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM Guanghao Li et.al. 2410.12324 null
2024-10-16 Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem Yichen Sha et.al. 2410.12169 null
2024-10-15 V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting Tuan Dang et.al. 2410.12068 link
2024-10-15 GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information Wancai Zheng et.al. 2410.11356 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 link
2024-10-14 MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator Taozhe Li et.al. 2410.10669 null
2024-10-13 Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph Benoit Casseau et.al. 2410.09896 null
2024-10-12 SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs Wenxi Chen et.al. 2410.09503 link
2024-10-12 An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation Wei Liang et.al. 2410.09443 null
2024-10-12 ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras Junkai Niu et.al. 2410.09374 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935 link
2024-10-11 Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints Yicheng He et.al. 2410.08780 null
2024-10-10 ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization Mason B. Peterson et.al. 2410.08262 link
2024-10-10 IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera Jian Huang et.al. 2410.08107 link
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285 null
2024-10-08 Submodular Optimization for Keyframe Selection & Usage in SLAM David Thorne et.al. 2410.05576 null
2024-10-07 SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones Denis Davletshin et.al. 2410.05405 null
2024-10-07 Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection Ang He et.al. 2410.05017 null
2024-10-05 A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems Nikola Radulov et.al. 2410.04242 link
2024-10-05 High-Speed Stereo Visual SLAM for Low-Powered Computing Devices Ashish Kumar et.al. 2410.04090 link
2024-10-04 EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM Shi Chen et.al. 2410.03812 null
2024-10-04 Estimating Body and Hand Motion in an Ego-sensed World Brent Yi et.al. 2410.03665 null
2024-10-03 LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features Zihao Dong et.al. 2410.02961 null
2024-10-02 ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space Hogyun Kim et.al. 2410.01325 null
2024-10-01 Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency William Dubois et.al. 2410.00758 null
2024-10-02 CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM Dapeng Feng et.al. 2410.00486 link
2024-09-30 Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications Zachary Fuge et.al. 2410.00122 null
2024-09-30 Direct Multipath-Based SLAM Mingchao Liang et.al. 2409.20552 null
2024-09-30 Robust Gaussian Splatting SLAM by Leveraging Loop Closure Zunjie Zhu et.al. 2409.20111 null
2024-09-30 DynORecon: Dynamic Object Reconstruction for Navigation Yiduo Wang et.al. 2409.19928 null
2024-09-29 CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation Yifan Duan et.al. 2409.19597 null
2024-09-29 CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought Yexing Du et.al. 2409.19510 link
2024-09-29 Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface Ziniu Wu et.al. 2409.19499 null
2024-09-27 Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet's Halls Leon Davies et.al. 2409.18752 null
2024-09-26 BlinkTrack: Feature Tracking over 100 FPS via Events and Images Yichen Shen et.al. 2409.17981 null
2024-09-26 Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry Qi Zhang et.al. 2409.17729 null
2024-09-26 Event-based Stereo Depth Estimation: A Survey Suman Ghosh et.al. 2409.17680 null
2024-09-25 Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras Sotiris Papatheodorou et.al. 2409.16972 null
2024-09-25 Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM Phu Pham et.al. 2409.16944 null
2024-09-25 Inline Photometrically Calibrated Hybrid Visual SLAM Nicolas Abboud et.al. 2409.16810 link
2024-09-25 Topological SLAM in colonoscopies leveraging deep features and topological priors Javier Morlana et.al. 2409.16806 link
2024-09-25 Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots Masoud Dayani Najafabadi et.al. 2409.16595 link
2024-09-25 Task-driven SLAM Benchmarking Yanwei Du et.al. 2409.16573 link
2024-09-24 SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints Jeahn Han et.al. 2409.15736 null
2024-09-23 Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization Neelkamal Somisetty et.al. 2409.15506 null
2024-09-22 SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms Niraj Pudasaini et.al. 2409.14515 null
2024-09-21 Point Cloud Structural Similarity-based Underwater Sonar Loop Detection Donghwi Jung et.al. 2409.14020 link
2024-09-20 HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device Vladimir Guzov et.al. 2409.13426 null
2024-09-20 Learning Visual Information Utility with PIXER Yash Turkar et.al. 2409.13151 null
2024-09-19 MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting Yan Song Hu et.al. 2409.13055 null
2024-09-19 Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2409.12518 link
2024-09-18 Bundle Adjustment in the Eager Mode Zitong Zhan et.al. 2409.12190 null
2024-09-23 Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping Jaehyung Jung et.al. 2409.12051 null
2024-09-18 Metric-Semantic Factor Graph Generation based on Graph Neural Networks Jose Andres Millan-Romera et.al. 2409.11972 null
2024-09-18 Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments Lei Cheng et.al. 2409.11854 null
2024-09-18 ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation Yanlin Jin et.al. 2409.11692 null
2024-09-18 SLAM assisted 3D tracking system for laparoscopic surgery Jingwei Song et.al. 2409.11688 null
2024-09-17 GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure Ziheng Xu et.al. 2409.10982 null
2024-09-17 Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells Ankit Butola et.al. 2409.10971 null
2024-09-17 Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping Bo Yang et.al. 2409.10824 link
2024-09-16 P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty Yufan Zhang et.al. 2409.10143 link
2024-09-16 SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning Amogh Joshi et.al. 2409.09990 null
2024-09-16 Enhancing Visual Inertial SLAM with Magnetic Measurements Bharat Joshi et.al. 2409.09904 null
2024-09-15 Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics Zi Cong Guo et.al. 2409.09871 link
2024-09-15 Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping Yi Liu et.al. 2409.09763 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-14 MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry Yuheng Qiu et.al. 2409.09479 null
2024-09-14 Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM Haoying Li et.al. 2409.09410 null
2024-09-14 GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians Dasong Gao et.al. 2409.09295 link
2024-09-14 Panoramic Direct LiDAR-assisted Visual Odometry Zikang Yuan et.al. 2409.09287 link
2024-09-11 Object Depth and Size Estimation using Stereo-vision and Integration with SLAM Layth Hamad et.al. 2409.07623 null
2024-09-11 Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry Anbo Tao et.al. 2409.06948 null
2024-09-10 Technical Report of Mobile Manipulator Robot for Industrial Environments Erfan Amoozad Khalili et.al. 2409.06693 null
2024-09-10 Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios Zhiqiang Chen et.al. 2409.04961 link
2024-09-08 FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat Changfei Fu et.al. 2409.03457 null
2024-09-03 Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness Michael D. Friske et.al. 2409.01915 null
2024-09-03 Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric Tingchen Ma et.al. 2409.01856 null
2024-09-02 Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM Ilari Vallivaara et.al. 2409.01242 null
2024-09-02 Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection Manon Kok et.al. 2409.01091 null
2024-09-02 Robust Vehicle Localization and Tracking in Rain using Street Maps Yu Xiang Tan et.al. 2409.01038 link
2024-08-31 UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM Mostafa Mansour et.al. 2409.00362 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-08-30 Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning Shuyang Zhang et.al. 2408.17005 link
2024-08-29 Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry Michael Adlerstein et.al. 2408.16472 null
2024-08-28 Single-Photon 3D Imaging with Equi-Depth Photon Histograms Kaustubh Sadekar et.al. 2408.16150 null
2024-08-28 BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR Miguel Arturo Vega Torres et.al. 2408.15870 link
2024-08-30 Addressing the challenges of loop detection in agricultural environments Nicolás Soncini et.al. 2408.15761 link
2024-08-28 ES-PTAM: Event-based Stereo Parallel Tracking and Mapping Suman Ghosh et.al. 2408.15605 link
2024-08-28 PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry Kaiqiao Yang et.al. 2408.15583 null
2024-09-02 Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration Rongge Zhang et.al. 2408.14726 link
2024-08-26 A Survey on Reinforcement Learning Applications in SLAM Mohammad Dehghani Tezerjani et.al. 2408.14518 null
2024-08-28 FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2408.14035 link
2024-08-21 Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild Turcan Tuna et.al. 2408.11809 null
2024-08-21 LiFCal: Online Light Field Camera Calibration via Bundle Adjustment Aymeric Fleith et.al. 2408.11682 null
2024-08-21 Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars Zhihao Lin et.al. 2408.11582 null
2024-08-21 RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform Maximilian Hilger et.al. 2408.11576 link
2024-08-21 Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models Kento Kawaharazuka et.al. 2408.11380 null
2024-08-20 LoopSplat: Loop Closure by Registering 3D Gaussian Splats Liyuan Zhu et.al. 2408.10154 link
2024-08-19 Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM Sanghyun Hahn et.al. 2408.09727 link
2024-08-17 GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System Shuo Wang et.al. 2408.09191 null
2024-08-15 GOReloc: Graph-based Object-Level Relocalization for Visual SLAM Yutong Wang et.al. 2408.07917 link
2024-08-14 Inverse k-visibility for RSSI-based Indoor Geometric Mapping Junseo Kim et.al. 2408.07757 null
2024-08-14 Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition Hogyun Kim et.al. 2408.07330 link
2024-08-12 CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments Yanpeng Jia et.al. 2408.05981 null
2024-08-21 Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis Zhongche Qu et.al. 2408.05635 null
2024-08-10 TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping Seoyeon Jang et.al. 2408.05453 null
2024-08-08 Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods Yiming Zhou et.al. 2408.04268 null
2024-08-07 Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM Yan Song Hu et.al. 2408.03825 null
2024-08-07 AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System Kuan Xu et.al. 2408.03520 link
2024-08-06 BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications G. Manni et.al. 2408.03078 link
2024-08-04 SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks Vladimir Zeković et.al. 2408.02084 null
2024-08-03 Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing Fabian Schmidt et.al. 2408.01716 link
2024-08-03 Deep Patch Visual SLAM Lahav Lipson et.al. 2408.01654 link
2024-08-02 Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data Chang Liu et.al. 2408.01544 null
2024-08-07 IG-SLAM: Instant Gaussian SLAM F. Aykut Sarikamis et.al. 2408.01126 null
2024-08-01 Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform Yuxin Lin et.al. 2408.00545 null
2024-08-01 High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets Jian Li et.al. 2408.00538 link
2024-07-31 SuperVINS: A visual-inertial SLAM framework integrated deep learning features Hongkun Luo et.al. 2407.21348 link
2024-07-30 NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding Hongjia Zhai et.al. 2407.20853 null
2024-07-29 A flexible framework for accurate LiDAR odometry, map manipulation, and localization José Luis Blanco-Claraco et.al. 2407.20465 link
2024-07-28 Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data Azmyin Md. Kamal et.al. 2407.19518 null
2024-07-26 Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation Aditya Penumarti et.al. 2407.19046 null
2024-07-26 HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM Zhe Xin et.al. 2407.18813 null
2024-07-25 CodedVO: Coded Visual Odometry Sachin Shah et.al. 2407.18240 null
2024-07-28 HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Zhenzhi Wang et.al. 2407.17438 link
2024-07-22 Memory Management for Real-Time Appearance-Based Loop Closure Detection Mathieu Labbé et.al. 2407.15890 null
2024-07-22 Reinforcement Learning Meets Visual Odometry Nico Messikommer et.al. 2407.15626 link
2024-07-22 Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM Mathieu Labbe et.al. 2407.15305 null
2024-07-21 Semi-Supervised Pipe Video Temporal Defect Interval Localization Zhu Huang et.al. 2407.15170 null
2024-07-21 VoxDepth: Rectification of Depth Images on Edge Devices Yashashwee Chakrabarty et.al. 2407.15067 null
2024-07-20 From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM Lorenzo Montano-Oliván et.al. 2407.14797 null
2024-07-19 MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion Qiyan Li et.al. 2407.14102 null
2024-07-18 A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion Jianxiang Xu et.al. 2407.13878 link
2024-07-18 Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM Baicheng Li et.al. 2407.13338 null
2024-07-18 Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain Bach Nguyen Gia et.al. 2407.13159 link
2024-07-17 Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge Andrea Albanese et.al. 2407.12663 null
2024-07-17 Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM Markus Weißflog et.al. 2407.12408 null
2024-07-19 Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion Sangjun Lee et.al. 2407.12405 link
2024-07-17 Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM Manh Do Duc et.al. 2407.11870 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736 link
2024-07-16 Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems Jianzhu Huai et.al. 2407.11705 null
2024-07-16 Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization Yu Ge et.al. 2407.11643 null
2024-07-16 I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM Gwangtak Bae et.al. 2407.11347 null
2024-07-16 FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration Jiantao Feng et.al. 2407.11299 null
2024-07-15 Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method Adam Korycki et.al. 2407.11238 null
2024-07-12 An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks Seyed Alireza Rahimi Azghadi et.al. 2407.09242 null
2024-07-11 SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM Neng Wang et.al. 2407.08106 link
2024-07-09 Hyperion -- A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM David Hug et.al. 2407.07074 link
2024-07-15 A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM Yasra Chandio et.al. 2407.06889 null
2024-07-08 Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Siva Krishna Ravipati et.al. 2407.06077 link
2024-07-10 Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact Sangwoo Jung et.al. 2407.05820 null
2024-07-07 Active Collaborative Visual SLAM exploiting ORB Features Muhammad Farhan Ahmed et.al. 2407.05453 null
2024-07-06 VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking Xuefeng Jiang et.al. 2407.05017 null
2024-07-06 Symmetric Linear Arc Monadic Datalog and Gadget Reductions Manuel Bodirsky et.al. 2407.04924 null
2024-07-03 Ultra-Lightweight Collaborative Mapping for Robot Swarms Vlad Niculescu et.al. 2407.03136 null
2024-07-01 RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields Haochen Jiang et.al. 2407.01303 link
2024-07-01 Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation Lianjie Guo et.al. 2407.01292 link
2024-07-01 Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization Ruofei Bai et.al. 2407.01013 link
2024-06-30 Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation Adnan Abdullah et.al. 2407.00848 null
2024-06-30 OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration Fengyuan Yang et.al. 2407.00574 null
2024-06-24 Compressing Search with Language Models Thomas Mulc et.al. 2407.00085 null
2024-06-28 CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services DongKi Noh et.al. 2406.19634 null
2024-06-25 Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System Xinzhe Liu et.al. 2406.17586 null
2024-07-02 SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation Xu Liu et.al. 2406.17249 link
2024-06-24 From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking Xiaohao Xu et.al. 2406.16850 link
2024-06-23 Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy Chen Wang et.al. 2406.16087 null
2024-06-19 Simultaneous Map and Object Reconstruction Nathaniel Chodosh et.al. 2406.13896 null
2024-06-14 Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization Wonho Song et.al. 2406.11599 null
2024-06-16 Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry Boris Chidlovskii et.al. 2406.11019 null
2024-06-15 Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM Yinjie Li et.al. 2406.10494 link
2024-06-12 From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers Swaminathan Gurumurthy et.al. 2406.07785 link
2024-06-27 Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF) Gyubeom Im et.al. 2406.06427 null
2024-06-10 Notes on Various Errors and Jacobian Derivations for SLAM Gyubeom Im et.al. 2406.06422 null
2024-06-23 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374 link
2024-06-15 Visual-Inertial SLAM as Simple as A, B, VINS Nathaniel Merrill et.al. 2406.05969 null
2024-06-09 MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps Jianhao Zheng et.al. 2406.05849 null
2024-06-06 Open Problem: Active Representation Learning Nikola Milosevic et.al. 2406.03845 null
2024-06-04 ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization Chen Mao et.al. 2406.01906 link
2024-06-03 The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry Paolo Cudrano et.al. 2406.01797 null
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929 null
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885 link
2024-05-30 Structure Gaussian SLAM with Manhattan World Hypothesis Shuhong Liu et.al. 2405.20031 null
2024-05-30 Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar Wouter Jansen et.al. 2405.19869 null
2024-05-30 SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization Jiang Wang et.al. 2405.19813 link
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614 null
2024-05-27 CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy Richard Elvira et.al. 2405.16932 null
2024-05-26 Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians Erik Sandström et.al. 2405.16544 link
2024-05-24 NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes Lizhi Bai et.al. 2405.15151 null
2024-05-23 ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization Han Song et.al. 2405.15082 null
2024-05-23 Synergistic Global-space Camera and Human Reconstruction from Videos Yizhou Zhao et.al. 2405.14855 null
2024-05-23 CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments Yang Zhou et.al. 2405.14731 link
2024-05-23 Efficient Robot Learning for Perception and Mapping Niclas Vödisch et.al. 2405.14688 null
2024-05-22 Monocular Gaussian SLAM with Language Extended Loop Closure Tian Lan et.al. 2405.13748 null
2024-05-26 NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments Dongha Chung et.al. 2405.12563 link
2024-05-20 EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving Boyi Liu et.al. 2405.12120 null
2024-05-24 Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation Hyungtae Lim et.al. 2405.11176 null
2024-05-18 MotionGS : Compact Gaussian Splatting SLAM by Motion Filter Xinli Guo et.al. 2405.11129 link
2024-05-17 CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion Gang Wang et.al. 2405.10793 null
2024-05-17 Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map Liang Zhao et.al. 2405.10743 null
2024-05-10 MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization Pengcheng Zhu et.al. 2405.06241 null
2024-05-07 Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map Yuxuan Xia et.al. 2405.04290 null
2024-05-07 IMU-Aided Event-based Stereo Visual Odometry Junkai Niu et.al. 2405.04071 link
2024-04-27 An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation Olivier Brochu Dufour et.al. 2404.17745 null
2024-04-26 Camera Motion Estimation from RGB-D-Inertial Scene Flow Samuel Cerezo et.al. 2404.17251 link
2024-04-23 Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization Lahav Lipson et.al. 2404.15263 link
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339 null
2024-04-17 VBR: A Vision Benchmark in Rome Leonardo Brizi et.al. 2404.11322 link
2024-04-14 Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration Yanhao Zhang et.al. 2404.09169 link
2024-04-06 Salient Sparse Visual Odometry With Pose-Only Supervision Siyu Chen et.al. 2404.04677 null
2024-03-25 A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments Gianluca D'Amico et.al. 2403.17084 null
2024-03-19 On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine Jagatpreet Singh Nir et.al. 2403.13170 null
2024-03-18 The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions Margaret Hansen et.al. 2403.12194 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639 null
2024-03-16 Efficient Domain Adaptation for Endoscopic Visual Odometry Junyang Wu et.al. 2403.10860 null
2024-03-14 Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) Matthew Lisondra et.al. 2403.09882 null
2024-03-02 Grid-based Fast and Structural Visual Odometry Zhang Zhihe et.al. 2403.01110 null
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-22 Secure Navigation using Landmark-based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2402.14280 null
2024-02-19 Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment Ganesh Sapkota et.al. 2402.12551 null
2024-02-07 Online and Certifiably Correct Visual Odometry and Mapping Devansh R Agrawal et.al. 2402.05254 null
2024-02-06 YOLOPoint Joint Keypoint and Object Detection Anton Backhaus et.al. 2402.03989 link
2024-01-19 Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning André O. Françani et.al. 2401.10857 null
2024-01-17 Event-Based Visual Odometry on Non-Holonomic Ground Vehicles Wanting Xu et.al. 2401.09331 link
2024-01-11 On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering Feng Zhu et.al. 2401.05836 null
2023-12-19 Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry Olaya Álvarez-Tuñón et.al. 2401.05396 link
2024-01-07 Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people Ali Samadzadeh et.al. 2401.03604 link
2024-01-03 LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry Weirong Chen et.al. 2401.01887 link
2023-12-28 SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction Zikang Yuan et.al. 2312.16800 link
2023-12-20 NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields Jens Naumann et.al. 2312.13471 null
2023-12-22 Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM Junru Lin et.al. 2312.13332 null
2023-12-20 Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach Habib Boloorchi Tabrizi et.al. 2312.13162 link
2023-12-20 Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera Abdulkadhem A. Abdulkadhem et.al. 2312.12680 null
2023-12-15 Deep Event Visual Odometry Simon Klenk et.al. 2312.09800 link
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889 null
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141 link
2023-11-30 Event-based Visual Inertial Velometer Xiuyuan Lu et.al. 2311.18189 null
2023-11-21 CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems Young-Hee Lee et.al. 2311.12580 null
2023-11-10 Dense Visual Odometry Using Genetic Algorithm Slimane Djema et.al. 2311.06149 null
2023-11-07 Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM Seongwook Yoon et.al. 2311.03722 null
2023-10-23 Converting Depth Images and Point Clouds for Feature-based Pose Estimation Robert Lösch et.al. 2310.14924 link
2023-10-17 Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms Yanyan Li et.al. 2310.10931 link
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082 null
2023-10-10 l-dyno: framework to learn consistent visual features using robot's motion Kartikeya Singh et.al. 2310.06249 link
2023-10-08 XVO: Generalized Visual Odometry via Cross-Modal Self-Training Lei Lai et.al. 2309.16772 null
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-23 Tag-based Visual Odometry Estimation for Indoor UAVs Localization Massimiliano Bertoni et.al. 2309.13311 null
2023-09-22 Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms Olivier Gamache et.al. 2309.13139 link
2023-09-20 Conformalized Multimodal Uncertainty Regression and Reasoning Domenico Parente et.al. 2309.11018 null
2023-09-20 OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving Heng Li et.al. 2309.11011 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-21 Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration Hongbo Zhao et.al. 2309.10314 null
2023-09-18 End-to-End Learned Event- and Image-based Visual Odometry Roberto Pellerito et.al. 2309.09947 link
2023-09-14 An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments Yehao Liu et.al. 2309.07408 null
2023-09-11 Evaluating Visual Odometry Methods for Autonomous Driving in Rain Yu Xiang Tan et.al. 2309.05249 null
2023-09-08 Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147 null
2023-09-04 EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity Zijie Jiang et.al. 2309.01296 null
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039 null
2023-08-19 Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters Xiao Liu et.al. 2308.09870 link
2023-08-12 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion Guirong Zhuo et.al. 2308.06573 null
2023-08-10 Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU U. V. B. L. Udugama et.al. 2308.05515 null
2023-08-02 A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry Cora A. Dimmig et.al. 2308.01398 null
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125 null
2023-08-02 Preliminary Design of the Dragonfly Navigation Filter Ben Schilling et.al. 2307.13513 null
2023-07-19 Optimizing the extended Fourier Mellin Transformation Algorithm Wenqing Jiang et.al. 2307.10015 link
2023-07-15 Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents Ke Cao et.al. 2307.07763 null
2023-07-26 Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression Jianeng Wang et.al. 2306.01188 null
2023-07-06 OSPC: Online Sequential Photometric Calibration Jawad Haidar et.al. 2305.17673 null
2023-05-15 Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface Shifan Zhu et.al. 2305.08962 null
2023-05-10 Transformer-based model for monocular visual odometry: a video understanding approach André O. Françani et.al. 2305.06121 link
2023-04-29 Modality-invariant Visual Odometry for Embodied Vision Marius Memmel et.al. 2305.00348 link
2023-04-21 FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving Yuxuan Liu et.al. 2304.10719 null
2023-07-08 Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping Hanyu Cai et.al. 2304.08978 null
2023-04-12 SiLK -- Simple Learned Keypoints Pierre Gleize et.al. 2304.06194 link
2023-04-11 ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster Yifei Dong et.al. 2304.04943 null
2023-03-21 Learning a Depth Covariance Function Eric Dexheimer et.al. 2303.12157 null
2023-03-21 Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network Alessandro Navone et.al. 2303.11725 null
2023-03-20 VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors Thien Hoang Nguyen et.al. 2303.10903 null
2023-03-17 CoVIO: Online Continual Learning for Visual-Inertial Odometry Niclas Vödisch et.al. 2303.10149 link
2023-03-15 UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry Chaoyang Jiang et.al. 2303.08550 null
2023-03-13 Discovering Multiple Algorithm Configurations Leonid Keselman et.al. 2303.07434 null
2023-03-09 Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation Masahiro Hirano et.al. 2303.05192 null
2023-03-16 Stereo Event-based Visual-Inertial Odometry Kunfeng Wang et.al. 2303.05086 link
2023-03-07 Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor Eduardo Gallo et.al. 2303.03804 null
2023-03-03 Lightweight, Uncertainty-Aware Conformalized Visual Odometry Alex C. Stutts et.al. 2303.02207 null
2023-02-24 FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets Yelena Randall et.al. 2302.12772 null
2023-02-27 CP+: Camera Poses Augmentation with Large-scale LiDAR Maps Jiadi Cui et.al. 2302.12198 null
2023-02-19 EdgeVO: An Efficient and Accurate Edge-based Visual Odometry Hui Zhao et.al. 2302.09493 null
2023-01-27 HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera Mostafa Ahmadi et.al. 2301.11823 null
2023-01-26 Distributed Optimization Methods for Multi-Robot Systems: Part I -- A Tutorial Ola Shorinwa et.al. 2301.11313 null
2023-01-24 Generalized Object Search Kaiyu Zheng et.al. 2301.10121 null
2023-01-22 Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories Hanlin Chen et.al. 2301.09194 null
2023-01-21 Dense RGB SLAM with Neural Implicit Maps Heng Li et.al. 2301.08930 null
2023-01-18 Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information Junshi Chen et.al. 2301.07560 null
2023-01-17 COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM Manthan Patel et.al. 2301.07147 link
2023-01-31 Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems Pierre-Yves Lajoie et.al. 2301.06230 link
2023-01-13 A LiDAR-Inertial-Visual SLAM System with Loop Detection Kangcheng Liu et.al. 2301.05604 null
2023-01-11 AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization Ying Chen et.al. 2301.04620 link
2023-01-12 TBV Radar SLAM -- trust but verify loop candidates Daniel Adolfsson et.al. 2301.04397 link
2022-12-31 Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges Maxwell McManus et.al. 2301.03359 null
2023-01-09 Motion Addition and Motion Optimization Liqun Qi et.al. 2301.03174 null
2023-01-08 Towards Open World NeRF-Based SLAM Daniil Lisus et.al. 2301.03102 null
2023-01-06 CyberLoc: Towards Accurate Long-term Visual Localization Liu Liu et.al. 2301.02403 null
2023-01-03 LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation Shreyansh Daftry et.al. 2301.01350 null
2022-12-31 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions Patrick Wenzel et.al. 2301.01147 null
2023-01-03 BS3D: Building-scale 3D Reconstruction from RGB-D Images Janne Mustaniemi et.al. 2301.01057 null
2023-01-10 An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping Masoud Dayani Najafabadi et.al. 2301.00618 link
2022-12-25 A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion Nadia Figueroa et.al. 2212.14772 null
2022-12-29 An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping Kangcheng Liu et.al. 2212.14209 link
2022-12-27 Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands Felipe Gómez-Cuba et.al. 2212.13477 link
2022-12-26 ESVIO: Event-based Stereo Visual Inertial Odometry Peiyu Chen et.al. 2212.13184 link
2022-12-24 A Comprehensive Review on Autonomous Navigation Saeid Nahavandi et.al. 2212.12808 null
2022-12-23 Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation Marina Lotti et.al. 2212.12388 null
2022-12-23 Implementation of a Blind navigation method in outdoors/indoors areas Mohammad Javadian Farzaneh et.al. 2212.12185 null
2022-12-22 S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations Hriday Bavle et.al. 2212.11770 link
2022-12-22 Active SLAM: A Review On Last Decade Muhammad Farhan Ahmed et.al. 2212.11654 null
2022-12-27 Motion, Unit Dual Quaternion and Motion Optimization Liqun Qi et.al. 2212.11593 null
2022-12-22 Vision-Based Environmental Perception for Autonomous Driving Fei Liu et.al. 2212.11453 null
2022-12-19 Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models Yong Cheng et.al. 2212.09553 null
2022-12-16 Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments Lasitha Weerakoon et.al. 2212.08633 null
2022-12-16 rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments Bo Wei et.al. 2212.08418 null
2023-03-02 AirVO: An Illumination-Robust Point-Line Visual Odometry Kuan Xu et.al. 2212.07595 link
2022-12-14 Autonomous Vehicle Navigation with LIDAR using Path Planning Rahul M K et.al. 2212.07155 null
2022-12-14 RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping Hyowon Kim et.al. 2212.07141 null
2022-12-13 Know What You Don't Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version) Daniil Lisus et.al. 2212.06923 null
2022-12-13 SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance Chenyangguang Zhang et.al. 2212.06524 null
2022-12-13 Localization and Navigation System for Indoor Mobile Robot Yanbaihui Liu et.al. 2212.06391 null
2022-12-12 Evaluation of RGB-D SLAM in Large Indoor Environments Kirill Muravyev et.al. 2212.05980 null
2022-12-19 A Light-Weight LiDAR-Inertial SLAM System with Loop Closing Kangcheng Liu et.al. 2212.05743 link
2022-12-12 An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds Kangcheng Liu et.al. 2212.05705 link
2022-12-09 SLAM for Visually Impaired People: A Survey Marziyeh Bamdad et.al. 2212.04745 null
2022-12-09 Ego-Body Pose Estimation via Ego-Head Pose Estimation Jiaman Li et.al. 2212.04636 null
2022-12-06 Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles Sushant Veer et.al. 2212.03323 link
2022-12-06 PRISM: Probabilistic Real-Time Inference in Spatial World Models Atanas Mirchev et.al. 2212.02988 null
2022-12-06 RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps Florian Sauerbeck et.al. 2212.02085 link
2022-12-05 DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization Xuebo Tian et.al. 2212.02077 null
2022-12-05 ObjectMatch: Robust Registration using Canonical Object Correspondences Can Gümeli et.al. 2212.01985 null
2022-12-02 Sparse SPN: Depth Completion from Sparse Keypoints Yuqun Wu et.al. 2212.00987 null
2022-12-01 maplab 2.0 -- A Modular and Multi-Modal Mapping Framework Andrei Cramariuc et.al. 2212.00654 link
2022-12-01 AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body -- Theory and Experiments Mehregan Dor et.al. 2212.00350 null
2022-11-30 MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves Pranjali Pathre et.al. 2211.16882 null
2022-11-29 PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images Hartmut Surmann et.al. 2211.16266 link
2022-11-29 MmWave Mapping and SLAM for 5G and Beyond Yu Ge et.al. 2211.16024 null
2022-11-28 Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map Xi Zheng et.al. 2211.15127 null
2022-11-29 BALF: Simple and Efficient Blur Aware Local Feature Detector Zhenjun Zhao et.al. 2211.14731 null
2022-11-27 Development of a Modular Real-time Shared-control System for a Smart Wheelchair Vaishanth Ramaraj et.al. 2211.14711 null
2022-11-26 A1 SLAM: Quadruped SLAM using the A1's Onboard Sensors Jerred Chen et.al. 2211.14432 link
2022-11-23 ActiveRMAP: Radiance Field for Active Mapping And Planning Huangying Zhan et.al. 2211.12656 null
2022-11-22 Vision-based localization methods under GPS-denied conditions Zihao Lu et.al. 2211.11988 null
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836 null
2022-11-21 ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari et.al. 2211.11704 null
2022-11-24 Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths Erik Leitinger et.al. 2211.09241 null
2022-11-16 Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery Hao Qu et.al. 2211.08904 null
2022-11-20 Detecting Line Segments in Motion-blurred Images with Events Huai Yu et.al. 2211.07365 link
2022-11-13 Automatic Eye-in-Hand Calibration using EKF Aditya Ramakrishnan et.al. 2211.06881 null
2022-11-12 Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling Zhihao Wang et.al. 2211.06557 link
2022-11-11 Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications Jie Yang et.al. 2211.05982 null
2022-11-10 Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time Ignacio Torroba et.al. 2211.05601 link
2022-11-07 When Geometry is not Enough: Using Reflector Markers in Lidar SLAM Gerhard Kurz et.al. 2211.03484 null
2022-11-07 Detecting Invalid Map Merges in Lifelong SLAM Matthias Holoch et.al. 2211.03423 null
2022-11-06 Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU Yibin Wu et.al. 2211.03174 link
2022-11-07 Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments Daniel Adolfsson et.al. 2211.02445 link
2022-11-03 DyOb-SLAM : Dynamic Object Tracking SLAM System Rushmian Annoy Wadud et.al. 2211.01941 null
2022-11-03 Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM Yang Chen et.al. 2211.01749 null
2022-11-04 $D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm Hao Xu et.al. 2211.01538 link
2022-11-02 Semantic SuperPoint: A Deep Semantic Descriptor Gabriel S. Gama et.al. 2211.01098 link
2022-11-02 Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation Myung-Hwan Jeon et.al. 2211.00960 link
2022-10-31 Mapping Extended Landmarks for Radar SLAM Shuai Sun et.al. 2210.17207 null
2022-10-25 MAROAM: Map-based Radar SLAM through Two-step Feature Selection Dequan Wang et.al. 2210.13797 null
2022-10-25 S3E: A Large-scale Multimodal Dataset for Collaborative SLAM Dapeng Feng et.al. 2210.13723 link
2022-10-24 NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields Antoni Rosinol et.al. 2210.13641 link
2022-10-24 Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging Geng Wang et.al. 2210.13556 null
2022-10-28 VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points Andreas Georgis et.al. 2210.12756 null
2022-10-22 SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation Junliang Chen et.al. 2210.12417 null
2022-10-21 DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm Shipeng Zhong et.al. 2210.11978 link
2022-10-21 Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments Shubham Kedia et.al. 2210.11652 null
2022-10-22 Visual SLAM: What are the Current Trends and What to Expect? Ali Tourani et.al. 2210.10491 null
2022-10-18 Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM Geon Choi et.al. 2210.09636 null
2022-10-16 D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments Ayman Beghdadi et.al. 2210.08647 null
2022-10-16 Indoor Smartphone SLAM with Learned Echoic Location Features Wenjie Luo et.al. 2210.08493 null
2022-10-15 Self-Improving SLAM in Dynamic Environments: Learning When to Mask Adrian Bojko et.al. 2210.08350 link
2022-10-13 Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems Pushyami Kaveti et.al. 2210.07315 link
2022-10-12 RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map Xuecheng Xu et.al. 2210.05984 link
2022-10-11 Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization Yuanzheng He et.al. 2210.05600 null
2022-10-11 Autonomous Asteroid Characterization Through Nanosatellite Swarming Kaitlin Dennison et.al. 2210.05518 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517 null
2022-10-11 Multi-Object Navigation with dynamically learned neural implicit representations Pierre Marza et.al. 2210.05129 link
2022-10-12 Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation Yulun Tian et.al. 2210.05020 null
2022-10-10 Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios Xingyu Chen et.al. 2210.04562 null
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236 null
2022-10-06 SCORE: A Second-Order Conic Initialization for Range-Aided SLAM Alan Papalia et.al. 2210.03177 link
2022-10-06 Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding Kirill Mazur et.al. 2210.03043 null
2022-10-06 Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence Osian Morgan et.al. 2210.02642 null
2022-10-05 MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation Hanwei Zhang et.al. 2210.02038 null
2022-10-04 O2S: Open-source open shuttle Nwankwo Linus et.al. 2210.01627 null
2022-10-04 Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing Weiying Wang et.al. 2210.01320 null
2022-10-03 Probabilistic Volumetric Fusion for Dense Monocular SLAM Antoni Rosinol et.al. 2210.01276 null
2022-10-03 DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams John McConnell et.al. 2210.00867 link
2022-10-03 A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments Ha Sier et.al. 2210.00812 link
2022-10-01 Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2 Ali Eslamian et.al. 2210.00278 null
2022-09-30 PyPose: A Library for Robot Learning with Physics-based Optimization Chen Wang et.al. 2209.15428 link
2022-09-29 DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment Mariia Gladkova et.al. 2209.14965 null
2022-09-28 Robust Incremental Smoothing and Mapping (riSAM) Daniel McGann et.al. 2209.14359 null
2022-09-27 Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping Chi-Ming Chung et.al. 2209.13274 link
2022-09-24 Graph Neural Networks for Multi-Robot Active Information Acquisition Mariliza Tzes et.al. 2209.12091 null
2022-09-24 Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes Jonathan J. Y. Kim et.al. 2209.11894 null
2022-09-23 involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs Gilad Rotman et.al. 2209.11591 null
2022-09-23 Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot David Balaban et.al. 2209.11432 null
2022-09-22 SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation Xiao Han et.al. 2209.10817 null
2022-09-22 Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio Wenhao Qiu et.al. 2209.10726 null
2022-09-21 Visual Localization and Mapping in Dynamic and Changing Environments João Carlos Virgolino Soares et.al. 2209.10710 null
2022-09-20 Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM Sabir Hossain et.al. 2209.10047 null
2022-09-20 WGICP: Differentiable Weighted GICP-Based Lidar Odometry Sanghyun Son et.al. 2209.09777 null
2022-09-20 PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention José Arce et.al. 2209.09699 link
2022-09-19 MeSLAM: Memory Efficient SLAM based on Neural Fields Evgenii Kruzhkov et.al. 2209.09357 null
2022-09-19 LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM Letian Zhang et.al. 2209.08810 null
2022-09-18 HGI-SLAM: Loop Closure With Human and Geometric Importance Features Shuhul Mujoo et.al. 2209.08608 null
2022-09-18 Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM Jiarui Tan et.al. 2209.08578 link
2022-09-17 DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments Shihao Shen et.al. 2209.08430 link
2022-09-17 OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM Matthieu Zins et.al. 2209.08338 null
2022-09-17 PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments Adam Dai et.al. 2209.08248 link
2022-09-16 ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM Aditya Arun et.al. 2209.08091 null
2022-09-16 iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking Yuhang Ming et.al. 2209.07919 null
2022-09-16 TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM Mathieu Gonzalez et.al. 2209.07888 null
2022-09-15 Landmark Management in the Application of Radar SLAM Shuai Sun et.al. 2209.07199 link
2022-09-15 PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization Xianwei Meng et.al. 2209.07061 null
2022-09-14 Semantic Visual Simultaneous Localization and Mapping: A Survey Kaiqi Chen et.al. 2209.06428 null
2022-09-13 Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets Islam Ali et.al. 2209.06316 null
2022-09-12 A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding Tin Lai et.al. 2209.05222 null
2022-09-12 Attitude-Guided Loop Closure for Cameras with Negative Plane Ze Wang et.al. 2209.05167 link
2022-09-09 General Place Recognition Survey: Towards the Real-world Autonomy Age Peng Yin et.al. 2209.04497 link
2022-09-08 ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology Julio A. Placed et.al. 2209.03693 link
2022-09-08 R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator Jiarong Lin et.al. 2209.03666 link
2022-09-06 Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection Brendon Forsgren et.al. 2209.02658 link
2022-09-05 Neuromorphic Visual Odometry with Resonator Networks Alpha Renner et.al. 2209.02000 null
2022-09-05 MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM Pavel Karpyshev et.al. 2209.01936 null
2022-09-05 ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics Boyi Liu et.al. 2209.01774 null
2022-09-04 CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud Evgeny Yudin et.al. 2209.01605 null
2022-08-31 PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM Yifan Duan et.al. 2208.14848 null
2022-08-30 BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition Peng Yin et.al. 2208.14543 null
2022-08-27 Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes Ali Safa et.al. 2208.12997 null
2022-08-25 FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms Jianhao Jiao et.al. 2208.11865 null
2022-08-25 Lidar SLAM for Autonomous Driving Vehicles Farhad Aghili et.al. 2208.11855 null
2022-08-24 DynaVINS: A Visual-Inertial SLAM for Dynamic Environments Seungwon Song et.al. 2208.11500 link
2022-08-22 Doppler Exploitation in Bistatic mmWave Radio SLAM Yu Ge et.al. 2208.10204 null
2022-08-21 Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping Lintong Zhang et.al. 2208.09825 link
2022-08-26 JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario Longrui Dong et.al. 2208.09777 null
2022-08-15 BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM Yunge Cui et.al. 2208.07473 link
2022-08-12 Handling Constrained Optimization in Factor Graphs for Autonomous Navigation Barbara Bazzana et.al. 2208.06325 null
2022-08-11 RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild Jason Y. Zhang et.al. 2208.05963 null
2022-08-08 Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation Yifei Ren et.al. 2208.04274 link
2022-08-08 SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty Shuai Zhang et.al. 2208.03945 link
2022-08-05 A Survey on Visual Map Localization Using LiDARs and Cameras Elhousni Mahdi et.al. 2208.03376 null
2022-08-04 SROS2: Usable Cyber Security Tools for ROS 2 Victor Mayoral Vilches et.al. 2208.02615 link
2022-08-03 Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms Bharath Garigipati et.al. 2208.02063 null
2022-08-02 Present and Future of SLAM in Extreme Underground Environments Kamak Ebadi et.al. 2208.01787 null
2022-08-01 Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion Simon Boche et.al. 2208.00709 null
2022-07-29 Neural Density-Distance Fields Itsuki Ueda et.al. 2207.14455 link
2022-07-25 DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions Tristan Laidlow et.al. 2207.12244 null
2022-07-25 Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration Kenji Koide et.al. 2207.11942 null
2022-07-22 NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction Yunlong Ran et.al. 2207.10985 null
2022-07-22 Dense RGB-D-Inertial SLAM with Map Deformations Tristan Laidlow et.al. 2207.10940 null
2022-07-22 PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes BaoSheng Zhang et.al. 2207.10916 null
2022-07-21 Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion Suman Ghosh et.al. 2207.10494 link
2022-07-21 Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions Quentin Serdel et.al. 2207.10489 link
2022-07-21 On applicability of von Karman's momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity Yujin Lu et.al. 2207.10413 null
2022-07-19 Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM Tuvy Lemberg et.al. 2207.09103 null
2022-07-18 DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM Weicai Ye et.al. 2207.08794 link
2022-07-18 Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction Marco Orsingher et.al. 2207.08439 null
2022-07-18 ORB-based SLAM accelerator on SoC FPGA Vibhakar Vemulapati et.al. 2207.08405 null
2022-07-14 Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset Riccardo Giubilato et.al. 2207.06815 null
2022-07-14 Semi-supervised Vector-Quantization in Visual SLAM using HGCN Amir Zarringhalam et.al. 2207.06738 null
2022-07-14 Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders Amir Zarringhalam et.al. 2207.06732 null
2022-07-13 SLAM: SLO-Aware Memory Optimization for Serverless Applications Gor Safaryan et.al. 2207.06183 null
2022-07-19 Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras Fangwen Shu et.al. 2207.06058 link
2022-07-12 Accelerating Certifiable Estimation with Preconditioned Eigensolvers David M. Rosen et.al. 2207.05257 null
2022-07-12 Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features Meiyu Zhi et.al. 2207.05244 null
2022-07-14 SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial Chih-Yuan Chiu et.al. 2207.05043 null
2022-07-08 BlindSpotNet: Seeing Where We Cannot See Taichi Fukuda et.al. 2207.03870 null
2022-07-08 Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints Philipp Glira et.al. 2207.03785 null
2022-07-08 Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements Ran Liu et.al. 2207.03700 null
2022-07-07 RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments Qihao Peng et.al. 2207.03539 null
2022-07-06 VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization Marius Laska et.al. 2207.02668 null
2022-07-06 A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models Axel Garcia-Vega et.al. 2207.02396 null
2022-07-04 VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM Ling Gao et.al. 2207.01404 null
2022-07-04 VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM Danpeng Chen et.al. 2207.01158 null
2022-07-03 Wireless Channel Prediction in Partially Observed Environments Mingsheng Yin et.al. 2207.00934 null
2022-07-01 A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers Julio A. Placed et.al. 2207.00254 null
2022-07-01 Keeping Less is More: Point Sparsification for Visual SLAM Yeonsoo Park et.al. 2207.00225 null
2022-06-30 Controlled and impulsive compression of an entrapped air bubble during impact Utkarsh Jain et.al. 2206.15297 null
2022-06-30 Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery Yuehao Wang et.al. 2206.15255 link
2022-06-27 IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments Abanob Soliman et.al. 2206.13455 link
2022-06-26 An Efficient Global Optimality Certificate for Landmark-Based SLAM Connor Holmes et.al. 2206.12961 link
2022-06-21 Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping Davide Tateo et.al. 2206.10263 link
2022-06-20 Data Fusion for Radio Frequency SLAM with Robust Sampling Erik Leitinger et.al. 2206.09746 null
2022-06-19 RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments Chenglong Qian et.al. 2206.09463 null
2022-06-17 Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments Khairuldanial Ismail et.al. 2206.08733 null
2022-06-17 An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions Yijun Yuan et.al. 2206.08712 link
2022-06-13 ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy Hao Bai et.al. 2206.06435 null
2022-06-10 Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming Javier Cremona et.al. 2206.05066 link
2022-06-09 SparseFormer: Attention-based Depth Completion Network Frederik Warburg et.al. 2206.04557 null
2022-06-07 Robot Self-Calibration Using Actuated 3D Sensors Arne Peters et.al. 2206.03430 null
2022-06-07 Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map Haodong Yuan et.al. 2206.03062 null
2022-06-05 DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions Alena Savinykh et.al. 2206.02199 null
2022-06-04 C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy Erez Posner et.al. 2206.01961 null
2022-06-01 PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry Dong-Uk Seo et.al. 2206.00266 link
2022-05-27 A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching Arno Solin et.al. 2205.13821 null
2022-05-31 LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments Yun Chang et.al. 2205.13135 link
2022-05-25 Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM Milad Ramezani et.al. 2205.12595 null
2022-05-24 Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM Christopher E. Denniston et.al. 2205.12402 link
2022-05-22 ALITA: A Large-scale Incremental Dataset for Long-term Autonomy Peng Yin et.al. 2205.10737 link
2022-05-19 FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2 Jeffrey Ichnowski et.al. 2205.09778 link
2022-05-17 Global Data Association for SLAM with 3D Grassmannian Manifold Objects Parker C. Lusk et.al. 2205.08556 null
2022-05-19 Cluster on Wheels Yuanyuan Yang et.al. 2205.08151 null
2022-05-12 Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry Shihao Shen et.al. 2205.05916 link
2022-05-12 S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization Ran Cheng et.al. 2205.05861 null
2022-05-14 Multi-modal Semantic SLAM for Complex Dynamic Environments Han Wang et.al. 2205.04300 link
2022-05-06 OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations Carmen Delgado et.al. 2205.03256 null
2022-05-05 CNN-Augmented Visual-Inertial SLAM with Planar Constraints Pan Ji et.al. 2205.02940 null
2022-05-05 PMBM-based SLAM Filters in 5G mmWave Vehicular Networks Hyowon Kim et.al. 2205.02502 null
2022-05-04 BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking Dorian Henning et.al. 2205.02301 null
2022-05-04 A Global Asymptotic Convergent Observer for SLAM Seyed Hamed Hashemi et.al. 2205.01953 null
2022-05-04 Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation Nathaniel Merrill et.al. 2205.01823 link
2022-05-03 GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping Pan Ji et.al. 2205.01656 null
2022-04-29 Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM Jinwoo Jeon et.al. 2204.13877 link
2022-04-27 The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection Konstantinos A. Tsintotas et.al. 2204.12831 null
2022-04-27 Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment Wenyu Li et.al. 2204.12769 null
2022-04-29 MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment Tingchen Ma et.al. 2204.11621 null
2022-04-23 Indoor simultaneous localization and mapping based on fringe projection profilometry Yang Zhao et.al. 2204.11020 null
2022-04-22 Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria Julio A. Placed et.al. 2204.10631 null
2022-04-22 Fast Autonomous Robotic Exploration Using the Underlying Graph Structure Julio A. Placed et.al. 2204.10610 null
2022-04-22 Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions Yutong Hu et.al. 2204.10552 null
2022-04-22 Implicit Object Mapping With Noisy Data Jad Abou-Chakra et.al. 2204.10516 link
2022-04-19 Photometric single-view dense 3D reconstruction in endoscopy Victor M. Batlle et.al. 2204.09083 null
2022-04-18 Pulsar skips: Understanding variations in the regular periods of rotating neutron stars Clayton Miller et.al. 2204.08449 null
2022-04-18 Tracking monocular camera pose and deformation for SLAM inside the human body Juan J. Gomez Rodriguez et.al. 2204.08309 null
2022-04-18 Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker Hanjing Ye et.al. 2204.08163 null
2022-04-14 ViViD++: Vision for Visibility Dataset Alex Junho Lee et.al. 2204.06183 null
2022-04-12 HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud Zhixing Hou et.al. 2204.05481 null
2022-04-12 RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room Cong Gao et.al. 2204.05467 null
2022-04-11 Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context Lizhou Liao et.al. 2204.04932 link
2022-04-04 Monitoring social distancing with single image depth estimation Alessio Mingozzi et.al. 2204.01693 null
2022-04-01 Bi-directional Loop Closure for Visual SLAM Ihtisham Ali et.al. 2204.01524 null
2022-04-04 IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers Lei Sun et.al. 2204.01324 link
2022-04-03 Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor Wenyan Ou et.al. 2204.01154 null
2022-04-02 UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps Ayyappa Swamy Thatavarthy et.al. 2204.00865 link
2022-03-31 Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects Yujie Lu et.al. 2204.00035 null
2022-03-30 GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios Chih-Yuan Chiu et.al. 2203.16690 null
2022-03-29 Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field Mostafa Osman et.al. 2203.15866 null
2022-03-29 Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform Mingjun Li et.al. 2203.15439 null
2022-03-29 Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots Pranay Mathur et.al. 2203.15272 null
2022-03-28 Are High-Resolution Event Cameras Really Needed? Daniel Gehrig et.al. 2203.14672 null
2022-03-25 Spectral Measurement Sparsification for Pose-Graph SLAM Kevin J. Doherty et.al. 2203.13897 link
2022-03-25 FD-SLAM: 3-D Reconstruction Using Features and Dense Matching Xingrui Yang et.al. 2203.13861 null
2022-03-25 Gravity-constrained point cloud registration Vladimír Kubelka et.al. 2203.13799 null
2022-03-24 MD-SLAM: Multi-cue Direct SLAM Luca Di Giammarino et.al. 2203.13237 link
2022-03-24 Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video Shun Taguchi et.al. 2203.12804 null
2022-03-19 Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems Jie Yang et.al. 2203.10267 null
2022-03-16 Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR Ian D. Miller et.al. 2203.08925 link
2022-03-15 Neural RF SLAM for unsupervised positioning and mapping with channel state information Shreya Kadambi et.al. 2203.08264 null
2022-03-15 Simultaneous Localisation and Mapping with Quadric Surfaces Tristan Laidlow et.al. 2203.08040 null
2022-03-14 Drift Reduced Navigation with Deep Explainable Features Mohd Omama et.al. 2203.06897 link
2022-03-11 An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs Keisuke Sugiura et.al. 2203.05763 null
2022-03-10 High Definition, Inexpensive, Underwater Mapping Bharat Joshi et.al. 2203.05640 link
2022-03-10 SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning Jaehoon Choi et.al. 2203.05332 null
2022-03-08 Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM Pierre-Yves Lajoie et.al. 2203.04446 link
2022-03-08 SLAM-Supported Self-Training for 6D Object Pose Estimation Ziqi Lu et.al. 2203.04424 link
2022-03-08 An Online Semantic Mapping System for Extending and Enhancing Visual SLAM Thorsten Hempel et.al. 2203.03944 null
2022-03-07 Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms Qingqing Li et.al. 2203.03454 link
2022-03-07 OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition Junyi Ma et.al. 2203.03397 link
2022-03-06 Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM Kazushi Aiba et.al. 2203.02887 null
2022-03-06 RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects Ran Long et.al. 2203.02882 null
2022-03-03 STUN: Self-Teaching Uncertainty Estimation for Place Recognition Kaiwen Cai et.al. 2203.01851 link
2022-03-03 Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning Niclas Vödisch et.al. 2203.01578 link
2022-03-02 FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2203.00893 link
2022-03-02 Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation Yulun Tian et.al. 2203.00851 null
2022-03-01 Descriptellation: Deep Learned Constellation Descriptors for SLAM Chunwei Xing et.al. 2203.00567 null
2022-03-01 Collaborative Robot Mapping using Spectral Graph Analysis Lukas Bernreiter et.al. 2203.00308 null
2022-02-26 RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization Nikolaos Kourtzanidis et.al. 2202.13221 link
2022-02-25 Probabilistic Data Association for Semantic SLAM at Scale Elad Michael et.al. 2202.12802 link
2022-02-24 TwistSLAM: Constrained SLAM in Dynamic Environment Mathieu Gonzalez et.al. 2202.12384 null
2022-02-24 Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion Hyeonsoo Jang et.al. 2202.12108 null
2022-02-23 MITI: SLAM Benchmark for Laparoscopic Surgery Regine Hartwig et.al. 2202.11496 null
2022-02-23 DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization Xuebo Tian et.al. 2202.11431 null
2022-02-23 Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets Islam Ali et.al. 2202.11312 null
2022-02-22 SAGE: SLAM with Appearance and Geometry Prior for Endoscopy Xingtong Liu et.al. 2202.09487 link
2022-02-18 OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure Stefan Leutenegger et.al. 2202.09199 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146 link
2022-02-18 An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems Qiang Liu et.al. 2202.08952 null
2022-02-17 Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study Giovanni Cioffi et.al. 2202.08894 link
2022-02-17 LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building Jiashi Zhang et.al. 2202.08487 null
2022-02-16 Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments Jinkun Wang et.al. 2202.08359 null
2022-02-11 Overhead Image Factors for Underwater Sonar-based SLAM John McConnell et.al. 2202.05811 null
2022-02-10 Scale Estimation with Dual Quadrics for Monocular Object SLAM Shuangfu Song et.al. 2202.04816 null
2022-02-08 A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition Nie Jiwei et.al. 2202.03677 null
2022-01-25 Autonomous Vehicles: Open-Source Technologies, Considerations, and Development Oussama Saoudi et.al. 2202.03148 null
2022-02-07 Temporal Point Cloud Completion with Pose Disturbance Jieqi Shi et.al. 2202.03084 null
2022-02-04 DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments Xinggang Hu et.al. 2202.01938 null
2022-02-01 A Model for Multi-View Residual Covariances based on Perspective Deformation Alejandro Fontan et.al. 2202.00765 null
2022-01-30 Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM Xinghe Chu et.al. 2201.12726 null
2022-01-28 RGB-D SLAM Using Attention Guided Frame Association Ali Caglayan et.al. 2201.12047 null
2022-02-04 Learning to Act with Affordance-Aware Multimodal Neural SLAM Zhiwei Jia et.al. 2201.09862 link
2022-01-22 Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems Xi Zheng et.al. 2201.09048 link
2022-01-17 SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System Giseop Kim et.al. 2201.06423 null
2022-01-14 SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions Ali Samadzadeh et.al. 2201.05386 link
2022-01-19 Multi-Hypothesis Scan Matching through Clustering Giorgio Iavicoli et.al. 2201.03814 null
2022-01-11 Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM Kevin J. Doherty et.al. 2201.03773 null
2022-01-10 High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM Brian M. Hopkinson et.al. 2201.03364 link
2022-01-10 Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition M. Usman Maqbool Bhutta et.al. 2201.03212 link
2022-01-04 Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds Xueliang Wen et.al. 2201.00959 null
2021-12-29 Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic Khen Elimelech et.al. 2112.14428 null
2021-12-19 M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots Jie Yin et.al. 2112.13659 link
2021-12-27 UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping Hyunjun Lim et.al. 2112.13515 link
2021-12-25 Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs Yusheng Wang et.al. 2112.13224 null
2021-12-25 Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping Peng Huang et.al. 2112.13222 null
2021-12-24 3D Point Cloud Reconstruction and SLAM as an Input Ziyu Li et.al. 2112.12907 null
2021-12-22 NICE-SLAM: Neural Implicit Scalable Encoding for SLAM Zihan Zhu et.al. 2112.12130 link
2021-12-18 Fast and Robust Registration of Partially Overlapping Point Clouds Eduardo Arnold et.al. 2112.09922 link
2021-12-17 Symmetry-aware Neural Architecture for Embodied Visual Navigation Shuang Liu et.al. 2112.09515 null
2021-12-27 Homography Decomposition Networks for Planar Object Tracking Xinrui Zhan et.al. 2112.07909 link
2021-12-14 Autonomous Navigation System from Simultaneous Localization and Mapping Micheal Caracciolo et.al. 2112.07723 link
2021-12-12 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation Bolivar Solarte et.al. 2112.06180 link
2021-12-11 Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization Amay Saxena et.al. 2112.05921 null
2021-12-07 Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems Gideon Billings et.al. 2112.03826 link
2021-12-05 Iterated Posterior Linearization PMB Filter for 5G SLAM Yu Ge et.al. 2112.02575 null
2021-12-03 Fast Direct Stereo Visual SLAM Jiawei Mo et.al. 2112.01890 link
2021-12-02 MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment Jie Ren et.al. 2112.01349 link
2021-12-01 Research on Event Accumulator Settings for Event-Based SLAM Kun Xiao et.al. 2112.00427 link
2021-11-29 An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments Assem Sadek et.al. 2111.14666 null
2021-11-29 Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report Hartmut Surmann et.al. 2111.14542 null
2021-11-24 Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment V. Ayala-Alfaro et.al. 2111.12690 null
2021-11-24 Autonomous bot with ML-based reactive navigation for indoor environment Yash Srivastava et.al. 2111.12542 null
2021-11-22 A General Framework for Lifelong Localization and Mapping in Changing Environment Min Zhao et.al. 2111.10946 link
2021-11-17 Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network Xiaoming Zhao et.al. 2111.09006 null
2021-11-10 Comparing dominance of tennis' big three via multiple-output Bayesian quantile regression models Bruno Santos et.al. 2111.05631 null
2021-11-10 TomoSLAM: factor graph optimization for rotation angle refinement in microtomography Mark Griguletskii et.al. 2111.05562 null
2021-11-07 Hierarchical Segment-based Optimization for SLAM Yuxin Tian et.al. 2111.04101 null
2021-11-07 Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM Shing Yan Loo et.al. 2111.04096 null
2021-11-05 MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry Joan P. Company-Corcoles et.al. 2111.03408 null
2021-10-31 Loop closure detection using local 3D deep descriptors Youjie Zhou et.al. 2111.00440 link
2021-10-27 Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification Mingsheng Yin et.al. 2110.14789 link
2021-10-27 Efficient Placard Discovery for Semantic Mapping During Frontier Exploration David Balaban et.al. 2110.14742 null
2021-10-26 Robust Multi-view Registration of Point Sets with Laplacian Mixture Model Jin Zhang et.al. 2110.13744 null
2021-10-25 WOLF: A modular estimation framework for robotics based on factor graphs Joan Sola et.al. 2110.12919 null
2021-10-21 Real-Time Ground-Plane Refined LiDAR SLAM Fan Yang et.al. 2110.11517 null
2021-10-21 SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words Jonathan J. Y. Kim et.al. 2110.11491 null
2021-10-21 InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion Zhenkun Zhu et.al. 2110.11040 null
2021-10-20 SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training Ankur Bapna et.al. 2110.10329 null
2021-10-18 Enhancing exploration algorithms for navigation with visual SLAM Kirill Muravyev et.al. 2110.09156 null
2021-10-18 Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment Rui Tian et.al. 2110.08977 null
2021-10-16 Partial Hierarchical Pose Graph Optimization for SLAM Alexander Korovko et.al. 2110.08639 null
2021-10-14 Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach Shumon Koga et.al. 2110.07546 null
2021-10-13 Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity Ran Liu et.al. 2110.06541 null
2021-10-12 Learning Efficient Multi-Agent Cooperative Visual Exploration Chao Yu et.al. 2110.05734 null
2021-10-07 Self-Supervised Depth Completion for Active Stereo Frederik Warburg et.al. 2110.03234 null
2021-10-06 InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes Zhenkun Zhu et.al. 2110.02593 null
2021-10-03 AEROS: Adaptive RObust least-Squares for Graph-Based SLAM Milad Ramezani et.al. 2110.02018 null
2021-10-04 Fast Uncertainty Quantification for Active Graph SLAM Julio A. Placed et.al. 2110.01289 link
2021-10-04 Geometry-based Graph Pruning for Lifelong SLAM Gerhard Kurz et.al. 2110.01286 null
2021-10-03 Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration Marcus Greiff et.al. 2110.01099 null
2021-10-02 Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows Qiangqiang Huang et.al. 2110.00876 link

(back to top)

SFM

Publish Date Title Authors PDF Code
2025-12-04 Deep infant brain segmentation from multi-contrast MRI Malte Hoffmann et.al. 2512.05114 null
2025-12-04 QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory Yu-Chao Hsu et.al. 2512.05049 null
2025-12-04 Geometric Data Science Olga D Anosova et.al. 2512.05040 null
2025-12-04 Internal superfluid response and torque evolution in the giant glitch of PSR J1718-3718 Peng Liu et.al. 2512.04972 null
2025-12-04 Canonical Rough Path over Tempered Fractional Brownian Motion: Existence, Construction, and Applications Atef Lechiheb et.al. 2512.04646 null
2025-12-04 Refaçade: Editing Object with Given Reference Texture Youze Huang et.al. 2512.04534 null
2025-12-04 Development of a 15-Degree-of-Freedom Bionic Hand with Cable-Driven Transmission and Distributed Actuation Haoqi Han et.al. 2512.04399 null
2025-12-03 Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications Gasser Elazab et.al. 2512.04303 null
2025-12-03 Emergent Outlier View Rejection in Visual Geometry Grounded Transformers Jisang Han et.al. 2512.04012 null
2025-12-03 DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment Sheng-Hao Liao et.al. 2512.03981 null
2025-11-26 TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos Seungjae Lee et.al. 2511.21690 null
2025-11-26 UAVLight: A Benchmark for Illumination-Robust 3D Reconstruction in Unmanned Aerial Vehicle (UAV) Scenes Kang Du et.al. 2511.21565 null
2025-11-26 From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings Jiajie Zhang et.al. 2511.21428 null
2025-11-26 DeepRFTv2: Kernel-level Learning for Image Deblurring Xintian Mao et.al. 2511.21132 null
2025-11-25 Hund-projected Kanamori model: an effective description of Hund's metals near the Mott insulating regime Johan Carlström et.al. 2511.20788 null
2025-11-25 From Observations to Simulations: A Neural-Network Approach to Intracluster Medium Kinematics E. Gatuzz et.al. 2511.20755 null
2025-11-25 Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization Tahira Kazimi et.al. 2511.20647 null
2025-11-25 Dance Style Classification using Laban-Inspired and Frequency-Domain Motion Features Ben Hamscher et.al. 2511.20469 null
2025-11-25 AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend Hengyi Wang et.al. 2511.20343 null
2025-11-25 Stochastic Dynamics of Skyrmions on a Racetrack: Impact of Equilibrium and Nonequilibrium Noise Anton V. Hlushchenko et.al. 2511.20287 null
2025-11-24 Dynamic Multi-Species Bird Soundscape Generation with Acoustic Patterning and 3D Spatialization Ellie L. Zhang et.al. 2511.19275 null
2025-11-24 A Deep-Learning-Based Framework for Focal Mechanism Determination and Its Application to the 2022 Luding Earthquake Sequence Ziye Yu et.al. 2511.19185 null
2025-11-24 MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes Kehua Chen et.al. 2511.19172 null
2025-11-24 The variability of blazars throughout the electromagnetic spectrum Claudia M. Raiteri et.al. 2511.18975 null
2025-11-24 MagicWorld: Interactive Geometry-driven Video World Exploration Guangyuan Li et.al. 2511.18886 null
2025-11-24 STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution Junyang Chen et.al. 2511.18786 null
2025-11-24 On the role of fractional Brownian motion in models of chemotaxis and stochastic gradient ascent Gustavo Cornejo-Olea et.al. 2511.18745 null
2025-11-23 C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction Kuan Wei Huang et.al. 2511.18559 null
2025-11-23 Non-Symplectic Deformations of Geometric Quantisation Kerr Maxwell et.al. 2511.18549 null
2025-11-23 Zero-Shot Video Deraining with Video Diffusion Models Tuomas Varanka et.al. 2511.18537 null
2025-11-23 Expanding the Workspace of Electromagnetic Navigation Systems Using Dynamic Feedback for Single- and Multi-agent Control Jasan Zughaibi et.al. 2511.18486 null
2025-11-23 Escape from end-pinching in Herschel-Bulkley ligaments Shu Yang et.al. 2511.18388 null
2025-11-23 EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning Yogesh Kulkarni et.al. 2511.18242 null
2025-11-22 MotionDuet: Dual-Conditioned 3D Human Motion Generation with Video-Regularized Text Learning Yi-Yang Zhang et.al. 2511.18209 null
2025-11-22 A Unified Multi-Dynamics Framework for Perception-Oriented Modeling in Tendon-Driven Continuum Robots Ibrahim Alsarraj et.al. 2511.18088 null
2025-11-22 Plan-X: Instruct Video Generation via Semantic Planning Lun Huang et.al. 2511.17986 null
2025-11-22 Dynamic Slowdown and Spatial Correlations in Viscous Silica Melt: Perspectives from Dynamic Disorder Shubham Kumar et.al. 2511.17887 null
2025-11-21 Lane-Frame Quantum Multimodal Driving Forecasts for the Trajectory of Autonomous Vehicles Navneet Singh et.al. 2511.17675 null
2025-11-18 Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression Siddiqua Namrah et.al. 2511.17612 null
2025-10-24 RadioMapMotion: A Dataset and Baseline for Proactive Spatio-Temporal Radio Environment Prediction Honggang Jia et.al. 2511.17526 null
2025-11-21 TRAO Survey of the Nearby Filamentary Molecular Clouds, the Universal Nursery of Stars (TRAO-FUNS). IV. Filaments and Dense Cores in the W40 and Serpens South Regions of Aquila Satyajeet Moharana et.al. 2511.16978 null
2025-11-21 One Walk is All You Need: Data-Efficient 3D RF Scene Reconstruction with Human Movements Yiheng Bian et.al. 2511.16966 null
2025-11-20 TriDiff-4D: Fast 4D Generation through Diffusion-based Triplane Re-posing Eddie Pokming Sheung et.al. 2511.16662 null
2025-11-20 Flow and Depth Assisted Video Prediction with Latent Transformer Eliyas Suleyman et.al. 2511.16484 null
2025-11-20 Two Epochs of VLBI Observations of 8 KISSR Seyfert & LINER Galaxies: Suggestions of Fast and Filamentary Outflows Preeti Kharb et.al. 2511.16159 null
2025-11-19 MambaIO: Global-Coordinate Inertial Odometry for Pedestrians via Multi-Scale Frequency-Decoupled Modeling Shanshan Zhang et.al. 2511.15645 null
2025-11-19 Covariant Measures of Non-Markovianity in Curved Spacetime Tushar Waghmare et.al. 2511.15365 null
2025-11-19 Generating Natural-Language Surgical Feedback: From Structured Representation to Domain-Grounded Evaluation Firdavs Nasriddinov et.al. 2511.15159 null
2025-11-19 SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection Chun-Jung Lin et.al. 2511.15153 null
2025-11-18 Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video Yarin Bekor et.al. 2511.14848 null
2025-11-18 Blur-Robust Detection via Feature Restoration: An End-to-End Framework for Prior-Guided Infrared UAV Target Detection Xiaolin Wang et.al. 2511.14371 null
2025-11-18 Hubble Space Telescope proper motions of Large Magellanic Cloud star clusters -- II. Kinematic structure of young and intermediate-age clusters F. Niederhofer et.al. 2511.14351 null
2025-11-18 Vortex stability in pseudo-Hermitian theories R. A. Battye et.al. 2511.14300 null
2025-11-18 Model-Based Clustering of Football Event Sequences: A Marked Spatio-Temporal Point Process Mixture Approach Koffi Amezouwui et.al. 2511.14297 null
2025-11-18 Newborn jet in the symbiotic system R Aquarii T. Liimets et.al. 2511.14243 null
2025-11-18 FreeMusco: Motion-Free Learning of Latent Control for Morphology-Adaptive Locomotion in Musculoskeletal Characters Minkwan Kim et.al. 2511.14205 null
2025-11-18 AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models Yuhua Jiang et.al. 2511.14148 null
2025-11-17 B2F: End-to-End Body-to-Face Motion Generation with Style Reference Bokyung Jang et.al. 2511.13988 null
2025-11-17 Enabling Real-Time Volumetric Imaging in Interventional Radiology Suits via a Deep Learning Framework Robust to C-arm Tilt Fawazilla Utomo et.al. 2511.13980 null
2025-11-17 Ultrafast electron diffractive imaging of the dissociation of pre-excited molecules Yanwei Xiong et.al. 2511.13479 null
2025-11-17 An Automated Framework for Analyzing Structural Evolution in On-the-fly Non-adiabatic Molecular Dynamics Using Autoencoder and Multiple Molecular Descriptors Hangxu Liu et.al. 2511.13364 null
2025-11-17 The Spontaneous Genesis of Solar Prominence Structures Driven by Supergranulation in Three-Dimensional Simulations Huanxin Chen et.al. 2511.13252 null
2025-11-17 Infrared photometry and CaT spectroscopy of the most metal-poor in-situ globular cluster VVV-CL001 W. Haro Moya et.al. 2511.13161 null
2025-11-16 Kagome metals Domenico Di Sante et.al. 2511.12731 null
2025-11-16 Examining Turbulence in Galactic Molecular Clouds - II: Continuity of Turbulence Cascading in a Portion of the Local Arm Yuehui Ma et.al. 2511.12418 null
2025-11-16 Towards Rotation-only Imaging Geometry: Rotation Estimation Xinrui Li et.al. 2511.12415 null
2025-11-14 Free3D: 3D Human Motion Emerges from Single-View 2D Supervision Sheng Liu et.al. 2511.11368 null
2025-11-14 YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation Pavel Rojtberg et.al. 2511.11344 null
2025-11-14 The Spatial Evolution of Star Clusters in NGC 628 with JWST Anne S. M. Buckner et.al. 2511.11115 null
2025-11-14 Discovery of an X-ray bridge between the comma-shaped gas and the main cluster in MCXC J0157.4-0550 Chong Yang et.al. 2511.10968 null
2025-11-14 DEFT-LLM: Disentangled Expert Feature Tuning for Micro-Expression Recognition Ren Zhang et.al. 2511.10948 null
2025-11-14 A High-Precision Dynamical Model of Callisto: Incorporating Rotation Effects within Multi-Layer Internal Structure Models Kai Huang et.al. 2511.10929 null
2025-11-14 Collaborative Multi-Robot Non-Prehensile Manipulation via Flow-Matching Co-Generation Yorai Shaoul et.al. 2511.10874 null
2025-11-13 A validated lumped-element model for bioinspired acoustic flow sensing toward the performance limit Wei Sun et.al. 2511.10830 null
2025-11-13 From Attention to Frequency: Integration of Vision Transformer and FFT-ReLU for Enhanced Image Deblurring Syed Mumtahin Mahmud et.al. 2511.10806 null
2025-11-13 Towards Attribution of Generators and Emotional Manipulation in Cross-Lingual Synthetic Speech using Geometric Learning Girish et.al. 2511.10790 null
2025-11-13 The Quiescent Merging Nature of the Coma Cluster Revealed by ICM Velocity Structure E. Gatuzz et.al. 2511.10740 null
2025-11-13 From Fold to Function: Dynamic Modeling and Simulation-Driven Design of Origami Mechanisms Tianhui Han et.al. 2511.10580 null
2025-11-13 M3Scope a 3D multimode multiplane microscope for imaging nanoscale dynamics in soft matter Steven Huysecom et.al. 2511.10174 null
2025-11-13 Physics-informed Machine Learning for Static Friction Modeling in Robotic Manipulators Based on Kolmogorov-Arnold Networks Yizheng Wang et.al. 2511.10079 null
2025-11-13 Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level Constraints Xiangyue Zhang et.al. 2511.10076 null
2025-11-13 PuffyBot: An Untethered Shape Morphing Robot for Multi-environment Locomotion Shashwat Singh et.al. 2511.09885 null
2025-11-13 AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting Aymen Mir et.al. 2511.09827 null
2025-11-12 DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation Jerrin Bright et.al. 2511.09502 null
2025-11-12 SPIDER: Scalable Physics-Informed Dexterous Retargeting Chaoyi Pan et.al. 2511.09484 null
2025-11-12 3D PIC simulation and theoretical modeling of RF Laser pulse in magnetized plasma for the generation of multidimensional relativistic Wakefields A. A. Molavi Choobini et.al. 2511.09079 null
2025-11-12 Group-Theoretic Structure Governing Identifiability in Inverse Problems Isshin Arai et.al. 2511.08995 null
2025-11-11 Resolving Thermospheric Vertical Wind Ambiguities and Energy Processes Jeffrey P. Thayer et.al. 2511.08830 null
2025-11-11 Analytical Description of Baryonic Matter Fluctuations Using Jeans Filtering Functions in Second-Order Cosmological Perturbation Theory Diego Fernando Fonseca et.al. 2511.08820 null
2025-11-11 3D MHD simulations of coronal loops heated via magnetic braiding I. Continuous driving Gabriele Cozzo et.al. 2511.08726 null
2025-11-11 Coordinated Space- and Ground-based Monitoring of Accretion Bursts in a Protoplanetary Disk: The Orbital and Accretion Properties of DQ Tau Hala Alqubelat et.al. 2511.08311 null
2025-11-11 Direction and speed selectivity properties for spatio-temporal receptive fields according to the generalized Gaussian derivative model for visual receptive fields Tony Lindeberg et.al. 2511.08101 null
2025-11-17 Silicon-photonic optomechanical magnetometer Fernando Gottardo et.al. 2511.07852 null
2025-11-11 Human Motion Synthesis in 3D Scenes via Unified Scene Semantic Occupancy Gong Jingyu et.al. 2511.07819 null
2025-11-10 DIMO: Diverse 3D Motion Generation for Arbitrary Objects Linzhan Mou et.al. 2511.07409 null
2025-11-10 Ultrafast Topological Transitions Driven by Permittivity Modulation in Non-Hermitian Multilayers Giuseppina Simone et.al. 2511.06963 null
2025-11-10 Ambiguity-aware Truncated Flow Matching for Ambiguous Medical Image Segmentation Fanding Li et.al. 2511.06857 null
2025-11-10 SDSS-ALMA Legacy Value Archival Gas Exploration (SALVAGE) -- I: global star formation is governed by central (not global) molecular gas Scott Wilkinson et.al. 2511.06775 null
2025-11-08 Development and testing of novel soft sleeve actuators Mohammed Abboodi et.al. 2511.06102 null
2025-11-08 Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration Umar Rashid et.al. 2511.06087 null
2025-11-08 Equilibrium Portfolio Selection under Utility-Variance Analysis of Log Returns in Incomplete Markets Yue Cao et.al. 2511.05861 null
2025-11-08 Supermassive Black Hole and Broad-line Region in NGC 5548: 2023 Reverberation Mapping Results Wen-Zhe Xi et.al. 2511.05851 null
2025-11-07 A dual grid geometric electromagnetic particle in cell method Katharina Kormann et.al. 2511.05032 null
2025-11-06 Kinematic and extinction analysis of a potential spiral arm beyond the Galactic bar Simran Joharle et.al. 2511.04778 null
2025-11-06 Sub-Gyr variability around the SFMS and its contribution to the scatter A. Camps-Fariña et.al. 2511.04745 null
2025-11-06 Dissecting coherent motions in extreme wall shear stress events within adverse pressure gradient turbulent boundary layers Leandro J. O. Silva et.al. 2511.04620 null
2025-11-21 Disentangled Concepts Speak Louder Than Words: Explainable Video Action Recognition Jongseo Lee et.al. 2511.03725 null
2025-11-05 Extreme-Mass-Ratio Inspirals Embedded in Dark Matter Halo I:Existence of Homoclinic Orbit and Near-Horizon Chaos Surajit Das et.al. 2511.03657 null
2025-11-04 Comparative Investigations on Active and Passive Tails of Undulating Swimmers Dev Pradeepkumar Nayak et.al. 2511.03057 null
2025-11-04 Distributions and evolution of the equatorial rotation velocities of 2937 BAF-type main-sequence stars from asteroseismology Conny Aerts et.al. 2511.02909 null
2025-11-04 Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization Shaohan Li et.al. 2511.02329 null
2025-11-04 Characterizing the astrometric quality of AGNs in Gaia-CRF3 Shilong Liao et.al. 2511.02204 null
2025-11-03 Fractional Diffusion Bridge Models Gabriel Nobis et.al. 2511.01795 null
2025-11-03 Phason-driven temperature-dependent transport in moiré graphene Alex Boschi et.al. 2511.01691 null
2025-11-03 Apsidal motion in massive binaries Sophie Rosu et.al. 2511.01522 null
2025-11-12 Robust topological invariants of timelike circular orbits for spinning test particles in black hole spacetimes Yong Song et.al. 2511.01447 null
2025-11-04 Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects Jiawei Wang et.al. 2511.01294 null
2025-11-03 Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play Jiatong Shi et.al. 2511.01261 null
2025-11-02 From Spray to Metric: The Geometric Construction of the Jacobi Metric Zonghai Li et.al. 2511.01004 null
2025-11-02 The CatWISE2020 Quasar dipole: A Reassessment of the Cosmic Dipole Anomaly Masroor Bashir et.al. 2511.00822 null
2025-11-02 Real-Time Learning of Predictive Dynamic Obstacle Models for Robotic Motion Planning Stella Kombo et.al. 2511.00814 null
2025-11-01 Oitijjo-3D: Generative AI Framework for Rapid 3D Heritage Reconstruction from Street View Imagery Momen Khandoker Ope et.al. 2511.00362 null
2025-11-17 Deep Chandra X-ray Observations of Abell 2029: the Merger History of a Relaxed, Strong Cool Core Cluster Courtney B. Watson et.al. 2511.00250 null
2025-10-30 Comparing the magnetic Rayleigh-Taylor instability dynamics in two- and three-dimensions Manohar Teja Kalluri et.al. 2510.27053 null
2025-10-30 HEIR: Learning Graph-Based Motion Hierarchies Cheng Zheng et.al. 2510.26786 null
2025-10-30 Wrinkle-Induced Hexagonal Boron Nitride Nanochannels for Biomolecule Localization and Imaging Xiliang Yang et.al. 2510.26370 null
2025-10-30 Ram pressure shaping HVC droplets -- FAST HI observations of HVC AC-III and theoretical interpretation Xunchuan Liu et.al. 2510.26077 null
2025-10-29 Spherically Symmetric Quantum-Corrected Black Holes with String Clouds: A Multi-Observable Analysis Faizuddin Ahmed et.al. 2510.25764 null
2025-10-29 Lost in Phonation: Voice Quality Variation as an Evaluation Dimension for Speech Foundation Models Harm Lameris et.al. 2510.25577 null
2025-10-29 4-Doodle: Text to 3D Sketches that Move! Hao Chen et.al. 2510.25319 null
2025-10-27 SFMS-ALR: Script-First Multilingual Speech Synthesis with Adaptive Locale Resolution Dharma Teja Donepudi et.al. 2510.25178 null
2025-10-29 Magnetic Fields in Massive Star-forming Regions (MagMaR). VI. Magnetic Field Dragging in the Filamentary High-mass Star-forming Region G35.20--0.74N due to Gravity Jihye Hwang et.al. 2510.25078 null
2025-10-28 The Binary Ballet: Mapping Local Expansion Around M81 & M82 Jenny Wagner et.al. 2510.24840 null
2025-10-29 Leveraging Scale Separation and Stochastic Closure for Data-Driven Prediction of Chaotic Dynamics Ismaël Zighed et.al. 2510.24583 null
2025-10-28 Tracking the normal modes of an overpass highway bridge using Distributed Acoustic Sensing E. Diego Mercerat et.al. 2510.24212 null
2025-10-28 High-energy droplet collisions in multi-interacting hollow cone sprays Narendra Dev et.al. 2510.24207 null
2025-10-27 Adaptive Keyframe Selection for Scalable 3D Scene Reconstruction in Dynamic Environments Raman Jha et.al. 2510.23928 null
2025-10-27 Non-Markovian quantum Mpemba effect in strongly correlated quantum dots YuanDong Wang et.al. 2510.23445 null
2025-10-27 FlowCapX: Physics-Grounded Flow Capture with Long-Term Consistency Ningxiao Tao et.al. 2510.23122 null
2025-10-27 EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction Taoyu Wu et.al. 2510.23087 null
2025-10-27 Adapting Speech Foundation Models with Large Language Models for Unified Speech Recognition Jing-Xuan Zhang et.al. 2510.22961 null
2025-10-26 MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control Fatemeh Nazarieh et.al. 2510.22810 null
2025-10-26 Kinematics of Acceleration-Induced Excitations in Confined Quantum Fields Hemansh Shah et.al. 2510.22797 null
2025-10-25 Learning 3D Anisotropic Noise Distributions Improves Molecular Force Field Modeling Xixian Liu et.al. 2510.22123 null
2025-10-21 Vertex and front-tracking methods for the modeling of microstructure evolution at the solid state: a brief review Marc Bernacki et.al. 2510.21818 null
2025-10-14 Beyond mechanochromism: Programmable multimodal actuation in cholesteric liquid crystal elastomer hollow fibers Jiazhe Ma et.al. 2510.21765 null
2025-10-24 Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging Ying Xue et.al. 2510.21654 null
2025-10-24 Magnetic Field Configuration of a Quiescent Prominence Revealed by Large-amplitude Longitudinal Oscillations in End-view Observations Jun Dai et.al. 2510.21487 null
2025-10-23 Kinetics of Peierls dimerization transition: Machine learning force-field approach Ho Jang et.al. 2510.20659 null
2025-10-23 RubbleSim: A Photorealistic Structural Collapse Simulator for Confined Space Mapping Constantine Frost et.al. 2510.20529 null
2025-10-23 A simple model for PDFs and nPDFs A. V. Kotikov et.al. 2510.20139 null
2025-10-22 Stochastic dynamics of quasiparticles in the hard rod gas Seema Chahal et.al. 2510.19693 null
2025-10-22 Probing Accretion Disk Winds of Stratified Nature with Fe XXVI Doublet in Black Hole X-ray Binaries Keigo Fukumura et.al. 2510.19539 null
2025-10-22 PRGCN: A Graph Memory Network for Cross-Sequence Pattern Reuse in 3D Human Pose Estimation Zhuoyang Xie et.al. 2510.19475 null
2025-10-22 Advances in 4D Representation: Geometry, Motion, and Interaction Mingrui Zhao et.al. 2510.19255 null
2025-10-21 The slope and scatter of the star forming main sequence at z~5 : reconciling observations with simulations Claudia Di Cesare et.al. 2510.19044 null
2025-10-21 $\nabla$ -SDF: Learning Euclidean Signed Distance Functions Online with Gradient-Augmented Octree Interpolation and Neural Residual Zhirui Dai et.al. 2510.18999 null
2025-10-21 Uniqueness of Angular Velocity Reconstruction in Parallel-Beam and Diffraction Tomography Peter Elbau et.al. 2510.18829 null
2025-10-21 Nonthermal electron acceleration in turbulent post-flare coronal loops Clarissa Mora et.al. 2510.18742 null
2025-10-21 Observational Tests of Regular Black Holes with Scalar Hair and their Stability P. A. González et.al. 2510.18647 null
2025-10-21 Multiscale transitional flow in anisotropic nanoparticle suspensions revealed by time-resolved x-ray scatter microscopy Kesavan Sekar et.al. 2510.18444 null
2025-10-21 MMRHP: A Miniature Mixed-Reality HIL Platform for Auditable Closed-Loop Evaluation Mingxin Li et.al. 2510.18371 null
2025-10-21 The selection function of the Gaia DR3 open cluster census Emily L. Hunt et.al. 2510.18343 null
2025-10-21 Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery Xiang Zhang et.al. 2510.18256 null
2025-10-20 Geometric Field Theory for Elastohydrodynamics of Cosserat Rods Mingjia Yan et.al. 2510.18097 null
2025-10-20 Bifurcations of planar balanced configurations for the $n$-body problem in $\mathbb{R}^4$ Katharina Kormanna et.al. 2510.17749 null
2025-10-20 Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS Feng Zhou et.al. 2510.17479 null
2025-10-20 Segmenting infant brains across magnetic fields: Domain randomization and annotation curation in ultra-low field MRI Vladyslav Zalevskyi et.al. 2510.17436 null
2025-10-21 Leveraging AV1 motion vectors for Fast and Dense Feature Matching Julien Zouein et.al. 2510.17434 null
2025-10-21 DeepDetect: Learning All-in-One Dense Keypoints Shaharyar Ahmed Khan Tareen et.al. 2510.17422 null
2025-10-20 Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models Katie Luo et.al. 2510.17274 null
2025-10-20 Kinetically-induced bound states in a frustrated Rydberg tweezer array Mu Qiao et.al. 2510.17183 null
2025-10-19 The Lorentz-Violating effects in charged particle systems E. Maciel et.al. 2510.17055 null
2025-10-18 CryoDyna: Multiscale end-to-end modeling of cryo-EM macromolecule dynamics with physics-aware neural network Chengwei Zhang et.al. 2510.16510 null
2025-10-18 HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars Haocheng Tang et.al. 2510.16463 null
2025-10-18 LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching Aidyn Ubingazhibov et.al. 2510.16438 null
2025-10-18 Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models Chenrui Tie et.al. 2510.16344 null
2025-10-18 XRISM-Subaru views of Abell 754: an off-axis, near-line-of-sight merging cluster Nobuhiro Okabe et.al. 2510.16291 null
2025-10-17 DGME-T: Directional Grid Motion Encoding for Transformer-Based Historical Camera Movement Classification Tingyu Lin et.al. 2510.15725 null
2025-10-17 A single optically detectable tumbling spin in silicon Félix Cache et.al. 2510.15590 null
2025-10-17 Airway Mucus Rheology: Physical Insights for Navigating through Health to Pathology and Clinical Applications Zhiwei Liu et.al. 2510.15562 null
2025-10-17 ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents Tingyu Lin et.al. 2510.15557 null
2025-10-17 MRASfM: Multi-Camera Reconstruction and Aggregation through Structure-from-Motion in Driving Scenes Lingfeng Xuan et.al. 2510.15467 null
2025-10-17 Modeling and Dynamic Simulation of a Hybrid Wind-Wave System on a Hexagonal Semi-Submersible Platform Saeid Bayat et.al. 2510.15285 null
2025-10-17 CuSfM: CUDA-Accelerated Structure-from-Motion Jingrui Yu et.al. 2510.15271 null
2025-10-16 OmniMotion: Multimodal Motion Generation with Continuous Masked Autoregression Zhe Li et.al. 2510.14954 null
2025-10-16 A Physics Prior-Guided Dual-Stream Attention Network for Motion Prediction of Elastic Bragg Breakwaters Lianzi Jiang et.al. 2510.14250 null
2025-10-15 Is Gravity Truly Balanced? A Historical-Critical Journey Through the Equivalence Principle and the Genesis of Spacetime Geometry Jaume de Haro et.al. 2510.13938 null
2025-10-15 Turbulent transport for wall shear stress fluctuations Myoungkyu Lee et.al. 2510.13758 null
2025-10-15 Orbital dynamics and precession in magnetized Kerr spacetime Karthik Iyer et.al. 2510.13569 null
2025-10-15 Learning Neural Parametric 3D Breast Shape Models for Metrical Surface Reconstruction From Monocular RGB Videos Maximilian Weiherer et.al. 2510.13540 null
2025-10-15 InstantSfM: Fully Sparse and Parallel Structure-from-Motion Jiankun Zhong et.al. 2510.13310 null
2025-10-15 Investigating Buoyant Plume Dynamics Induced by Localized Fire-Simulated Heating over Plant Canopies Using LES Ajinkya Desai et.al. 2510.13196 null
2025-11-06 Dependency of the Bar Formation Timescale On The Halo Spin Bin-Hui Chen et.al. 2510.13153 null
2025-10-15 Edit-Your-Interest: Efficient Video Editing via Feature Most-Similar Propagation Yi Zuo et.al. 2510.13084 null
2025-10-14 Mapping the Perseus Galaxy Cluster with XRISM: Gas Kinematic Features and their Implications for Turbulence Congyao Zhang et.al. 2510.12782 null
2025-10-14 PET Head Motion Estimation Using Supervised Deep Learning with Attention Zhuotong Cai et.al. 2510.12758 null
2025-10-14 Widespread Hot Molecular Gas Heated by Shear-induced Turbulence in the Galactic Center Juan Li et.al. 2510.12518 null
2025-10-14 M3D-skin: Multi-material 3D-printed Tactile Sensor with Hierarchical Infill Structures for Pressure Sensing Shunnosuke Yoshimura et.al. 2510.12419 null
2025-10-14 Scene Coordinate Reconstruction Priors Wenjing Bian et.al. 2510.12387 null
2025-10-14 Holographic Turbulence and the Fractal Dimension of the Turbulent Horizon Jia Du et.al. 2510.12198 null
2025-10-14 VIDMP3: Video Editing by Representing Motion with Pose and Position Priors Sandeep Mishra et.al. 2510.12069 null
2025-10-13 NaviGait: Navigating Dynamically Feasible Gait Libraries using Deep Reinforcement Learning Neil C. Janwani et.al. 2510.11542 null
2025-10-13 Behavior of passive polymeric tracers of different topologies in a dilute bath of active Brownian particles Ramanand Singh Yadav et.al. 2510.11337 null
2025-10-13 The chemodynamical memory of a major merger in a NIHAO-UHD Milky Way analogue I: A golden thread through time and space Sven Buder et.al. 2510.11284 null
2025-10-13 High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation Runyang Feng et.al. 2510.11017 null
2025-10-12 Align2Act: Instruction-Tuned Models for Human-Aligned Autonomous Driving Kanishkha Jaisankar et.al. 2510.10503 null
2025-10-12 Mesh-Gait: A Unified Framework for Gait Recognition Through Multi-Modal Representation Learning from 2D Silhouettes Zhao-Yang Wang et.al. 2510.10406 null
2025-10-11 sqrtVINS: Robust and Ultrafast Square-Root Filter-based 3D Motion Tracking Yuxiang Peng et.al. 2510.10346 null
2025-10-11 Ordinal Scale Traffic Congestion Classification with Multi-Modal Vision-Language and Motion Analysis Yu-Hsuan Lin et.al. 2510.10342 null
2025-10-11 Detection of Quadruple Structure Near the ASCC 32 Region via Machine Learning Methods Mohammad Noormohammadi et.al. 2510.10296 null
2025-10-11 Are Video Models Emerging as Zero-Shot Learners and Reasoners in Medical Imaging? Yuxiang Lai et.al. 2510.10254 null
2025-10-11 BurstDeflicker: A Benchmark Dataset for Flicker Removal in Dynamic Scenes Lishen Qu et.al. 2510.09996 null
2025-10-11 A no-contact result for a plate-fluid interaction system in dimension three Mario Bukal et.al. 2510.09992 null
2025-10-13 Guiding Energy-Efficient Locomotion through Impact Mitigation Rewards Chenghao Wang et.al. 2510.09543 null
2025-10-10 Two-Stage Gaussian Splatting Optimization for Outdoor Scene Reconstruction Deborah Pintani et.al. 2510.09489 null
2025-10-10 What is the contribution of gravitational infall on the mass assembly of star-forming clouds? A case study in a numerical simulation of the interstellar medium Noé Brucy et.al. 2510.09480 null
2025-10-11 The Visual Iconicity Challenge: Evaluating Vision-Language Models on Sign Language Form-Meaning Mapping Onur Keleş et.al. 2510.08482 null
2025-10-09 Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools Zhenlong Yuan et.al. 2510.08480 null
2025-10-09 Scalar-tensor theories in the Lyra geometry: Invariance under local transformations of length units and the Jordan-Einstein frame conundrum E. C. Valadão et.al. 2510.08433 null
2025-10-09 Beyond hospital reach: Autonomous lightweight ultrasound robot for liver sonography Zihan Li et.al. 2510.08106 null
2025-10-09 Executable Analytic Concepts as the Missing Link Between VLM Insight and Precise Manipulation Mingyang Sun et.al. 2510.07975 null
2025-10-08 XRISM/Resolve observations of Hercules X-1: vertical structure and kinematics of the disk wind Peter Kosec et.al. 2510.07615 null
2025-10-08 Curve separation in supercritical half-space last passage percolation Evgeni Dimitrov et.al. 2510.07508 null
2025-10-07 Learning from Limited Multi-Phase CT: Dual-Branch Prototype-Guided Framework for Early Recurrence Prediction in HCC Hsin-Pei Yu et.al. 2510.07347 null
2025-10-08 Dispersion and the transport of exciton-polaritons in an optical conveyor belt Xingran Xu et.al. 2510.07049 null
2025-10-08 The Star-forming Main Sequence and Bursty Star-formation Histories at $z>1.4$ in JADES and AURORA Leonardo Clarke et.al. 2510.06681 null
2025-10-08 Classical Polymerization of the Bianchi I Model with Deformed Poisson Structure Babak Vakili et.al. 2510.06628 null
2025-10-07 Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation Qingxuan Wu et.al. 2510.06504 null
2025-10-07 The first proper motion measurement of the acceleration regions in the large-scale jets of SS 433 powering the W50 nebula Naomi Tsuji et.al. 2510.06431 null
2025-10-07 Gravitational deflection of charged massive particle around charged galactic wormhole Md Khalid Hossain et.al. 2510.06294 null
2025-10-07 Cross-Embodiment Dexterous Hand Articulation Generation via Morphology-Aware Learning Heng Zhang et.al. 2510.06068 null
2025-10-07 Midway Network: Learning Representations for Recognition and Motion from Latent Dynamics Christopher Hoang et.al. 2510.05558 null
2025-10-06 The Prevalence of Bursty Star Formation in Low-Mass Galaxies at z=1-7 from Hα-to-UV Diagnostics Marissa N. Perry et.al. 2510.05388 null
2025-10-06 StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation Mingyu Liu et.al. 2510.05057 null
2025-10-06 Thermal effects in fluid structure interactions Sourav Mitra et.al. 2510.04801 null
2025-10-06 Equilibrium properties of strongly confined fluids Ana M. Montero et.al. 2510.04546 null
2025-10-05 Physics-Inspired All-Pair Interaction Learning for 3D Dynamics Modeling Kai Yang et.al. 2510.04233 null
2025-10-05 From Shadow to Light: Toward Safe and Efficient Policy Learning Across MPC, DeePC, RL, and LLM Agents Amin Vahidi-Moghaddam et.al. 2510.04076 null
2025-10-04 Dissecting Larval Zebrafish Hunting using Deep Reinforcement Learning Trained RNN Agents Raaghav Malik et.al. 2510.03699 null
2025-10-03 Bloch Oscillations and Landau-Zener Transitions in Flat-Band Lattices with Quadratic and Linear Band Touchings Chenhaoyue Wang et.al. 2510.03530 null
2025-10-03 Selective disruption of reach-related saccade timing following a middle-cerebral artery stroke Mahya Beheshti et.al. 2510.03076 null
2025-10-03 A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios Ruining Yang et.al. 2510.02627 null
2025-10-23 DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing Zihan Zhou et.al. 2510.02253 null
2025-10-02 Non-Gaussian Rotational Diffusion and Swing Motion of Dumbbell Probes in Two Dimensional Colloids Jeongmin Kim et.al. 2510.01847 null
2025-10-02 Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale Yongbo Chen et.al. 2510.01665 null
2025-10-01 Depinning of KPZ Interfaces in Fractional Brownian Landscapes Neda Valizadeh et.al. 2510.01103 null
2025-10-01 Can World Models Benefit VLMs for World Dynamics? Kevin Zhang et.al. 2510.00855 null
2025-09-30 Learning Human Reaching Optimality Principles from Minimal Observation Inverse Reinforcement Learning Sarmad Mehrdad et.al. 2510.00329 null
2025-09-30 JADES: An Abundance of Ultra-Distant T- and Y-Dwarfs in Deep Extragalactic Data Kevin N. Hainline et.al. 2510.00111 null
2025-10-03 The warm outer layer of a Little Red Dot as the source of [Fe II] and collisional Balmer lines with scattering wings Alberto Torralba et.al. 2510.00103 null
2025-09-30 Seeing Space and Motion: Enhancing Latent Actions with Spatial and Dynamic Awareness for VLA Zhejia Cai et.al. 2509.26251 null
2025-09-30 Droplets sliding on single and multiple vertical fibers Matteo Leonard et.al. 2509.25898 null
2025-09-30 Hierarchical Diffusion Motion Planning with Task-Conditioned Uncertainty-Aware Priors Amelie Minji Kim et.al. 2509.25685 null
2025-09-30 On the shape of pancakes: catastrophe theory and Gaussian statistics in 2D Abineet Parichha et.al. 2509.25608 null
2025-10-06 CoTaP: Compliant Task Pipeline and Reinforcement Learning of Its Controller with Compliance Modulation Zewen He et.al. 2509.25443 null
2025-09-29 Data-Augmented Resolvent Analysis of Wall-Bounded High-Pressure Transcritical Flow M. Bernades et.al. 2509.25398 null
2025-09-29 Seeking Kinematic Association of Known FU Orionis Stars with Young Clusters in Cygnus Tamojeet Roychowdhury et.al. 2509.25341 null
2025-10-08 VGGT-X: When VGGT Meets Dense Novel View Synthesis Yang Liu et.al. 2509.25191 null
2025-09-29 Fast Feature Field ( $\text{F}^3$ ): A Predictive Representation of Events Richeek Das et.al. 2509.25146 null
2025-09-29 Impact of Atomic Substitution on Core-Hole Relaxation Dynamics: A Study of Br $_2$ and IBr Nivedita Bhat et.al. 2509.24915 null
2025-09-29 Understanding Cognitive States from Head & Hand Motion Data Kaiang Wen et.al. 2509.24255 null
2025-09-28 BOSfM: A View Planning Framework for Optimal 3D Reconstruction of Agricultural Scenes Athanasios Bacharis et.al. 2509.24126 null
2025-09-28 RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization Dongki Jung et.al. 2509.23991 null
2025-09-28 CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting Dragoş-Andrei Chileban et.al. 2509.23947 null
2025-09-28 Witnessing Magnetic Reconnection in Tangled Superpenumbral Fibrils Around a Sunspot Hechao Chen et.al. 2509.23636 null
2025-09-27 Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos Junyi Wu et.al. 2509.23492 null
2025-09-27 Geometry-Aware Losses for Structure-Preserving Text-to-Sign Language Generation Zetian Wu et.al. 2509.23011 null
2025-09-26 Scallop Theorem for Swimming in Anisotropic Fluids Mojtaba Rajabi et.al. 2509.22249 null
2025-09-26 Taming Flow-based I2V Models for Creative Video Editing Xianghao Kong et.al. 2509.21917 null
2025-09-25 First results from ALPPS: a sub-Alfvénic streamer in SVS13A P. C. Cortes et.al. 2509.21701 null
2025-09-25 Multireference equation-of-motion driven similarity renormalization group for X-ray photoelectron spectra Shuhang Li et.al. 2509.21646 null
2025-09-25 Taxonomy-aware Dynamic Motion Generation on Hyperbolic Manifolds Luis Augenstein et.al. 2509.21281 null
2025-09-24 Pattern Formation in Agent-Based and PDE Models for Evolutionary Games with Payoff-Driven Motion Tianyong Yao et.al. 2509.20538 null
2025-09-24 Glassy dynamics in two-dimensional ring polymers: size versus stiffness polydispersity Rahul Nayak et.al. 2509.20066 null
2025-09-24 Modelling and Analysis of Non-Contacting Mechanical Face Seals with Axial Disturbances and Misalignment Ben S Ashby et.al. 2509.19993 null
2025-09-24 Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering Jiangxue Yu et.al. 2509.19898 null
2025-09-23 Probing the Origin of X-ray Flares in the Low-Hard State of GRS 1915+105 Using AstroSat and NuSTAR Shahzada Akhter et.al. 2509.19546 null
2025-10-30 Reaction/Diffusion Competition Drives Anomalous Relaxation of Vitrimers Makayla R. Branham-Ferrari et.al. 2509.19496 null
2025-09-23 Internal dynamics and structure of Cepheus OB4. The asymmetric expansion of Berkeley 59 Bruno Wiesneth et.al. 2509.19175 null
2025-09-23 DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring Pengteng Li et.al. 2509.18898 null
2025-09-23 Kinematics of the interstellar medium using Gaia: A catalogue of 102 YSO-MC associations within 3.5 kpc from the Sun with 3D velocities Ji-Xuan Zhou et.al. 2509.18496 null
2025-09-22 Efficient Particle Acceleration in 2.5-Dimensional, Hybrid-Kinetic Simulations of Decaying, Supersonic, Plasma Turbulence Keyan Gootkin et.al. 2509.18374 null
2025-09-22 Waves drive the rise and fall of 2D flows in rotating turbulence Sébastien Gomé et.al. 2509.18323 null
2025-09-22 VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models Geonung Kim et.al. 2509.17985 null
2025-09-22 Tensor-Based Self-Calibration of Cameras via the TrifocalCalib Method Gregory Schroeder et.al. 2509.17620 null
2025-10-15 Energy Correlators Resolving Proton Spin Jun Gao et.al. 2509.17596 null
2025-09-22 Learning Dexterous Manipulation with Quantized Hand State Ying Feng et.al. 2509.17450 null
2025-09-21 Reference-aware SFM layers for intrusive intelligibility prediction Hanlin Yu et.al. 2509.17270 null
2025-09-21 Beat on Gaze: Learning Stylized Generation of Gaze and Head Dynamics Chengwei Shi et.al. 2509.17168 null
2025-11-19 Asymptotic Higher Spin Symmetries: Noether Realization & Algebraic Structure in Einstein-Yang-Mills Theory Nicolas Cresto et.al. 2509.17137 null
2025-09-21 Insensitivity-induced potential non-uniqueness in system identification of Bouc-Wen models Adrita Kundu et.al. 2509.17122 null
2025-09-21 Dynamics of the $N$ -body system in energy-momentum squared gravity: II. Existence of a Self-Acceleration Elham Nazari et.al. 2509.17017 null
2025-09-21 VidCLearn: A Continual Learning Approach for Text-to-Video Generation Luca Zanchetta et.al. 2509.16956 null
2025-09-27 HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos Haoyang Weng et.al. 2509.16757 null
2025-09-19 On the application of refractive index matching to study the buoyancy-driven motion of spheres Jibu Tom Jose et.al. 2509.16384 null
2025-09-19 Investigating Polyglot Speech Foundation Models for Learning Collective Emotion from Crowds Orchid Chetia Phukan et.al. 2509.16329 null
2025-11-05 Modeling Elastic-Body Dynamics of Robotic Fish Using a Variational Framework Zhiheng Chen et.al. 2509.16145 null
2025-10-09 Hierarchical Reinforcement Learning with Low-Level MPC for Multi-Agent Control Max Studt et.al. 2509.15799 null
2025-09-19 Search for cosmic-ray induced gamma-ray emission from local galaxy clusters using Fermi-LAT data Judit Pérez-Romero et.al. 2509.15720 null
2025-10-24 MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild Deming Li et.al. 2509.15548 null
2025-10-21 SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models Sen Wang et.al. 2509.15536 null
2025-09-18 Dynamical Analysis of the HD 169142 Planet-Forming Disk: Twelve Years of High-Contrast Polarimetry Miles Lucas et.al. 2509.15323 null
2025-09-18 Static AdS Black Holes Surrounded by Strings and Quintessence-like Field within Rastall Gravity Framework Allan. R. P. Moreira et.al. 2509.15274 null
2025-09-27 WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance Chenxi Song et.al. 2509.15130 null
2025-09-17 Repulsive Trajectory Modification and Conflict Resolution for Efficient Multi-Manipulator Motion Planning Junhwa Hong et.al. 2509.13882 null
2025-09-18 MapAnything: Universal Feed-Forward Metric 3D Reconstruction Nikhil Keetha et.al. 2509.13414 null
2025-09-16 Optimal Annuitization with stochastic mortality: Piecewise Deterministic Mortality Force Matteo Buttarazzi et.al. 2509.13091 null
2025-09-16 Spatiotemporal graph neural process for reconstruction, extrapolation, and classification of cardiac trajectories Jaume Banus et.al. 2509.12953 null
2025-09-18 A-TDOM: Active TDOM via On-the-Fly 3DGS Yiwei Xu et.al. 2509.12759 null
2025-10-21 Neural 3D Object Reconstruction with Small-Scale Unmanned Aerial Vehicles Àlmos Veres-Vitàlyos et.al. 2509.12458 null
2025-09-15 DYNAMO: Dependency-Aware Deep Learning Framework for Articulated Assembly Motion Prediction Mayank Patel et.al. 2509.12430 null
2025-11-20 End-to-End 4D Heart Mesh Recovery Across Full-Stack and Sparse Cardiac MRI Yihong Chen et.al. 2509.12090 null
2025-11-18 Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting Yi-Hsin Li et.al. 2509.11853 null
2025-09-15 WAFER: A new method to retrieve sun-induced fluorescence based on spectral wavelet decompositions Veronika Oehl et.al. 2509.11829 null
2025-09-14 Understanding the effect of wall elasticity in turbulent channel flows M. Koseki et.al. 2509.11142 null
2025-09-14 3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment Nhut Le et.al. 2509.11097 null
2025-09-13 Space Astrometry with Gaia: Advances in Understanding our Galaxy Michael Perryman et.al. 2509.10883 null
2025-11-04 Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation Hao Zhang et.al. 2509.10687 null
2025-09-12 Nanosculpting lateral weak link junctions in superconducting Fe(Te,Se)/Bi2Te3 with focused Si++ ions and implications on vortex pinning Debarghya Mallick et.al. 2509.10606 null
2025-09-17 DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training Jianxin Shi et.al. 2509.10426 null
2025-09-12 Breakdown of the critical state in the ferromagnetic superconductor EuFe $2$(As${1-x}$P$_x$)$_2$ William Robert Fern et.al. 2509.10339 null
2025-09-12 A MeerKAT view of the parsec-scale jets in the black-hole X-ray binary GRS 1758-258 I. Mariani et.al. 2509.10275 null
2025-09-12 Robustness and Diagnostic Performance of Super-Resolution Fetal Brain MRI Ema Masterl et.al. 2509.10257 null
2025-09-12 Cluster Ages to Reconstruct the Milky Way Assembly (CARMA) IV. Chrono-dynamics of seven old star clusters in the Large Magellanic Cloud and the peculiar origin of NGC 1841 F. Niederhofer et.al. 2509.10144 null
2025-09-11 Initial conditions for tidal synchronisation of a planet by its moon Valeri V. Makarov et.al. 2509.09858 null
2025-09-09 Australian Supermarket Object Set (ASOS): A Benchmark Dataset of Physical Objects and 3D Models for Robotics and Computer Vision Akansel Cosgun et.al. 2509.09720 null
2025-09-11 MOFU: Development of a MOrphing Fluffy Unit with Expansion and Contraction Capabilities and Evaluation of the Animacy of Its Movements Taisei Mogi et.al. 2509.09613 null
2025-09-11 DualTrack: Sensorless 3D Ultrasound needs Local and Global Context Paul F. R. Wilson et.al. 2509.09530 null
2025-09-11 BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging Peng Zhou et.al. 2509.09484 null
2025-09-11 A Hybrid Hinge-Beam Continuum Robot with Passive Safety Capping for Real-Time Fatigue Awareness Tongshun Chen et.al. 2509.09404 null
2025-09-11 Video Understanding by Design: How Datasets Shape Architectures and Insights Lei Wang et.al. 2509.09151 null
2025-09-11 Exploration on the Two-stream Instability in the Polar Cusp Under Solar Storm Disturbances and its Potential Impacts on Spacecraft Jikai Sun et.al. 2509.09126 null
2025-09-11 Propulsive transitions and scaling relations of a heaving flexible foil in a cylinder wake Guojun Li et.al. 2509.09102 null
2025-10-18 Kinetostatics and Particle-Swarm Optimization of Vehicle-Mounted Underactuated Metamorphic Loading Manipulators Nan Mao et.al. 2509.09093 null
2025-10-04 A comprehensive view of nuclear shapes, rotations and vibrations from fully quantum mechanical perspectives Takaharu Otsuka et.al. 2509.08552 null
2025-09-10 The GECKOS survey: Jeans anisotropic models of edge-on discs uncover the impact of dust and kinematic structures T. H. Rutherford et.al. 2509.08371 null
2025-08-26 Analog-based ensembles to characterize turbulent dynamics from observed data Carlos Granero-Belinchon et.al. 2509.07992 null
2025-09-09 Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation Shunlei Li et.al. 2509.07957 null
2025-09-09 Mode-coupling theory of the glass transition for a liquid in a periodic potential Abolfazl Ahmadirahmat et.al. 2509.07697 null
2025-09-09 Beyond Motion Cues and Structural Sparsity: Revisiting Small Moving Target Detection Guoyi Zhang et.al. 2509.07654 null
2025-09-10 VIM-GS: Visual-Inertial Monocular Gaussian Splatting via Object-level Guidance in Large Scenes Shengkai Zhang et.al. 2509.06685 null
2025-09-08 From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans Marilyn Keller et.al. 2509.06607 null
2025-09-08 Nonlinear planar Hall effect from superconducting vortex motion Mio Hashimoto et.al. 2509.06313 null
2025-11-11 Limiting distribution of the chemical distance in high dimensional critical percolation Shirshendu Chatterjee et.al. 2509.06236 null
2025-09-07 Micro-Expression Recognition via Fine-Grained Dynamic Perception Zhiwen Shao et.al. 2509.06015 null
2025-09-07 Modeling Magnetoelastic Wave Interactions in Magnetic Films and Heterostructures: A finite-difference approach Peter Flauger et.al. 2509.06007 null
2025-09-07 Skyrmion manipulation and logic gate functionality in transition metal multilayers Tamali Mukherjee et.al. 2509.05951 null
2025-09-06 Depth Profiling of Oxygen Migration in Ta/HfO2 Stacks During Ionic Liquid Gating Beatrice Bednarz et.al. 2509.05748 null
2025-09-05 Resolving Tangling in Multi-Conformer Refinement via Iterative Projections Avinash Mandaiya et.al. 2509.05189 null
2025-09-04 Disentangling Multiple Gas Kinematic Drivers in the Perseus Galaxy Cluster XRISM Collaboration et.al. 2509.04421 null
2025-09-07 Hyperuniformity and conservation laws in non-equilibrium systems Raphaël Maire et.al. 2509.04242 null
2025-09-03 Exploiting correlations in multi-coincidence Coulomb explosion patterns for differentiating molecular structures using machine learning Anbu Selvam Venkatachalam et.al. 2509.03776 null
2025-09-03 Beyond the Clouds: S3 as the most distant extended Milky Way stream, not of LMC origin Ó. Jiménez-Arranz et.al. 2509.03424 null
2025-09-02 Voter Model stability with respect to conservative noises Gideon Amir et.al. 2509.02717 null
2025-09-02 Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction Xueyang Kang et.al. 2509.01873 null
2025-09-01 Optimal information injection and transfer mechanisms for active matter reservoir computing Mario U. Gaimann et.al. 2509.01799 null
2025-09-01 An Accurate Comprehensive Approach to Substructure: IV. Dynamical Friction Eduard Salvador-Solé et.al. 2509.01553 null
2025-08-31 Origin and control of pseudo-rotating spiral jets Karol Wawrzak et.al. 2509.00763 null
2025-09-30 Intramolecular Singlet Fission Through a Coherently Coupled Excimer-like Intermediate Sanjoy Patra et.al. 2508.21568 null
2025-08-28 Coherent motions to predict Lagrangian trajectories Ali R Khojasteh et.al. 2508.21191 null
2025-08-28 First-Order Viscous Relativistic Hydrodynamics on the Two-Sphere Lennox S. Keeble et.al. 2508.20998 null
2025-08-28 Scaling Fabric-Based Piezoresistive Sensor Arrays for Whole-Body Tactile Sensing Curtis C. Johnson et.al. 2508.20959 null
2025-08-28 Language-Enhanced Mobile Manipulation for Efficient Object Search in Indoor Environments Liding Zhang et.al. 2508.20899 null
2025-08-28 On W-algebras and ODE/IM correspondence Matěj Kudrna et.al. 2508.20793 null
2025-08-28 AvatarBack: Back-Head Generation for Complete 3D Avatars from Front-View Images Shiqi Xin et.al. 2508.20623 null
2025-08-26 PRISM: A Framework Harnessing Unsupervised Visual Representations and Textual Prompts for Explainable MACE Survival Prediction from Cardiac Cine MRI Haoyang Su et.al. 2508.19325 null
2025-08-26 Thermoelectric evidence of the electronic structure changes from the charge-density-wave transition in FeGe Kaila Jenkins et.al. 2508.19116 null
2025-08-26 WIde Separation Planets In Time (WISPIT): A Gap-clearing Planet in a Multi-ringed Disk around the Young Solar-type Star WISPIT 2 Richelle F. van Capelleveen et.al. 2508.19053 null
2025-08-27 Striking Similarities in Dynamics and Vibrations of 2D Quasicrystals and Supercooled Liquids Edwin A. Bedolla-Montiel et.al. 2508.18856 null
2025-08-26 Locally tuned hydrodynamics of active polymer chains Lisa Sappl et.al. 2508.18789 null
2025-08-26 Chemical control of polymorphism and ferroelectricity in PbTiO3 and SrTiO3 monolayers and bilayers Shaowen Xu et.al. 2508.18777 null
2025-08-26 A New Evidence of Interplay Between Tetrahedral and Octahedral Symmetries and Symmetry Breaking: Exotic Rotational Bands in $^{152}$ Sm S. Basak et.al. 2508.18686 null
2025-11-24 Warm Chat: Diffuse Emotion-aware Interactive Talking Head Avatar with Tree-Structured Guidance Haijie Yang et.al. 2508.18337 null
2025-08-25 Cellular Flow Architecture Exposes the Hidden Mechanics of Biological Matter Tianxiang Ma et.al. 2508.17974 null
2025-08-25 SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization Junyuan Deng et.al. 2508.17972 null
2025-08-25 On the complexity of parametrized motion planning algorithms Navnath Daundkar et.al. 2508.17629 null
2025-10-07 MoSA: Motion-Coherent Human Video Generation via Structure-Appearance Decoupling Haoyu Wang et.al. 2508.17404 null
2025-08-24 Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery Jiaqi Liu et.al. 2508.17380 null
2025-08-23 A fluxonium qubit-based hybrid electromechanical system Roson Nongthombam et.al. 2508.17105 null
2025-08-27 A Black Hole Solution in Kalb-Ramond Gravity with Quintessence Field: From Geodesic Dynamics to Thermal Criticality Ahmad Al-Badawi et.al. 2508.16693 null
2025-11-10 Stable black holes in lower dimensional $f(\mathbb{Q})$ non-metric gravity G. G. L. Nashed et.al. 2508.16679 null
2025-08-07 Thermal convection in huddling emperor penguins Dmitry Bratsun et.al. 2508.16586 null
2025-08-22 Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation Chun-Peng Chang et.al. 2508.16512 null
2025-08-25 HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images Anilkumar Swamy et.al. 2508.16465 null
2025-08-26 Prompting with Sign Parameters for Low-resource Sign Language Instruction Generation Md Tariquzzaman et.al. 2508.16076 null
2025-08-22 NeuralMeshing: Complete Object Mesh Extraction from Casual Captures Floris Erich et.al. 2508.16026 null
2025-08-21 WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception Zhiheng Liu et.al. 2508.15720 null
2025-08-21 Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework Zongqi He et.al. 2508.15457 null
2025-09-21 DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians Cong Wang et.al. 2508.15376 null
2025-09-04 A Spectroscopic Hunt for Post-Red Supergiants in the Large Magellanic Cloud II: Turbulent Line Broadening in the Spectra of LMC Yellow Supergiants Trevor Z. Dorn-Wallenstein et.al. 2508.14971 null
2025-08-22 The Alma catalogue of OB stars. III. A cross-match with Gaia DR3 and an extension based on new spectral classifications M. Pantaleoni González et.al. 2508.14875 null
2025-08-20 Probing the farthest star clusters to the Small Magellanic Cloud A. E. Piatti et.al. 2508.14701 null
2025-08-20 GeMS: Efficient Gaussian Splatting for Extreme Motion Blur Gopi Raju Matta et.al. 2508.14682 null
2025-08-20 Identifying Monochromatic Signals in LISA and Taiji via Spectral Split: Gravitational Waves versus Ultralight Dark Matter Yue-Hui Yao et.al. 2508.14655 null
2025-08-20 From Slices to Structures: Unsupervised 3D Reconstruction of Female Pelvic Anatomy from Freehand Transvaginal Ultrasound Max Krähenmann et.al. 2508.14552 null
2025-08-20 Singularity of the axisymmetric stagnation-point-like solution within a cylinder of the 3D Euler incompressible fluid equations Yinshen Xu et.al. 2508.14550 null
2025-08-20 Anisotropic Neutrino Emission from Spinning, Moving, and Charged Primordial Black Holes Arnab Chaudhuri et.al. 2508.14510 null
2025-08-19 Gravitational Influence from Planets on the Measured Rates of Period Change of Pulsating White Dwarfs Ling Xuan Yao et.al. 2508.14195 null
2025-08-20 Properties of the temporal transfer matrix in integrable Floquet circuits Ilya Vilkoviskiy et.al. 2508.13883 null
2025-10-31 Smooth Flow Matching Jianbin Tan et.al. 2508.13831 null
2025-08-18 Towards Routine Condensed Phase Simulations with Delta-Learned Coupled Cluster Accuracy: Application to Liquid Water Niamh O'Neill et.al. 2508.13391 null
2025-08-18 Dynamic stall of a hydrofoil with tubercles in surface gravity waves Guillaume Ricard et.al. 2508.13329 null
2025-08-18 MaskSem: Semantic-Guided Masking for Learning 3D Hybrid High-Order Motion Representation Wei Wei et.al. 2508.12948 null
2025-10-20 Visual-Neural-Inspired Image Inpainting for Specific Objects-of-Interest Imaging Yonghao Wu et.al. 2508.12808 null
2025-08-18 Discerning and quantifying high frequency activities in EEG under normal and epileptic conditions Jyotiraj Nath et.al. 2508.12670 null
2025-08-17 HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization Hyebin Ahn et.al. 2508.12292 null
2025-08-17 What do Speech Foundation Models Learn? Analysis and Applications Ankita Pasad et.al. 2508.12255 null
2025-08-16 KP-INR: A Dual-Branch Implicit Neural Representation Model for Cardiac Cine MRI Reconstruction Donghang Lyu et.al. 2508.12147 null
2025-08-16 Applied causality to infer protein dynamics and kinetics Akashnathan Aranganathan et.al. 2508.12060 null
2025-09-15 WiseLVAM: A Novel Framework For Left Ventricle Automatic Measurements Durgesh Kumar Singh et.al. 2508.12023 null
2025-08-19 Colloidal hydrodynamic interactions in viscoelastic fluids Dae Yeon Kim et.al. 2508.11948 null
2025-08-16 Mapping feedback signatures in 3C 297: A quasar-host merger at Cosmic Noon Chetna Duggal et.al. 2508.11926 null
2025-09-08 Deformation Driven Suction Cups: A Mechanics-Based Approach to Wearable Electronics Seola Lee et.al. 2508.11838 null
2025-08-01 Multimodal Quantitative Measures for Multiparty Behaviour Evaluation Ojas Shirekar et.al. 2508.10916 null
2025-08-14 Reduction of motion artifacts from photoplethysmography signals using learned convolutional sparse coding Giulio Basso et.al. 2508.10805 null
2025-08-14 Snap-through time of arches is controlled by slenderness and imperfections William Simpkins et.al. 2508.10802 null
2025-08-14 On the Derivation of Equations of Motion from Symmetries in Quantum-Mechanical Systems via Heisenberg's Uncertainty Enrique Casanova et.al. 2508.10661 null
2025-08-14 EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba Quang Nguyen et.al. 2508.10522 null
2025-08-13 Coulomb excitation of $^{124}$Te: Emerging collectivity and persisting seniority structure in the $6_1^+$ level M. Reece et.al. 2508.09643 null
2025-08-12 A Galactic Interloper: A Study of the Cam OB1 Association's Clusters and its Visitor from the Perseus Arm Joseph Mullen et.al. 2508.09393 null
2025-08-12 CLF-RL: Control Lyapunov Function Guided Reinforcement Learning Kejun Li et.al. 2508.09354 null
2025-08-12 Quadrupolar gyration of a Brownian particle in a confining ring Iman Abdoli et.al. 2508.08792 null
2025-08-11 Weak solutions and incompressible limit of a quasi-incompressible Navier--Stokes/Cahn--Hilliard model for viscous two-phase flows Mingwen Fei et.al. 2508.08090 null
2025-08-11 Joint Transcription of Acoustic Guitar Strumming Directions and Chords Sebastian Murgul et.al. 2508.07973 null
2025-08-12 Mem4D: Decoupling Static and Dynamic Memory for Dynamic Scene Reconstruction Xudong Cai et.al. 2508.07908 null
2025-08-11 Tracking Any Point Methods for Markerless 3D Tissue Tracking in Endoscopic Stereo Images Konrad Reuter et.al. 2508.07851 null
2025-08-11 Optimization of a Nonlinear Acoustics -- Structure Interaction Model Barbara Kaltenbacher et.al. 2508.07728 null
2025-08-10 GS4Buildings: Prior-Guided Gaussian Splatting for 3D Building Reconstruction Qilin Zhang et.al. 2508.07355 null
2025-11-17 Understanding Dynamic Scenes in Ego Centric 4D Point Clouds Junsheng Huang et.al. 2508.07251 null
2025-08-27 From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving Antonio Guillen-Perez et.al. 2508.07029 null
2025-08-09 Evaluating Fisheye-Compatible 3D Gaussian Splatting Methods on Real Images Beyond 180 Degree Field of View Ulas Gunes et.al. 2508.06968 null
2025-08-08 Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video Jixuan He et.al. 2508.06715 null
2025-08-08 Low temperature jet spectra of (DFE)2, DFE-He, DFE-He2 and DFE in the 2210-3105 cm-1 region (DFE = 1,1 difluoroethylene) A. J. Barclay et.al. 2508.06629 null
2025-08-08 V: An Efficient Motion Planning Algorithm for Autonomous Vehicles* Abdullah Zareh Andaryan et.al. 2508.06404 null
2025-08-08 Topological edge states and amplitude-dependent delocalization in quasiperiodic elliptically geared lattices Shuaifeng Li et.al. 2508.06286 null
2025-08-07 CleanUpBench: Embodied Sweeping and Grasping Benchmark Wenbo Li et.al. 2508.05543 null
2025-08-07 F2PASeg: Feature Fusion for Pituitary Anatomy Segmentation in Endoscopic Surgery Lumin Chen et.al. 2508.05465 null
2025-08-07 Information-Theoretic Graph Fusion with Vision-Language-Action Model for Policy Reasoning and Dual Robotic Control Shunlei Li et.al. 2508.05342 null
2025-10-08 Regular black hole's impact on the gravitational waveforms from periodic orbits Mirzabek Alloqulov et.al. 2508.05245 null
2025-08-07 EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery Bingyu Yang et.al. 2508.05205 null
2025-08-07 Refining Gaussian Splatting: A Volumetric Densification Approach Mohamed Abdul Gafoor et.al. 2508.05187 null
2025-09-02 XRISM/Resolve View of Abell 2319: Turbulence, Sloshing, and ICM Dynamics XRISM Collaboration et.al. 2508.05067 null
2025-11-04 Bursting at the seams: the star-forming main sequence and its scatter at z=3-9 using NIRCam photometry from JADES C. Simmonds et.al. 2508.04410 null
2025-09-19 Variational mode decomposition analysis of the relationship between low-frequency shock-wave oscillations and buffet cells Yuya Ohmichi et.al. 2508.04250 null
2025-08-06 PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction Muhua Zhu et.al. 2508.04236 null
2025-08-06 SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition Jiahui Li et.al. 2508.04224 null
2025-08-06 Probing globular clusters using modulated gravitational waves from binary black holes Jie Wu et.al. 2508.04021 null
2025-10-21 Can Large Language Models Adequately Perform Symbolic Reasoning Over Time Series? Zewen Liu et.al. 2508.03963 null
2025-09-26 Next Generation Equation-Free Multiscale Modelling of Crowd Dynamics via Machine Learning Hector Vargas Alvarez et.al. 2508.03926 null
2025-08-05 High-Resolution Dynamic Full-Field Optical Coherence Microscopy: Illuminating Intracellular Activity in Deep Tissue Erikas Tarvydas et.al. 2508.03657 null
2025-08-05 WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval Junlong Ren et.al. 2508.03343 null
2025-08-04 A fluid--peridynamic structure model of deformation and damage of microchannels Ziyu Wang et.al. 2508.02875 null
2025-08-04 Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering Xu Wang et.al. 2508.02362 null
2025-08-04 Newtons First Law Is Not a Special Case of the Second Law Indresh Yadav et.al. 2508.02246 null
2025-08-04 IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A Chen Li et.al. 2508.01984 null
2025-08-03 CVD-SfM: A Cross-View Deep Front-end Structure-from-Motion System for Sparse Localization in Multi-Altitude Scenes Yaxuan Li et.al. 2508.01936 null
2025-10-16 Orbital angular momentum of entangled photons as a probe for relativistic effects Fazilah Nothlawala et.al. 2508.01716 null
2025-08-02 Rim destabilization and re-formation upon severance from its expanding sheet M. Kharbedia et.al. 2508.01308 null
2025-10-16 UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation Chaitanya Patel et.al. 2508.01126 null
2025-08-01 Counting topological interface modes using simplicial characteristic classes N. Bohlsen et.al. 2508.01063 null
2025-08-01 3D Reconstruction via Incremental Structure From Motion Muhammad Zeeshan et.al. 2508.01019 null
2025-08-01 GeoMoE: Divide-and-Conquer Motion Field Modeling with Mixture-of-Experts for Two-View Geometry Jiajun Le et.al. 2508.00592 null
2025-08-01 TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps Zehui Xu et.al. 2508.00303 null
2025-07-30 X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention Xiaochen Zhao et.al. 2507.23143 null
2025-07-30 Eddy population based model for the wall-pressure spectrum at high Reynolds number Jonathan M. O. Massey et.al. 2507.23098 null
2025-08-01 Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future Guoping Xu et.al. 2507.22792 null
2025-08-14 A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks Hang Su et.al. 2507.22733 null
2025-07-29 Probing Turbulence, Gravity, Supernovae, and Magnetic Field Effects with the 6D Kinematics of Young Stars in Milky Way Star-Forming Regions Benjamin N. Velguth et.al. 2507.22107 null
2025-07-28 Projecting the New Body: How Body Image Evolves During Learning to Walk with a Wearable Robot I-Chieh Lee et.al. 2507.21384 null
2025-07-28 FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling Jingting Li et.al. 2507.20557 null
2025-07-27 Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars Mattia Piccinini et.al. 2507.20427 null
2025-07-27 Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models Bohong Chen et.al. 2507.20220 null
2025-07-27 Unveiling the Sagittarius Dwarf Spheroidal Galaxy Core with Gaia DR3 Ellie K. H. Toguchi-Tani et.al. 2507.20212 null
2025-07-27 PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks Clinton Ansun Mo et.al. 2507.20170 null
2025-10-04 RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters Xiaolin Liu et.al. 2507.20117 null
2025-07-26 Nonlinear causality of Israel-Stewart theory with diffusion Ian Cordeiro et.al. 2507.20064 null
2025-07-26 TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking Mengmeng Wang et.al. 2507.19908 null
2025-11-08 RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection Xiaokai Bai et.al. 2507.19856 null
2025-07-25 The phase spiral's origin and evolution: indications from its varying properties across the Milky Way disk Axel Widmark et.al. 2507.19579 null
2025-08-02 GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting Baijun Ye et.al. 2507.19451 null
2025-11-10 A multi-dynamic low-rank deep image prior (ML-DIP) for 3D real-time cardiovascular MRI Chong Chen et.al. 2507.19404 null
2025-07-25 NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography Kirsten W. H. Maas et.al. 2507.19328 null
2025-07-31 MVG4D: Image Matrix-Based Multi-View and Motion Generation for 4D Content Creation from a Single Image DongFu Yin et.al. 2507.18371 null
2025-07-23 Zero-Shot Dynamic Concept Personalization with Grid-Based LoRA Rameen Abdal et.al. 2507.17963 null
2025-07-23 MCM: Mamba-based Cardiac Motion Tracking using Sequential Images in MRI Jiahui Yin et.al. 2507.17678 null
2025-07-23 Constraints on Axion Dark Matter by Spin-Dependent Macroscopic Force Dongyi Yang et.al. 2507.17148 null
2025-10-01 A Tutorial on MRI Reconstruction: From Modern Methods to Clinical Implications Tolga Çukur et.al. 2507.16715 null
2025-07-22 Dyna3DGR: 4D Cardiac Motion Tracking with Dynamic 3D Gaussian Representation Xueming Fu et.al. 2507.16608 null
2025-07-22 Sparse-View 3D Reconstruction: Recent Advances and Open Challenges Tanveer Younis et.al. 2507.16406 null
2025-07-22 MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation Yanchen Liu et.al. 2507.16310 null
2025-07-22 Universal Wavelet Units in 3D Retinal Layer Segmentation An D. Le et.al. 2507.16119 null
2025-09-24 Interpretable Embeddings of Speech Enhance and Explain Brain Encoding Performance of Audio Models Riki Shimizu et.al. 2507.16080 null
2025-07-21 Relationship between Structure and Dynamics of an Icosahedral Quasicrystal using Unsupervised Machine Learning Edwin A. Bedolla-Montiel et.al. 2507.15731 null
2025-07-21 Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing Boni Hu et.al. 2507.15683 null
2025-08-28 Edge-effects in the turbulent flow over flexible aquatic vegetation Giulio Foggi Rota et.al. 2507.15477 null
2025-07-21 Low-Latency Event-Based Velocimetry for Quadrotor Control in a Narrow Pipe Leonard Bauersfeld et.al. 2507.15444 null
2025-07-21 Few-Shot Object Detection via Spatial-Channel State Space Model Zhimeng Xin et.al. 2507.15308 null
2025-10-11 TinyIO: Lightweight Reparameterized Inertial Odometry Shanshan Zhang et.al. 2507.15293 null
2025-10-24 An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks Xinyi Wu et.al. 2507.14798 null
2025-07-20 Flow Equivariant Recurrent Neural Networks T. Anderson Keller et.al. 2507.14793 null
2025-07-19 The Serpent Eating Its Own Tail: Dust Destruction in the Apep Colliding-Wind Nebula Ryan M. T. White et.al. 2507.14610 null
2025-07-19 BT-TL-DMPs: A Novel Robot TAMP Framework Combining Behavior Tree, Temporal Logic and Dynamical Movement Primitives Zezhi Liu et.al. 2507.14582 null
2025-07-19 Motion Segmentation and Egomotion Estimation from Event-Based Normal Flow Zhiyuan Hua et.al. 2507.14500 null
2025-07-18 DUSTrack: Semi-automated point tracking in ultrasound videos Praneeth Namburi et.al. 2507.14368 null
2025-07-18 Efficient Variational Dynamics of Open Quantum Bosonic Systems via Automatic Differentiation Jacopo Tosca et.al. 2507.14076 null
2025-07-29 DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation Haoran Li et.al. 2507.13985 null
2025-07-18 Gaussian kernel-based motion measurement Hongyi Liu et.al. 2507.13693 null
2025-10-20 Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation Masahiro Ogawa et.al. 2507.13628 null
2025-07-16 Enhancing In-Domain and Out-Domain EmoFake Detection via Cooperative Multilingual Speech Foundation Models Orchid Chetia Phukan et.al. 2507.12595 null
2025-07-16 BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images Davide Di Nucci et.al. 2507.12095 null
2025-07-16 Spatial Frequency Modulation for Semantic Segmentation Linwei Chen et.al. 2507.11893 null
2025-07-14 Supporting SENĆOTEN Language Documentation Efforts with Automatic Speech Recognition Mengzhe Geng et.al. 2507.10827 null
2025-07-11 Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT Wei Zhang et.al. 2507.08448 null
2025-07-04 MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion Peilin Tao et.al. 2507.03306 null
2025-06-30 Towards Initialization-free Calibrated Bundle Adjustment Carl Olsson et.al. 2506.23808 null
2025-06-30 AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention Ziao Liu et.al. 2506.23611 null
2025-06-27 Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras Petr Hruby et.al. 2506.22069 null
2025-06-24 ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes Chenhao Zhang et.al. 2506.21629 null
2025-07-08 Wild refitting for black box prediction Martin J. Wainwright et.al. 2506.21460 null
2025-06-24 Experimental Assessment of Neural 3D Reconstruction for Small UAV-based Applications Genís Castillo Gómez-Raya et.al. 2506.19491 null
2025-06-23 ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs Michal Nazarczuk et.al. 2506.18792 null
2025-06-23 Room temperature spin injection into commercial VCSELs at non-resonant wavelengths Timur Almabetov et.al. 2506.18376 null
2025-06-11 OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary Yui Sudo et.al. 2506.09448 null
2025-06-06 SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction Yuchao Zheng et.al. 2506.05935 null
2025-06-05 On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images Andreas Meuleman et.al. 2506.05558 null
2025-06-05 SupeRANSAC: One RANSAC to Rule Them All Daniel Barath et.al. 2506.04803 link
2025-06-04 Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Tianyu Huang et.al. 2506.04225 null
2025-06-04 Accelerating SfM-based Pose Estimation with Dominating Set Joji Joseph et.al. 2506.03667 null
2025-06-03 Nearby dwarf galaxies with extreme star formation rates: a window into dwarf-galaxy evolution in the early Universe S. Kaviraj et.al. 2506.03265 null
2025-06-02 Fast and Robust Rotation Averaging with Anisotropic Coordinate Descent Yaroslava Lochman et.al. 2506.01940 null
2025-06-03 Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC Qingzheng Wang et.al. 2505.24200 null
2025-05-29 Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping Justin Lazarow et.al. 2505.23756 null
2025-05-30 FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian Sara Papi et.al. 2505.22759 link
2025-05-28 UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images Junhuan Liu et.al. 2505.22098 null
2025-05-28 Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule San Jiang et.al. 2505.22089 null
2025-05-30 Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations Whenty Ariyanti et.al. 2505.21356 null
2025-05-27 Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting Xiangyu Sun et.al. 2505.20729 null
2025-05-26 Robust fine-tuning of speech recognition models via model merging: application to disordered speech Alexandre Ducorroy et.al. 2505.20477 null
2025-05-29 Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud Natsuki Takama et.al. 2505.19854 null
2025-05-25 Improving Novel view synthesis of 360 $^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images Guangan Chen et.al. 2505.19264 link
2025-05-24 Token-Level Logits Matter: A Closer Look at Speech Foundation Models for Ambiguous Emotion Recognition Jule Valendo Halim et.al. 2505.18484 null
2025-05-22 Tracking the Flight: Exploring a Computational Framework for Analyzing Escape Responses in Plains Zebra (Equus quagga) Isla Duporge et.al. 2505.16882 link
2025-05-21 A Taxonomy of Structure from Motion Methods Federica Arrigoni et.al. 2505.15814 null
2025-05-18 Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis Dong Yang et.al. 2505.12226 null
2025-05-15 Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis Francisco Raverta Capua et.al. 2505.10751 link
2025-05-13 Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People Haoshuai Zhou et.al. 2505.08215 null
2025-05-12 RDD: Robust Feature Detector and Descriptor using Deformable Transformer Gonglin Chen et.al. 2505.08013 null
2025-05-12 Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild Lintao Xiang et.al. 2505.07373 null
2025-05-11 Symmetry in Fundamental Parameters of Galaxies on the Star-forming Main Sequence Zhicheng He et.al. 2505.06868 null
2025-05-10 TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility Marius Baden et.al. 2505.06743 null
2025-05-08 DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion Qitao Zhao et.al. 2505.05473 null
2025-05-20 FastMap: Revisiting Dense and Scalable Structure from Motion Jiahao Li et.al. 2505.04612 link
2025-05-15 Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera Siming He et.al. 2505.03093 null
2025-05-03 AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting Junhao Shi et.al. 2505.01799 null
2025-05-03 PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth Bu Jin et.al. 2505.01729 null
2025-05-01 Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation? Viktor Kocur et.al. 2505.00866 link
2025-04-29 Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. 2504.20496 null
2025-04-29 Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views Jiang Wu et.al. 2504.20378 link
2025-04-28 MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion Zador Pataki et.al. 2504.20040 link
2025-04-24 Dynamic Camera Poses and Where to Find Them Chris Rockwell et.al. 2504.17788 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-23 A Low-Cost Photogrammetry System for 3D Plant Modeling and Phenotyping Joe Hrzich et.al. 2504.16840 null
2025-04-23 PRaDA: Projective Radial Distortion Averaging Daniil Sinitsyn et.al. 2504.16499 null
2025-04-21 Traversing the Star-Forming Main Sequence with Molecular Gas Stacks of z~1.6 Cluster Galaxies Alex Pigarelli et.al. 2504.15381 null
2025-04-21 Towards Understanding Camera Motions in Any Video Zhiqiu Lin et.al. 2504.15376 null
2025-04-21 StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models Yeona Hong et.al. 2504.14915 null
2025-04-17 Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering Landon Dyken et.al. 2504.13339 null
2025-04-15 EDGS: Eliminating Densification for Efficient Convergence of 3DGS Dmytro Kotovenko et.al. 2504.13204 null
2025-04-15 Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps Panagiotis Agrafiotis et.al. 2504.11416 link
2025-04-12 A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds Jizong Peng et.al. 2504.09129 null
2025-04-11 Stereophotoclinometry Revisited Travis Driver et.al. 2504.08252 null
2025-04-08 Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring José A. Pilartes-Congo et.al. 2504.06464 null
2025-04-07 Decoding the variability in the star-formation histories of z ~ 0.8 galaxies Jenny T. Wan et.al. 2504.05281 null
2025-04-05 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS Zhisheng Huang et.al. 2504.04294 null
2025-04-04 An Algebraic Geometry Approach to Viewing Graph Solvability Federica Arrigoni et.al. 2504.03637 null
2025-04-04 Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video Jiaxin Guo et.al. 2504.03198 null
2025-04-03 Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation Feng Gao et.al. 2504.02647 link
2025-04-09 FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking Ulas Gunes et.al. 2504.01732 null
2025-03-31 LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors Han Zhou et.al. 2504.00219 null
2025-03-30 AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos Felix Wimbauer et.al. 2503.23282 link
2025-03-24 Ground Penetrating Radar-Assisted Multimodal Robot Odometry Using Subsurface Feature Matrix Haifeng Li et.al. 2503.18301 null
2025-03-22 3D Modeling: Camera Movement Estimation and path Correction for SFM Model using the Combination of Modified A-SIFT and Stereo System Usha Kumari et.al. 2503.17668 null
2025-03-25 ProtoGS: Efficient and High-Quality Rendering with 3D Gaussian Prototypes Zhengqing Gao et.al. 2503.17486 null
2025-03-21 ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration Johan Edstedt et.al. 2503.17093 link
2025-03-20 From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction Ayberk Acar et.al. 2503.16263 null
2025-03-22 Euclid Quick Data Release (Q1). A first view of the star-forming main sequence in the Euclid Deep Fields Euclid Collaboration et.al. 2503.15314 null
2025-03-18 Multi-view Reconstruction via SfM-guided Monocular Depth Estimation Haoyu Guo et.al. 2503.14483 null
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios Iryna Repinetska et.al. 2503.13710 null
2025-03-17 Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization Yiwei Xu et.al. 2503.13086 null
2025-03-15 SFMNet: Sparse Focal Modulation for 3D Object Detection Oren Shrout et.al. 2503.12093 null
2025-03-11 A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds Felix Rydell et.al. 2503.08142 null
2025-03-11 DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection Johan Edstedt et.al. 2503.07347 link
2025-03-18 Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion Mona Sheikh Zeinoddin et.al. 2503.07204 null
2025-03-10 VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation Hanzhi Chen et.al. 2503.07135 null
2025-03-09 AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation Yang Zou et.al. 2503.06660 null
2025-03-07 LiDAR-enhanced 3D Gaussian Splatting Mapping Jian Shen et.al. 2503.05425 null
2025-03-06 PLMP -- Point-Line Minimal Problems for Projective SfM Kim Kiehn et.al. 2503.04351 null
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661 link
2025-03-03 ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization Anas Abdelkarim et.al. 2503.01311 link
2025-03-05 A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping Jialei He et.al. 2503.01202 null
2025-03-02 MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain Rui Yi Yong et.al. 2503.00853 null
2025-03-02 PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery BoCheng Li et.al. 2503.00848 null
2025-03-02 Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration Jinjiang You et.al. 2503.00737 link
2025-02-28 The THESAN-ZOOM project: Burst, quench, repeat -- unveiling the evolution of high-redshift galaxies along the star-forming main sequence William McClymont et.al. 2503.00106 null
2025-02-27 Best Foot Forward: Robust Foot Reconstruction in-the-wild Kyle Fogarty et.al. 2502.20511 null
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-03-04 Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model Yaxuan Huang et.al. 2502.16779 null
2025-02-20 CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting Qilin Zhang et.al. 2502.14684 link
2025-02-19 Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections Seong Jong Yoo et.al. 2502.13986 null
2025-02-19 IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras Dongki Jung et.al. 2502.12545 null
2025-02-12 Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors Vishwanath Pratap Singh et.al. 2502.08587 null
2025-02-10 FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences Oliver Boyne et.al. 2502.06367 link
2025-02-09 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Jing-Xuan Zhang et.al. 2502.05766 link
2025-02-10 Building Rome with Convex Optimization Haoyu Han et.al. 2502.04640 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657 null
2025-02-05 GP-GS: Gaussian Processes for Enhanced Gaussian Splatting Zhihao Guo et.al. 2502.02283 link
2025-02-03 XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications Shangjin Zhai et.al. 2502.01297 null
2025-01-29 Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment Zixue Zeng et.al. 2501.17690 link
2025-01-28 Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction Tim Flückiger et.al. 2501.16221 null
2025-01-25 Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos Zhen-Hui Dong et.al. 2501.15096 null
2025-01-24 MATCHA:Towards Matching Anything Fei Xue et.al. 2501.14945 null
2025-01-24 Light3R-SfM: Towards Feed-forward Structure-from-Motion Sven Elflein et.al. 2501.14914 null
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277 null
2025-01-21 Theory of quantum-geometric charge and spin Josephson diode effects in strongly spin-polarized hybrid structures with noncoplanar spin textures Niklas L. Schulz et.al. 2501.12232 null
2025-01-14 Selective Attention Merging for low resource tasks: A case study of Child ASR Natarajan Balaji Shankar et.al. 2501.08468 link
2025-01-14 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015 null
2025-02-02 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-11 Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis Aditya Rauniyar et.al. 2501.06431 null
2025-01-09 Existence of dynamical fluctuation in AMPT generated data for Au+Au collisions at 10 AGeV Somen Gope et.al. 2501.05175 null
2025-01-06 Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation Yuezhang Lv et.al. 2501.02821 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-02 EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy Ao Gao et.al. 2501.01003 null
2024-12-30 KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences Keng-Wei Chang et.al. 2412.20767 null
2024-12-27 Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images Xudong Cai et.al. 2412.19518 null
2024-12-25 Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition Shujie Hu et.al. 2412.18832 null
2024-12-23 Reconstructing People, Places, and Cameras Lea Müller et.al. 2412.17806 link
2024-12-18 Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation Rémi Marsal et.al. 2412.14103 null
2024-12-16 Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection Beomseok Lee et.al. 2412.11978 null
2024-12-18 SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video Jongmin Park et.al. 2412.09982 null
2024-12-12 CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework Yushan Han et.al. 2412.08344 null
2024-12-10 Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling Hui Deng et.al. 2412.07230 null
2024-12-08 Unveiling True Talent: The Soccer Factor Model for Skill Evaluation Alexandre Andorra et.al. 2412.05911 null
2024-12-08 Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features Yuanbo Xiangli et.al. 2412.05826 null
2024-12-06 MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos Zhengqi Li et.al. 2412.04463 null
2024-12-03 ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification Pan Zhang et.al. 2412.02044 link
2024-12-02 SfM-Free 3D Gaussian Splatting via Hierarchical Training Bo Ji et.al. 2412.01553 link
2024-12-02 MVImgNet2.0: A Larger-scale Dataset of Multi-view Images Xiaoguang Han et.al. 2412.01430 null
2024-12-02 TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories Mengran Li et.al. 2412.01122 null
2024-12-02 Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM Alejandro Fontan et.al. 2412.01116 null
2024-11-27 RoMo: Robust Motion Segmentation Improves Structure from Motion Lily Goli et.al. 2411.18650 null
2024-11-26 The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at $z \sim$ 0.3 Marcie Mun et.al. 2411.17882 null
2024-11-25 Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations Peng Wei et.al. 2411.16150 null
2024-11-24 ZeroGS: Training 3D Gaussian Splatting from Unposed Images Yu Chen et.al. 2411.15779 null
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-15 SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Yutao Tang et.al. 2411.12592 link
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-13 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization Mijeong Kim et.al. 2411.08879 null
2024-11-13 Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model Yutao Shen et.al. 2411.08453 null
2024-11-08 From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS Haoran Zhang et.al. 2411.05362 link
2024-10-29 A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching Yi-Ting Huang et.al. 2410.22602 null
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213 null
2024-10-17 Stochastic Flow Matching for Resolving Small-Scale Physics Stathi Fotiadis et.al. 2410.19814 null
2024-10-25 A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint Changshi Mu et.al. 2410.19473 link
2024-10-30 Large Spatial Model: End-to-end Unposed Images to Semantic 3D Zhiwen Fan et.al. 2410.18956 link
2024-10-23 CO-CAVITY project: Molecular gas and star formation in void galaxies M. I. Rodríguez et.al. 2410.18078 null
2024-10-23 PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting Yu Wang et.al. 2410.17505 null
2024-10-20 Neural Active Structure-from-Motion in Dark and Textureless Environment Kazuto Ichimaru et.al. 2410.15378 null
2024-10-17 SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation Shiao Xie et.al. 2410.13486 null
2024-10-16 Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks Orchid Chetia Phukan et.al. 2410.12947 null
2024-10-16 Gravity-aligned Rotation Averaging with Circular Regression Linfei Pan et.al. 2410.12763 link
2024-10-16 Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals Orchid Chetia Phukan et.al. 2410.12645 null
2024-10-15 SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection Yizhe Liu et.al. 2410.12080 link
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-09 Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models Ange Lou et.al. 2410.07434 null
2024-10-09 Deep HI Mapping of M 106 Group with FAST Yao Liu et.al. 2410.07038 null
2024-10-09 MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data Mingu Kang et.al. 2410.06442 null
2024-10-08 Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? Charalambos Tzamos et.al. 2410.05984 link
2024-10-04 Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering Laura Fink et.al. 2410.03861 link
2024-10-01 MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages Marco Gaido et.al. 2410.01036 link
2024-10-01 Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance Hongchao Shu et.al. 2410.00386 null
2024-09-29 Robust Incremental Structure-from-Motion with Hybrid Features Shaohui Liu et.al. 2409.19811 null
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673 null
2024-09-26 BlinkTrack: Feature Tracking over 100 FPS via Events and Images Yichen Shen et.al. 2409.17981 null
2024-09-25 How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not Francesco Verdini et.al. 2409.17044 null
2024-09-24 Frequency-based View Selection in Gaussian Splatting Reconstruction Monica M. Q. Li et.al. 2409.16470 null
2024-10-07 Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion Juan-Diego Florez et.al. 2409.16465 null
2024-09-24 Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research Vandita Shukla et.al. 2409.15914 null
2024-09-23 Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments Francisco Roza de Moraes et.al. 2409.15602 null
2024-09-23 Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking Subham Agrawal et.al. 2409.14844 null
2024-09-21 Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models Orchid Chetia Phukan et.al. 2409.14131 null
2024-09-17 GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module Yichen Zhang et.al. 2409.11307 null
2024-09-13 Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints Shan Chen et.al. 2409.08613 null
2024-09-09 KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction Davide Di Nucci et.al. 2409.05407 null
2024-09-06 The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population Ryan P. Keenan et.al. 2409.03963 null
2024-09-05 Active Galactic Nuclei in the Green Valley at z $\sim$ 0.7 Charity Woodrum et.al. 2409.03197 null
2024-09-04 Object Gaussian for Monocular 6D Pose Estimation from Sparse Views Luqing Luo et.al. 2409.02581 null
2024-09-11 Geometry-aware Feature Matching for Large-Scale Structure from Motion Gonglin Chen et.al. 2409.02310 null
2024-09-04 The study of strongly intensive observables for $π^{\pm,0}$ in $pp$ collisions at LHC energy in the framework of PYTHIA model Tumpa Biswas et.al. 2409.00525 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966 null
2024-08-20 TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks Jinjie Mai et.al. 2408.10739 null
2024-08-16 Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS Wei Sun et.al. 2408.08723 null
2024-08-15 CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning Wei Zhu et.al. 2408.08134 link
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648 null
2024-08-07 Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM Yan Song Hu et.al. 2408.03825 null
2024-08-05 Context-aware Mamba-based Reinforcement Learning for social robot navigation Syed Muhammad Mustafa et.al. 2408.02661 null
2024-08-04 Birational geometry of critical loci in Algebraic Vision Marina Bertolini et.al. 2408.02067 null
2024-08-04 PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone Xin Yang et.al. 2408.02053 null
2024-08-02 Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris Kentaro Uno et.al. 2408.01035 null
2024-08-01 LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting Zhenyu Bao et.al. 2408.00254 null
2024-07-29 Global Structure-from-Motion Revisited Linfei Pan et.al. 2407.20219 link
2024-08-06 Revisit Self-supervised Depth Estimation with Local Structure-from-Motion Shengjie Zhu et.al. 2407.19166 null
2024-07-23 The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations Hao Liu et.al. 2407.16452 null
2024-07-22 Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures Ruizhe Wang et.al. 2407.15435 null
2024-07-16 NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models Francesco Milano et.al. 2407.12207 link
2024-07-15 LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning Zhuozhu Jian et.al. 2407.10782 null
2024-07-15 Towards Scale-Aware Full Surround Monodepth with Transformers Yuchen Yang et.al. 2407.10406 null
2024-07-14 3DEgo: 3D Editing on the Go! Umar Khalid et.al. 2407.10102 null
2024-07-10 Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization Jinjie Mai et.al. 2407.08023 link
2024-07-10 Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods Euclid Collaboration et.al. 2407.07940 null
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860 null
2024-07-09 Computer vision tasks for intelligent aerospace missions: An overview Huilin Chen et.al. 2407.06513 null
2024-07-08 Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views Jiawei Guo et.al. 2407.05666 null
2024-07-05 Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization Shaohan Li et.al. 2407.04260 null
2024-07-15 SfM on-the-fly: Get better 3D from What You Capture Zongqian Zhan et.al. 2407.03939 null
2024-07-03 Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction Jiaxin Guo et.al. 2407.02918 link
2024-07-02 Indoor 3D Reconstruction with an Unknown Camera-Projector Pair Zhaoshuai Qi et.al. 2407.01945 null
2024-06-27 SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas John Lambert et.al. 2406.19390 link
2024-06-27 STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning Yanan Zhang et.al. 2406.19362 null
2024-06-26 VDG: Vision-Only Dynamic Gaussian for Driving Simulation Hao Li et.al. 2406.18198 null
2024-06-25 Consensus Learning with Deep Sets for Essential Matrix Estimation Dror Moran et.al. 2406.17414 link
2024-06-24 Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction Tong Qin et.al. 2406.16289 null
2024-06-21 The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization Ivan Nikolić et.al. 2406.15237 link
2024-06-19 MVSBoost: An Efficient Point Cloud-based 3D Reconstruction Umair Haroon et.al. 2406.13515 null
2024-06-17 MegaScenes: Scene-Level View Synthesis at Scale Joseph Tung et.al. 2406.11819 link
2024-06-15 Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models Ruchao Fan et.al. 2406.10507 link
2024-06-14 On the Evaluation of Speech Foundation Models for Spoken Language Understanding Siddhant Arora et.al. 2406.10083 null
2024-06-12 Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement Maxime Pietrantoni et.al. 2406.08463 null
2024-06-12 SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models Chun Yin et.al. 2406.08445 null
2024-06-10 Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Xin Jin et.al. 2406.06216 link
2024-06-07 The Star-Forming Main Sequence in JADES and CEERS at $z>1.4$ : Investigating the Burstiness of Star Formation Leonardo Clarke et.al. 2406.05178 null
2024-06-13 Gaussian Splatting with Localized Points Management Haosen Yang et.al. 2406.04251 null
2024-06-05 L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration Yibo Liu et.al. 2406.03298 link
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-05-29 Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy Zijie Jiang et.al. 2405.18863 null
2024-05-29 3D Reconstruction with Fast Dipole Sums Hanyu Chen et.al. 2405.16788 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599 null
2024-05-26 Categorical Flow Matching on Statistical Manifolds Chaoran Cheng et.al. 2405.16441 link
2024-05-22 Exploring Galaxy Properties of eCALIFA with Contrastive Learning G. Martínez-Solaeche et.al. 2405.13471 null
2024-05-23 Switched Flow Matching: Eliminating Singularities via Switching ODEs Qunxi Zhu et.al. 2405.11605 null
2024-05-28 NeRO: Neural Road Surface Reconstruction Ruibo Wang et.al. 2405.10554 link
2024-05-15 Three Dimensional Spatial Cognition: Bees and Bats Robert Worden et.al. 2405.09413 null
2024-05-09 Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media Zhizhen Zhang et.al. 2405.05760 null
2024-05-09 Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment Simon Weber et.al. 2405.05079 link
2024-05-07 Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications Markus Hillemann et.al. 2405.04345 null
2024-05-07 Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling Jiawei Shi et.al. 2405.04309 null
2024-05-06 Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion Yunfeng Li et.al. 2405.03177 link
2024-05-03 HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 Miriam Jäger et.al. 2405.02005 null
2024-04-25 The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time Marcie Mun et.al. 2404.16319 null
2024-04-22 Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer Eric Brachmann et.al. 2404.14351 null
2024-04-22 RESFM: Robust Equivariant Multiview Structure from Motion Fadi Khatib et.al. 2404.14280 null
2024-04-22 Does Gaussian Splatting need SFM Initialization? Yalda Foroutan et.al. 2404.12547 null
2024-05-07 A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion Feng Yu et.al. 2404.11590 link
2024-04-18 DeblurGS: Gaussian Splatting for Camera Motion Blur Jeongtaek Oh et.al. 2404.11358 null
2024-05-21 LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives Jiadi Cui et.al. 2404.09748 null
2024-04-12 MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance Yuqun Wu et.al. 2404.08252 null
2024-04-11 Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation Keonhee Han et.al. 2404.07933 null
2024-04-07 NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization Peng Tu et.al. 2404.04875 null
2024-04-04 GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis Emmanouil Nikolakakis et.al. 2404.03126 null
2024-03-29 InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds Zhiwen Fan et.al. 2403.20309 link
2024-03-29 HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes Zhuopeng Li et.al. 2403.20032 null
2024-03-26 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation Jiahao Chen et.al. 2403.17537 null
2024-03-25 INPC: Implicit Neural Point Clouds for Radiance Field Rendering Florian Hahlbohm et.al. 2403.16862 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639 null
2024-03-14 Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting Jaewoo Jung et.al. 2403.09413 link
2024-03-13 Refractive COLMAP: Refractive Structure-from-Motion Revisited Mengkun She et.al. 2403.08640 null
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 link
2024-03-11 SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection Yifu Tao et.al. 2403.06877 null
2024-03-24 BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling Cheng Peng et.al. 2403.04926 link
2024-02-22 GaussianPro: 3D Gaussian Splatting with Progressive Propagation Kai Cheng et.al. 2402.14650 null
2024-02-25 A Robust Error-Resistant View Selection Method for 3D Reconstruction Shaojie Zhang et.al. 2402.11431 null
2024-02-17 Dense Matchers for Dense Tracking Tomáš Jelínek et.al. 2402.11287 null
2024-03-11 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592 link
2024-01-22 HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs Zelin Gao et.al. 2401.11711 null
2024-01-19 SCENES: Subpixel Correspondence Estimation With Epipolar Supervision Dominik A. Kloepfer et.al. 2401.10886 null
2024-01-15 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data Mathilde Letard et.al. 2401.09481 link
2024-01-17 3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey Thiago Lopes Trugillo da Silveira et.al. 2401.09252 null
2024-01-17 ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization Weiyao Wang et.al. 2401.08937 null
2024-01-16 Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions Yi-Fan Zuo et.al. 2401.08043 link
2024-01-10 Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects Tianhang Cheng et.al. 2401.05236 link
2024-01-07 A Classification of Critical Configurations for any Number of Projective Views Martin Bråtelund et.al. 2401.03450 link
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-16 Transformers in Unsupervised Structure-from-Motion Hemang Chawla et.al. 2312.10529 link
2023-12-14 HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video Xueying Wang et.al. 2312.08863 null
2023-12-14 CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning Qingsong Yan et.al. 2312.08760 null
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 link
2023-12-11 Gaussian Splatting SLAM Hidenobu Matsuki et.al. 2312.06741 null
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889 null
2023-12-07 Visual Geometry Grounded Deep Structure From Motion Jianyuan Wang et.al. 2312.04563 null
2023-11-30 Distributed Global Structure-from-Motion with a Deep Front-End Ayush Baid et.al. 2311.18801 link
2023-11-21 Robot Hand-Eye Calibration using Structure-from-Motion Nicolas Andreff et.al. 2311.11808 null
2023-11-18 LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation Sébastien Henry et.al. 2311.11171 null
2023-11-10 MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty Rémi Marsal et.al. 2311.06137 link
2023-11-08 VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering Linus Franke et.al. 2311.04634 link
2023-10-22 A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video Jan Emily Mangulabnan et.al. 2310.14364 null
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-10-09 Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration Chunge Bai et.al. 2310.05504 link
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134 null
2023-11-29 Pose-Free Generalizable Rendering Transformer Zhiwen Fan et.al. 2310.03704 link
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092 null
2023-10-01 Propagating Semantic Labels in Video Data David Balaban et.al. 2310.00783 null
2023-09-22 Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning Jonathan Sauder et.al. 2309.12804 null
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883 link
2023-09-19 Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water Jayesh Tripathi et.al. 2309.10269 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927 link
2023-09-08 Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147 null
2023-09-01 SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation Youhong Wang et.al. 2309.00526 null
2023-09-01 Dense Voxel 3D Reconstruction Using a Monocular Event Camera Haodong Chen et.al. 2309.00385 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 link
2023-08-26 Disjoint Pose and Shape for 3D Face Reconstruction Raja Kumar et.al. 2308.13903 null
2023-08-30 CamP: Camera Preconditioning for Neural Radiance Fields Keunhong Park et.al. 2308.10902 null
2023-08-18 Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling Haorui Ji et.al. 2308.10705 null
2023-08-14 Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation Tao Liu et.al. 2308.07231 link
2023-08-11 Efficient Large-scale AUV-based Visual Seafloor Mapping Mengkun She et.al. 2308.06147 null
2023-08-04 EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems Weihan Wang et.al. 2308.02670 null
2023-08-15 Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites Jyotirmaya Shivottam et.al. 2308.01246 link
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125 null
2023-07-27 PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking Yang Zheng et.al. 2307.15055 link
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981 null
2023-07-10 Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor San Jiang et.al. 2307.04520 null
2023-07-07 RGB-D Mapping and Tracking in a Plenoxel Radiance Field Andreas L. Teigen et.al. 2307.03404 link
2023-06-29 The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes David Recasens et.al. 2306.16917 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667 null
2023-06-24 3D Reconstruction of Spherical Images based on Incremental Structure from Motion San Jiang et.al. 2306.12770 link
2023-06-15 NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Varun Jampani et.al. 2306.09109 link
2023-06-15 Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization Dror Aiger et.al. 2306.09012 link
2023-06-10 3D reconstruction using Structure for Motion Kshitij Karnawat et.al. 2306.06360 link
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938 null
2023-05-31 FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow Cameron Smith et.al. 2306.00180 null
2023-05-19 SIDAR: Synthetic Image Dataset for Alignment & Restoration Monika Kwiatkowski et.al. 2305.12036 link
2023-05-09 Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization Clémentin Boittiaux et.al. 2305.05301 link
2023-05-09 Rotation Synchronization via Deep Matrix Factorization Gk Tejus et.al. 2305.05268 link
2023-04-20 A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion Miriam Jäger et.al. 2304.10664 null
2023-04-14 Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments Felix Ott et.al. 2304.07250 null
2023-04-12 Visual Localization using Imperfect 3D Models from the Internet Vojtech Panek et.al. 2304.05947 link
2023-04-08 Photometric Correction for Infrared Sensors Jincheng Zhang et.al. 2304.03930 null
2023-04-07 DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium Antyanta Bangunharcana et.al. 2304.03560 link
2023-04-05 Semantic Validation in Structure from Motion Joseph Rowell et.al. 2304.02420 link
2023-03-31 Learning Internal Representations of 3D Transformations from 2D Projected Inputs Marissa Connor et.al. 2303.17776 null
2023-03-30 3D Line Mapping Revisited Shaohui Liu et.al. 2303.17504 link
2023-03-27 TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering Jaehoon Choi et.al. 2303.15060 null
2023-03-26 On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks HyunJun Jung et.al. 2303.14840 link
2023-03-24 Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container Jinguang Tong et.al. 2303.13805 link
2023-03-24 Progressively Optimized Local Radiance Fields for Robust View Synthesis Andreas Meuleman et.al. 2303.13791 null
2023-03-15 RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters Shuja Khalid et.al. 2303.08695 null
2023-03-09 Revisiting Rotation Averaging: Uncertainties and Robust Losses Ganlin Zhang et.al. 2303.05195 link
2023-02-28 Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images Zhongli Fan et.al. 2302.14239 link
2023-03-25 BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling Sameera Ramasinghe et.al. 2302.13543 null
2023-02-21 EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images Zhichao Ye et.al. 2302.10544 link
2023-02-18 Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering Tatsuro Yamane et.al. 2302.09208 null
2023-02-12 Uncertainty-Driven Dense Two-View Structure from Motion Weirong Chen et.al. 2302.00523 null
2023-01-28 AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion Yu Chen et.al. 2301.12135 null
2023-01-20 A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles Zhefan Xu et.al. 2301.08422 link
2023-03-21 Robust Dynamic Radiance Fields Yu-Lun Liu et.al. 2301.02239 link
2022-12-24 Polarimetric Multi-View Inverse Rendering Jinyu Zhao et.al. 2212.12721 null
2022-12-13 Accidental Turntables: Learning 3D Pose by Watching Objects Turn Zezhou Cheng et.al. 2212.06300 null
2022-12-04 3D Object Aided Self-Supervised Monocular Depth Estimation Songlin Wei et.al. 2212.01768 null
2022-12-02 High-Res Facial Appearance Capture from Polarized Smartphone Images Dejan Azinović et.al. 2212.01160 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069 link
2022-11-24 JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models Sepidehsadat Hosseini et.al. 2211.13785 null
2022-11-24 SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks Sergio Izquierdo et.al. 2211.13551 link
2022-11-22 Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces Yuxi Xiao et.al. 2211.12018 link
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836 null
2022-11-14 Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion René Haas et.al. 2211.07195 null
2022-10-13 Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach Zhiang Chen et.al. 2210.07349 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517 null
2022-10-07 Leveraging Structure from Motion to Localize Inaccessible Bus Stops Indu Panigrahi et.al. 2210.03646 link
2022-10-01 Structure-Aware NeRF without Posed Camera via Epipolar Constraint Shu Chen et.al. 2210.00183 link
2022-10-05 FAST-LIO, Then Bayesian ICP, Then GTSFM Jerred Chen et.al. 2210.00146 null
2022-09-20 BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction Ahalya Ravendran et.al. 2209.09470 null
2022-09-19 A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion Gerry Chen et.al. 2209.08690 null
2022-09-14 End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes Qiao Chen et.al. 2209.06926 null
2022-09-07 Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021 Hartmut Surmann et.al. 2209.03084 null
2022-08-27 Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data Thomas A. Ciarfuglia et.al. 2208.13001 null
2022-08-12 Handling Constrained Optimization in Factor Graphs for Autonomous Navigation Barbara Bazzana et.al. 2208.06325 null
2022-08-04 Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training Yao-Chih Lee et.al. 2208.02709 link
2022-07-31 One Object at a Time: Accurate and Robust Structure From Motion for Robots Aravind Battaje et.al. 2208.00487 null
2022-07-23 Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks Daniel Posada et.al. 2207.11413 null
2022-07-25 MeshLoc: Mesh-Based Visual Localization Vojtech Panek et.al. 2207.10762 link
2022-07-19 ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild Wang Zhao et.al. 2207.09137 link
2022-07-16 Organic Priors in Non-Rigid Structure from Motion Suryansh Kumar et.al. 2207.06262 null
2022-07-06 A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models Axel Garcia-Vega et.al. 2207.02396 null
2022-06-24 Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set San Jiang et.al. 2206.11499 null
2022-06-13 TC-SfM: Robust Track-Community-Based Structure-from-Motion Lei Wang et.al. 2206.05866 null
2022-06-10 EigenFairing: 3D Model Fairing using Image Coherence Pragyana Mishra et.al. 2206.05309 null
2022-06-01 Semantic Room Wireframe Detection from a Single View David Gillsjö et.al. 2206.00491 link
2022-05-31 Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction Qiancheng Fu et.al. 2205.15848 null
2022-05-09 Is my Depth Ground-Truth Good Enough? HAMMER -- Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression HyunJun Jung et.al. 2205.04565 null
2022-05-07 Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs Pedro F. Proença et.al. 2205.03522 null
2022-05-06 EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms Levi Burner et.al. 2205.03467 null
2022-04-20 Learned Monocular Depth Priors in Visual-Inertial Initialization Yunwen Zhou et.al. 2204.09171 null
2022-04-10 Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective Hui Deng et.al. 2204.04730 null
2022-04-08 Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems Debao Huang et.al. 2204.04145 null
2022-04-07 SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation Yi Wei et.al. 2204.03636 link
2022-04-06 Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion Lukas Bommes et.al. 2204.02733 link
2022-04-05 Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows Sheng Liu et.al. 2204.02509 link
2022-03-31 Fast, Accurate and Memory-Efficient Partial Permutation Synchronization Shaohan Li et.al. 2203.16505 null
2022-03-28 Visual Odometry for RGB-D Cameras Afonso Fontes et.al. 2203.15119 null
2022-03-28 Optimizing Elimination Templates by Greedy Parameter Search Evgeniy Martyushev et.al. 2203.14901 link
2022-03-23 Event-Based Dense Reconstruction Pipeline Kun Xiao et.al. 2203.12270 null
2022-03-21 DiffPoseNet: Direct Differentiable Camera Pose Estimation Chethan M. Parameshwara et.al. 2203.11174 null
2022-03-02 Asynchronous Optimisation for Event-based Visual Odometry Daqi Liu et.al. 2203.01037 null
2022-03-02 Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation Yulun Tian et.al. 2203.00851 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146 link
2022-01-20 GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry Yunhan Zhao et.al. 2201.08131 null
2022-01-13 Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching Yunpeng Shi et.al. 2201.04797 link
2022-01-10 High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM Brian M. Hopkinson et.al. 2201.03364 link
2022-01-06 De-rendering 3D Objects in the Wild Felix Wimbauer et.al. 2201.02279 link
2021-12-29 On the Instability of Relative Pose Estimation and RANSAC's Role Hongyi Fan et.al. 2112.14651 null
2021-12-16 Road-aware Monocular Structure from Motion and Homography Estimation Wei Sui et.al. 2112.08635 null
2021-12-10 Critical configurations for three projective views Martin Bråtelund et.al. 2112.05478 null
2021-12-09 Critical configurations for two projective views, a new approach Martin Bråtelund et.al. 2112.05074 null
2021-12-06 Dense Depth Priors for Neural Radiance Fields from Sparse Input Views Barbara Roessle et.al. 2112.03288 link
2021-12-10 MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment Jie Ren et.al. 2112.01349 link
2021-11-11 Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft Pascal Schoppmann et.al. 2111.06271 null
2021-11-10 Damage Estimation and Localization from Sparse Aerial Imagery Rene Garcia Franceschini et.al. 2111.03708 null
2021-11-03 Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems Swarnabja Bhaumik et.al. 2111.02064 null
2021-10-14 Modeling dynamic target deformation in camera calibration Annika Hagemann et.al. 2110.07322 null
2021-10-13 Hyperspectral 3D Mapping of Underwater Environments Maxime Ferrera et.al. 2110.06571 null
2021-09-24 Automatic Map Update Using Dashcam Videos Aziza Zhanabatyrova et.al. 2109.12131 null
2021-09-16 Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs Gabriel Moreira et.al. 2109.08046 link
2021-09-06 Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications Tejas Mane et.al. 2109.02740 null
2021-09-02 Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency Beatrix-Emőke Fülöp-Balogh et.al. 2109.01018 null
2021-09-01 On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation Eric Brachmann et.al. 2109.00524 link
2021-08-31 DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension Roman Shapovalov et.al. 2109.00033 null
2021-08-29 Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration Seyed-Mahdi Nasiri et.al. 2108.12876 null
2021-08-23 Burst Imaging for Light-Constrained Structure-From-Motion Ahalya Ravendran et.al. 2108.09895 null

(back to top)

Visual Localization

Publish Date Title Authors PDF Code
2025-12-04 ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Shengyuan Ding et.al. 2512.05111 null
2025-12-04 Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark Haobo Yuan et.al. 2512.05091 null
2025-12-04 Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding Abhigyan Bhattacharya et.al. 2512.05039 null
2025-12-04 Revealing stimulus-dependent dynamics through statistical complexity Edson V. de Paula et.al. 2512.05007 null
2025-12-04 Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis Supriya Bordoloi et.al. 2512.04989 null
2025-12-04 LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging Zhijian Shu et.al. 2512.04939 null
2025-12-04 Terahertz Fourier Ptychographic Imaging Pitambar Mukherjee et.al. 2512.04783 null
2025-12-04 TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards Mauro Martini et.al. 2512.04772 null
2025-12-04 MemLoRA: Distilling Expert Adapters for On-Device Memory Systems Massimo Bini et.al. 2512.04763 null
2025-12-04 Spectral micro-CT for quantitative analysis of calcification in fibrocartilage Vittoria Mazzini et.al. 2512.04662 null
2025-11-26 Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models Naifu Zhang et.al. 2511.21663 null
2025-11-26 Fast 3D Ultrasound Localization Microscopy via Projection-based Processing Framework Jingke Zhang et.al. 2511.21647 null
2025-11-26 Qwen3-VL Technical Report Shuai Bai et.al. 2511.21631 null
2025-11-26 Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy Teng Hu et.al. 2511.21579 null
2025-11-26 FITRep: Attention-Guided Item Representation via MLLMs Guoxiao Zhang et.al. 2511.21389 null
2025-11-26 Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning Xin Gu et.al. 2511.21375 null
2025-11-26 HTTM: Head-wise Temporal Token Merging for Faster VGGT Weitian Wang et.al. 2511.21317 null
2025-11-26 Low-dose Chemically Specific Bioimaging via Deep-UV Lensless Holographic Microscopy on a Standard Camera Piotr Arcab et.al. 2511.21311 null
2025-11-26 Adaptive Lighting Control in Visible Light Systems: An Integrated Sensing, Communication, and Illumination Framework Xinyan Xie et.al. 2511.21271 null
2025-11-26 Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition Baoli Sun et.al. 2511.21202 null
2025-11-24 Wigner and Gabor phase-space analysis of propagators for evolution equations Elena Cordero et.al. 2511.19400 null
2025-11-24 Real-Time Object Tracking with On-Device Deep Learning for Adaptive Beamforming in Dynamic Acoustic Environments Jorge Ortigoso-Narro et.al. 2511.19396 null
2025-11-24 In-vivo imaging with a low-cost MRI scanner and cloud data processing in low-resource settings Teresa Guallart-Naval et.al. 2511.19226 null
2025-11-24 Can Modern Vision Models Understand the Difference Between an Object and a Look-alike? Itay Cohen et.al. 2511.19200 null
2025-11-24 From Pixels to Posts: Retrieval-Augmented Fashion Captioning and Hashtag Generation Moazzam Umer Gondal et.al. 2511.19149 null
2025-11-24 Graph-based 3D Human Pose Estimation using WiFi Signals Jichao Chen et.al. 2511.19105 null
2025-11-24 Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach Fan Nie et.al. 2511.19080 null
2025-11-24 LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space Hai Wu et.al. 2511.19057 null
2025-11-24 Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors Haihang Wu et.al. 2511.19031 null
2025-11-24 Dynamic Granularity Matters: Rethinking Vision Transformers Beyond Fixed Patch Splitting Qiyang Yu et.al. 2511.19021 null
2025-11-24 AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization Christos Koutlis et.al. 2511.18993 null
2025-11-24 Zero-shot segmentation of skin tumors in whole-slide images with vision-language foundation models Santiago Moreno et.al. 2511.18978 null
2025-11-24 MagicWorld: Interactive Geometry-driven Video World Exploration Guangyuan Li et.al. 2511.18886 null
2025-11-24 SP-VINS: A Hybrid Stereo Visual Inertial Navigation System based on Implicit Environmental Map Xueyu Du et.al. 2511.18756 null
2025-11-24 Seeing What Matters: Visual Preference Policy Optimization for Visual Generation Ziqi Ni et.al. 2511.18719 null
2025-11-24 CNN-Based Camera Pose Estimation and Localisation of Scan Images for Aircraft Visual Inspection Xueyan Oh et.al. 2511.18702 null
2025-11-24 Stable Multi-Drone GNSS Tracking System for Marine Robots Shuo Wen et.al. 2511.18694 null
2025-11-23 Shape-Adapting Gated Experts: Dynamic Expert Routing for Colonoscopic Lesion Segmentation Gia Huy Thai et.al. 2511.18493 null
2025-11-23 Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span Heeseung Yun et.al. 2511.18470 null
2025-11-23 LungX: A Hybrid EfficientNet-Vision Transformer Architecture with Multi-Scale Attention for Accurate Pneumonia Detection Mansur Yerzhanuly et.al. 2511.18425 null
2025-11-23 4D-VGGT: A General Foundation Model with SpatioTemporal Awareness for Dynamic Scene Geometry Estimation Haonan Wang et.al. 2511.18416 null
2025-11-23 NSTR: Neural Spectral Transport Representation for Space-Varying Frequency Fields Plein Versace et.al. 2511.18384 null
2025-11-23 Learning Visually Interpretable Oscillator Networks for Soft Continuum Robots from Video Henrik Krauss et.al. 2511.18322 null
2025-11-23 Table Comprehension in Building Codes using Vision Language Models and Domain-Specific Fine-Tuning Mohammad Aqib et.al. 2511.18306 null
2025-11-23 AIA-UltraNeRF:Acoustic-Impedance-Aware Neural Radiance Field with Hash Encodings for Robotic Ultrasound Reconstruction and Localization Shuai Zhang et.al. 2511.18293 null
2025-11-23 SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes Jungho Lee et.al. 2511.18290 null
2025-11-22 AFT: Appearance-Based Feature Tracking for Markerless and Training-Free Shape Reconstruction of Soft Robots Shangyuan Yuan et.al. 2511.18215 null
2025-11-22 ProHD: Projection-Based Hausdorff Distance Approximation Jiuzhou Fu et.al. 2511.18207 null
2025-11-22 ARIAL: An Agentic Framework for Document VQA with Precise Answer Localization Ahmad Mohammadshirazi et.al. 2511.18192 null
2025-11-22 Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models Dachuan Zhao et.al. 2511.18123 null
2025-11-22 PromptMoE: Generalizable Zero-Shot Anomaly Detection via Visually-Guided Prompt Mixtures Yuheng Shao et.al. 2511.18116 null
2025-11-22 Spotlight: Identifying and Localizing Video Generation Errors Using VLMs Aditya Chinchure et.al. 2511.18102 null
2025-11-22 VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object Detection Jianhang Yao et.al. 2511.18075 null
2025-11-22 HyM-UNet: Synergizing Local Texture and Global Context via Hybrid CNN-Mamba Architecture for Medical Image Segmentation Haodong Chen et.al. 2511.17988 null
2025-11-22 Signal: Selective Interaction and Global-local Alignment for Multi-Modal Object Re-Identification Yangyang Liu et.al. 2511.17965 null
2025-11-22 MambaTAD: When State-Space Models Meet Long-Range Temporal Action Detection Hui Lu et.al. 2511.17929 null
2025-11-22 MGA-VQA: Secure and Interpretable Graph-Augmented Visual Question Answering with Memory-Guided Protection Against Unauthorized Knowledge Use Ahmad Mohammadshirazi et.al. 2511.17881 null
2025-11-21 AEGIS: Preserving privacy of 3D Facial Avatars with Adversarial Perturbations Dawid Wolkiewicz et.al. 2511.17747 null
2025-11-21 Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolved Transcriptomics Wei Zhang et.al. 2511.17685 null
2025-11-18 Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression Siddiqua Namrah et.al. 2511.17612 null
2025-11-18 3D Ground Truth Reconstruction from Multi-Camera Annotations Using UKF Linh Van Ma et.al. 2511.17609 null
2025-11-21 REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing Binger Chen et.al. 2511.17442 null
2025-11-21 IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation Yifan Li et.al. 2511.17384 null
2025-11-21 SVRecon: Sparse Voxel Rasterization for Surface Reconstruction Seunghun Oh et.al. 2511.17364 null
2025-11-21 NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior Dongbo Shi et.al. 2511.17322 null
2025-11-21 MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning Wenrui Zhang et.al. 2511.17300 null
2025-11-21 Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation Chuancheng Shi et.al. 2511.17282 null
2025-11-21 A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback Bulat Khaertdinov et.al. 2511.17255 null
2025-11-21 Mixed Reality Scenic Live Streaming for Cultural Heritage: Visual Interactions in a Historic Landscape Zeyu Huang et.al. 2511.17246 null
2025-11-21 SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors Kunyi Li et.al. 2511.17207 null
2025-11-21 Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition Aditya Mishra et.al. 2511.17183 null
2025-11-21 Reflection-Based Relative Localization for Cooperative UAV Teams Using Active Markers Tim Lakemann et.al. 2511.17166 null
2025-11-21 Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation Shuo Wang et.al. 2511.17097 null
2025-11-21 Spanning Tree Autoregressive Visual Generation Sangkyu Lee et.al. 2511.17089 null
2025-11-24 ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion Junming Liu et.al. 2511.17068 null
2025-11-21 Stable Offline Hand-Eye Calibration for any Robot with Just One Mark Sicheng Xie et.al. 2511.17001 null
2025-11-21 VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions Qianyi Shao et.al. 2511.16998 null
2025-11-21 DReX: Pure Vision Fusion of Self-Supervised and Convolutional Representations for Image Complexity Prediction Jonathan Skaza et.al. 2511.16991 null
2025-11-21 The Finer the Better: Towards Granular-aware Open-set Domain Generalization Yunyun Wang et.al. 2511.16979 null
2025-11-21 Single-Axis Ptychographic Coherent Diffractive Imaging for Spectroscopic and Wavefront Retrieval Qijun You et.al. 2511.16950 null
2025-11-20 SAM 3: Segment Anything with Concepts Nicolas Carion et.al. 2511.16719 null
2025-11-24 PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation Ting Pan et.al. 2511.16712 null
2025-11-20 Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation Ziyu Guo et.al. 2511.16671 null
2025-11-23 Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems Elias Lumer et.al. 2511.16654 null
2025-11-20 SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction Guolin Huang et.al. 2511.16635 null
2025-11-21 POMA-3D: The Point Map Way to 3D Scene Understanding Ye Mao et.al. 2511.16567 null
2025-11-20 NutriScreener: Retrieval-Augmented Multi-Pose Graph Attention Network for Malnourishment Screening Misaal Khan et.al. 2511.16566 null
2025-11-20 Contrastive vision-language learning with paraphrasing and negation Kwun Ho Ngan et.al. 2511.16527 null
2025-11-20 BoxingVI: A Multi-Modal Benchmark for Boxing Action Recognition and Localization Rahul Kumar et.al. 2511.16524 null
2025-11-20 YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras Fan Yang et.al. 2511.16521 null
2025-11-20 TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models Li Zhang et.al. 2511.16423 null
2025-11-20 CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering Joni Vanherck et.al. 2511.16349 null
2025-11-20 Real-Time Inference for Distributed Multimodal Systems under Communication Delay Uncertainty Victor Croisfelt et.al. 2511.16225 null
2025-11-20 Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments Renxiang Xiao et.al. 2511.16091 null
2025-11-20 AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive Transformers Boxun Xu et.al. 2511.16047 null
2025-11-19 EfficientSAM3: Progressive Hierarchical Distillation for Video Concept Segmentation from SAM1, 2, and 3 Chengxi Zeng et.al. 2511.15833 null
2025-11-19 IMACT-CXR - An Interactive Multi-Agent Conversational Tutoring System for Chest X-Ray Interpretation Tuan-Anh Le et.al. 2511.15825 null
2025-11-19 Multidimensional scaling of two-mode three-way asymmetric dissimilarities: finding archetypal profiles and clustering Aleix Alcacer et.al. 2511.15813 null
2025-11-19 GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization Yikun Wang et.al. 2511.15705 null
2025-11-19 First Frame Is the Place to Go for Video Content Customization Jingxi Chen et.al. 2511.15700 null
2025-11-19 Hierarchical Semantic Tree Anchoring for CLIP-Based Class-Incremental Learning Tao Hu et.al. 2511.15633 null
2025-11-19 Multi-Text Guided Few-Shot Semantic Segmentation Qiang Jiao et.al. 2511.15515 null
2025-11-19 SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome Dabin Jeong et.al. 2511.15464 null
2025-11-19 HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation Linyin Luo et.al. 2511.15435 null
2025-11-19 The Empowerment of Science of Science by Large Language Models: New Tools and Methods Guoqiang Liang et.al. 2511.15370 null
2025-11-19 C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models Nayoung Oh et.al. 2511.15333 null
2025-11-19 Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval Qing Wang et.al. 2511.15201 null
2025-11-19 Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation Jin Wang et.al. 2511.15118 null
2025-11-19 BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer Wenhan Yu et.al. 2511.15090 null
2025-11-18 FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding Zhenshi Li et.al. 2511.14901 null
2025-11-18 Quantum Transport Spectroscopy of Pseudomagnetic Field in Graphene Divya Sahani et.al. 2511.14888 null
2025-09-16 Image-Seeking Intent Prediction for Cross-Device Product Search Mariya Hendriksen et.al. 2511.14764 null
2025-11-18 FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation Yunfeng Wu et.al. 2511.14712 null
2025-11-18 Overcoming global sensitivity limitations: using active subspaces to explore discrepancies between global and local parameter sensitivities Huiyan Zou et.al. 2511.14687 null
2025-11-18 A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases Tao Yang et.al. 2511.14638 null
2025-11-18 Mind the Gaps: Measuring Visual Artifacts in Dimensionality Reduction Jaume Ros et.al. 2511.14544 null
2025-11-18 D-PerceptCT: Deep Perceptual Enhancement for Low-Dose CT Images Taifour Yousra Nabila et.al. 2511.14518 null
2025-11-18 Aerial Assistance System for Automated Firefighting during Turntable Ladder Operations Jan Quenzel et.al. 2511.14504 null
2025-11-18 DIR-TIR: Dialog-Iterative Refinement for Text-to-Image Retrieval Zongwei Zhen et.al. 2511.14449 null
2025-11-18 Agentic Video Intelligence: A Flexible Framework for Advanced Video Exploration and Understanding Hong Gao et.al. 2511.14446 null
2025-11-19 Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving Kangqiao Zhao et.al. 2511.14386 null
2025-11-18 O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language Model Rishi Gupta et.al. 2511.14368 null
2025-11-23 Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors Jeryes Danial et.al. 2511.14335 null
2025-11-18 Statistically controllable microstructure reconstruction framework for heterogeneous materials using sliced-Wasserstein metric and neural networks Zhenchuan Ma et.al. 2511.14268 null
2025-11-18 EBind: a practical approach to space binding Jim Broadbent et.al. 2511.14229 null
2025-11-18 LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation Hao Jiang et.al. 2511.14221 null
2025-11-19 Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution N Dinesh Reddy et.al. 2511.14210 null
2025-11-19 PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation Xiangyu Li et.al. 2511.14185 null
2025-11-18 SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM An Yu et.al. 2511.14143 null
2025-11-18 $A^2$GC: $A$symmetric $A$ ggregation with Geometric Constraints for Locally Aggregated Descriptors Zhenyu Li et.al. 2511.14109 null
2025-11-18 SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts Fan Zhang et.al. 2511.14093 null
2025-11-18 HiEAG: Evidence-Augmented Generation for Out-of-Context Misinformation Detection Junjie Wu et.al. 2511.14027 null
2025-11-17 EchoAgent: Guideline-Centric Reasoning Agent for Echocardiography Measurement and Interpretation Matin Daghyani et.al. 2511.13948 null
2025-11-17 Start Small, Think Big: Curriculum-based Relative Policy Optimization for Visual Grounding Qingyang Yan et.al. 2511.13924 null
2025-11-17 GRLoc: Geometric Representation Regression for Visual Localization Changyang Li et.al. 2511.13864 null
2025-11-17 Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification Linhan Zhou et.al. 2511.13575 null
2025-11-17 Language-Guided Invariance Probing of Vision-Language Models Jae Joong Lee et.al. 2511.13494 null
2025-11-17 Attention Grounded Enhancement for Visual Document Retrieval Wanqing Cui et.al. 2511.13415 null
2025-11-17 Stray Light Correction for the Helioseismic and Magnetic Imager A. A. Norton et.al. 2511.13348 null
2025-11-17 Uncovering and Mitigating Transient Blindness in Multimodal Model Editing Xiaoqi Han et.al. 2511.13243 null
2025-11-17 GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry Chiyun Noh et.al. 2511.13216 null
2025-11-17 Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework Diego Ortego et.al. 2511.13189 null
2025-11-17 THIR: Topological Histopathological Image Retrieval Zahra Tabatabaei et.al. 2511.13170 null
2025-11-17 SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration Haodong Wang et.al. 2511.13168 null
2025-11-17 MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications Gagan Raj Gupta et.al. 2511.13131 null
2025-11-17 Region-Point Joint Representation for Effective Trajectory Similarity Learning Hao Long et.al. 2511.13125 null
2025-11-17 Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining Zhaocheng Yu et.al. 2511.13113 null
2025-11-17 uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired Data Dahyun Chung et.al. 2511.13036 null
2025-11-17 Angular Gradient Sign Method: Uncovering Vulnerabilities in Hyperbolic Networks Minsoo Jo et.al. 2511.12985 null
2025-11-17 MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning Yoonjae Seo et.al. 2511.12976 null
2025-11-16 Enhancing Neuro-Oncology Through Self-Assessing Deep Learning Models for Brain Tumor Unified Model for MRI Segmentation Andrew Zhou et.al. 2511.12801 null
2025-11-16 Predicting upcoming visual features during eye movements yields scene representations aligned with human visual cortex Sushrut Thorat et.al. 2511.12715 null
2025-11-16 FLClear: Visually Verifiable Multi-Client Watermarking for Federated Learning Chen Gu et.al. 2511.12663 null
2025-11-16 D $^{2}$ -VPR: A Parameter-efficient Visual-foundation-model-based Visual Place Recognition Method via Knowledge Distillation and Deformable Aggregation Zheyuan Zhang et.al. 2511.12528 null
2025-11-16 Visible Structure Retrieval for Lightweight Image-Based Relocalisation Fereidoon Zangeneh et.al. 2511.12503 null
2025-11-16 CoTBox-TTT: Grounding Medical VQA with Visual Chain-of-Thought Boxes During Test-time Training Jiahe Qian et.al. 2511.12446 null
2025-11-15 Calibrated Decomposition of Aleatoric and Epistemic Uncertainty in Deep Features for Inference-Time Adaptation Divake Kumar et.al. 2511.12389 null
2025-11-15 SpaceVLM: Sub-Space Modeling of Negation in Vision-Language Models Sepehr Kazemi Ranjbar et.al. 2511.12331 null
2025-11-15 A Disease-Aware Dual-Stage Framework for Chest X-ray Report Generation Puzhen Wu et.al. 2511.12259 null
2025-11-21 Model Inversion Attack Against Deep Hashing Dongdong Zhao et.al. 2511.12233 null
2025-11-15 FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention Peng Zhang et.al. 2511.12215 null
2025-11-18 OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs Feng Chen et.al. 2511.12201 null
2025-11-15 MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering Seokwon Song et.al. 2511.12142 null
2025-11-15 Look As You Think: Unifying Reasoning and Visual Evidence Attribution for Verifiable Document RAG via Reinforcement Learning Shuochen Liu et.al. 2511.12003 null
2025-11-21 Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models Siyou Li et.al. 2511.11910 null
2025-11-14 TopoPerception: A Shortcut-Free Evaluation of Global Visual Perception in Large Vision-Language Models Wenhao Zhou et.al. 2511.11831 null
2025-11-14 Lessons Learned from Developing a Privacy-Preserving Multimodal Wearable for Local Voice-and-Vision Inference Yonatan Tussa et.al. 2511.11811 null
2025-11-12 Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement Lian He et.al. 2511.11702 null
2025-11-12 Doubly Debiased Test-Time Prompt Tuning for Vision-Language Models Fei Song et.al. 2511.11690 null
2025-11-10 A Deep Learning Model to Predicting Changes in Consumer Attributes for New Line-extended Products Li Yinxing et.al. 2511.11646 null
2025-11-14 DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding Dawei Zhu et.al. 2511.11552 null
2025-11-14 STEM EBIC as a Quantitative Probe of Semiconductor Devices Sebastian Schneider et.al. 2511.11528 null
2025-11-14 Bridging Hidden States in Vision-Language Models Benjamin Fein-Ashley et.al. 2511.11526 null
2025-11-14 Comprehension of Multilingual Expressions Referring to Target Objects in Visual Inputs Francisco Nogueira et.al. 2511.11427 null
2025-11-14 Shrinking the Teacher: An Adaptive Teaching Paradigm for Asymmetric EEG-Vision Alignment Lukun Wu et.al. 2511.11422 null
2025-11-14 Bidimensional measurements of photon statistics within a multimodal temporal framework C. Hainaut et.al. 2511.11403 null
2025-11-18 GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes Shumit A. Mitra et.al. 2511.11401 null
2025-11-14 StochEP: Stochastic Equilibrium Propagation for Spiking Convergent Recurrent Neural Networks Jiaqi Lin et.al. 2511.11320 null
2025-11-21 DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding Tanveer Hannan et.al. 2511.11313 null
2025-11-18 MOON Embedding: Multimodal Representation Learning for E-commerce Search Advertising Chenghan Fu et.al. 2511.11305 null
2025-11-14 3D Stokes polarimetric imaging at nanoscales Isael Herrera et.al. 2511.11222 null
2025-11-14 Positional Bias in Multimodal Embedding Models: Do They Favor the Beginning, the Middle, or the End? Kebin Wu et.al. 2511.11216 null
2025-11-21 Draft and Refine with Visual Experts Sungheon Jeong et.al. 2511.11005 null
2025-11-14 ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization Anzhe Cheng et.al. 2511.10971 null
2025-11-13 From Attention to Frequency: Integration of Vision Transformer and FFT-ReLU for Enhanced Image Deblurring Syed Mumtahin Mahmud et.al. 2511.10806 null
2025-11-13 Semantic Property Maps for Driving Applications Marcus Greiff et.al. 2511.10798 null
2025-11-13 Fast Data Attribution for Text-to-Image Models Sheng-Yu Wang et.al. 2511.10721 null
2025-11-18 CARScenes: Semantic VLM Dataset for Safe Autonomous Driving Yuankai He et.al. 2511.10701 null
2025-11-12 DualVision ArthroNav: Investigating Opportunities to Enhance Localization and Reconstruction in Image-based Arthroscopy Navigation via External Cameras Hongchao Shu et.al. 2511.10699 null
2025-11-12 $π$ -Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling Dong Liu et.al. 2511.10696 null
2025-11-13 Mined Prompting and Metadata-Guided Generation for Wound Care Visual Question Answering Bavana Durgapraveen et.al. 2511.10591 null
2025-11-13 SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation Wei Li et.al. 2511.10518 null
2025-11-13 Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators Maximiliane Gruber et.al. 2511.10424 null
2025-11-16 MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns Jiarui Zhang et.al. 2511.10390 null
2025-11-17 Physics informed Transformer-VAE for biophysical parameter estimation: PROSAIL model inversion in Sentinel-2 imagery Prince Mensah et.al. 2511.10387 null
2025-11-13 Rethinking Visual Information Processing in Multimodal LLMs Dongwan Kim et.al. 2511.10301 null
2025-11-13 H3Former: Hypergraph-based Semantic-Aware Aggregation via Hyperbolic Hierarchical Contrastive Loss for Fine-Grained Visual Classification Yongji Zhang et.al. 2511.10260 null
2025-11-20 TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding Jinxuan Li et.al. 2511.10241 null
2025-11-13 Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization Ashutosh Anshul et.al. 2511.10212 null
2025-11-13 Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA Yiran Zhang et.al. 2511.10182 null
2025-11-13 GEA: Generation-Enhanced Alignment for Text-to-Image Person Retrieval Hao Zou et.al. 2511.10154 null
2025-11-13 Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction Mingda Jia et.al. 2511.10134 null
2025-11-13 GridPrune: From "Where to Look" to "What to Select" in Visual Token Pruning for MLLMs Yuxiang Duan et.al. 2511.10081 null
2025-11-13 Radiology Workflow-Guided Hierarchical Reinforcement Fine-Tuning for Medical Report Generation Bodong Du et.al. 2511.10065 null
2025-11-13 Trapped by Their Own Light: Deployable and Stealth Retroreflective Patch Attacks on Traffic Sign Recognition Systems Go Tsuruoka et.al. 2511.10050 null
2025-11-13 AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models Xinyi Wang et.al. 2511.10017 null
2025-11-13 Learning phase diversity for solving ill-posed inverse problems in imaging Jasleen Birdi et.al. 2511.09952 null
2025-11-13 MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding Ketong Chen et.al. 2511.09919 null
2025-11-12 From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance Jeongho Min et.al. 2511.09820 null
2025-11-12 PALMS+: Modular Image-Based Floor Plan Localization Leveraging Depth Foundation Model Yunqian Cheng et.al. 2511.09724 null
2025-11-12 SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control Arman Zarei et.al. 2511.09715 null
2025-11-12 IFG: Internet-Scale Guidance for Functional Grasping Generation Ray Muxin Liu et.al. 2511.09558 null
2025-11-12 Warped Disk Galaxies: Statistical Properties from DESI Legacy Imaging Surveys DR8 Yiheng Wang et.al. 2511.09518 null
2025-11-12 A general framework for adaptive nonparametric dimensionality reduction Antonio Di Noia et.al. 2511.09486 null
2025-11-12 BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation Hongchao Shu et.al. 2511.09443 null
2025-11-12 NeuroCLIP: Brain-Inspired Prompt Tuning for EEG-to-Image Multimodal Contrastive Learning Jiyuan Wang et.al. 2511.09250 null
2025-11-12 SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields Sangheon Yang et.al. 2511.09072 null
2025-11-12 ROI-based Deep Image Compression with Implicit Bit Allocation Kai Hu et.al. 2511.08918 null
2025-11-12 Negative Entity Suppression for Zero-Shot Captioning with Synthetic Images Zimao Lu et.al. 2511.08909 null
2025-11-13 LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis Ibne Farabi Shihab et.al. 2511.08903 null
2025-11-11 SIFT-Graph: Benchmarking Multimodal Defense Against Image Adversarial Attacks With Robust Feature Graph Jingjie He et.al. 2511.08810 null
2025-11-11 Decoupling Composition and Band Gap in $κ$-Ga$_2$O$_3$ Heterostructures via STEM-EELS Annett Thøgersen et.al. 2511.08728 null
2025-11-11 Spatio-Temporal Cluster-Triggered Encoding for Spiking Neural Networks Lingyun Ke et.al. 2511.08469 null
2025-11-11 Isolated massive star candidates in NGC 4242 with GULP Pietro Facchini et.al. 2511.08447 null
2025-11-11 Text-based Aerial-Ground Person Retrieval Xinyu Zhou et.al. 2511.08369 null
2025-11-11 VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion Samet Hicsonmez et.al. 2511.08173 null
2025-11-11 Multi-Granularity Mutual Refinement Network for Zero-Shot Learning Ning Wang et.al. 2511.08163 null
2025-11-11 Direction and speed selectivity properties for spatio-temporal receptive fields according to the generalized Gaussian derivative model for visual receptive fields Tony Lindeberg et.al. 2511.08101 null
2025-11-11 Multi-modal Deepfake Detection and Localization with FPN-Transformer Chende Zheng et.al. 2511.08031 null
2025-11-12 EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision Yifei Cao et.al. 2511.08007 null
2025-11-11 Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition Lintong Zhang et.al. 2511.07974 null
2025-11-11 Exploring the Underwater World Segmentation without Extra Training Bingyu Li et.al. 2511.07923 null
2025-11-11 Visual Bridge: Universal Visual Perception Representations Generating Yilin Gao et.al. 2511.07877 null
2025-11-11 MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection Sunghun Yang et.al. 2511.07862 null
2025-11-11 Semantic-Consistent Bidirectional Contrastive Hashing for Noisy Multi-Label Cross-Modal Retrieval Likang Peng et.al. 2511.07780 null
2025-11-14 Multistep Quasimetric Learning for Scalable Goal-conditioned Reinforcement Learning Bill Chunyuan Zheng et.al. 2511.07730 null
2025-11-11 Operational machine learning for remote spectroscopic detection of CH $_{4}$ point sources Vít Růžička et.al. 2511.07719 null
2025-11-19 Cross Modal Fine-Grained Alignment via Granularity-Aware and Region-Uncertain Modeling Jiale Liu et.al. 2511.07710 null
2025-11-10 Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning Michael Hoffmann et.al. 2511.07682 null
2025-11-10 CAVER: Curious Audiovisual Exploring Robot Luca Macesanu et.al. 2511.07619 null
2025-11-08 Multivariate Variational Autoencoder Mehmet Can Yavuz et.al. 2511.07472 null
2025-11-20 AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents Ye Zheng et.al. 2511.07441 null
2025-11-10 TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research Han Zhang et.al. 2511.07412 null
2025-11-10 YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting Botao Ye et.al. 2511.07321 null
2025-11-10 VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models Ying Cheng et.al. 2511.07299 null
2025-11-10 Direct imaging of magnetotransport at graphene-metal interfaces with a single-spin quantum sensor C. Ding et.al. 2511.07181 null
2025-11-10 LeCoT: revisiting network architecture for two-view correspondence pruning Luanyuan Dai et.al. 2511.07078 null
2025-11-10 Integration of Visual SLAM into Consumer-Grade Automotive Localization Luis Diener et.al. 2511.06919 null
2025-11-10 Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding Yuzhen Li et.al. 2511.06908 null
2025-11-10 NeuroBridge: Bio-Inspired Self-Supervised EEG-to-Image Decoding via Cognitive Priors and Bidirectional Semantic Alignment Wenjiang Zhang et.al. 2511.06836 null
2025-11-10 Semi-distributed Cross-modal Air-Ground Relative Localization Weining Lu et.al. 2511.06749 null
2025-11-10 AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer Yulim So et.al. 2511.06687 null
2025-11-10 HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment Ruijia Wu et.al. 2511.06653 null
2025-11-09 DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization Tao Liu et.al. 2511.06422 null
2025-11-09 A generalization bound for exit wave reconstruction via deep unfolding Moussa Atwi et.al. 2511.06413 null
2025-11-09 CINEMAE: Leveraging Frozen Masked Autoencoders for Cross-Generator AI Image Detection Minsuk Jang et.al. 2511.06325 null
2025-11-09 ALIGN: A Vision-Language Framework for High-Accuracy Accident Location Inference through Geo-Spatial Neural Reasoning MD Thamed Bin Zaman Chowdhury et.al. 2511.06316 null
2025-11-11 Robust Nearest Neighbour Retrieval Using Targeted Manifold Manipulation B. Ghosh et.al. 2511.06261 null
2025-11-09 ExpReS-VLA: Specializing Vision-Language-Action Models Through Experience Replay and Retrieval Shahram Najam Syed et.al. 2511.06202 null
2025-11-08 Real-Time Bundle Adjustment for Ultra-High-Resolution UAV Imagery Using Adaptive Patch-Based Feature Tracking Selim Ahmet Iz et.al. 2511.06152 null
2025-11-11 When Object-Centric World Models Meet Policy Learning: From Pixels to Policies, and Where It Breaks Stefano Ferraro et.al. 2511.06136 null
2025-11-08 Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration Umar Rashid et.al. 2511.06087 null
2025-11-08 Visual Exploration of Feature Relationships in Sparse Autoencoders with Curated Concepts Xinyuan Yan et.al. 2511.06048 null
2025-11-08 S2ML: Spatio-Spectral Mutual Learning for Depth Completion Zihui Zhao et.al. 2511.06033 null
2025-11-08 Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era Feng Lu et.al. 2511.06024 null
2025-11-08 Dissecting the Perseus-Pisces supercluster observed with CFHT-MegaCam: Investigating environmental effects on galaxy morphology M. Mondelin et.al. 2511.05925 null
2025-11-08 Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning Fei Yu et.al. 2511.05894 null
2025-11-08 HAPS Communication Networks: A Tutorial-cum-Survey on Integration with Optical Atmospheric Sensing Ali Elkhazraji et.al. 2511.05877 null
2025-11-07 SARCH: Multimodal Search for Archaeological Archives Nivedita Sinha et.al. 2511.05667 null
2025-11-05 Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps Yoojin Oh et.al. 2511.05590 null
2025-11-07 Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments Laura Alejandra Encinar Gonzalez et.al. 2511.05404 null
2025-11-07 PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization Zehui Feng et.al. 2511.05393 null
2025-11-07 Turning Adversaries into Allies: Reversing Typographic Attacks for Multimodal E-Commerce Product Retrieval Janet Jenq et.al. 2511.05325 null
2025-11-07 On the possibility of using decayless kink oscillations of coronal loops to forecast powerful solar flares and coronal mass ejections A. B. Nechaeva et.al. 2511.05175 null
2025-11-07 Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start Fuyang Liu et.al. 2511.05095 null
2025-11-07 Dynamic Residual Encoding with Slide-Level Contrastive Learning for End-to-End Whole Slide Image Representation Jing Jin et.al. 2511.05034 null
2025-11-07 DAFM: Dynamic Adaptive Fusion for Multi-Model Collaboration in Composed Image Retrieval Yawei Cai et.al. 2511.05020 null
2025-11-07 Nuclear Ptychoscopy: A Ptychographic Framework for Nuclear Spectroscopy Ziyang Yuan et.al. 2511.04924 null
2025-11-06 Learning to reason about rare diseases through retrieval-augmented agents Ha Young Kim et.al. 2511.04720 null
2025-11-06 PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning Yicheng Xiao et.al. 2511.04601 null
2025-11-06 Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA Itbaan Safwan et.al. 2511.04384 null
2025-11-06 High-Resolution Forest Mapping from L-Band Interferometric SAR Time Series using Deep Learning over Northern Spain Chiara Telli et.al. 2511.04362 null
2025-11-06 Probing the Probes: Methods and Metrics for Concept Alignment Jacob Lysnæs-Larsen et.al. 2511.04312 null
2025-11-06 DINOv2 Driven Gait Representation Learning for Video-Based Visible-Infrared Person Re-identification Yujie Yang et.al. 2511.04281 null
2025-11-07 On the Brittleness of CLIP Text Encoders Allie Tran et.al. 2511.04247 null
2025-11-06 An Efficient Algorithm for Learning-Based Visual Localization Jindi Zhong et.al. 2511.04232 null
2025-11-06 GraspView: Active Perception Scoring and Best-View Optimization for Robotic Grasping in Cluttered Environments Shenglin Wang et.al. 2511.04199 null
2025-11-06 Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories Olav Finne Praesteng Larsen et.al. 2511.04155 null
2025-11-06 Learning from Online Videos at Inference Time for Computer-Use Agents Yujian Liu et.al. 2511.04137 null
2025-11-06 SpatialLock: Precise Spatial Control in Text-to-Image Synthesis Biao Liu et.al. 2511.04112 null
2025-11-06 Caption Injection for Optimization in Generative Search Engine Xiaolu Chen et.al. 2511.04080 null
2025-11-06 CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation Yuwen Tao et.al. 2511.03992 null
2025-11-05 SILVI: Simple Interface for Labeling Video Interactions Ozan Kanbertay et.al. 2511.03819 null
2025-11-05 Expert Evaluation of LLM World Models: A High- $T_c$ Superconductivity Case Study Haoyu Guo et.al. 2511.03782 null
2025-11-05 The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents Xingyao Wang et.al. 2511.03690 null
2025-11-10 Coherent Differential Imaging of high-contrast extended sources with VLT/SPHERE Axel Potier et.al. 2511.03518 null
2025-11-05 Performance Evaluation of a Position-Sensitive SiPM-based Gamma Camera for Intraoperative Imaging Aramis Raiola et.al. 2511.03493 null
2025-11-05 Lightwave Power Transfer-Enabled Underwater Optical ISAC Systems under Ship Attitude Variation Kapila W. S. Palitharathna et.al. 2511.03366 null
2025-11-05 Accelerating Physical Property Reasoning for Augmented Visual Cognition Hongbo Lan et.al. 2511.03126 null
2025-11-04 The Curved Spacetime of Transformer Architectures Riccardo Di Sipio et.al. 2511.03060 null
2025-11-04 SLIP: Structural-aware Language-Image Pretraining for Vision-Language Alignment Wenbo Lu et.al. 2511.03019 null
2025-11-04 Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data Jessica Plassmann et.al. 2511.02541 null
2025-11-04 Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization Tao Liu et.al. 2511.02489 null
2025-11-04 LUMA-RAG: Lifelong Multimodal Agents with Provably Stable Streaming Alignment Rohan Wandre et.al. 2511.02371 null
2025-11-04 Learning Spatial Awareness for Laparoscopic Surgery with AI Assisted Visual Feedback Songyang Liu et.al. 2511.02233 null
2025-11-03 AlloyLens: A Visual Analytics Tool for High-throughput Alloy Screening and Inverse Design Suyang Li et.al. 2511.02133 null
2025-11-10 Enhancing Multimodal Recommendations with Vision-Language Models and Information-Aware Fusion Hai-Dang Kieu et.al. 2511.02113 null
2025-11-03 TurboMap: GPU-Accelerated Local Mapping for Visual SLAM Parsa Hosseininejad et.al. 2511.02036 null
2025-11-03 Topological Expansion of Boehm's Brushes via Structured Light Dmitry A. Pushin et.al. 2511.01841 null
2025-11-05 TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning Ming Li et.al. 2511.01833 null
2025-11-03 3EED: Ground Everything Everywhere in 3D Rong Li et.al. 2511.01755 null
2025-11-03 Progressive Translation of H&E to IHC with Enhanced Structural Fidelity Yuhang Kang et.al. 2511.01698 null
2025-11-03 Vote-in-Context: Turning VLMs into Zero-Shot Rank Fusers Mohamed Eltahir et.al. 2511.01617 null
2025-11-03 Wave-Particle (Continuous-Discrete) Dualistic Visual Tokenization for Unified Understanding and Generation Yizhu Chen et.al. 2511.01593 null
2025-11-03 Floor Plan-Guided Visual Navigation Incorporating Depth and Directional Cues Wei Huang et.al. 2511.01493 null
2025-11-03 UniSOT: A Unified Framework for Multi-Modality Single Object Tracking Yinchao Ma et.al. 2511.01427 null
2025-11-03 Semantic BIM enrichment for firefighting assets: Fire-ART dataset and panoramic image-based 3D reconstruction Ya Wen et.al. 2511.01399 null
2025-11-03 SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment Xinyu Mao et.al. 2511.01390 null
2025-11-03 MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement Jierui Qu et.al. 2511.01345 null
2025-11-03 Direct Mapping of Intrinsic Topology of Bound States in the Continuum via Nonlinear Emission Shuzheng Chen et.al. 2511.01337 null
2025-11-03 MotionStream: Real-Time Video Generation with Interactive Motion Controls Joonghyuk Shin et.al. 2511.01266 null
2025-11-03 A Saddle Point Remedy: Power of Variable Elimination in Non-convex Optimization Min Gan et.al. 2511.01234 null
2025-11-02 Efficient Test-Time Retrieval Augmented Generation Hailong Yin et.al. 2511.01059 null
2025-11-02 Integrating Visual and X-Ray Machine Learning Features in the Study of Paintings by Goya Hassan Ugail et.al. 2511.01000 null
2025-11-02 Dynamic Multi-level Weighted Alignment Network for Zero-shot Sketch-based Image Retrieval Hanwen Su et.al. 2511.00925 null
2025-11-02 GraphGeo: Multi-Agent Debate Framework for Visual Geo-localization with Heterogeneous Graph Neural Networks Heng Zheng et.al. 2511.00908 null
2025-11-02 Enhancing Adversarial Transferability in Visual-Language Pre-training Models via Local Shuffle and Sample-based Attack Xin Liu et.al. 2511.00831 null
2025-11-01 Applying Medical Imaging Tractography Techniques to Painterly Rendering of Images Alberto Di Biase et.al. 2511.00702 null
2025-11-01 Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles Hyungtae Lim et.al. 2511.00635 null
2025-11-05 Text-guided Fine-Grained Video Anomaly Detection Jihao Gu et.al. 2511.00524 null
2025-11-01 OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback Kai Luo et.al. 2511.00510 null
2025-11-09 VinDr-CXR-VQA: A Visual Question Answering Dataset for Explainable Chest X-Ray Analysis with Multi-Task Learning Dang H. Nguyen et.al. 2511.00504 null
2025-11-01 FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts Weihao Bo et.al. 2511.00480 null
2025-11-20 Weakly Supervised Pneumonia Localization from Chest X-Rays Using Deep Neural Network and Grad-CAM Explanations Kiran Shahi et.al. 2511.00456 null
2025-11-01 ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training Xin Yao et.al. 2511.00446 null
2025-11-01 Leveraging Hierarchical Image-Text Misalignment for Universal Fake Image Detection Daichi Zhang et.al. 2511.00427 null
2025-11-01 VinciCoder: Unifying Multimodal Code Generation via Coarse-to-fine Visual Reinforcement Learning Xuanle Zhao et.al. 2511.00391 null
2025-11-19 Spot The Ball: A Benchmark for Visual Social Inference Neha Balamurugan et.al. 2511.00261 null
2025-10-31 Generative Modeling Enables Molecular Structure Retrieval from Coulomb Explosion Imaging Xiang Li et.al. 2511.00179 null
2025-10-31 Integrating ConvNeXt and Vision Transformers for Enhancing Facial Age Estimation Gaby Maroun et.al. 2511.00123 null
2025-11-03 Image Hashing via Cross-View Code Alignment in the Age of Foundation Models Ilyass Moummad et.al. 2510.27584 null
2025-10-31 DP-FedPGN: Finding Global Flat Minima for Differentially Private Federated Learning via Penalizing Gradient Norm Junkang Liu et.al. 2510.27504 null
2025-10-31 ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use Mengjie Deng et.al. 2510.27363 null
2025-10-31 RzenEmbed: Towards Comprehensive Multimodal Retrieval Weijian Jian et.al. 2510.27350 null
2025-11-24 FOCUS: Efficient Keyframe Selection for Long Video Understanding Zirui Zhu et.al. 2510.27280 null
2025-10-31 Approximate Diverse $k$ -nearest Neighbor Search in Vector Database Jiachen Zhao et.al. 2510.27243 null
2025-11-04 Dual-level Progressive Hardness-Aware Reweighting for Cross-View Geo-Localization Guozheng Zheng et.al. 2510.27181 null
2025-10-31 M^3Detection: Multi-Frame Multi-Level Feature Fusion for Multi-Modal 3D Object Detection with Camera and 4D Imaging Radar Xiaozhi Li et.al. 2510.27166 null
2025-10-31 AFM-Net: Advanced Fusing Hierarchical CNN Visual Priors with Global Sequence Modeling for Remote Sensing Image Scene Classification Yuanhao Tang et.al. 2510.27155 null
2025-10-31 WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond Zhicong Sun et.al. 2510.27133 null
2025-11-04 NaviTrace: Evaluating Embodied Navigation of Vision-Language Models Tim Windecker et.al. 2510.26909 null
2025-10-30 Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench Fenfen Lin et.al. 2510.26865 null
2025-11-03 Evaluating Perspectival Biases in Cross-Modal Retrieval Teerapol Saengsukhiran et.al. 2510.26861 null
2025-10-29 Audio-Visual Speech Enhancement In Complex Scenarios With Separation And Dereverberation Joint Modeling Jiarong Du et.al. 2510.26825 null
2025-10-30 Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark Ziyu Guo et.al. 2510.26802 null
2025-10-30 Scaling Image Geo-Localization to Continent Level Philipp Lindenberger et.al. 2510.26795 null
2025-11-03 ChartAB: A Benchmark for Chart Grounding & Dense Alignment Aniruddh Bansal et.al. 2510.26781 null
2025-10-30 STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization Marco Federici et.al. 2510.26771 null
2025-10-30 Fire Behavior Monitoring using MeteoSat Third Generation, FCI-FireDyn algorithm: Rate Of Spread and Burnt Area Dynamics for large fire event Ronan Paugam et.al. 2510.26677 null
2025-10-30 Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection Yuanting Fan et.al. 2510.26464 null
2025-10-30 CorVS: Person Identification via Video Trajectory-Sensor Correspondence in a Real-World Warehouse Kazuma Kano et.al. 2510.26369 null
2025-10-30 Weak-Lensing Detection of Intercluster Filaments in Three Nearby Cluster Systems Rahul Shinde et.al. 2510.26318 null
2025-10-30 Self-localization on a 3D map by fusing global and local features from a monocular camera Satoshi Kikuch et.al. 2510.26170 null
2025-10-30 CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark Jiaqi Wang et.al. 2510.26160 null
2025-10-30 Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM Ali Caglayan et.al. 2510.26131 null
2025-10-30 Josephson effect with periodic order parameter Klaus Ziegler et.al. 2510.26128 null
2025-10-30 OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research Caoshuo Li et.al. 2510.26114 null
2025-10-29 RADRON: Cooperative Localization of Ionizing Radiation Sources by MAVs with Compton Cameras Petr Stibinger et.al. 2510.26018 null
2025-10-29 DARTS: A Drone-Based AI-Powered Real-Time Traffic Incident Detection System Bai Li et.al. 2510.26004 null
2025-10-31 Larger Hausdorff Dimension in Scanning Pattern Facilitates Mamba-Based Methods in Low-Light Image Enhancement Xinhua Wang et.al. 2510.26001 null
2025-10-29 Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer Roman Beliy et.al. 2510.25976 null
2025-10-26 Towards Piece-by-Piece Explanations for Chess Positions with SHAP Francesco Spinnato et.al. 2510.25775 null
2025-10-29 Retrieval-Augmented Search for Large-Scale Map Collections with ColPali Jamie Mahowald et.al. 2510.25718 null
2025-10-29 Instance-Level Composed Image Retrieval Bill Psomas et.al. 2510.25387 null
2025-10-29 Prompt Estimation from Prototypes for Federated Prompt Tuning of Vision Transformers M Yashwanth et.al. 2510.25372 null
2025-10-29 Development of a new phase-retrieval algorithm from a single-shot image for X-ray schlieren microscopy Ryutaro Nishimura et.al. 2510.25264 null
2025-10-29 Spectral analysis of the stiffness matrix sequence in the approximated Stokes equation Samuele Ferri et.al. 2510.25252 null
2025-10-29 Hybrid Vision Servoing with Depp Alignment and GRU-Based Occlusion Recovery Jee Won Lee et.al. 2510.25233 null
2025-10-29 MMM-Fact: A Multimodal, Multi-Domain Fact-Checking Dataset with Multi-Level Retrieval Difficulty Wenyan Xu et.al. 2510.25120 null
2025-10-29 Visual Diversity and Region-aware Prompt Learning for Zero-shot HOI Detection Chanhyeong Yang et.al. 2510.25094 null
2025-10-28 Defect Mitigation for Robot Arm-based Additive Manufacturing Utilizing Intelligent Control and IOT Matsive Ali et.al. 2510.24994 null
2025-10-28 DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts Binbin Li et.al. 2510.24813 null
2025-10-28 Generative AI for Healthcare: Fundamentals, Challenges, and Perspectives Gang Chen et.al. 2510.24551 null
2025-10-28 GeVI-SLAM: Gravity-Enhanced Stereo Visua Inertial SLAM for Underwater Robots Yuan Shen et.al. 2510.24533 null
2025-10-28 Fast and accurate neural reflectance transformation imaging through knowledge distillation Tinsae G. Dulecha et.al. 2510.24486 null
2025-10-28 Deeply-Conditioned Image Compression via Self-Generated Priors Zhineng Zhao et.al. 2510.24437 null
2025-10-28 Half-Light Radius Measurements of Andromeda Dwarf Satellites from the Isaac Newton Telescope Survey Using Exponential, Plummer, and Sérsic Fits Hedieh Abdollahi et.al. 2510.24377 null
2025-10-28 Decoupling What to Count and Where to See for Referring Expression Counting Yuda Zou et.al. 2510.24374 null
2025-10-28 Sound Source Localization for Spatial Mapping of Surgical Actions in Dynamic Scenes Jonas Hein et.al. 2510.24332 null
2025-10-28 CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation Anshul Kaushal et.al. 2510.24202 null
2025-10-28 LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation Haotian Zhou et.al. 2510.24118 null
2025-10-27 Explainable Detection of AI-Generated Images with Artifact Localization Using Faster-Than-Lies and Vision-Language Models for Edge Devices Aryan Mathur et.al. 2510.23775 null
2025-10-27 EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT Baoqi Pei et.al. 2510.23569 null
2025-10-27 MDReID: Modality-Decoupled Learning for Any-to-Any Multi-Modal Object Re-Identification Yingying Feng et.al. 2510.23301 null
2025-10-27 Learning from Frustration: Torsor CNNs on Graphs Daiyuan Li et.al. 2510.23288 null
2025-10-27 Moderating Role of Presence in EEG Responses to Visuo-haptic Prediction Error in Virtual Reality Lukas Gehrke et.al. 2510.23262 null
2025-10-27 Accurate and Scalable Multimodal Pathology Retrieval via Attentive Vision-Language Alignment Hongyi Wang et.al. 2510.23224 null
2025-10-27 The Sun as an X-ray star V.: A new method to retrieve coronal filling factors Wilhelmina Maryann Joseph et.al. 2510.23161 null
2025-10-27 Reliable Robotic Task Execution in the Face of Anomalies Bharath Santhanam et.al. 2510.23121 null
2025-10-27 Multi-Stage Field Extraction of Financial Documents with OCR and Compact Vision-Language Models Yichao Jin et.al. 2510.23066 null
2025-10-26 Seeing the Unseen: Towards Zero-Shot Inspection for Wind Turbine Blades using Knowledge-Augmented Vision Language Models Yang Zhang et.al. 2510.22868 null
2025-10-26 Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models Lexiang Xiong et.al. 2510.22851 null
2025-10-26 Analytical Swarm Chemistry: Characterization and Analysis of Emergent Swarm Behaviors Ricardo Vega et.al. 2510.22821 null
2025-10-26 VEHME: A Vision-Language Model For Evaluating Handwritten Mathematics Expressions Thu Phuong Nguyen et.al. 2510.22798 null
2025-11-01 Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval Binxiao Xu et.al. 2510.22765 null
2025-10-26 TWC-SLAM: Multi-Agent Cooperative SLAM with Text Semantics and WiFi Features Integration for Similar Indoor Environments Chunyu Li et.al. 2510.22754 null
2025-10-30 Cross-view Localization and Synthesis -- Datasets, Challenges and Opportunities Ningli Xu et.al. 2510.22736 null
2025-10-26 S-Chain: Structured Visual Chain-of-Thought For Medicine Khai Le-Duc et.al. 2510.22728 null
2025-10-26 SpoofTrackBench: Interpretable AI for Spoof-Aware UAV Tracking and Benchmarking Van Le et.al. 2510.22726 null
2025-10-26 LRW-Persian: Lip-reading in the Wild Dataset for Persian Language Zahra Taghizadeh et.al. 2510.22716 null
2025-10-26 SARCLIP: A Vision Language Foundation Model for Semantic Understanding and Target Recognition in SAR Imagery Qiwei Ma et.al. 2510.22665 null
2025-10-26 CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation Md. Mehedi Hasan et.al. 2510.22609 null
2025-10-26 SWAN: Self-supervised Wavelet Neural Network for Hyperspectral Image Unmixing Yassh Ramchandani et.al. 2510.22607 null
2025-10-26 RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience Huilin Yin et.al. 2510.22600 null
2025-10-26 STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models Mahiro Ukai et.al. 2510.22571 null
2025-10-26 Structure Aware Image Downscaling G B Kevin Arjun et.al. 2510.22551 null
2025-10-26 Low-Light Image Enhancement Using Gamma Learning And Attention-Enabled Encoder-Decoder Networks Bibhabasu Debnath et.al. 2510.22547 null
2025-10-26 Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing Xiang Fei et.al. 2510.22529 null
2025-10-26 Open Multimodal Retrieval-Augmented Factual Image Generation Yang Tian et.al. 2510.22521 null
2025-10-25 Moving Beyond Diffusion: Hierarchy-to-Hierarchy Autoregression for fMRI-to-Image Reconstruction Xu Zhang et.al. 2510.22335 null
2025-10-25 From Slides to Chatbots: Enhancing Large Language Models with University Course Materials Tu Anh Dinh et.al. 2510.22272 null
2025-10-25 Scaling Non-Parametric Sampling with Representation Vincent Lu et.al. 2510.22196 null
2025-10-24 Earth Analogs in Reflected Light: Insights from Early Spectral Characterization in Unconstrained Orbits Arnaud Salvador et.al. 2510.21973 null
2025-10-23 TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary Weights and Distilled Knowledge Shu-Hao Zhang et.al. 2510.21879 null
2025-10-22 SCoPE VLM: Selective Context Processing for Efficient Document Navigation in Vision-Language Models Gyubeum Lim et.al. 2510.21850 null
2025-10-24 Modest-Align: Data-Efficient Alignment for Vision-Language Models Jiaxiang Liu et.al. 2510.21606 null
2025-10-23 GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs Guanghao Zheng et.al. 2510.21501 null
2025-10-24 MUVR: A Multi-Modal Untrimmed Video Retrieval Benchmark with Multi-Level Visual Correspondence Yue Feng et.al. 2510.21406 null
2025-10-24 Dynamic Semantic-Aware Correlation Modeling for UAV Tracking Xinyu Zhou et.al. 2510.21351 null
2025-10-24 CT-CLIP: A Multi-modal Fusion Framework for Robust Apple Leaf Disease Recognition in Complex Environments Lemin Liu et.al. 2510.21346 null
2025-10-24 FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning Lu Zhang et.al. 2510.21311 null
2025-10-24 Underwater Visual-Inertial-Acoustic-Depth SLAM with DVL Preintegration for Degraded Environments Shuoshuo Ding et.al. 2510.21215 null
2025-10-24 A visual big data system for the prediction of weather-related variables: Jordan-Spain case study Shadi Aljawarneh et.al. 2510.21176 null
2025-10-24 MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning Siyong Chen et.al. 2510.21093 null
2025-10-27 LayerComposer: Interactive Personalized T2I via Spatially-Aware Layered Canvas Guocheng Gordon Qian et.al. 2510.20820 null
2025-10-23 Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation Yuhan Liu et.al. 2510.20812 null
2025-10-23 Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence Jiahao Meng et.al. 2510.20579 null
2025-10-23 Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired Navigation Marziyeh Bamdad et.al. 2510.20549 null
2025-10-24 Robust Preference Alignment via Directional Neighborhood Consensus Ruochen Mao et.al. 2510.20498 null
2025-10-23 Degradation-Aware Cooperative Multi-Modal GNSS-Denied Localization Leveraging LiDAR-Based Robot Detections Václav Pritzl et.al. 2510.20480 null
2025-11-20 Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence Kun Ouyang et.al. 2510.20470 null
2025-10-23 Mitigating Cross-modal Representation Bias for Multicultural Image-to-Recipe Retrieval Qing Wang et.al. 2510.20393 null
2025-10-25 DB-FGA-Net: Dual Backbone Frequency Gated Attention Network for Multi-Class Brain Tumor Classification with Grad-CAM Interpretability Saraf Anzum Shreya et.al. 2510.20299 null
2025-10-23 A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization LinFeng Li et.al. 2510.20291 null
2025-10-23 Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures Rahul Raja et.al. 2510.20193 null
2025-10-23 PathFormer: A Transformer with 3D Grid Constraints for Digital Twin Robot-Arm Trajectory Generation Ahmed Alanazi et.al. 2510.20161 null
2025-10-27 "Learning Together": AI-Mediated Support for Parental Involvement in Everyday Learning Yao Li et.al. 2510.20123 null
2025-10-24 BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models Ziheng Zhang et.al. 2510.20095 null
2025-10-22 Exposing Blindspots: Cultural Bias Evaluation in Generative Image Models Huichan Seo et.al. 2510.20042 null
2025-10-22 Automating Iconclass: LLMs and RAG for Large-Scale Classification of Religious Woodcuts Drew B. Thomas et.al. 2510.19986 null
2025-10-22 Compressing Biology: Evaluating the Stable Diffusion VAE for Phenotypic Drug Discovery Télio Cropsal et.al. 2510.19887 null
2025-10-22 Multilayer Perceptron Neural Network Model: A Novel Approach for LFP Contrast Sensitivity Tuning Sahar Maleki et.al. 2510.19636 null
2025-10-22 XBench: A Comprehensive Benchmark for Visual-Language Explanations in Chest Radiography Haozhe Luo et.al. 2510.19599 null
2025-10-22 Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation Su Ho Han et.al. 2510.19592 null
2025-10-22 AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields Woo Jae Kim et.al. 2510.19371 null
2025-10-22 Exploring Scale Shift in Crowd Localization under the Context of Domain Generalization Juncheng Wang et.al. 2510.19330 null
2025-10-22 Step-Aware Residual-Guided Diffusion for EEG Spatial Super-Resolution Hongjun Liu et.al. 2510.19166 null
2025-10-21 UniHPR: Unified Human Pose Representation via Singular Value Contrastive Learning Zhongyu Jiang et.al. 2510.19078 null
2025-10-21 Macroscopic EEG Reveals Discriminative Low-Frequency Oscillations in Plan-to-Grasp Visuomotor Tasks Anna Cetera et.al. 2510.19057 null
2025-10-21 Visually Comparing Graph Vertex Ordering Algorithms through Geometrical and Topological Approaches Karelia Salinas et.al. 2510.19009 null
2025-10-21 Underwater Dense Mapping with the First Compact 3D Sonar Chinmay Burgul et.al. 2510.18991 null
2025-10-18 Small Language Models Offer Significant Potential for Science Community Jian Zhang et.al. 2510.18890 null
2025-10-21 FedDEAP: Adaptive Dual-Prompt Tuning for Multi-Domain Federated Learning Yubin Zheng et.al. 2510.18837 null
2025-10-21 UltraGen: High-Resolution Video Generation with Hierarchical Attention Teng Hu et.al. 2510.18775 null
2025-10-21 Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting Taha Binhuraib et.al. 2510.18745 null
2025-10-21 SSD: Spatial-Semantic Head Decoupling for Efficient Autoregressive Image Generation Siyong Jian et.al. 2510.18716 null
2025-10-21 Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents Yiqi Lin et.al. 2510.18703 null
2025-10-21 CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder Yongmin Lee et.al. 2510.18583 null
2025-11-12 Large deviations in the many-body localization transition: The case of the random-field XXZ chain Greivin Alfaro Miranda et.al. 2510.18545 null
2025-10-21 RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation Junwen Huang et.al. 2510.18521 null
2025-10-21 Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation Wei-Chia Chang et.al. 2510.18502 null
2025-10-21 Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection Ji Du et.al. 2510.18437 null
2025-10-21 ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization Yuanhe Guo et.al. 2510.18433 null
2025-10-21 Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents Guangfu Guo et.al. 2510.18424 null
2025-10-21 Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models Lehan Wang et.al. 2510.18303 null
2025-10-22 Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs Yanhong Li et.al. 2510.18279 null
2025-10-21 TreeFedDG: Alleviating Global Drift in Federated Domain Generalization for Medical Image Segmentation Yucheng Song et.al. 2510.18268 null
2025-10-21 UWBench: A Comprehensive Vision-Language Benchmark for Underwater Understanding Da Zhang et.al. 2510.18262 null
2025-10-21 DualHash: A Stochastic Primal-Dual Algorithm with Theoretical Guarantee for Deep Hashing Luxuan Li et.al. 2510.18218 null
2025-10-20 AION-1: Omnimodal Foundation Model for Astronomical Sciences Liam Parker et.al. 2510.17960 null
2025-10-13 Pre to Post-Treatment Glioblastoma MRI Prediction using a Latent Diffusion Model Alexandre G. Leclercq et.al. 2510.17851 null
2025-09-30 Micromechanical characterisation of osteoarthritic subchondral bone by micropillar compression Samuel McPhee et.al. 2510.17824 null
2025-10-20 SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference Samir Khaki et.al. 2510.17777 null
2025-10-20 Seeing but Not Believing: Probing the Disconnect Between Visual Attention and Answer Correctness in VLMs Zhining Liu et.al. 2510.17771 null
2025-10-20 Joint Multi-Condition Representation Modelling via Matrix Factorisation for Visual Place Recognition Timur Ismagilov et.al. 2510.17739 null
2025-10-20 Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning Min Cao et.al. 2510.17685 null
2025-10-20 MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning Mir Nafis Sharear Shopnil et.al. 2510.17590 null
2025-10-20 BenCao: An Instruction-Tuned Large Language Model for Traditional Chinese Medicine Jiacheng Xie et.al. 2510.17415 null
2025-10-20 Model Metamers Reveal Invariances in Graph Neural Networks Wei Xu et.al. 2510.17378 null
2025-10-20 Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation Chenghao Zhang et.al. 2510.17354 null
2025-10-21 LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding ZhaoYang Han et.al. 2510.17305 null
2025-10-20 Performance Evaluation of an Integrated System for Visible Light Communication and Positioning Using an Event Camera Ryota Soga et.al. 2510.17203 null
2025-10-20 Generation then Reconstruction: Accelerating Masked Autoregressive Models via Two-Stage Sampling Feihong Yan et.al. 2510.17171 null
2025-10-22 OmniVIC: A Self-Improving Variable Impedance Controller with Vision-Language In-Context Learning for Safe Robotic Manipulation Heng Zhang et.al. 2510.17150 null
2025-10-19 Person Re-Identification via Generalized Class Prototypes Md Ahmed Al Muzaddid et.al. 2510.17043 null
2025-10-19 A Low-Complexity View Synthesis Distortion Estimation Method for 3D Video with Large Baseline Considerations Chongyuan Bi et.al. 2510.17037 null
2025-10-19 SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models Chih-Kai Yang et.al. 2510.16917 null
2025-10-19 ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification Akhila Kambhatla et.al. 2510.16854 null
2025-11-24 ReefNet: A Large scale, Taxonomically Enriched Dataset and Benchmark for Hard Coral Classification Yahia Battach et.al. 2510.16822 null
2025-10-19 An Efficient Framework for Whole-Page Reranking via Single-Modal Supervision Zishuai Zhang et.al. 2510.16803 null
2025-10-19 Region in Context: Text-condition Image editing with Human-like semantic reasoning Thuy Phuong Vu et.al. 2510.16772 null
2025-10-19 See or Say Graphs: Agent-Driven Scalable Graph Understanding with Vision-Language Models Shuo Han et.al. 2510.16769 null
2025-10-19 Exact Nearest-Neighbor Search on Energy-Efficient FPGA Devices Patrizio Dazzi et.al. 2510.16736 null
2025-10-27 UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid Tianyang Dou et.al. 2510.16730 null
2025-10-18 Safire: Similarity Framework for Visualization Retrieval Huyen N. Nguyen et.al. 2510.16662 null
2025-10-18 A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications Melika Filvantorkaman et.al. 2510.16611 null
2025-10-18 Image Categorization and Search via a GAT Autoencoder and Representative Models Duygu Sap et.al. 2510.16514 null
2025-10-18 RefAtomNet++: Advancing Referring Atomic Video Action Recognition using Semantic Retrieval based Multi-Trajectory Mamba Kunyu Peng et.al. 2510.16444 null
2025-10-18 RL makes MLLMs see better than SFT Junha Song et.al. 2510.16333 null
2025-10-17 Out-of-Equilibrium Dynamics in a U(1) Lattice Gauge Theory via Local Information Flows: Scattering and String Breaking Claudia Artiaco et.al. 2510.16101 null
2025-10-14 Frequency domain laser ultrasound microscopy for nanometric layer thickness imaging with GHz elastic plate resonances Martin Ryzy et.al. 2510.16000 null
2025-10-27 ESCA: Contextualizing Embodied Agents via Scene-Graph Generation Jiani Huang et.al. 2510.15963 null
2025-10-17 Memory-SAM: Human-Prompt-Free Tongue Segmentation via Retrieval-to-Prompt Joongwon Chae et.al. 2510.15849 null
2025-10-17 FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification Zhen Sun et.al. 2510.15595 null
2025-10-17 MCA: Modality Composition Awareness for Robust Composed Multimodal Retrieval Qiyu Wu et.al. 2510.15543 null
2025-10-17 DPTrack:Directional Kernel-Guided Prompt Learning for Robust Nighttime Aerial Tracking Zhiqiang Zhu et.al. 2510.15449 null
2025-10-17 Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning Xuchen Li et.al. 2510.15440 null
2025-10-17 Semantic4Safety: Causal Insights from Zero-shot Street View Imagery Segmentation for Urban Road Safety Huan Chen et.al. 2510.15434 null
2025-11-07 Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs Lee Qi Zun et.al. 2510.15418 null
2025-10-17 PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction Ting-Yu Yen et.al. 2510.15386 null
2025-10-17 WebGen-V Bench: Structured Representation for Enhancing Visual Design in LLM-based Web Generation and Evaluation Kuang-Da Wang et.al. 2510.15306 null
2025-10-17 Post-Processing Methods for Improving Accuracy in MRI Inpainting Nishad Kulkarni et.al. 2510.15282 null
2025-10-17 CuSfM: CUDA-Accelerated Structure-from-Motion Jingrui Yu et.al. 2510.15271 null
2025-11-02 Experience-Driven Exploration for Efficient API-Free AI Agents Chenwei Tang et.al. 2510.15259 null
2025-10-17 LVI-Q: Robust LiDAR-Visual-Inertial-Kinematic Odometry for Quadruped Robots Using Tightly-Coupled and Efficient Alternating Optimization Kevin Christiansen Marsim et.al. 2510.15220 null
2025-10-16 TGT: Text-Grounded Trajectories for Locally Controlled Video Generation Guofeng Zhang et.al. 2510.15104 null
2025-10-16 Comprehensive language-image pre-training for 3D medical image understanding Tassilo Wald et.al. 2510.15042 null
2025-10-16 NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks Junliang Ye et.al. 2510.15019 null
2025-10-16 ChangingGrounding: 3D Visual Grounding in Changing Scenes Miao Hu et.al. 2510.14965 null
2025-10-16 RainDiff: End-to-end Precipitation Nowcasting Via Token-wise Attention Diffusion Thao Nguyen et.al. 2510.14962 null
2025-10-16 CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection Hojun Choi et.al. 2510.14792 null
2025-10-16 Improving Cybercrime Detection and Digital Forensics Investigations with Artificial Intelligence Silvia Lucia Sanna et.al. 2510.14638 null
2025-10-16 Multimodal RAG for Unstructured Data:Leveraging Modality-Aware Knowledge Graphs with Hybrid Retrieval Rashmi R et.al. 2510.14592 null
2025-10-16 Talking Points: Describing and Localizing Pixels Matan Rusanovsky et.al. 2510.14583 null
2025-10-16 Acquisition of interpretable domain information during brain MR image harmonization for content-based image retrieval Keima Abe et.al. 2510.14535 null
2025-11-24 Structured Random Models for Phase Retrieval with Optical Diffusers Zhiyuan Hu et.al. 2510.14490 null
2025-10-16 Spatial Preference Rewarding for MLLMs Spatial Understanding Han Qiu et.al. 2510.14374 null
2025-10-14 K-frames: Scene-Driven Any-k Keyframe Selection for long video understanding Yifeng Yao et.al. 2510.13891 null
2025-10-12 Multimodal Retrieval-Augmented Generation with Large Language Models for Medical VQA A H M Rezaul Karim et.al. 2510.13856 null
2025-09-19 GQVis: A Dataset of Genomics Data Questions and Visualizations for Generative AI Skylar Sargent Walters et.al. 2510.13816 null
2025-10-15 Adaptive Visual Conditioning for Semantic Consistency in Diffusion-Based Story Continuation Seyed Mohammad Mousavi et.al. 2510.13787 null
2025-10-16 NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow Matching Run Luo et.al. 2510.13721 null
2025-10-15 Jacobian-Based Interpretation of Nonlinear Neural Encoding Model Xiaohui Gao et.al. 2510.13688 null
2025-11-11 AVAR-Net: A Lightweight Audio-Visual Anomaly Recognition Framework with a Benchmark Dataset Amjid Ali et.al. 2510.13630 null
2025-10-15 Characterizing Lidar Point-Cloud Adversities Using a Vector Field Visualization Daniel Choate et.al. 2510.13619 null
2025-10-15 Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU Ruiqi Ye et.al. 2510.13546 null
2025-10-15 Through the Lens of Doubt: Robust and Efficient Uncertainty Estimation for Visual Place Recognition Emily Miller et.al. 2510.13464 null
2025-10-15 Improving Visual Recommendation on E-commerce Platforms Using Vision-Language Models Yuki Yada et.al. 2510.13359 null
2025-10-15 UniVector: Unified Vector Extraction via Instance-Geometry Interaction Yinglong Yan et.al. 2510.13234 null
2025-10-15 OS-HGAdapter: Open Semantic Hypergraph Adapter for Large Language Models Assisted Entropy-Enhanced Image-Text Alignment Rongjun Chen et.al. 2510.13131 null
2025-10-23 Epistemic-aware Vision-Language Foundation Model for Fetal Ultrasound Interpretation Xiao He et.al. 2510.12953 null
2025-10-14 DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search Kartik Narayan et.al. 2510.12801 null
2025-10-14 SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models Weiyang Jin et.al. 2510.12784 null
2025-10-24 E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization Wenpu Li et.al. 2510.12753 null
2025-10-14 A Text-Image Fusion Method with Data Augmentation Capabilities for Referring Medical Image Segmentation Shurong Chai et.al. 2510.12482 null
2025-10-14 SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression Biao Zhang et.al. 2510.12474 null
2025-10-14 SpineBench: Benchmarking Multimodal LLMs for Spinal Pathology Analysis Chenghanyu Zhang et.al. 2510.12267 null
2025-10-14 Local Background Features Matter in Out-of-Distribution Detection Jinlun Ye et.al. 2510.12259 null
2025-10-14 SDGraph: Multi-Level Sketch Representation Learning by Sparse-Dense Graph Architecture Xi Cheng et.al. 2510.12192 null
2025-10-14 ImageSentinel: Protecting Visual Datasets from Unauthorized Retrieval-Augmented Image Generation Ziyuan Luo et.al. 2510.12119 null
2025-10-13 Embedding the Teacher: Distilling vLLM Preferences for Scalable Image Retrieval Eric He et.al. 2510.12014 null
2025-10-11 Benefits and Limitations of Using GenAI for Political Education and Municipal Elections Raphael Fischer et.al. 2510.11749 null
2025-10-13 High-resolution Photo Enhancement in Real-time: A Laplacian Pyramid Network Feng Zhang et.al. 2510.11613 null
2025-10-14 Massive Activations are the Key to Local Detail Synthesis in Diffusion Transformers Chaofan Gan et.al. 2510.11538 null
2025-10-13 A Modular AIoT Framework for Low-Latency Real-Time Robotic Teleoperation in Smart Cities Shih-Chieh Sun et.al. 2510.11421 null
2025-10-13 MMAP: A Multi-Magnification and Prototype-Aware Architecture for Predicting Spatial Gene Expression Hai Dang Nguyen et.al. 2510.11344 null
2025-10-13 A Large-Language-Model Assisted Automated Scale Bar Detection and Extraction Framework for Scanning Electron Microscopic Images Yuxuan Chen et.al. 2510.11260 null
2025-10-13 PhysHSI: Towards a Real-World Generalizable and Natural Humanoid-Scene Interaction System Huayi Wang et.al. 2510.11072 null
2025-10-13 Impact of elastic inhomogeneity on collective dynamical properties investigated by field theoretical description in real space Cunyuan Jiang et.al. 2510.10928 null
2025-10-13 SceneTextStylizer: A Training-Free Scene Text Style Transfer Framework with Diffusion Model Honghui Yuan et.al. 2510.10910 null
2025-10-13 Spatial Correlation of Superconducting and Pseudogap Dynamics in a Bi-based Cuprate T. Shimizu et.al. 2510.10906 null
2025-10-13 Where on Earth? A Vision-Language Benchmark for Probing Model Geolocation Skills Across Scales Zhaofang Qian et.al. 2510.10880 null
2025-10-12 OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs Caorui Li et.al. 2510.10689 null
2025-10-12 A Simple and Better Baseline for Visual Grounding Jingchao Wang et.al. 2510.10587 null
2025-10-12 BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices Euhid Aman et.al. 2510.10560 null
2025-10-12 Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs Suyang Xi et.al. 2510.10426 null
2025-10-11 B2N3D: Progressive Learning from Binary to N-ary Relationships for 3D Object Grounding Feng Xiao et.al. 2510.10194 null
2025-10-11 TCMA: Text-Conditioned Multi-granularity Alignment for Drone Cross-Modal Text-Video Retrieval Zixu Zhao et.al. 2510.10180 null
2025-10-11 ViConEx-Med: Visual Concept Explainability via Multi-Concept Token Transformer for Medical Image Analysis Cristiano Patrício et.al. 2510.10174 null
2025-10-11 Cooperative Pseudo Labeling for Unsupervised Federated Classification Kuangpu Guo et.al. 2510.10100 null
2025-10-11 Think Twice to See More: Iterative Visual Reasoning in Medical VLMs Kaitao Chen et.al. 2510.10052 null
2025-10-11 Complementary and Contrastive Learning for Audio-Visual Segmentation Sitong Gong et.al. 2510.10051 null
2025-10-11 Scaling Traffic Insights with AI and Language Model-Powered Camera Systems for Data-Driven Transportation Decision Making Fan Zuo et.al. 2510.09981 null
2025-10-14 J-RAS: Enhancing Medical Image Segmentation via Retrieval-Augmented Joint Training Salma J. Ahmed et.al. 2510.09953 null
2025-10-15 Egocentric Visual Navigation through Hippocampal Sequences Xiao-Xiong Lin et.al. 2510.09951 null
2025-10-10 The Geometry of Reasoning: Flowing Logics in Representation Space Yufa Zhou et.al. 2510.09782 null
2025-10-10 VisRAG 2.0: Evidence-Guided Multi-Image Reasoning in Visual Retrieval-Augmented Generation Yubo Sun et.al. 2510.09733 null
2025-10-07 Semantic-Cohesive Knowledge Distillation for Deep Cross-modal Hashing Changchang Sun et.al. 2510.09664 null
2025-10-10 MRMR: A Realistic and Expert-Level Multidisciplinary Benchmark for Reasoning-Intensive Multimodal Retrieval Siyue Zhang et.al. 2510.09510 null
2025-10-10 Diagonal Artifacts in Samsung Images: PRNU Challenges and Solutions David Vázquez-Padín et.al. 2510.09509 null
2025-10-10 Dynamic Weight-based Temporal Aggregation for Low-light Video Enhancement Ruirui Lin et.al. 2510.09450 null
2025-10-10 Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians Jin-Chuan Shi et.al. 2510.09438 null
2025-10-10 Sub-Diffraction Chromatin Domains: Architecture, Regulation, and Functional Roles in Nuclear Organization Vinayak Vinayak et.al. 2510.09375 null
2025-10-10 Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation Wenyao Zhang et.al. 2510.09320 null
2025-10-10 Instance-Level Generation for Representation Learning Yankun Wu et.al. 2510.09171 null
2025-10-10 Robust Visual Teach-and-Repeat Navigation with Flexible Topo-metric Graph Map Representation Jikai Wang et.al. 2510.09089 null
2025-10-10 Visual Anomaly Detection for Reliable Robotic Implantation of Flexible Microelectrode Array Yitong Chen et.al. 2510.09071 null
2025-10-10 HandEval: Taking the First Step Towards Hand Quality Evaluation in Generated Images Zichuan Wang et.al. 2510.08978 null
2025-10-10 Hierarchical Scheduling for Multi-Vector Image Retrieval Maoliang Li et.al. 2510.08976 null
2025-11-19 FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation Samuel Hildebrand et.al. 2510.08945 null
2025-10-09 Identifying Video Game Debugging Bottlenecks: An Industry Perspective Carlos Pinto Gomez et.al. 2510.08834 null
2025-10-09 Whole Body Model Predictive Control for Spin-Aware Quadrupedal Table Tennis David Nguyen et.al. 2510.08754 null
2025-10-08 Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry Thomas Fel et.al. 2510.08638 null
2025-10-11 MultiCOIN: Multi-Modal COntrollable Video INbetweening Maham Tanveer et.al. 2510.08561 null
2025-10-09 X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering Zhitong Huang et.al. 2510.08530 null
2025-10-09 Observation of electromagnons in a monolayer multiferroic Mohammad Amini et.al. 2510.08253 null
2025-10-09 DarkHash: A Data-Free Backdoor Attack Against Deep Hashing Ziqi Zhou et.al. 2510.08094 null
2025-10-09 CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning Weihuang Lin et.al. 2510.08003 null
2025-10-09 MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding Peiran Wu et.al. 2510.07915 null
2025-10-09 RePainter: Empowering E-commerce Object Removal via Spatial-matting Reinforcement Learning Zipeng Guo et.al. 2510.07721 null
2025-10-09 Multimodal Safety Evaluation in Generative Agent Social Simulations Alhim Vera et.al. 2510.07709 null
2025-10-09 Mutual Learning for Hashing: Unlocking Strong Hash Functions from Weak Supervision Xiaoxu Ma et.al. 2510.07703 null
2025-10-16 Ctrl-VI: Controllable Video Synthesis via Variational Inference Haoyi Duan et.al. 2510.07670 null
2025-10-08 SpecGuard: Spectral Projection-based Advanced Invisible Watermarking Inzamamul Alam et.al. 2510.07302 null
2025-10-10 DPL: Depth-only Perceptive Humanoid Locomotion via Realistic Depth Synthesis and Cross-Attention Terrain Reconstruction Jingkai Sun et.al. 2510.07152 null
2025-10-08 ELMUR: External Layer Memory with Update/Rewrite for Long-Horizon RL Egor Cherepanov et.al. 2510.07151 null
2025-11-14 Concept Retrieval -- What and How? Ori Nizan et.al. 2510.07058 null
2025-10-08 High-Performance Imaging in a Dilution Refrigerator Timo Eikelmann et.al. 2510.07054 null
2025-10-08 Introspection in Learned Semantic Scene Graph Localisation Manshika Charvi Bissessur et.al. 2510.07053 null
2025-10-08 IAR2: Improving Autoregressive Visual Generation with Semantic-Detail Associated Token Prediction Ran Yi et.al. 2510.06928 null
2025-10-08 M3Retrieve: Benchmarking Multimodal Retrieval for Medicine Arkadeep Acharya et.al. 2510.06888 null
2025-10-08 Versatile 3D reconstruction framework for hard X-ray grazing incidence imaging of nanostructures Luke Besley et.al. 2510.06877 null
2025-10-08 Multi-hop Deep Joint Source-Channel Coding with Deep Hash Distillation for Semantically Aligned Image Retrieval Didrik Bergström et.al. 2510.06868 null
2025-10-08 Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking Mitchell Keren Taraday et.al. 2510.06820 null
2025-10-08 Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity Islomjon Shukhratov et.al. 2510.06802 null
2025-10-08 DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining Zhiliang Zhu et.al. 2510.06746 null
2025-10-08 ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory Yunzhong Xiao et.al. 2510.06664 null
2025-11-15 Implicit-Knowledge Visual Question Answering with Structured Reasoning Traces Zhihao Wen et.al. 2510.06638 null
2025-10-07 TDiff: Thermal Plug-And-Play Prior with Patch-Based Diffusion Piyush Dashpute et.al. 2510.06460 null
2025-10-07 Vi-TacMan: Articulated Object Manipulation via Vision and Touch Leiyao Cui et.al. 2510.06339 null
2025-10-05 A Mixed-Methods Analysis of Repression and Mobilization in Bangladesh's July Revolution Using Machine Learning and Statistical Modeling Md. Saiful Bari Siddiqui et.al. 2510.06264 null
2025-10-09 A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants Hans G. W. van Dam et.al. 2510.06223 null
2025-10-07 Human3R: Everyone Everywhere All at Once Yue Chen et.al. 2510.06219 null
2025-10-07 DYMO-Hair: Generalizable Volumetric Dynamics Modeling for Robot Hair Manipulation Chengyang Zhao et.al. 2510.06199 null
**2025-10-0

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages