Usage instructions: here
Table of Contents
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2025-12-04 | TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards | Mauro Martini et.al. | 2512.04772 | null |
| 2025-12-03 | What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models | Tianchen Deng et.al. | 2512.03422 | null |
| 2025-12-02 | VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM | Zihan Zhu et.al. | 2512.02293 | null |
| 2025-12-01 | KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM | Zaid Nasser et.al. | 2512.01889 | null |
| 2025-12-01 | Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching | Yue Pan et.al. | 2512.01850 | null |
| 2025-12-01 | AgriLiRa4D: A Multi-Sensor UAV Dataset for Robust SLAM in Challenging Agricultural Fields | Zhihao Zhan et.al. | 2512.01753 | null |
| 2025-12-01 | EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly | Xiaokun Pan et.al. | 2512.01296 | null |
| 2025-11-30 | Integration of UWB Radar on Mobile Robots for Continuous Obstacle and Environment Mapping | Adelina Giurea et.al. | 2512.01018 | null |
| 2025-11-30 | EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes | Xiaoshan Wu et.al. | 2512.00771 | null |
| 2025-11-29 | Odometry Without Correspondence from Inertially Constrained Ruled Surfaces | Chenqi Zhu et.al. | 2512.00327 | null |
| 2025-11-26 | Dual-Agent Reinforcement Learning for Adaptive and Cost-Aware Visual-Inertial Odometry | Feiyang Pan et.al. | 2511.21083 | null |
| 2025-11-25 | Estimating Fog Parameters from a Sequence of Stereo Images | Yining Ding et.al. | 2511.20865 | null |
| 2025-11-25 | The origin of B-type runaway stars based on kinematics | Yanjun Guo et.al. | 2511.20566 | null |
| 2025-11-25 | Metric, inertially aligned monocular state estimation via kinetodynamic priors | Jiaxin Liu et.al. | 2511.20496 | null |
| 2025-11-25 | AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend | Hengyi Wang et.al. | 2511.20343 | null |
| 2025-11-25 | Stellar Parameters of BOSS M dwarfs in SDSS-V DR19 | Dan Qiu et.al. | 2511.20005 | null |
| 2025-11-26 | Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors | Yuchen Zhou et.al. | 2511.19031 | null |
| 2025-11-24 | AutoOdom: Learning Auto-regressive Proprioceptive Odometry for Legged Locomotion | Changsheng Luo et.al. | 2511.18857 | null |
| 2025-11-24 | SP-VINS: A Hybrid Stereo Visual Inertial Navigation System based on Implicit Environmental Map | Xueyu Du et.al. | 2511.18756 | null |
| 2025-11-24 | Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing | Xiaotong Huang et.al. | 2511.18755 | null |
| 2025-11-24 | Stable Multi-Drone GNSS Tracking System for Marine Robots | Shuo Wen et.al. | 2511.18694 | null |
| 2025-11-23 | Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span | Heeseung Yun et.al. | 2511.18470 | null |
| 2025-11-22 | Unobservable Subspace Evolution and Alignment for Consistent Visual-Inertial Navigation | Chungeng Tian et.al. | 2511.17992 | null |
| 2025-11-21 | Target-Bench: Can World Models Achieve Mapless Path Planning with Semantic Targets? | Dingrui Wang et.al. | 2511.17792 | null |
| 2025-11-21 | IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation | Yifan Li et.al. | 2511.17384 | null |
| 2025-11-21 | MonoSpheres: Large-Scale Monocular SLAM-Based UAV Exploration through Perception-Coupled Mapping and Planning | Tomáš Musil et.al. | 2511.17299 | null |
| 2025-11-21 | SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors | Kunyi Li et.al. | 2511.17207 | null |
| 2025-11-20 | CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering | Joni Vanherck et.al. | 2511.16349 | null |
| 2025-11-20 | Building temporally coherent 3D maps with VGGT for memory-efficient Semantic SLAM | Gergely Dinya et.al. | 2511.16282 | null |
| 2025-11-20 | LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM | Sibaek Lee et.al. | 2511.16144 | null |
| 2025-11-20 | Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2511.16091 | null |
| 2025-11-20 | Semantic Glitch: Agency and Artistry in an Autonomous Pixel Cloud | Qing Zhang et.al. | 2511.16048 | null |
| 2025-11-11 | Real-time Point Cloud Data Transmission via L4S for 5G-Edge-Assisted Robotics | Gerasimos Damigos et.al. | 2511.15677 | null |
| 2025-11-19 | Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2511.15597 | null |
| 2025-11-18 | A visual study of ICP variants for Lidar Odometry | Sebastian Dingler et.al. | 2511.14919 | null |
| 2025-11-18 | SLAM-AGS: Slide-Label Aware Multi-Task Pretraining Using Adaptive Gradient Surgery in Computational Cytology | Marco Acerbis et.al. | 2511.14639 | null |
| 2025-11-23 | Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors | Jeryes Danial et.al. | 2511.14335 | null |
| 2025-11-18 | MA-SLAM: Active SLAM in Large-Scale Unknown Environment using Map Aware Deep Reinforcement Learning | Yizhen Yin et.al. | 2511.14330 | null |
| 2025-11-18 | iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion | Hao Wang et.al. | 2511.14149 | null |
| 2025-11-17 | GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry | Chiyun Noh et.al. | 2511.13216 | null |
| 2025-11-16 | DPVO-QAT++: Heterogeneous QAT and CUDA Kernel Fusion for High-Performance Deep Patch Visual Odometry | Cheng Liao et.al. | 2511.12653 | null |
| 2025-11-14 | Autonomous Underwater Cognitive System for Adaptive Navigation: A SLAM-Integrated Cognitive Architecture | K. A. I. N Jayarathne et.al. | 2511.11845 | null |
| 2025-11-12 | DualVision ArthroNav: Investigating Opportunities to Enhance Localization and Reconstruction in Image-based Arthroscopy Navigation via External Cameras | Hongchao Shu et.al. | 2511.10699 | null |
| 2025-11-12 | Generation-Agnostic Zero-Energy Devices for Sustainable Connectivity, Sensing, and Localization | Navid Amani et.al. | 2511.09372 | null |
| 2025-11-12 | UMIGen: A Unified Framework for Egocentric Point Cloud Generation and Cross-Embodiment Robotic Imitation Learning | Yan Huang et.al. | 2511.09302 | null |
| 2025-11-12 | SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields | Sangheon Yang et.al. | 2511.09072 | null |
| 2025-11-10 | Integration of Visual SLAM into Consumer-Grade Automotive Localization | Luis Diener et.al. | 2511.06919 | null |
| 2025-11-10 | Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes | Meijun Guo et.al. | 2511.06765 | null |
| 2025-11-10 | Semi-distributed Cross-modal Air-Ground Relative Localization | Weining Lu et.al. | 2511.06749 | null |
| 2025-11-08 | ViTaMIn-B: A Reliable and Efficient Visuo-Tactile Bimanual Manipulation Interface | Chuanyu Li et.al. | 2511.05858 | null |
| 2025-11-08 | 3D Mapping Using a Lightweight and Low-Power Monocular Camera Embedded inside a Gripper of Limbed Climbing Robots | Taku Okawara et.al. | 2511.05816 | null |
| 2025-11-07 | Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments | Laura Alejandra Encinar Gonzalez et.al. | 2511.05404 | null |
| 2025-11-06 | Synchronous Observer Design for Landmark-Inertial SLAM with Almost-Global Convergence | Arkadeep Saha et.al. | 2511.04531 | null |
| 2025-11-06 | PUL-SLAM: Path-Uncertainty Co-Optimization with Lightweight Stagnation Detection for Efficient Robotic Exploration | Yizhen Yin et.al. | 2511.04180 | null |
| 2025-11-04 | Analytical modelling of a stop-less modular bus service with an application to charging strategies comparison | Haoran Zhao et.al. | 2511.03754 | null |
| 2025-11-04 | Self-Supervised Moving Object Segmentation of Sparse and Noisy Radar Point Clouds | Leon Schwarzer et.al. | 2511.02395 | null |
| 2025-11-03 | TurboMap: GPU-Accelerated Local Mapping for Visual SLAM | Parsa Hosseininejad et.al. | 2511.02036 | null |
| 2025-11-03 | CM-LIUW-Odometry: Robust and High-Precision LiDAR-Inertial-UWB-Wheel Odometry for Extreme Degradation Coal Mine Tunnels | Kun Hu et.al. | 2511.01379 | null |
| 2025-11-11 | Tackling the Kidnapped Robot Problem via Sparse Feasible Hypothesis Sampling and Reliable Batched Multi-Stage Inference | Muhua Zhang et.al. | 2511.01219 | null |
| 2025-11-03 | LiDAR-VGGT: Cross-Modal Coarse-to-Fine Fusion for Globally Consistent and Metric-Scale Dense Mapping | Lijie Wang et.al. | 2511.01186 | null |
| 2025-11-01 | Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles | Hyungtae Lim et.al. | 2511.00635 | null |
| 2025-10-31 | WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond | Zhicong Sun et.al. | 2510.27133 | null |
| 2025-10-30 | AgriGS-SLAM: Orchard Mapping Across Seasons via Multi-View Gaussian Splatting SLAM | Mirko Usuelli et.al. | 2510.26358 | null |
| 2025-10-30 | Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM | Ali Caglayan et.al. | 2510.26131 | null |
| 2025-10-29 | EA3D: Online Open-World 3D Object Extraction from Streaming Videos | Xiaoyu Zhou et.al. | 2510.25146 | null |
| 2025-10-28 | Spatiotemporal Calibration of Doppler Velocity Logs for Underwater Robots | Hongxu Zhao et.al. | 2510.24571 | null |
| 2025-10-28 | GeVI-SLAM: Gravity-Enhanced Stereo Visua Inertial SLAM for Underwater Robots | Yuan Shen et.al. | 2510.24533 | null |
| 2025-10-28 | A Survey on Collaborative SLAM with 3D Gaussian Splatting | Phuc Nguyen Xuan et.al. | 2510.23988 | null |
| 2025-10-26 | TWC-SLAM: Multi-Agent Cooperative SLAM with Text Semantics and WiFi Features Integration for Similar Indoor Environments | Chunyu Li et.al. | 2510.22754 | null |
| 2025-10-26 | Policies over Poses: Reinforcement Learning based Distributed Pose-Graph Optimization for Multi-Robot SLAM | Sai Krishna Ghanta et.al. | 2510.22740 | null |
| 2025-10-26 | LVD-GS: Gaussian Splatting SLAM for Dynamic Scenes via Hierarchical Explicit-Implicit Representation Collaboration Rendering | Wenkai Zhu et.al. | 2510.22669 | null |
| 2025-10-26 | RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience | Huilin Yin et.al. | 2510.22600 | null |
| 2025-10-26 | UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models | Wenming Tu et.al. | 2510.22588 | null |
| 2025-10-26 | Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing | Xiang Fei et.al. | 2510.22529 | null |
| 2025-10-24 | Underwater Visual-Inertial-Acoustic-Depth SLAM with DVL Preintegration for Degraded Environments | Shuoshuo Ding et.al. | 2510.21215 | null |
| 2025-10-23 | Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired Navigation | Marziyeh Bamdad et.al. | 2510.20549 | null |
| 2025-10-23 | Degradation-Aware Cooperative Multi-Modal GNSS-Denied Localization Leveraging LiDAR-Based Robot Detections | Václav Pritzl et.al. | 2510.20480 | null |
| 2025-10-21 | Underwater Dense Mapping with the First Compact 3D Sonar | Chinmay Burgul et.al. | 2510.18991 | null |
| 2025-10-21 | DeepDetect: Learning All-in-One Dense Keypoints | Shaharyar Ahmed Khan Tareen et.al. | 2510.17422 | null |
| 2025-10-18 | LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching | Aidyn Ubingazhibov et.al. | 2510.16438 | null |
| 2025-10-17 | VAR-SLAM: Visual Adaptive and Robust SLAM for Dynamic Environments | João Carlos Virgolino Soares et.al. | 2510.16205 | null |
| 2025-10-17 | Dynamic Recalibration in LiDAR SLAM: Integrating AI and Geometric Methods with Real-Time Feedback Using INAF Fusion | Zahra Arjmandi et.al. | 2510.15803 | null |
| 2025-10-17 | LVI-Q: Robust LiDAR-Visual-Inertial-Kinematic Odometry for Quadruped Robots Using Tightly-Coupled and Efficient Alternating Optimization | Kevin Christiansen Marsim et.al. | 2510.15220 | null |
| 2025-10-16 | 3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation | JoungBin Lee et.al. | 2510.14945 | null |
| 2025-10-15 | Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU | Ruiqi Ye et.al. | 2510.13546 | null |
| 2025-10-15 | Through the Lens of Doubt: Robust and Efficient Uncertainty Estimation for Visual Place Recognition | Emily Miller et.al. | 2510.13464 | null |
| 2025-10-15 | DAMM-LOAM: Degeneracy Aware Multi-Metric LiDAR Odometry and Mapping | Nishant Chandna et.al. | 2510.13287 | null |
| 2025-10-14 | SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding | Zhiliu Yang et.al. | 2510.12749 | null |
| 2025-10-14 | PolygMap: A Perceptive Locomotion Framework for Humanoid Robot Stair Climbing | Bingquan Li et.al. | 2510.12346 | null |
| 2025-10-09 | ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation | Guanghao Li et.al. | 2510.08551 | null |
| 2025-10-09 | RTGS: Real-Time 3D Gaussian Splatting SLAM via Multi-Level Redundancy Reduction | Leshu Li et.al. | 2510.06644 | null |
| 2025-10-07 | Human3R: Everyone Everywhere All at Once | Yue Chen et.al. | 2510.06219 | null |
| 2025-11-02 | Dropping the D: RGB-D SLAM Without the Depth Sensor | Mert Kiray et.al. | 2510.06216 | null |
| 2025-10-07 | Coordinate-Consistent Localization via Continuous-Time Calibration and Fusion of UWB and SLAM Observations | Tien-Dat Nguyen et.al. | 2510.05992 | null |
| 2025-10-06 | OKVIS2-X: Open Keyframe-based Visual-Inertial SLAM Configurable with Dense Depth or LiDAR, and GNSS | Simon Boche et.al. | 2510.04612 | null |
| 2025-10-04 | TCB-VIO: Tightly-Coupled Focal-Plane Binary-Enhanced Visual Inertial Odometry | Matthew Lisondra et.al. | 2510.03919 | null |
| 2025-11-19 | Visual Odometry with Transformers | Vlardimir Yugay et.al. | 2510.03348 | null |
| 2025-10-02 | RSV-SLAM: Toward Real-Time Semantic Visual SLAM in Indoor Dynamic Environments | Mobin Habibpour et.al. | 2510.02616 | null |
| 2025-10-02 | EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction | Lingxiang Hu et.al. | 2510.02080 | null |
| 2025-10-02 | Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale | Yongbo Chen et.al. | 2510.01665 | null |
| 2025-10-02 | Statistical Uncertainty Learning for Robust Visual-Inertial State Estimation | Seungwon Choi et.al. | 2510.01648 | null |
| 2025-10-01 | Instant4D: 4D Gaussian Splatting in Minutes | Zhanpeng Luo et.al. | 2510.01119 | null |
| 2025-10-01 | Semantic Visual Simultaneous Localization and Mapping: A Survey on State of the Art, Challenges, and Future Directions | Thanh Nguyen Canh et.al. | 2510.00783 | null |
| 2025-09-30 | Benchmarking Egocentric Visual-Inertial SLAM at City Scale | Anusha Krishnan et.al. | 2509.26639 | null |
| 2025-09-30 | Graphite: A GPU-Accelerated Mixed-Precision Graph Optimization Framework | Shishir Gopinath et.al. | 2509.26581 | null |
| 2025-09-30 | Radio-based Multi-Robot Odometry and Relative Localization | Andrés Martínez-Silva et.al. | 2509.26558 | null |
| 2025-09-30 | DEPTHOR++: Robust Depth Enhancement from a Real-World Lightweight dToF and RGB Guidance | Jijun Xiang et.al. | 2509.26498 | null |
| 2025-09-30 | Side Scan Sonar-based SLAM for Autonomous Algae Farm Monitoring | Julian Valdez et.al. | 2509.26121 | null |
| 2025-09-30 | User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality | Conghao Zhou et.al. | 2509.25905 | null |
| 2025-09-29 | PROFusion: Robust and Accurate Dense Reconstruction via Camera Pose Regression and Optimization | Siyan Dong et.al. | 2509.24236 | null |
| 2025-09-28 | GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State | Guole Shen et.al. | 2509.23737 | null |
| 2025-09-28 | From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations | Javed Ahmad et.al. | 2509.23555 | null |
| 2025-09-27 | EKF-Based Fusion of Wi-Fi/LiDAR/IMU for Indoor Localization and Navigation | Zeyi Li et.al. | 2509.23118 | null |
| 2025-09-26 | Good Weights: Proactive, Adaptive Dead Reckoning Fusion for Continuous and Robust Visual SLAM | Yanwei Du et.al. | 2509.22910 | null |
| 2025-09-26 | IMU-Preintegrated Radar Factors for Asynchronous Radar-LiDAR-Inertial SLAM | Johan Hatleskog et.al. | 2509.22288 | null |
| 2025-09-25 | Real-Time Indoor Object SLAM with LLM-Enhanced Priors | Yang Jiao et.al. | 2509.21602 | null |
| 2025-09-25 | PL-VIWO2: A Lightweight, Fast and Robust Visual-Inertial-Wheel Odometry Using Points and Lines | Zhixin Zhang et.al. | 2509.21563 | null |
| 2025-09-25 | AnywhereVLA: Language-Conditioned Exploration and Mobile Manipulation | Konstantin Gubernatorov et.al. | 2509.21006 | null |
| 2025-11-16 | MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM | Yuxuan Zhou et.al. | 2509.20757 | null |
| 2025-09-25 | SLAM-Free Visual Navigation with Hierarchical Vision-Language Perception and Coarse-to-Fine Semantic Topological Planning | Guoyang Zhao et.al. | 2509.20739 | null |
| 2025-09-24 | Optical Ocean Recipes: Creating Realistic Datasets to Facilitate Underwater Vision Research | Patricia Schöntag et.al. | 2509.20171 | null |
| 2025-09-23 | Bioinspired SLAM Approach for Unmanned Surface Vehicle | Fabio Coelho et.al. | 2509.19522 | null |
| 2025-09-23 | CU-Multi: A Dataset for Multi-Robot Collaborative Perception | Doncey Albin et.al. | 2509.19463 | null |
| 2025-09-23 | Towards Robust LiDAR Localization: Deep Learning-based Uncertainty Estimation | Minoo Dolatabadi et.al. | 2509.18954 | null |
| 2025-09-23 | An Extended Kalman Filter for Systems with Infinite-Dimensional Measurements | Maxwell M. Varley et.al. | 2509.18749 | null |
| 2025-09-22 | Semantic-Aware Particle Filter for Reliable Vineyard Robot Localisation | Rajitha de Silva et.al. | 2509.18342 | null |
| 2025-09-22 | ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos | Shi Chen et.al. | 2509.17864 | null |
| 2025-09-21 | SLAM-Former: Putting SLAM into One Transformer | Yijun Yuan et.al. | 2509.16909 | null |
| 2025-09-21 | ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM | Amanuel T. Dufera et.al. | 2509.16863 | null |
| 2025-09-19 | SLaM-DiMM: Shared Latent Modeling for Diffusion Based Missing Modality Synthesis in MRI | Bhavesh Sandbhor et.al. | 2509.16019 | null |
| 2025-09-19 | Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion | Yinong Cao et.al. | 2509.15673 | null |
| 2025-09-19 | STARC: See-Through-Wall Augmented Reality Framework for Human-Robot Collaboration in Emergency Response | Shenghai Yuan et.al. | 2509.15507 | null |
| 2025-09-18 | Human Interaction for Collaborative Semantic SLAM using Extended Reality | Laura Ribeiro et.al. | 2509.14949 | null |
| 2025-09-18 | BEV-ODOM2: Enhanced BEV-based Monocular Visual Odometry with PV-BEV Fusion and Dense Flow Supervision for Ground Robots | Yufei Wei et.al. | 2509.14636 | null |
| 2025-09-18 | Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods | Adam D. Hines et.al. | 2509.14516 | null |
| 2025-10-03 | MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping | Zhihao Cao et.al. | 2509.14191 | null |
| 2025-10-08 | BIM Informed Visual SLAM for Construction Monitoring | Asier Bikandi-Noya et.al. | 2509.13972 | null |
| 2025-09-17 | UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry | Tae-Wook Um et.al. | 2509.13713 | null |
| 2025-09-17 | Barometer-Aided Attitude Estimation | Méloné Nyoba Tchonkeu et.al. | 2509.13649 | null |
| 2025-09-16 | Semantic 3D Reconstructions with SLAM for Central Airway Obstruction | Ayberk Acar et.al. | 2509.13541 | null |
| 2025-09-16 | MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM | Yinlong Bai et.al. | 2509.13536 | null |
| 2025-09-18 | MATTER: Multiscale Attention for Registration Error Regression | Shipeng Liu et.al. | 2509.12924 | null |
| 2025-09-16 | Match Chat: Real Time Generative AI and Generative Computing for Tennis | Aaron Baughman et.al. | 2509.12592 | null |
| 2025-09-15 | See What I Mean? Mobile Eye-Perspective Rendering for Optical See-through Head-mounted Displays | Gerlinde Emsenhuber et.al. | 2509.11653 | null |
| 2025-09-15 | Gaussian-Plus-SDF SLAM: High-fidelity 3D Reconstruction at 150+ fps | Zhexi Peng et.al. | 2509.11574 | null |
| 2025-09-28 | Autonomous Close-Proximity Photovoltaic Panel Coating Using a Quadcopter | Dimitri Jacquemont et.al. | 2509.10979 | null |
| 2025-09-13 | FastTrack: GPU-Accelerated Tracking for Visual SLAM | Kimia Khabiri et.al. | 2509.10757 | null |
| 2025-09-12 | Robust Localization in Modern Cellular Networks using Global Map Features | Junshi Chen et.al. | 2509.10433 | null |
| 2025-09-12 | Efficient and Accurate Downfacing Visual Inertial Odometry | Jonas Kühne et.al. | 2509.10021 | null |
| 2025-10-10 | SMapper: A Multi-Modal Data Acquisition Platform for SLAM Benchmarking | Pedro Miguel Bastos Soares et.al. | 2509.09509 | null |
| 2025-09-11 | S-BEVLoc: BEV-based Self-supervised Framework for Large-scale LiDAR Global Localization | Chenghao Zhang et.al. | 2509.09110 | null |
| 2025-09-10 | Good Deep Features to Track: Self-Supervised Feature Extraction and Tracking in Visual Odometry | Sai Puneeth Reddy Gottam et.al. | 2509.08333 | null |
| 2025-09-10 | Behaviorally Heterogeneous Multi-Agent Exploration Using Distributed Task Allocation | Nirabhra Mandal et.al. | 2509.08242 | null |
| 2025-09-10 | Deep Visual Odometry for Stereo Event Cameras | Sheng Zhong et.al. | 2509.08235 | null |
| 2025-09-10 | Online Dynamic SLAM with Incremental Smoothing and Mapping | Jesse Morris et.al. | 2509.08197 | null |
| 2025-09-09 | Sensing with Mobile Devices through Radio SLAM: Models, Methods, Opportunities, and Challenges | Yu Ge et.al. | 2509.07775 | null |
| 2025-11-04 | Radar-Based Odometry for Low-Speed Driving | Luis Diener et.al. | 2509.07683 | null |
| 2025-09-09 | Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark | Yandi Yang et.al. | 2509.07362 | null |
| 2025-09-08 | Detection and Recovery of Adversarial Slow-Pose Drift in Offloaded Visual-Inertial Odometry | Soruya Saha et.al. | 2509.07130 | null |
| 2025-09-08 | Co-Located VR with Hybrid SLAM-based HMD Tracking and Motion Capture Synchronization | Carlos A. Pinheiro de Sousa et.al. | 2509.06582 | null |
| 2025-09-15 | Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation | Ian Page et.al. | 2509.06433 | null |
| 2025-09-07 | DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal Fusion | Mengmeng Liu et.al. | 2509.06023 | null |
| 2025-09-06 | Multi-LVI-SAM: A Robust LiDAR-Visual-Inertial Odometry for Multiple Fisheye Cameras | Xinyu Zhang et.al. | 2509.05740 | null |
| 2025-09-30 | LiDAR-BIND-T: Improved and Temporally Consistent Sensor Modality Translation and Fusion for Robotic Applications | Niels Balemans et.al. | 2509.05728 | null |
| 2025-09-04 | Stitching the Story: Creating Panoramic Incident Summaries from Body-Worn Footage | Dor Cohen et.al. | 2509.04370 | null |
| 2025-09-04 | Odometry Calibration and Pose Estimation of a 4WIS4WID Mobile Wall Climbing Robot | Branimir Ćaran et.al. | 2509.04016 | null |
| 2025-09-03 | IL-SLAM: Intelligent Line-assisted SLAM Based on Feature Awareness for Dynamic Environments | Haolan Zhang et.al. | 2509.02972 | null |
| 2025-09-02 | Coral: A Unifying Abstraction Layer for Composable Robotics Software | Steven Swanbeck et.al. | 2509.02453 | null |
| 2025-09-02 | Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction | Xueyang Kang et.al. | 2509.01873 | null |
| 2025-09-01 | ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association | Ganlin Zhang et.al. | 2509.01584 | null |
| 2025-09-01 | FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field | Fan Zhu et.al. | 2509.01547 | null |
| 2025-09-01 | SR-SLAM: Scene-reliability Based RGB-D SLAM in Diverse Environments | Haolan Zhang et.al. | 2509.01111 | null |
| 2025-08-31 | DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments | Yi Liu et.al. | 2509.00741 | null |
| 2025-08-30 | AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility Detection | Houshu He et.al. | 2509.00433 | null |
| 2025-08-29 | The Rosario Dataset v2: Multimodal Dataset for Agricultural Robotics | Nicolas Soncini et.al. | 2508.21635 | null |
| 2025-08-28 | Observer Design for Optical Flow-Based Visual-Inertial Odometry with Almost-Global Convergence | Tarek Bouazza et.al. | 2508.21163 | null |
| 2025-08-28 | Adam SLAM - the last mile of camera calibration with 3DGS | Matthieu Gendrin et.al. | 2508.20526 | null |
| 2025-08-24 | SEER-VAR: Semantic Egocentric Environment Reasoner for Vehicle Augmented Reality | Yuzhi Lai et.al. | 2508.17255 | null |
| 2025-08-24 | VROOM - Visual Reconstruction over Onboard Multiview | Yajat Yadav et.al. | 2508.17172 | null |
| 2025-08-23 | DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration | Jiayi Li et.al. | 2508.17034 | null |
| 2025-08-23 | A Workflow for Map Creation in Autonomous Vehicle Simulations | Zubair Islam et.al. | 2508.16856 | null |
| 2025-09-12 | COSMO-Bench: A Benchmark for Collaborative SLAM Optimization | Daniel McGann et.al. | 2508.16731 | null |
| 2025-08-22 | GPL-SLAM: A Laser SLAM Framework with Gaussian Process Based Extended Landmarks | Ali Emre Balcı et.al. | 2508.16459 | null |
| 2025-08-21 | GelSLAM: A Real-time, High-Fidelity, and Robust 3D Tactile SLAM System | Hung-Jui Huang et.al. | 2508.15990 | null |
| 2025-08-19 | SLAM-based Safe Indoor Exploration Strategy | Omar Mostafa et.al. | 2508.14235 | null |
| 2025-09-05 | Online 3D Gaussian Splatting Modeling with Novel View Selection | Byeonggwon Lee et.al. | 2508.14014 | null |
| 2025-08-19 | ROVER: Robust Loop Closure Verification with Trajectory Prior in Repetitive Environments | Jingwen Yu et.al. | 2508.13488 | null |
| 2025-08-18 | XR-NPE: High-Throughput Mixed-precision SIMD Neural Processing Engine for Extended Reality Perception Workloads | Tejas Chaudhari et.al. | 2508.13049 | null |
| 2025-08-16 | DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects | Tingbang Liang et.al. | 2508.11950 | null |
| 2025-08-14 | CVIRO: A Consistent and Tightly-Coupled Visual-Inertial-Ranging Odometry on Lie Groups | Yizhi Zhou et.al. | 2508.10867 | null |
| 2025-08-14 | Super LiDAR Reflectance for Robotic Perception | Wei Gao et.al. | 2508.10398 | null |
| 2025-08-12 | Transient Noise Removal via Diffusion-based Speech Inpainting | Mordehay Moradi et.al. | 2508.08890 | null |
| 2025-08-09 | EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events | Siyu Chen et.al. | 2508.07003 | null |
| 2025-08-07 | A Multi-view Landmark Representation Approach with Application to GNSS-Visual-Inertial Odometry | Tong Hua et.al. | 2508.05368 | null |
| 2025-08-07 | Speech LLMs in Low-Resource Scenarios: Data Volume Requirements and the Impact of Pretraining on High-Resource Languages | Seraphina Fong et.al. | 2508.05149 | null |
| 2025-08-06 | Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline | Linqing Zhao et.al. | 2508.04597 | null |
| 2025-10-15 | Inland-LOAM: Voxel-Based Structural Semantic LiDAR Odometry and Mapping for Inland Waterway Navigation | Zhongbi Luo et.al. | 2508.03672 | null |
| 2025-08-04 | A Moment Matching-Based Method for Sparse and Noisy Point Cloud Registration | Xingyi Li et.al. | 2508.02187 | null |
| 2025-08-04 | AID4AD: Aerial Image Data for Automated Driving Perception | Daniel Lengerer et.al. | 2508.02140 | null |
| 2025-08-01 | CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry | Jingchao Xie et.al. | 2508.00568 | null |
| 2025-07-31 | The Monado SLAM Dataset for Egocentric Visual-Inertial Tracking | Mateo de Mayo et.al. | 2508.00088 | null |
| 2025-07-31 | Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes | Xiaohan Li et.al. | 2507.23677 | null |
| 2025-07-31 | DRACo-SLAM2: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar EquippedUnderwater Robot Teams with Object Graph Matching | Yewei Huang et.al. | 2507.23629 | null |
| 2025-07-31 | GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting | Jaeseok Park et.al. | 2507.23273 | null |
| 2025-07-30 | Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques | Weide Liu et.al. | 2507.22791 | null |
| 2025-07-30 | UAVScenes: A Multi-Modal Dataset for UAVs | Sijie Wang et.al. | 2507.22412 | null |
| 2025-07-29 | Impact of Underwater Image Enhancement on Feature Matching | Jason M. Summers et.al. | 2507.21715 | null |
| 2025-07-29 | Adaptive Prior Scene-Object SLAM for Dynamic Environments | Haolan Zhang et.al. | 2507.21709 | null |
| 2025-08-01 | Multi-robot LiDAR SLAM: a practical case study in underground tunnel environments | Federica Di Lauro et.al. | 2507.21553 | null |
| 2025-07-28 | Ruoyu Fan et.al. | 2507.20854 | null | |
| 2025-07-28 | Large-Scale LiDAR-Inertial Dataset for Degradation-Robust High-Precision Mapping | Xiaofeng Jin et.al. | 2507.20516 | null |
| 2025-07-26 | DOA: A Degeneracy Optimization Agent with Adaptive Pose Compensation Capability based on Deep Reinforcement Learning | Yanbin Li et.al. | 2507.19742 | null |
| 2025-07-25 | DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations | Ziren Gong et.al. | 2507.19474 | null |
| 2025-07-25 | The Eloquence team submission for task 1 of MLC-SLM challenge | Lorenzo Concina et.al. | 2507.19308 | null |
| 2025-07-31 | SmartPNT-MSF: A Multi-Sensor Fusion Dataset for Positioning and Navigation Research | Feng Zhu et.al. | 2507.19079 | null |
| 2025-07-25 | A Fast and Light-weight Non-Iterative Visual Odometry with RGB-D Cameras | Zheng Yang et.al. | 2507.18886 | null |
| 2025-07-24 | G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM | Gyuhyeon Pak et.al. | 2507.18344 | null |
| 2025-07-23 | Physics-based Human Pose Estimation from a Single Moving RGB Camera | Ayce Idil Aytekin et.al. | 2507.17406 | null |
| 2025-08-01 | CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance | Peiqi Chen et.al. | 2507.17312 | null |
| 2025-07-21 | DiffPF: Differentiable Particle Filtering with Generative Sampling via Conditional Diffusion Models | Ziyu Wan et.al. | 2507.15716 | null |
| 2025-07-21 | Dense-depth map guided deep Lidar-Visual Odometry with Sparse Point Clouds and Images | JunYing Huang et.al. | 2507.15496 | null |
| 2025-07-21 | All-UWB SLAM Using UWB Radar and UWB AOA | Charith Premachandra et.al. | 2507.15474 | null |
| 2025-07-21 | BenchDepth: Are We on the Right Way to Evaluate Depth Foundation Models? | Zhenyu Li et.al. | 2507.15321 | null |
| 2025-07-20 | LoopNet: A Multitasking Few-Shot Learning Approach for Loop Closure in Large Scale SLAM | Mohammad-Maher Nakshbandi et.al. | 2507.15109 | null |
| 2025-11-04 | Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey | Jiahui Zhang et.al. | 2507.14501 | null |
| 2025-07-18 | SaWa-ML: Structure-Aware Pose Correction and Weight Adaptation-Based Robust Multi-Robot Localization | Junho Choi et.al. | 2507.13702 | null |
| 2025-07-17 | DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model | Maulana Bisyir Azhari et.al. | 2507.13145 | null |
| 2025-07-17 | MoCap2GT: A High-Precision Ground Truth Estimator for SLAM Benchmarking Based on Motion Capture and IMU Fusion | Zichao Shu et.al. | 2507.12920 | null |
| 2025-07-17 | Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot | Luca Garello et.al. | 2507.12273 | null |
| 2025-07-16 | Tree-SLAM: semantic object SLAM for efficient mapping of individual trees in orchards | David Rapado-Rincon et.al. | 2507.12093 | null |
| 2025-07-11 | Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework | Deteng Zhang et.al. | 2507.08364 | null |
| 2025-07-10 | Hardware-Aware Feature Extraction Quantisation for Real-Time Visual Odometry on FPGA Platforms | Mateusz Wasala et.al. | 2507.07903 | null |
| 2025-07-10 | IRAF-SLAM: An Illumination-Robust and Adaptive Feature-Culling Front-End for Visual SLAM in Challenging Environments | Thanh Nguyen Canh et.al. | 2507.07752 | null |
| 2025-07-09 | g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM | Quanjie Qiu et.al. | 2507.07142 | null |
| 2025-07-08 | Mapping the Catacombs: An Underwater Cave Segment of the Devil's Eye System | Michalis Chatzispyrou et.al. | 2507.06397 | null |
| 2025-07-08 | Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems | Hang Que et.al. | 2507.05718 | null |
| 2025-07-07 | Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR | Tao Du et.al. | 2507.04662 | null |
| 2025-07-06 | Lidar Variability: A Novel Dataset and Comparative Study of Solid-State and Spinning Lidars | Doumegna Mawuto Koudjo Felix et.al. | 2507.04321 | null |
| 2025-07-09 | Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM | Xiaolei Lang et.al. | 2507.04004 | null |
| 2025-07-04 | Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps | Chong Cheng et.al. | 2507.03737 | null |
| 2025-07-01 | RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles | David Hunt et.al. | 2507.00937 | null |
| 2025-07-01 | Generation of Indoor Open Street Maps for Robot Navigation from CAD Files | Jiajie Zhang et.al. | 2507.00552 | null |
| 2025-06-30 | VOCAL: Visual Odometry via ContrAstive Learning | Chi-Yao Huang et.al. | 2507.00243 | null |
| 2025-06-29 | TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints | Zhen Tan et.al. | 2506.23207 | null |
| 2025-06-29 | Event-based Stereo Visual-Inertial Odometry with Voxel Map | Zhaoxing Zhang et.al. | 2506.23078 | null |
| 2025-06-26 | Adaptive Multipath-Based SLAM for Distributed MIMO Systems | Xuhong Li et.al. | 2506.21798 | null |
| 2025-06-24 | Ark: An Open-source Python-based Framework for Robot Learning | Magnus Dierking et.al. | 2506.21628 | null |
| 2025-06-26 | EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting | Taoyu Wu et.al. | 2506.21420 | null |
| 2025-06-26 | CURL-SLAM: Continuous and Compact LiDAR Mapping | Kaicheng Zhang et.al. | 2506.21077 | null |
| 2025-06-25 | SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning | Mimo Shirasaka et.al. | 2506.20394 | null |
| 2025-06-25 | Real-Time Obstacle Avoidance Algorithms for Unmanned Aerial and Ground Vehicles | Jingwen Wei et.al. | 2506.20311 | null |
| 2025-06-24 | Posterior Cramér-Rao Bounds on Localization and Mapping Errors in Distributed MIMO SLAM | Benjamin J. B. Deutschmann et.al. | 2506.19957 | null |
| 2025-06-23 | GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM | Annika Thomas et.al. | 2506.18885 | null |
| 2025-06-23 | MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation | Tianchen Deng et.al. | 2506.18678 | null |
| 2025-06-24 | Multimodal Fusion SLAM with Fourier Attention | Youjie Zhou et.al. | 2506.18204 | null |
| 2025-06-22 | ADA-DPM: A Neural Descriptors-based Adaptive Noise Point Filtering Strategy for SLAM | Yongxin Shao et.al. | 2506.18016 | null |
| 2025-06-21 | Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems | Sebastian Sansoni et.al. | 2506.17775 | null |
| 2025-06-18 | MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System | Miaoxin Pan et.al. | 2506.15402 | null |
| 2025-06-24 | RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories | Qingsong Yan et.al. | 2506.15242 | null |
| 2025-06-18 | SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization | Hanjun Kim et.al. | 2506.15175 | null |
| 2025-06-18 | VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments | Bingbing Zhang et.al. | 2506.15126 | null |
| 2025-06-16 | Slanted light-sheet array microscopy for large volume imaging at rates exceeding 100 Hz | Kai Long et.al. | 2506.13664 | null |
| 2025-06-16 | Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots | Jaehong Oh et.al. | 2506.13149 | null |
| 2025-06-16 | A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method | Zhanhua Xin et.al. | 2506.13100 | null |
| 2025-06-16 | SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure | Shahram Najam Syed et.al. | 2506.13089 | link |
| 2025-06-12 | LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System | Hongbeen Park et.al. | 2506.10567 | null |
| 2025-06-11 | VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots | Miguel Á. González-Santamarta et.al. | 2506.09583 | null |
| 2025-06-10 | UFM: A Simple Path towards Unified Dense Correspondence with Flow | Yuchen Zhang et.al. | 2506.09278 | null |
| 2025-06-10 | Princeton365: A Diverse Dataset with Accurate Camera Pose | Karhan Kayan et.al. | 2506.09035 | null |
| 2025-06-10 | Planar Collisionless Shock Simulations with Semi-Implicit Particle-in-Cell Model FLEKS | Hongyang Zhou et.al. | 2506.08384 | null |
| 2025-06-09 | ZeroVO: Visual Odometry with Minimal Assumptions | Lei Lai et.al. | 2506.08005 | null |
| 2025-06-08 | Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs | Qiong Chang et.al. | 2506.07164 | null |
| 2025-06-08 | UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment | Wentao Zhao et.al. | 2506.07013 | null |
| 2025-06-06 | GS4: Generalizable Sparse Splatting Semantic SLAM | Mingqi Jiang et.al. | 2506.06517 | null |
| 2025-06-06 | Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception | Pushyami Kaveti et.al. | 2506.06476 | null |
| 2025-06-04 | Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset | Zirui Wang et.al. | 2506.04224 | null |
| 2025-06-03 | LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM | Roman Titkov et.al. | 2506.03073 | null |
| 2025-06-03 | Online Performance Assessment of Multi-Source-Localization for Autonomous Driving Systems Using Subjective Logic | Stefan Orf et.al. | 2506.02932 | null |
| 2025-06-03 | VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians | Pengchong Hu et.al. | 2506.02741 | null |
| 2025-06-03 | GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Shufan Qing et.al. | 2506.02736 | link |
| 2025-06-03 | Olfactory Inertial Odometry: Methodology for Effective Robot Navigation by Scent | Kordel K. France et.al. | 2506.02373 | null |
| 2025-06-01 | Globally Consistent RGB-D SLAM with 2D Gaussian Splatting | Xingguang Zhong et.al. | 2506.00970 | link |
| 2025-05-30 | Black-box Adversarial Attacks on CNN-based SLAM Algorithms | Maria Rafaela Gkeka et.al. | 2505.24654 | null |
| 2025-05-28 | Semantic Exploration and Dense Mapping of Complex Environments using Ground Robots Equipped with LiDAR and Panoramic Camera | Xiaoyang Zhan et.al. | 2505.22880 | null |
| 2025-05-28 | 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians | Hidenobu Matsuki et.al. | 2505.22859 | null |
| 2025-05-28 | UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments | Wancai Zheng et.al. | 2505.22335 | null |
| 2025-05-27 | HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving | Bingxiang Kang et.al. | 2505.20906 | null |
| 2025-05-27 | ProBA: Probabilistic Bundle Adjustment with the Bhattacharyya Coefficient | Jason Chui et.al. | 2505.20858 | null |
| 2025-05-26 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting | Wenhua Wu et.al. | 2505.19420 | null |
| 2025-05-25 | VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes | Tianchen Deng et.al. | 2505.18992 | link |
| 2025-05-23 | CU-Multi: A Dataset for Multi-Robot Data Association | Doncey Albin et.al. | 2505.17576 | null |
| 2025-05-22 | TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition | Oliver Grainge et.al. | 2505.16447 | null |
| 2025-05-20 | A Methodological Framework for Measuring Spatial Labeling Similarity | Yihang Du et.al. | 2505.14128 | link |
| 2025-05-22 | Place Recognition: A Comprehensive Review, Current Challenges and Future Directions | Zhenyu Li et.al. | 2505.14068 | link |
| 2025-05-19 | eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks | Jad Mansour et.al. | 2505.13309 | null |
| 2025-05-23 | VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold | Dominic Maggio et.al. | 2505.12549 | null |
| 2025-05-18 | Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey | Calvin Galagain et.al. | 2505.12384 | null |
| 2025-05-18 | Structureless VIO | Junlin Song et.al. | 2505.12337 | null |
| 2025-05-16 | EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video | Ryan Hoque et.al. | 2505.11709 | null |
| 2025-05-16 | Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization | Aaron Wilhelm et.al. | 2505.11620 | null |
| 2025-05-16 | Robust 2D lidar-based SLAM in arboreal environments without IMU/GNSS | Paola Nazate-Burgos et.al. | 2505.10847 | null |
| 2025-05-15 | TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation | Manthan Patel et.al. | 2505.10696 | null |
| 2025-05-15 | A hybrid SLAM-Payne framework for atmospheric parameter and abundance determination of early-type Stars from LAMOST DR9 low-resolution Spectra | Weijia Sun et.al. | 2505.10310 | null |
| 2025-05-15 | Large-Scale Gaussian Splatting SLAM | Zhe Xin et.al. | 2505.09915 | null |
| 2025-05-13 | Automated Meta Prompt Engineering for Alignment with the Theory of Mind | Aaron Baughman et.al. | 2505.09024 | null |
| 2025-05-13 | MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM | Saqi Hussain Kalan et.al. | 2505.08388 | null |
| 2025-05-13 | SKiD-SLAM: Robust, Lightweight, and Distributed Multi-Robot LiDAR SLAM in Resource-Constrained Field Environments | Hogyun Kim et.al. | 2505.08230 | null |
| 2025-05-12 | RDD: Robust Feature Detector and Descriptor using Deformable Transformer | Gonglin Chen et.al. | 2505.08013 | null |
| 2025-05-12 | Ranking-aware Continual Learning for LiDAR Place Recognition | Xufei Wang et.al. | 2505.07198 | null |
| 2025-05-07 | Scalable Aerial GNSS Localization for Marine Robots | Shuo Wen et.al. | 2505.04095 | link |
| 2025-05-06 | Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions | Lukas Schichler et.al. | 2505.03565 | null |
| 2025-05-06 | AquaticVision: Benchmarking Visual SLAM in Underwater Environment with Events and Frames | Yifan Peng et.al. | 2505.03448 | null |
| 2025-05-06 | LiftFeat: 3D Geometry-Aware Local Feature Matching | Yepeng Liu et.al. | 2505.03422 | link |
| 2025-05-05 | LiDAR-Inertial SLAM-Based Navigation and Safety-Oriented AI-Driven Control System for Skid-Steer Robots | Mehdi Heydari Shahna et.al. | 2505.02598 | null |
| 2025-05-04 | Robust Localization, Mapping, and Navigation for Quadruped Robots | Dyuman Aditya et.al. | 2505.02272 | null |
| 2025-05-04 | SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment | Ganesh Sapkota et.al. | 2505.01956 | null |
| 2025-05-03 | GauS-SLAM: Dense RGB-D SLAM with Gaussian Surfels | Yongxin Su et.al. | 2505.01934 | null |
| 2025-05-02 | Tightly Coupled Range Inertial Odometry and Mapping with Exact Point Cloud Downsampling | Kenji Koide et.al. | 2505.01017 | null |
| 2025-04-30 | An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation | Yaming Ou et.al. | 2504.21826 | null |
| 2025-04-30 | eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes | Henry John Krumb et.al. | 2504.21562 | null |
| 2025-04-29 | Large-scale visual SLAM for in-the-wild videos | Shuo Sun et.al. | 2504.20496 | null |
| 2025-04-28 | Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM | Leon Davies et.al. | 2504.19654 | null |
| 2025-04-28 | GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM | Leon Davies et.al. | 2504.19653 | null |
| 2025-04-28 | GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field | Zuxing Lu et.al. | 2504.19409 | null |
| 2025-04-27 | Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users | Apurv Varshney et.al. | 2504.19345 | null |
| 2025-04-27 | NANO-SLAM : Natural Gradient Gaussian Approximation for Vehicle SLAM | Tianyi Zhang et.al. | 2504.19195 | null |
| 2025-04-27 | MISO: Multiresolution Submap Optimization for Efficient Globally Consistent Neural Implicit Reconstruction | Yulun Tian et.al. | 2504.19104 | null |
| 2025-04-25 | Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift | Devansh R. Agrawal et.al. | 2504.18713 | null |
| 2025-04-25 | Range-based 6-DoF Monte Carlo SLAM with Gradient-guided Particle Filter on GPU | Takumi Nakao et.al. | 2504.18056 | null |
| 2025-04-24 | Autonomous Navigation Of Quadrupeds Using Coverage Path Planning | Alexander James Becoy et.al. | 2504.17880 | null |
| 2025-04-24 | BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring | Asier Bikandi et.al. | 2504.17693 | null |
| 2025-04-24 | Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images | Zebo Huang et.al. | 2504.17582 | null |
| 2025-04-24 | Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization | Guangyang Zeng et.al. | 2504.17410 | null |
| 2025-04-24 | EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy | Haodi Yao et.al. | 2504.17280 | null |
| 2025-04-23 | ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration | Andrea Conti et.al. | 2504.16545 | null |
| 2025-04-22 | DERD-Net: Learning Depth from Event-based Ray Densities | Diego de Oliveira Hitzges et.al. | 2504.15863 | null |
| 2025-04-23 | SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems | Abhishek Tyagi et.al. | 2504.15305 | null |
| 2025-04-20 | Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction | Weirong Chen et.al. | 2504.14516 | null |
| 2025-04-20 | SG-Reg: Generalizable and Efficient Scene Graph Registration | Chuhao Liu et.al. | 2504.14440 | link |
| 2025-04-19 | Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering | Jonathan Embley-Riches et.al. | 2504.14135 | null |
| 2025-04-16 | An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World | Xingwu Ji et.al. | 2504.11698 | link |
| 2025-04-18 | Doppler-SLAM: Doppler-Aided Radar-Inertial and LiDAR-Inertial Simultaneous Localization and Mapping | Dong Wang et.al. | 2504.11634 | link |
| 2025-04-14 | Region Based SLAM-Aware Exploration: Efficient and Robust Autonomous Mapping Strategy That Can Scale | Megha Maheshwari et.al. | 2504.10416 | null |
| 2025-04-14 | RoboCup Rescue 2025 Team Description Paper UruBots | Kevin Farias et.al. | 2504.09778 | null |
| 2025-04-11 | FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment | Sebastián Barbas Laina et.al. | 2504.08603 | null |
| 2025-04-11 | PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection | Xiong Li et.al. | 2504.08280 | null |
| 2025-04-11 | II-NVM: Enhancing Map Accuracy and Consistency with Normal Vector-Assisted Mapping | Chengwei Zhao et.al. | 2504.08204 | link |
| 2025-04-10 | UWB Anchor Based Localization of a Planetary Rover | Andreas Nüchter et.al. | 2504.07658 | null |
| 2025-04-10 | Event Signal Filtering via Probability Flux Estimation | Jinze Chen et.al. | 2504.07503 | null |
| 2025-04-07 | Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM | Zhicong Sun et.al. | 2504.04844 | link |
| 2025-04-06 | SELC: Self-Supervised Efficient Local Correspondence Learning for Low Quality Images | Yuqing Wang et.al. | 2504.04497 | null |
| 2025-04-06 | VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets | Alejandro Fontan et.al. | 2504.04457 | link |
| 2025-04-05 | Nonlinear Observer Design for Landmark-Inertial Simultaneous Localization and Mapping | Mouaad Boughellaba et.al. | 2504.04239 | null |
| 2025-04-04 | WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments | Jianhao Zheng et.al. | 2504.03886 | null |
| 2025-04-03 | SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections | Prashant Kumar et.al. | 2504.03089 | null |
| 2025-04-03 | Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision | Xiaofeng Han et.al. | 2504.02477 | null |
| 2025-04-03 | MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM | Renwu Li et.al. | 2504.02437 | null |
| 2025-04-02 | A Chefs KISS -- Utilizing semantic information in both ICP and SLAM framework | Sven Ochs et.al. | 2504.02086 | null |
| 2025-04-01 | Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments | Yuchen Zhang et.al. | 2504.01997 | null |
| 2025-04-02 | Strengthening Multi-Robot Systems for SAR: Co-Designing Robotics and Communication Towards 6G | Juan Bravo-Arrabal et.al. | 2504.01940 | null |
| 2025-04-02 | Dynamic Initialization for LiDAR-inertial SLAM | Jie Xu et.al. | 2504.01451 | link |
| 2025-04-02 | ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue | Thomas Pritchard et.al. | 2504.01261 | link |
| 2025-03-31 | SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection | Yannick Burkhardt et.al. | 2504.00139 | null |
| 2025-03-30 | A Visual-Inertial Motion Prior SLAM for Dynamic Environments | Weilong Sun et.al. | 2503.23429 | null |
| 2025-03-30 | AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos | Felix Wimbauer et.al. | 2503.23282 | link |
| 2025-03-27 | HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM | Ziren Gong et.al. | 2503.21778 | null |
| 2025-03-27 | STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM | Yongxu Wang et.al. | 2503.21425 | null |
| 2025-03-25 | Scene-agnostic Pose Regression for Visual Localization | Junwei Zheng et.al. | 2503.19543 | null |
| 2025-03-25 | First Results on UAV-aided User Localization Using ToA and OpenAirInterface in 5G NR | Omid Esrafilian et.al. | 2503.19529 | null |
| 2025-03-25 | MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments | Yongxin Ma et.al. | 2503.19506 | link |
| 2025-03-24 | Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control | Tohid Kargar Tasooji et.al. | 2503.19135 | null |
| 2025-03-24 | GI-SLAM: Gaussian-Inertial SLAM | Xulang Liu et.al. | 2503.18275 | null |
| 2025-03-22 | LightLoc: Learning Outdoor LiDAR Localization at Light Speed | Wen Li et.al. | 2503.17814 | link |
| 2025-03-21 | Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions | Muhua Zhang et.al. | 2503.17005 | null |
| 2025-03-20 | 4D Gaussian Splatting SLAM | Yanyan Li et.al. | 2503.16710 | null |
| 2025-03-20 | Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education | Giovanni Adorni et.al. | 2503.16307 | null |
| 2025-03-20 | Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors | Tian Yi Lim et.al. | 2503.16275 | null |
| 2025-03-19 | A Sigma Point-based Low Complexity Algorithm for Multipath-based SLAM in MIMO Systems | Anna Masiero et.al. | 2503.15286 | null |
| 2025-03-19 | ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents | Hao Liang et.al. | 2503.14948 | null |
| 2025-03-18 | 3D Densification for Multi-Map Monocular VSLAM in Endoscopy | X. Anadón et.al. | 2503.14346 | null |
| 2025-03-18 | GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics | Tingyang Xiao et.al. | 2503.14247 | link |
| 2025-03-18 | A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios | Huy-Hoang Bui et.al. | 2503.13982 | link |
| 2025-03-17 | Digital Beamforming Enhanced Radar Odometry | Jingqi Jiang et.al. | 2503.13252 | link |
| 2025-03-17 | Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes | Tatsuro Sakai et.al. | 2503.12768 | null |
| 2025-03-16 | KISS-SLAM: A Simple, Robust, and Accurate 3D LiDAR SLAM System With Enhanced Generalization Capabilities | Tiziano Guadagnino et.al. | 2503.12660 | null |
| 2025-03-16 | Deblur Gaussian Splatting SLAM | Francesco Girlanda et.al. | 2503.12572 | null |
| 2025-03-16 | M2UD: A Multi-model, Multi-scenario, Uneven-terrain Dataset for Ground Robot with Localization and Mapping Evaluation | Yanpeng Jia et.al. | 2503.12387 | null |
| 2025-03-13 | OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions | Maxim Popov et.al. | 2503.10331 | null |
| 2025-03-12 | Online Language Splatting | Saimouli Katragadda et.al. | 2503.09447 | null |
| 2025-03-12 | MonoSLAM: Robust Monocular SLAM with Global Structure Optimization | Bingzheng Jiang et.al. | 2503.09296 | null |
| 2025-03-11 | Keypoint Detection and Description for Raw Bayer Images | Jiakai Lin et.al. | 2503.08673 | null |
| 2025-03-11 | GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats | Kai Deng et.al. | 2503.08071 | link |
| 2025-03-10 | POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality | Joey Wilson et.al. | 2503.07819 | null |
| 2025-03-08 | HIPPO-MAT: Decentralized Task Allocation Using GraphSAGE and Multi-Agent Deep Reinforcement Learning | Lavanya Ratnabala et.al. | 2503.07662 | null |
| 2025-03-10 | AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones | Xiaowei Li et.al. | 2503.06890 | link |
| 2025-03-08 | InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning | Seongjun Choi et.al. | 2503.06010 | link |
| 2025-03-07 | THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks | Chaoran Xiong et.al. | 2503.05112 | null |
| 2025-03-07 | Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry | Chengwei Zhao et.al. | 2503.05077 | link |
| 2025-03-06 | MarsLGPR: Mars Rover Localization with Ground Penetrating Radar | Anja Sheppard et.al. | 2503.04944 | null |
| 2025-03-06 | On the Connection Between Magnetic-Field Odometry Aided Inertial Navigation and Magnetic-Field SLAM | Isaac Skog et.al. | 2503.04286 | null |
| 2025-03-06 | Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes | Hui Zhang et.al. | 2503.04235 | null |
| 2025-03-06 | DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems | Joshua Bird et.al. | 2503.04126 | null |
| 2025-03-05 | Equivariant Filter Design for Range-only SLAM | Yixiao Ge et.al. | 2503.03973 | null |
| 2025-03-05 | Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments | Jie Deng et.al. | 2503.03373 | link |
| 2025-03-05 | OpenGV 2.0: Motion prior-assisted calibration and SLAM with vehicle-mounted surround-view systems | Kun Huang et.al. | 2503.03230 | null |
| 2025-03-05 | Distributed Certifiably Correct Range-Aided SLAM | Alexander Thoms et.al. | 2503.03192 | link |
| 2025-03-04 | Introspective Loop Closure for SLAM with 4D Imaging Radar | Maximilian Hilger et.al. | 2503.02383 | null |
| 2025-03-04 | DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting | Haoyuan Li et.al. | 2503.02223 | link |
| 2025-03-03 | Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM | Marco Giberna et.al. | 2503.02050 | null |
| 2025-03-03 | vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding | Ali Tourani et.al. | 2503.01783 | null |
| 2025-03-03 | MUSt3R: Multi-view Network for Stereo 3D Reconstruction | Yohann Cabon et.al. | 2503.01661 | link |
| 2025-03-03 | OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding | Dianyi Yang et.al. | 2503.01646 | null |
| 2025-03-03 | MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features | Chao Ye et.al. | 2503.01571 | link |
| 2025-03-03 | AI-Driven Relocation Tracking in Dynamic Kitchen Environments | Arash Nasr Esfahani et.al. | 2503.01547 | link |
| 2025-03-03 | Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning | Xintao Chao et.al. | 2503.01543 | null |
| 2025-03-03 | RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation | Shu Pan et.al. | 2503.01434 | null |
| 2025-02-27 | BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground | Yufei Wei et.al. | 2502.20078 | null |
| 2025-02-26 | Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects | Petri Mäkinen et.al. | 2502.19169 | null |
| 2025-02-26 | SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images | Yangfan Xu et.al. | 2502.18932 | null |
| 2025-02-25 | S-Graphs 2.0 -- A Hierarchical-Semantic Optimization and Loop Closure for SLAM | Hriday Bavle et.al. | 2502.18044 | link |
| 2025-02-25 | MegaLoc: One Retrieval to Place Them All | Gabriele Berton et.al. | 2502.17237 | link |
| 2025-02-24 | SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building | Haoming Huang et.al. | 2502.16856 | link |
| 2025-02-27 | Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM | Yao Zhang et.al. | 2502.16495 | null |
| 2025-02-19 | Slamming: Training a Speech Language Model on One GPU in a Day | Gallil Maimon et.al. | 2502.15814 | link |
| 2025-02-21 | RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes | Sicheng Yu et.al. | 2502.15633 | null |
| 2025-02-20 | Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2502.14931 | null |
| 2025-02-19 | 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments | Vincent Ress et.al. | 2502.13803 | null |
| 2025-02-19 | Active Illumination for Visual Ego-Motion Estimation in the Dark | Francesco Crocetti et.al. | 2502.13708 | null |
| 2025-02-17 | From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations | Matteo Scucchia et.al. | 2502.12303 | null |
| 2025-02-19 | pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM | Luigi Freda et.al. | 2502.11955 | link |
| 2025-02-17 | Anti-Degeneracy Scheme for Lidar SLAM based on Particle Filter in Geometry Feature-Less Environments | Yanbin Li et.al. | 2502.11486 | null |
| 2025-02-16 | GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting | Zelin Zhou et.al. | 2502.10975 | null |
| 2025-02-19 | MonoForce: Learnable Image-conditioned Physics Engine | Ruslan Agishev et.al. | 2502.10156 | link |
| 2025-02-13 | Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions | Dario Pisanti et.al. | 2502.09795 | null |
| 2025-02-13 | DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior | Mingrui Li et.al. | 2502.09111 | null |
| 2025-02-12 | LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features | Shujie Zhou et.al. | 2502.08676 | link |
| 2025-02-10 | Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map | Yingyu Wang et.al. | 2502.06292 | link |
| 2025-02-09 | PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map | Yue Pan et.al. | 2502.05752 | link |
| 2025-02-07 | Joint State and Noise Covariance Estimation | Kasra Khosoussi et.al. | 2502.04584 | null |
| 2025-02-05 | GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM | Mingrui Li et.al. | 2502.03228 | null |
| 2025-02-04 | SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification | Yifu Tao et.al. | 2502.02657 | null |
| 2025-02-04 | HeRCULES: Heterogeneous Radar Dataset in Complex Urban Environment for Multi-session Radar SLAM | Hanjun Kim et.al. | 2502.01946 | null |
| 2025-02-03 | Statistical enhance learning for modeling and prediction tennis matches at Grand Slam tournaments | Nourah Buhamra et.al. | 2502.01613 | null |
| 2025-02-03 | Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter | Dabin Kim et.al. | 2502.01092 | null |
| 2025-02-01 | FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps | Maximilian Leitenstern et.al. | 2502.00395 | link |
| 2025-01-31 | LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks | Liudi Yang et.al. | 2501.19382 | link |
| 2025-01-31 | Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping | Yiming Huang et.al. | 2501.19319 | link |
| 2025-01-31 | GO: The Great Outdoors Multimodal Dataset | Peng Jiang et.al. | 2501.19274 | null |
| 2025-01-30 | Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems | Liudi Yang et.al. | 2501.18110 | null |
| 2025-01-28 | SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios | Yinqi Chen et.al. | 2501.16754 | null |
| 2025-01-27 | Visual-Lidar Map Alignment for Infrastructure Inspections | Jake McLaughlin et.al. | 2501.14486 | link |
| 2025-01-24 | Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video | Xiaohao Xu et.al. | 2501.14319 | link |
| 2025-01-24 | HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting | Javier Yu et.al. | 2501.14147 | null |
| 2025-01-23 | FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation | Bingyang Zhou et.al. | 2501.13876 | null |
| 2025-01-23 | VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM | Gyuhyeon Pak et.al. | 2501.13402 | null |
| 2025-01-22 | Grid-based Submap Joining: An Efficient Algorithm for Simultaneously Optimizing Global Occupancy Map and Local Submap Frames | Yingyu Wang et.al. | 2501.12764 | null |
| 2025-01-21 | DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM | Jesse Morris et.al. | 2501.11893 | link |
| 2025-01-21 | Survey on Monocular Metric Depth Estimation | Jiuling Zhang et.al. | 2501.11841 | null |
| 2025-01-19 | OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors | Dominik Kulmer et.al. | 2501.11111 | link |
| 2025-01-19 | Factor Graph-Based Active SLAM for Spacecraft Proximity Operations | Lorenzo Ticozzi et.al. | 2501.10950 | null |
| 2025-01-23 | Mesh2SLAM in VR: A Fast Geometry-Based SLAM Framework for Rapid Prototyping in Virtual Reality Applications | Carlos Augusto Pinheiro de Sousa et.al. | 2501.09600 | null |
| 2025-01-16 | Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment | Maksim Filipenko et.al. | 2501.09490 | null |
| 2025-01-15 | Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures | Pengru Deng et.al. | 2501.09203 | null |
| 2025-01-15 | AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning | Assaf Lahiany et.al. | 2501.09160 | null |
| 2025-01-15 | SLC |
Yuhang Ming et.al. | 2501.08880 | null |
| 2025-01-15 | GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping | Sheng Hong et.al. | 2501.08672 | null |
| 2025-01-16 | BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module | Dongzhihan Wang et.al. | 2501.08659 | null |
| 2025-01-15 | Self-Organizing Edge Computing Distribution Framework for Visual SLAM | Jussi Kalliola et.al. | 2501.08629 | null |
| 2025-01-14 | VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes | Ke Wu et.al. | 2501.08286 | null |
| 2025-01-13 | Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps | Saurabh Gupta et.al. | 2501.07399 | null |
| 2025-01-13 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
| 2025-01-12 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
| 2025-01-11 | SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors | Zhen Hong et.al. | 2501.06469 | null |
| 2025-01-09 | Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping | Wen Tianci et.al. | 2501.05242 | null |
| 2025-01-07 | SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment | Yuchun Fan et.al. | 2501.03681 | link |
| 2025-01-06 | HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos | Jinglei Zhang et.al. | 2501.02973 | null |
| 2025-01-09 | LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments | Haosong Yue et.al. | 2501.02580 | link |
| 2025-01-04 | ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle | Yinchuan Wang et.al. | 2501.02166 | link |
| 2024-12-31 | PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM | Runnan Chen et.al. | 2501.00352 | null |
| 2024-12-30 | Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields | Evgenii Kruzhkov et.al. | 2412.20976 | null |
| 2024-12-28 | MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing | Shuo Wang et.al. | 2412.20082 | null |
| 2024-12-27 | DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction | Kai Xu et.al. | 2412.19584 | null |
| 2024-12-26 | MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo | Byeonggwon Lee et.al. | 2412.19130 | null |
| 2024-12-23 | End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework | Fuhua Jia et.al. | 2412.17343 | null |
| 2024-12-23 | LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation | Riku Uemura et.al. | 2412.17282 | null |
| 2024-12-23 | Selective Kalman Filter: When and How to Fuse Multi-Sensor Information to Overcome Degeneracy in SLAM | Jie Xu et.al. | 2412.17235 | null |
| 2025-01-03 | Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry | Zhaoxing Zhang et.al. | 2412.16923 | link |
| 2024-12-21 | Query Quantized Neural SLAM | Sijia Jiang et.al. | 2412.16476 | link |
| 2024-12-20 | SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training | Wenxi Chen et.al. | 2412.15649 | link |
| 2024-12-18 | Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed | Zidong Han et.al. | 2412.13912 | null |
| 2024-12-18 | Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation | Sait Akturk et.al. | 2412.13752 | null |
| 2024-12-18 | 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching | Fernando Amodeo et.al. | 2412.13639 | link |
| 2024-12-17 | NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment | Andrea Dunn Beltran et.al. | 2412.13176 | null |
| 2024-12-18 | Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera | Zhengdi Yu et.al. | 2412.12861 | null |
| 2024-12-16 | Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration | Meisam Kabiri et.al. | 2412.12406 | null |
| 2024-12-16 | MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors | Riku Murai et.al. | 2412.12392 | null |
| 2024-12-16 | Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges | Martin Aubard et.al. | 2412.11840 | null |
| 2024-12-19 | RoMeO: Robust Metric Visual Odometry | Junda Cheng et.al. | 2412.11530 | null |
| 2024-12-14 | Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency | Yang Song et.al. | 2412.10809 | link |
| 2024-12-13 | RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting | Lizhi Bai et.al. | 2412.09868 | null |
| 2024-12-12 | SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos | Yuzheng Liu et.al. | 2412.09401 | link |
| 2024-12-12 | eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction | Jad Mansour et.al. | 2412.09209 | link |
| 2024-12-12 | Drift-free Visual SLAM using Digital Twins | Roxane Merat et.al. | 2412.08496 | null |
| 2024-12-10 | A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM | Zongbo Liao et.al. | 2412.07513 | null |
| 2024-12-08 | DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments | Juwon Kim et.al. | 2412.05839 | null |
| 2024-12-06 | MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos | Zhengqi Li et.al. | 2412.04463 | null |
| 2024-12-05 | Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset | Fuzhang Han et.al. | 2412.04287 | link |
| 2024-12-10 | MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application | Hyesu Jang et.al. | 2412.03887 | null |
| 2024-12-04 | Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars | John McConnell et.al. | 2412.03760 | null |
| 2024-12-04 | BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement | Miguel Arturo Vega Torres et.al. | 2412.03434 | link |
| 2024-12-04 | NeRF and Gaussian Splatting SLAM in the Wild | Fabian Schmidt et.al. | 2412.03263 | link |
| 2024-12-04 | MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras | Huai Yu et.al. | 2412.03146 | link |
| 2024-12-04 | An indoor DSO-based ceiling-vision odometry system for indoor industrial environments | Abdelhak Bougouffa et.al. | 2412.02950 | null |
| 2024-12-03 | ROVER: A Multi-Season Dataset for Visual SLAM | Fabian Schmidt et.al. | 2412.02506 | link |
| 2024-12-04 | RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting | Zhenzhong Cao et.al. | 2412.01217 | link |
| 2024-11-28 | Visual SLAMMOT Considering Multiple Motion Models | Peilin Tian et.al. | 2411.19134 | null |
| 2024-11-27 | ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching | Yangrui Dong et.al. | 2411.18174 | null |
| 2024-11-27 | HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction | Wei Zhang et.al. | 2411.17982 | link |
| 2024-11-26 | MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework | Xiangcheng Hu et.al. | 2411.17928 | link |
| 2024-11-29 | DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting | Christian Homeyer et.al. | 2411.17660 | link |
| 2024-11-25 | MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Vladimir Yugay et.al. | 2411.16785 | null |
| 2024-11-24 | Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors | Soumava Paul et.al. | 2411.15966 | null |
| 2024-11-24 | Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors | R. Herrmann et.al. | 2411.15901 | null |
| 2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
| 2024-11-23 | Gassidy: Gaussian Splatting SLAM in Dynamic Environments | Long Wen et.al. | 2411.15476 | null |
| 2024-11-22 | OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping | Tomas Berriel Martins et.al. | 2411.15043 | link |
| 2024-11-22 | A Benchmark Dataset for Collaborative SLAM in Service Environments | Harin Park et.al. | 2411.14775 | link |
| 2024-11-21 | InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation | Marziyeh Bamdad et.al. | 2411.14358 | link |
| 2024-11-20 | Robust Monocular Visual Odometry using Curriculum Learning | Assaf Lahiany et.al. | 2411.13438 | null |
| 2024-11-20 | Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds | Jelena Trisovic et.al. | 2411.13310 | null |
| 2024-11-19 | 3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality | Hanbeom Chang et.al. | 2411.12514 | null |
| 2024-11-19 | LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2411.12185 | null |
| 2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
| 2024-11-18 | The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters | Jie Ju et.al. | 2411.11250 | null |
| 2024-11-17 | A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality | Wei-Hsiang Lien et.al. | 2411.10940 | null |
| 2024-11-16 | DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment | Mangyu Kong et.al. | 2411.10722 | link |
| 2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
| 2024-11-15 | BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation | Yufei Wei et.al. | 2411.10195 | null |
| 2024-11-13 | DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization | Yueming Xu et.al. | 2411.08373 | null |
| 2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
| 2024-11-12 | Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments | Ankit Shaw et.al. | 2411.08231 | null |
| 2024-11-12 | NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN | Sonia Raychaudhuri et.al. | 2411.07848 | null |
| 2024-11-11 | Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems | Yasra Chandio et.al. | 2411.07146 | null |
| 2024-11-11 | Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models | Jungseok Hong et.al. | 2411.06752 | null |
| 2024-11-11 | HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation | Xiaolong Wang et.al. | 2411.06700 | null |
| 2024-11-08 | Development of an indoor localization and navigation system based on monocular SLAM for mobile robots | Thanh Nguyen Canh et.al. | 2411.05337 | null |
| 2024-11-07 | Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping | Sayat Ibrayev et.al. | 2411.04797 | null |
| 2024-11-07 | MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation | Sayan Paul et.al. | 2411.04796 | null |
| 2024-11-09 | DEIO: Deep Event Inertial Odometry | Weipeng Guan et.al. | 2411.03928 | link |
| 2024-11-06 | Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward | Shashi Kumar et.al. | 2411.03866 | null |
| 2024-11-06 | LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior | Jiahui Wang et.al. | 2411.03610 | link |
| 2024-11-05 | LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting | Huibin Zhao et.al. | 2411.02703 | null |
| 2024-11-04 | Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing | Xinran Zhang et.al. | 2411.02553 | null |
| 2024-11-04 | Semantic Masking and Visual Feature Matching for Robust Localization | Luisa Mao et.al. | 2411.01804 | null |
| 2024-10-31 | XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM | Xiaomeng Wang et.al. | 2410.23690 | link |
| 2024-10-30 | LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM | Yucheng Huang et.al. | 2410.23231 | link |
| 2024-10-30 | ISAC Prototype System for Multi-Domain Cooperative Communication Networks | Jie Yang et.al. | 2410.22956 | null |
| 2024-10-30 | SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark | HyunJun Jung et.al. | 2410.22715 | link |
| 2024-10-29 | LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues | Hanqing Jiang et.al. | 2410.22213 | null |
| 2024-10-29 | EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments | Linus Nwankwo et.al. | 2410.22200 | null |
| 2024-10-28 | NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments | Taiyi Pan et.al. | 2410.21615 | link |
| 2024-10-28 | coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM | Emiliano Höss et.al. | 2410.21149 | link |
| 2024-11-01 | RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior | Mingjiang Liang et.al. | 2410.20358 | null |
| 2024-10-25 | Context-Based Visual-Language Place Recognition | Soojin Woo et.al. | 2410.19341 | link |
| 2024-10-22 | AG-SLAM: Active Gaussian Splatting SLAM | Wen Jiang et.al. | 2410.17422 | null |
| 2024-10-22 | Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study | J. Jorge et.al. | 2410.17171 | null |
| 2024-10-19 | EndoMetric: Near-light metric scale monocular SLAM | Raúl Iranzo et.al. | 2410.15065 | null |
| 2024-10-17 | Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot | Dongkun Han et.al. | 2410.13612 | null |
| 2024-10-17 | TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal | Yanpeng Jia et.al. | 2410.13240 | null |
| 2024-10-16 | QueensCAMP: an RGB-D dataset for robust Visual SLAM | Hudson M. S. Bruno et.al. | 2410.12520 | link |
| 2024-10-18 | PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM | Guanghao Li et.al. | 2410.12324 | null |
| 2024-10-16 | Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem | Yichen Sha et.al. | 2410.12169 | null |
| 2024-10-15 | V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting | Tuan Dang et.al. | 2410.12068 | link |
| 2024-10-15 | GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information | Wancai Zheng et.al. | 2410.11356 | null |
| 2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
| 2024-10-14 | MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator | Taozhe Li et.al. | 2410.10669 | null |
| 2024-10-13 | Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph | Benoit Casseau et.al. | 2410.09896 | null |
| 2024-10-12 | SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs | Wenxi Chen et.al. | 2410.09503 | link |
| 2024-10-12 | An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation | Wei Liang et.al. | 2410.09443 | null |
| 2024-10-12 | ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras | Junkai Niu et.al. | 2410.09374 | link |
| 2024-10-11 | Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System | Zheng Liu et.al. | 2410.08935 | link |
| 2024-10-11 | Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints | Yicheng He et.al. | 2410.08780 | null |
| 2024-10-10 | ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization | Mason B. Peterson et.al. | 2410.08262 | link |
| 2024-10-10 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
| 2024-10-08 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
| 2024-10-08 | Submodular Optimization for Keyframe Selection & Usage in SLAM | David Thorne et.al. | 2410.05576 | null |
| 2024-10-07 | SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones | Denis Davletshin et.al. | 2410.05405 | null |
| 2024-10-07 | Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection | Ang He et.al. | 2410.05017 | null |
| 2024-10-05 | A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems | Nikola Radulov et.al. | 2410.04242 | link |
| 2024-10-05 | High-Speed Stereo Visual SLAM for Low-Powered Computing Devices | Ashish Kumar et.al. | 2410.04090 | link |
| 2024-10-04 | EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM | Shi Chen et.al. | 2410.03812 | null |
| 2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665 | null |
| 2024-10-03 | LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features | Zihao Dong et.al. | 2410.02961 | null |
| 2024-10-02 | ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space | Hogyun Kim et.al. | 2410.01325 | null |
| 2024-10-01 | Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency | William Dubois et.al. | 2410.00758 | null |
| 2024-10-02 | CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM | Dapeng Feng et.al. | 2410.00486 | link |
| 2024-09-30 | Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications | Zachary Fuge et.al. | 2410.00122 | null |
| 2024-09-30 | Direct Multipath-Based SLAM | Mingchao Liang et.al. | 2409.20552 | null |
| 2024-09-30 | Robust Gaussian Splatting SLAM by Leveraging Loop Closure | Zunjie Zhu et.al. | 2409.20111 | null |
| 2024-09-30 | DynORecon: Dynamic Object Reconstruction for Navigation | Yiduo Wang et.al. | 2409.19928 | null |
| 2024-09-29 | CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation | Yifan Duan et.al. | 2409.19597 | null |
| 2024-09-29 | CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought | Yexing Du et.al. | 2409.19510 | link |
| 2024-09-29 | Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface | Ziniu Wu et.al. | 2409.19499 | null |
| 2024-09-27 | Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet's Halls | Leon Davies et.al. | 2409.18752 | null |
| 2024-09-26 | BlinkTrack: Feature Tracking over 100 FPS via Events and Images | Yichen Shen et.al. | 2409.17981 | null |
| 2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
| 2024-09-26 | Event-based Stereo Depth Estimation: A Survey | Suman Ghosh et.al. | 2409.17680 | null |
| 2024-09-25 | Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras | Sotiris Papatheodorou et.al. | 2409.16972 | null |
| 2024-09-25 | Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM | Phu Pham et.al. | 2409.16944 | null |
| 2024-09-25 | Inline Photometrically Calibrated Hybrid Visual SLAM | Nicolas Abboud et.al. | 2409.16810 | link |
| 2024-09-25 | Topological SLAM in colonoscopies leveraging deep features and topological priors | Javier Morlana et.al. | 2409.16806 | link |
| 2024-09-25 | Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots | Masoud Dayani Najafabadi et.al. | 2409.16595 | link |
| 2024-09-25 | Task-driven SLAM Benchmarking | Yanwei Du et.al. | 2409.16573 | link |
| 2024-09-24 | SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints | Jeahn Han et.al. | 2409.15736 | null |
| 2024-09-23 | Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization | Neelkamal Somisetty et.al. | 2409.15506 | null |
| 2024-09-22 | SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms | Niraj Pudasaini et.al. | 2409.14515 | null |
| 2024-09-21 | Point Cloud Structural Similarity-based Underwater Sonar Loop Detection | Donghwi Jung et.al. | 2409.14020 | link |
| 2024-09-20 | HMD |
Vladimir Guzov et.al. | 2409.13426 | null |
| 2024-09-20 | Learning Visual Information Utility with PIXER | Yash Turkar et.al. | 2409.13151 | null |
| 2024-09-19 | MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting | Yan Song Hu et.al. | 2409.13055 | null |
| 2024-09-19 | Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2409.12518 | link |
| 2024-09-18 | Bundle Adjustment in the Eager Mode | Zitong Zhan et.al. | 2409.12190 | null |
| 2024-09-23 | Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping | Jaehyung Jung et.al. | 2409.12051 | null |
| 2024-09-18 | Metric-Semantic Factor Graph Generation based on Graph Neural Networks | Jose Andres Millan-Romera et.al. | 2409.11972 | null |
| 2024-09-18 | Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments | Lei Cheng et.al. | 2409.11854 | null |
| 2024-09-18 | ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation | Yanlin Jin et.al. | 2409.11692 | null |
| 2024-09-18 | SLAM assisted 3D tracking system for laparoscopic surgery | Jingwei Song et.al. | 2409.11688 | null |
| 2024-09-17 | GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure | Ziheng Xu et.al. | 2409.10982 | null |
| 2024-09-17 | Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells | Ankit Butola et.al. | 2409.10971 | null |
| 2024-09-17 | Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping | Bo Yang et.al. | 2409.10824 | link |
| 2024-09-16 | P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty | Yufan Zhang et.al. | 2409.10143 | link |
| 2024-09-16 | SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning | Amogh Joshi et.al. | 2409.09990 | null |
| 2024-09-16 | Enhancing Visual Inertial SLAM with Magnetic Measurements | Bharat Joshi et.al. | 2409.09904 | null |
| 2024-09-15 | Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics | Zi Cong Guo et.al. | 2409.09871 | link |
| 2024-09-15 | Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping | Yi Liu et.al. | 2409.09763 | null |
| 2024-09-15 | High Definition Map Mapping and Update: A General Overview and Future Directions | Benny Wijaya et.al. | 2409.09726 | null |
| 2024-09-14 | MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry | Yuheng Qiu et.al. | 2409.09479 | null |
| 2024-09-14 | Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM | Haoying Li et.al. | 2409.09410 | null |
| 2024-09-14 | GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians | Dasong Gao et.al. | 2409.09295 | link |
| 2024-09-14 | Panoramic Direct LiDAR-assisted Visual Odometry | Zikang Yuan et.al. | 2409.09287 | link |
| 2024-09-11 | Object Depth and Size Estimation using Stereo-vision and Integration with SLAM | Layth Hamad et.al. | 2409.07623 | null |
| 2024-09-11 | Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry | Anbo Tao et.al. | 2409.06948 | null |
| 2024-09-10 | Technical Report of Mobile Manipulator Robot for Industrial Environments | Erfan Amoozad Khalili et.al. | 2409.06693 | null |
| 2024-09-10 | Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios | Zhiqiang Chen et.al. | 2409.04961 | link |
| 2024-09-08 | FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat | Changfei Fu et.al. | 2409.03457 | null |
| 2024-09-03 | Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness | Michael D. Friske et.al. | 2409.01915 | null |
| 2024-09-03 | Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric | Tingchen Ma et.al. | 2409.01856 | null |
| 2024-09-02 | Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM | Ilari Vallivaara et.al. | 2409.01242 | null |
| 2024-09-02 | Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection | Manon Kok et.al. | 2409.01091 | null |
| 2024-09-02 | Robust Vehicle Localization and Tracking in Rain using Street Maps | Yu Xiang Tan et.al. | 2409.01038 | link |
| 2024-08-31 | UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM | Mostafa Mansour et.al. | 2409.00362 | null |
| 2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
| 2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
| 2024-08-29 | Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry | Michael Adlerstein et.al. | 2408.16472 | null |
| 2024-08-28 | Single-Photon 3D Imaging with Equi-Depth Photon Histograms | Kaustubh Sadekar et.al. | 2408.16150 | null |
| 2024-08-28 | BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR | Miguel Arturo Vega Torres et.al. | 2408.15870 | link |
| 2024-08-30 | Addressing the challenges of loop detection in agricultural environments | Nicolás Soncini et.al. | 2408.15761 | link |
| 2024-08-28 | ES-PTAM: Event-based Stereo Parallel Tracking and Mapping | Suman Ghosh et.al. | 2408.15605 | link |
| 2024-08-28 | PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry | Kaiqiao Yang et.al. | 2408.15583 | null |
| 2024-09-02 | Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration | Rongge Zhang et.al. | 2408.14726 | link |
| 2024-08-26 | A Survey on Reinforcement Learning Applications in SLAM | Mohammad Dehghani Tezerjani et.al. | 2408.14518 | null |
| 2024-08-28 | FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2408.14035 | link |
| 2024-08-21 | Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild | Turcan Tuna et.al. | 2408.11809 | null |
| 2024-08-21 | LiFCal: Online Light Field Camera Calibration via Bundle Adjustment | Aymeric Fleith et.al. | 2408.11682 | null |
| 2024-08-21 | Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars | Zhihao Lin et.al. | 2408.11582 | null |
| 2024-08-21 | RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform | Maximilian Hilger et.al. | 2408.11576 | link |
| 2024-08-21 | Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models | Kento Kawaharazuka et.al. | 2408.11380 | null |
| 2024-08-20 | LoopSplat: Loop Closure by Registering 3D Gaussian Splats | Liyuan Zhu et.al. | 2408.10154 | link |
| 2024-08-19 | Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM | Sanghyun Hahn et.al. | 2408.09727 | link |
| 2024-08-17 | GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Shuo Wang et.al. | 2408.09191 | null |
| 2024-08-15 | GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Yutong Wang et.al. | 2408.07917 | link |
| 2024-08-14 | Inverse k-visibility for RSSI-based Indoor Geometric Mapping | Junseo Kim et.al. | 2408.07757 | null |
| 2024-08-14 | Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition | Hogyun Kim et.al. | 2408.07330 | link |
| 2024-08-12 | CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments | Yanpeng Jia et.al. | 2408.05981 | null |
| 2024-08-21 | Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis | Zhongche Qu et.al. | 2408.05635 | null |
| 2024-08-10 | TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping | Seoyeon Jang et.al. | 2408.05453 | null |
| 2024-08-08 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
| 2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
| 2024-08-07 | AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System | Kuan Xu et.al. | 2408.03520 | link |
| 2024-08-06 | BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications | G. Manni et.al. | 2408.03078 | link |
| 2024-08-04 | SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks | Vladimir Zeković et.al. | 2408.02084 | null |
| 2024-08-03 | Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing | Fabian Schmidt et.al. | 2408.01716 | link |
| 2024-08-03 | Deep Patch Visual SLAM | Lahav Lipson et.al. | 2408.01654 | link |
| 2024-08-02 | Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data | Chang Liu et.al. | 2408.01544 | null |
| 2024-08-07 | IG-SLAM: Instant Gaussian SLAM | F. Aykut Sarikamis et.al. | 2408.01126 | null |
| 2024-08-01 | Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform | Yuxin Lin et.al. | 2408.00545 | null |
| 2024-08-01 | High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets | Jian Li et.al. | 2408.00538 | link |
| 2024-07-31 | SuperVINS: A visual-inertial SLAM framework integrated deep learning features | Hongkun Luo et.al. | 2407.21348 | link |
| 2024-07-30 | NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding | Hongjia Zhai et.al. | 2407.20853 | null |
| 2024-07-29 | A flexible framework for accurate LiDAR odometry, map manipulation, and localization | José Luis Blanco-Claraco et.al. | 2407.20465 | link |
| 2024-07-28 | Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data | Azmyin Md. Kamal et.al. | 2407.19518 | null |
| 2024-07-26 | Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation | Aditya Penumarti et.al. | 2407.19046 | null |
| 2024-07-26 | HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM | Zhe Xin et.al. | 2407.18813 | null |
| 2024-07-25 | CodedVO: Coded Visual Odometry | Sachin Shah et.al. | 2407.18240 | null |
| 2024-07-28 | HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation | Zhenzhi Wang et.al. | 2407.17438 | link |
| 2024-07-22 | Memory Management for Real-Time Appearance-Based Loop Closure Detection | Mathieu Labbé et.al. | 2407.15890 | null |
| 2024-07-22 | Reinforcement Learning Meets Visual Odometry | Nico Messikommer et.al. | 2407.15626 | link |
| 2024-07-22 | Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM | Mathieu Labbe et.al. | 2407.15305 | null |
| 2024-07-21 | Semi-Supervised Pipe Video Temporal Defect Interval Localization | Zhu Huang et.al. | 2407.15170 | null |
| 2024-07-21 | VoxDepth: Rectification of Depth Images on Edge Devices | Yashashwee Chakrabarty et.al. | 2407.15067 | null |
| 2024-07-20 | From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM | Lorenzo Montano-Oliván et.al. | 2407.14797 | null |
| 2024-07-19 | MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion | Qiyan Li et.al. | 2407.14102 | null |
| 2024-07-18 | A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion | Jianxiang Xu et.al. | 2407.13878 | link |
| 2024-07-18 | Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM | Baicheng Li et.al. | 2407.13338 | null |
| 2024-07-18 | Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain | Bach Nguyen Gia et.al. | 2407.13159 | link |
| 2024-07-17 | Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge | Andrea Albanese et.al. | 2407.12663 | null |
| 2024-07-17 | Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM | Markus Weißflog et.al. | 2407.12408 | null |
| 2024-07-19 | Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Sangjun Lee et.al. | 2407.12405 | link |
| 2024-07-17 | Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM | Manh Do Duc et.al. | 2407.11870 | null |
| 2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
| 2024-07-16 | Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems | Jianzhu Huai et.al. | 2407.11705 | null |
| 2024-07-16 | Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization | Yu Ge et.al. | 2407.11643 | null |
| 2024-07-16 | I |
Gwangtak Bae et.al. | 2407.11347 | null |
| 2024-07-16 | FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration | Jiantao Feng et.al. | 2407.11299 | null |
| 2024-07-15 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
| 2024-07-12 | An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks | Seyed Alireza Rahimi Azghadi et.al. | 2407.09242 | null |
| 2024-07-11 | SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM | Neng Wang et.al. | 2407.08106 | link |
| 2024-07-09 | Hyperion -- A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM | David Hug et.al. | 2407.07074 | link |
| 2024-07-15 | A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM | Yasra Chandio et.al. | 2407.06889 | null |
| 2024-07-08 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots | Siva Krishna Ravipati et.al. | 2407.06077 | link |
| 2024-07-10 | Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact | Sangwoo Jung et.al. | 2407.05820 | null |
| 2024-07-07 | Active Collaborative Visual SLAM exploiting ORB Features | Muhammad Farhan Ahmed et.al. | 2407.05453 | null |
| 2024-07-06 | VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking | Xuefeng Jiang et.al. | 2407.05017 | null |
| 2024-07-06 | Symmetric Linear Arc Monadic Datalog and Gadget Reductions | Manuel Bodirsky et.al. | 2407.04924 | null |
| 2024-07-03 | Ultra-Lightweight Collaborative Mapping for Robot Swarms | Vlad Niculescu et.al. | 2407.03136 | null |
| 2024-07-01 | RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields | Haochen Jiang et.al. | 2407.01303 | link |
| 2024-07-01 | Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation | Lianjie Guo et.al. | 2407.01292 | link |
| 2024-07-01 | Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization | Ruofei Bai et.al. | 2407.01013 | link |
| 2024-06-30 | Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation | Adnan Abdullah et.al. | 2407.00848 | null |
| 2024-06-30 | OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration | Fengyuan Yang et.al. | 2407.00574 | null |
| 2024-06-24 | Compressing Search with Language Models | Thomas Mulc et.al. | 2407.00085 | null |
| 2024-06-28 | CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services | DongKi Noh et.al. | 2406.19634 | null |
| 2024-06-25 | Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System | Xinzhe Liu et.al. | 2406.17586 | null |
| 2024-07-02 | SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation | Xu Liu et.al. | 2406.17249 | link |
| 2024-06-24 | From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking | Xiaohao Xu et.al. | 2406.16850 | link |
| 2024-06-23 | Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy | Chen Wang et.al. | 2406.16087 | null |
| 2024-06-19 | Simultaneous Map and Object Reconstruction | Nathaniel Chodosh et.al. | 2406.13896 | null |
| 2024-06-14 | Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization | Wonho Song et.al. | 2406.11599 | null |
| 2024-06-16 | Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry | Boris Chidlovskii et.al. | 2406.11019 | null |
| 2024-06-15 | Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM | Yinjie Li et.al. | 2406.10494 | link |
| 2024-06-12 | From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers | Swaminathan Gurumurthy et.al. | 2406.07785 | link |
| 2024-06-27 | Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF) | Gyubeom Im et.al. | 2406.06427 | null |
| 2024-06-10 | Notes on Various Errors and Jacobian Derivations for SLAM | Gyubeom Im et.al. | 2406.06422 | null |
| 2024-06-23 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374 | link |
| 2024-06-15 | Visual-Inertial SLAM as Simple as A, B, VINS | Nathaniel Merrill et.al. | 2406.05969 | null |
| 2024-06-09 | MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps | Jianhao Zheng et.al. | 2406.05849 | null |
| 2024-06-06 | Open Problem: Active Representation Learning | Nikola Milosevic et.al. | 2406.03845 | null |
| 2024-06-04 | ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization | Chen Mao et.al. | 2406.01906 | link |
| 2024-06-03 | The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry | Paolo Cudrano et.al. | 2406.01797 | null |
| 2024-06-03 | Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry | Takayuki Kanai et.al. | 2406.00929 | null |
| 2024-06-02 | Visual place recognition for aerial imagery: A survey | Ivan Moskalenko et.al. | 2406.00885 | link |
| 2024-05-30 | Structure Gaussian SLAM with Manhattan World Hypothesis | Shuhong Liu et.al. | 2405.20031 | null |
| 2024-05-30 | Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar | Wouter Jansen et.al. | 2405.19869 | null |
| 2024-05-30 | SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization | Jiang Wang et.al. | 2405.19813 | link |
| 2024-05-30 | TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM | Peifeng Jiang et.al. | 2405.19614 | null |
| 2024-05-27 | CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy | Richard Elvira et.al. | 2405.16932 | null |
| 2024-05-26 | Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | Erik Sandström et.al. | 2405.16544 | link |
| 2024-05-24 | NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes | Lizhi Bai et.al. | 2405.15151 | null |
| 2024-05-23 | ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization | Han Song et.al. | 2405.15082 | null |
| 2024-05-23 | Synergistic Global-space Camera and Human Reconstruction from Videos | Yizhou Zhao et.al. | 2405.14855 | null |
| 2024-05-23 | CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments | Yang Zhou et.al. | 2405.14731 | link |
| 2024-05-23 | Efficient Robot Learning for Perception and Mapping | Niclas Vödisch et.al. | 2405.14688 | null |
| 2024-05-22 | Monocular Gaussian SLAM with Language Extended Loop Closure | Tian Lan et.al. | 2405.13748 | null |
| 2024-05-26 | NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments | Dongha Chung et.al. | 2405.12563 | link |
| 2024-05-20 | EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving | Boyi Liu et.al. | 2405.12120 | null |
| 2024-05-24 | Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation | Hyungtae Lim et.al. | 2405.11176 | null |
| 2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
| 2024-05-17 | CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion | Gang Wang et.al. | 2405.10793 | null |
| 2024-05-17 | Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map | Liang Zhao et.al. | 2405.10743 | null |
| 2024-05-10 | MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization | Pengcheng Zhu et.al. | 2405.06241 | null |
| 2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
| 2024-05-07 | IMU-Aided Event-based Stereo Visual Odometry | Junkai Niu et.al. | 2405.04071 | link |
| 2024-04-27 | An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation | Olivier Brochu Dufour et.al. | 2404.17745 | null |
| 2024-04-26 | Camera Motion Estimation from RGB-D-Inertial Scene Flow | Samuel Cerezo et.al. | 2404.17251 | link |
| 2024-04-23 | Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization | Lahav Lipson et.al. | 2404.15263 | link |
| 2024-04-18 | SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints | Spencer Carmichael et.al. | 2404.12339 | null |
| 2024-04-17 | VBR: A Vision Benchmark in Rome | Leonardo Brizi et.al. | 2404.11322 | link |
| 2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
| 2024-04-06 | Salient Sparse Visual Odometry With Pose-Only Supervision | Siyu Chen et.al. | 2404.04677 | null |
| 2024-03-25 | A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments | Gianluca D'Amico et.al. | 2403.17084 | null |
| 2024-03-19 | On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine | Jagatpreet Singh Nir et.al. | 2403.13170 | null |
| 2024-03-18 | The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions | Margaret Hansen et.al. | 2403.12194 | null |
| 2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
| 2024-03-16 | Efficient Domain Adaptation for Endoscopic Visual Odometry | Junyang Wu et.al. | 2403.10860 | null |
| 2024-03-14 | Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) | Matthew Lisondra et.al. | 2403.09882 | null |
| 2024-03-02 | Grid-based Fast and Structural Visual Odometry | Zhang Zhihe et.al. | 2403.01110 | null |
| 2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | link |
| 2024-02-22 | Secure Navigation using Landmark-based Localization in a GPS-denied Environment | Ganesh Sapkota et.al. | 2402.14280 | null |
| 2024-02-19 | Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment | Ganesh Sapkota et.al. | 2402.12551 | null |
| 2024-02-07 | Online and Certifiably Correct Visual Odometry and Mapping | Devansh R Agrawal et.al. | 2402.05254 | null |
| 2024-02-06 | YOLOPoint Joint Keypoint and Object Detection | Anton Backhaus et.al. | 2402.03989 | link |
| 2024-01-19 | Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning | André O. Françani et.al. | 2401.10857 | null |
| 2024-01-17 | Event-Based Visual Odometry on Non-Holonomic Ground Vehicles | Wanting Xu et.al. | 2401.09331 | link |
| 2024-01-11 | On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering | Feng Zhu et.al. | 2401.05836 | null |
| 2023-12-19 | Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry | Olaya Álvarez-Tuñón et.al. | 2401.05396 | link |
| 2024-01-07 | Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people | Ali Samadzadeh et.al. | 2401.03604 | link |
| 2024-01-03 | LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry | Weirong Chen et.al. | 2401.01887 | link |
| 2023-12-28 | SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction | Zikang Yuan et.al. | 2312.16800 | link |
| 2023-12-20 | NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Jens Naumann et.al. | 2312.13471 | null |
| 2023-12-22 | Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM | Junru Lin et.al. | 2312.13332 | null |
| 2023-12-20 | Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach | Habib Boloorchi Tabrizi et.al. | 2312.13162 | link |
| 2023-12-20 | Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera | Abdulkadhem A. Abdulkadhem et.al. | 2312.12680 | null |
| 2023-12-15 | Deep Event Visual Odometry | Simon Klenk et.al. | 2312.09800 | link |
| 2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889 | null |
| 2023-12-04 | iMatching: Imperative Correspondence Learning | Zitong Zhan et.al. | 2312.02141 | link |
| 2023-11-30 | Event-based Visual Inertial Velometer | Xiuyuan Lu et.al. | 2311.18189 | null |
| 2023-11-21 | CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems | Young-Hee Lee et.al. | 2311.12580 | null |
| 2023-11-10 | Dense Visual Odometry Using Genetic Algorithm | Slimane Djema et.al. | 2311.06149 | null |
| 2023-11-07 | Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM | Seongwook Yoon et.al. | 2311.03722 | null |
| 2023-10-23 | Converting Depth Images and Point Clouds for Feature-based Pose Estimation | Robert Lösch et.al. | 2310.14924 | link |
| 2023-10-17 | Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms | Yanyan Li et.al. | 2310.10931 | link |
| 2023-10-12 | Jointly Optimized Global-Local Visual Localization of UAVs | Haoling Li et.al. | 2310.08082 | null |
| 2023-10-10 | l-dyno: framework to learn consistent visual features using robot's motion | Kartikeya Singh et.al. | 2310.06249 | link |
| 2023-10-08 | XVO: Generalized Visual Odometry via Cross-Modal Self-Training | Lei Lai et.al. | 2309.16772 | null |
| 2023-10-22 | ObVi-SLAM: Long-Term Object-Visual SLAM | Amanda Adkins et.al. | 2309.15268 | link |
| 2023-09-23 | Tag-based Visual Odometry Estimation for Indoor UAVs Localization | Massimiliano Bertoni et.al. | 2309.13311 | null |
| 2023-09-22 | Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms | Olivier Gamache et.al. | 2309.13139 | link |
| 2023-09-20 | Conformalized Multimodal Uncertainty Regression and Reasoning | Domenico Parente et.al. | 2309.11018 | null |
| 2023-09-20 | OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving | Heng Li et.al. | 2309.11011 | link |
| 2023-09-19 | LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation | Haizhou Zhang et.al. | 2309.10436 | link |
| 2023-09-21 | Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration | Hongbo Zhao et.al. | 2309.10314 | null |
| 2023-09-18 | End-to-End Learned Event- and Image-based Visual Odometry | Roberto Pellerito et.al. | 2309.09947 | link |
| 2023-09-14 | An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments | Yehao Liu et.al. | 2309.07408 | null |
| 2023-09-11 | Evaluating Visual Odometry Methods for Autonomous Driving in Rain | Yu Xiang Tan et.al. | 2309.05249 | null |
| 2023-09-08 | Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Akankshya Kar et.al. | 2309.04147 | null |
| 2023-09-04 | EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity | Zijie Jiang et.al. | 2309.01296 | null |
| 2023-08-27 | Deep Learning for Visual Localization and Mapping: A Survey | Changhao Chen et.al. | 2308.14039 | null |
| 2023-08-19 | Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters | Xiao Liu et.al. | 2308.09870 | link |
| 2023-08-12 | 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion | Guirong Zhuo et.al. | 2308.06573 | null |
| 2023-08-10 | Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU | U. V. B. L. Udugama et.al. | 2308.05515 | null |
| 2023-08-02 | A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry | Cora A. Dimmig et.al. | 2308.01398 | null |
| 2023-08-02 | Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network | Shenbagaraj Kannapiran et.al. | 2308.01125 | null |
| 2023-08-02 | Preliminary Design of the Dragonfly Navigation Filter | Ben Schilling et.al. | 2307.13513 | null |
| 2023-07-19 | Optimizing the extended Fourier Mellin Transformation Algorithm | Wenqing Jiang et.al. | 2307.10015 | link |
| 2023-07-15 | Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents | Ke Cao et.al. | 2307.07763 | null |
| 2023-07-26 | Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression | Jianeng Wang et.al. | 2306.01188 | null |
| 2023-07-06 | OSPC: Online Sequential Photometric Calibration | Jawad Haidar et.al. | 2305.17673 | null |
| 2023-05-15 | Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface | Shifan Zhu et.al. | 2305.08962 | null |
| 2023-05-10 | Transformer-based model for monocular visual odometry: a video understanding approach | André O. Françani et.al. | 2305.06121 | link |
| 2023-04-29 | Modality-invariant Visual Odometry for Embodied Vision | Marius Memmel et.al. | 2305.00348 | link |
| 2023-04-21 | FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving | Yuxuan Liu et.al. | 2304.10719 | null |
| 2023-07-08 | Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping | Hanyu Cai et.al. | 2304.08978 | null |
| 2023-04-12 | SiLK -- Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194 | link |
| 2023-04-11 | ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster | Yifei Dong et.al. | 2304.04943 | null |
| 2023-03-21 | Learning a Depth Covariance Function | Eric Dexheimer et.al. | 2303.12157 | null |
| 2023-03-21 | Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network | Alessandro Navone et.al. | 2303.11725 | null |
| 2023-03-20 | VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors | Thien Hoang Nguyen et.al. | 2303.10903 | null |
| 2023-03-17 | CoVIO: Online Continual Learning for Visual-Inertial Odometry | Niclas Vödisch et.al. | 2303.10149 | link |
| 2023-03-15 | UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry | Chaoyang Jiang et.al. | 2303.08550 | null |
| 2023-03-13 | Discovering Multiple Algorithm Configurations | Leonid Keselman et.al. | 2303.07434 | null |
| 2023-03-09 | Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation | Masahiro Hirano et.al. | 2303.05192 | null |
| 2023-03-16 | Stereo Event-based Visual-Inertial Odometry | Kunfeng Wang et.al. | 2303.05086 | link |
| 2023-03-07 | Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor | Eduardo Gallo et.al. | 2303.03804 | null |
| 2023-03-03 | Lightweight, Uncertainty-Aware Conformalized Visual Odometry | Alex C. Stutts et.al. | 2303.02207 | null |
| 2023-02-24 | FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets | Yelena Randall et.al. | 2302.12772 | null |
| 2023-02-27 | CP+: Camera Poses Augmentation with Large-scale LiDAR Maps | Jiadi Cui et.al. | 2302.12198 | null |
| 2023-02-19 | EdgeVO: An Efficient and Accurate Edge-based Visual Odometry | Hui Zhao et.al. | 2302.09493 | null |
| 2023-01-27 | HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera | Mostafa Ahmadi et.al. | 2301.11823 | null |
| 2023-01-26 | Distributed Optimization Methods for Multi-Robot Systems: Part I -- A Tutorial | Ola Shorinwa et.al. | 2301.11313 | null |
| 2023-01-24 | Generalized Object Search | Kaiyu Zheng et.al. | 2301.10121 | null |
| 2023-01-22 | Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories | Hanlin Chen et.al. | 2301.09194 | null |
| 2023-01-21 | Dense RGB SLAM with Neural Implicit Maps | Heng Li et.al. | 2301.08930 | null |
| 2023-01-18 | Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information | Junshi Chen et.al. | 2301.07560 | null |
| 2023-01-17 | COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM | Manthan Patel et.al. | 2301.07147 | link |
| 2023-01-31 | Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems | Pierre-Yves Lajoie et.al. | 2301.06230 | link |
| 2023-01-13 | A LiDAR-Inertial-Visual SLAM System with Loop Detection | Kangcheng Liu et.al. | 2301.05604 | null |
| 2023-01-11 | AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization | Ying Chen et.al. | 2301.04620 | link |
| 2023-01-12 | TBV Radar SLAM -- trust but verify loop candidates | Daniel Adolfsson et.al. | 2301.04397 | link |
| 2022-12-31 | Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges | Maxwell McManus et.al. | 2301.03359 | null |
| 2023-01-09 | Motion Addition and Motion Optimization | Liqun Qi et.al. | 2301.03174 | null |
| 2023-01-08 | Towards Open World NeRF-Based SLAM | Daniil Lisus et.al. | 2301.03102 | null |
| 2023-01-06 | CyberLoc: Towards Accurate Long-term Visual Localization | Liu Liu et.al. | 2301.02403 | null |
| 2023-01-03 | LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation | Shreyansh Daftry et.al. | 2301.01350 | null |
| 2022-12-31 | 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions | Patrick Wenzel et.al. | 2301.01147 | null |
| 2023-01-03 | BS3D: Building-scale 3D Reconstruction from RGB-D Images | Janne Mustaniemi et.al. | 2301.01057 | null |
| 2023-01-10 | An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping | Masoud Dayani Najafabadi et.al. | 2301.00618 | link |
| 2022-12-25 | A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion | Nadia Figueroa et.al. | 2212.14772 | null |
| 2022-12-29 | An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping | Kangcheng Liu et.al. | 2212.14209 | link |
| 2022-12-27 | Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands | Felipe Gómez-Cuba et.al. | 2212.13477 | link |
| 2022-12-26 | ESVIO: Event-based Stereo Visual Inertial Odometry | Peiyu Chen et.al. | 2212.13184 | link |
| 2022-12-24 | A Comprehensive Review on Autonomous Navigation | Saeid Nahavandi et.al. | 2212.12808 | null |
| 2022-12-23 | Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation | Marina Lotti et.al. | 2212.12388 | null |
| 2022-12-23 | Implementation of a Blind navigation method in outdoors/indoors areas | Mohammad Javadian Farzaneh et.al. | 2212.12185 | null |
| 2022-12-22 | S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations | Hriday Bavle et.al. | 2212.11770 | link |
| 2022-12-22 | Active SLAM: A Review On Last Decade | Muhammad Farhan Ahmed et.al. | 2212.11654 | null |
| 2022-12-27 | Motion, Unit Dual Quaternion and Motion Optimization | Liqun Qi et.al. | 2212.11593 | null |
| 2022-12-22 | Vision-Based Environmental Perception for Autonomous Driving | Fei Liu et.al. | 2212.11453 | null |
| 2022-12-19 | Mu |
Yong Cheng et.al. | 2212.09553 | null |
| 2022-12-16 | Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments | Lasitha Weerakoon et.al. | 2212.08633 | null |
| 2022-12-16 | rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments | Bo Wei et.al. | 2212.08418 | null |
| 2023-03-02 | AirVO: An Illumination-Robust Point-Line Visual Odometry | Kuan Xu et.al. | 2212.07595 | link |
| 2022-12-14 | Autonomous Vehicle Navigation with LIDAR using Path Planning | Rahul M K et.al. | 2212.07155 | null |
| 2022-12-14 | RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping | Hyowon Kim et.al. | 2212.07141 | null |
| 2022-12-13 | Know What You Don't Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version) | Daniil Lisus et.al. | 2212.06923 | null |
| 2022-12-13 | SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance | Chenyangguang Zhang et.al. | 2212.06524 | null |
| 2022-12-13 | Localization and Navigation System for Indoor Mobile Robot | Yanbaihui Liu et.al. | 2212.06391 | null |
| 2022-12-12 | Evaluation of RGB-D SLAM in Large Indoor Environments | Kirill Muravyev et.al. | 2212.05980 | null |
| 2022-12-19 | A Light-Weight LiDAR-Inertial SLAM System with Loop Closing | Kangcheng Liu et.al. | 2212.05743 | link |
| 2022-12-12 | An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds | Kangcheng Liu et.al. | 2212.05705 | link |
| 2022-12-09 | SLAM for Visually Impaired People: A Survey | Marziyeh Bamdad et.al. | 2212.04745 | null |
| 2022-12-09 | Ego-Body Pose Estimation via Ego-Head Pose Estimation | Jiaman Li et.al. | 2212.04636 | null |
| 2022-12-06 | Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles | Sushant Veer et.al. | 2212.03323 | link |
| 2022-12-06 | PRISM: Probabilistic Real-Time Inference in Spatial World Models | Atanas Mirchev et.al. | 2212.02988 | null |
| 2022-12-06 | RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps | Florian Sauerbeck et.al. | 2212.02085 | link |
| 2022-12-05 | DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization | Xuebo Tian et.al. | 2212.02077 | null |
| 2022-12-05 | ObjectMatch: Robust Registration using Canonical Object Correspondences | Can Gümeli et.al. | 2212.01985 | null |
| 2022-12-02 | Sparse SPN: Depth Completion from Sparse Keypoints | Yuqun Wu et.al. | 2212.00987 | null |
| 2022-12-01 | maplab 2.0 -- A Modular and Multi-Modal Mapping Framework | Andrei Cramariuc et.al. | 2212.00654 | link |
| 2022-12-01 | AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body -- Theory and Experiments | Mehregan Dor et.al. | 2212.00350 | null |
| 2022-11-30 | MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves | Pranjali Pathre et.al. | 2211.16882 | null |
| 2022-11-29 | PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images | Hartmut Surmann et.al. | 2211.16266 | link |
| 2022-11-29 | MmWave Mapping and SLAM for 5G and Beyond | Yu Ge et.al. | 2211.16024 | null |
| 2022-11-28 | Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map | Xi Zheng et.al. | 2211.15127 | null |
| 2022-11-29 | BALF: Simple and Efficient Blur Aware Local Feature Detector | Zhenjun Zhao et.al. | 2211.14731 | null |
| 2022-11-27 | Development of a Modular Real-time Shared-control System for a Smart Wheelchair | Vaishanth Ramaraj et.al. | 2211.14711 | null |
| 2022-11-26 | A1 SLAM: Quadruped SLAM using the A1's Onboard Sensors | Jerred Chen et.al. | 2211.14432 | link |
| 2022-11-23 | ActiveRMAP: Radiance Field for Active Mapping And Planning | Huangying Zhan et.al. | 2211.12656 | null |
| 2022-11-22 | Vision-based localization methods under GPS-denied conditions | Zihao Lu et.al. | 2211.11988 | null |
| 2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
| 2022-11-21 | ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari et.al. | 2211.11704 | null |
| 2022-11-24 | Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths | Erik Leitinger et.al. | 2211.09241 | null |
| 2022-11-16 | Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery | Hao Qu et.al. | 2211.08904 | null |
| 2022-11-20 | Detecting Line Segments in Motion-blurred Images with Events | Huai Yu et.al. | 2211.07365 | link |
| 2022-11-13 | Automatic Eye-in-Hand Calibration using EKF | Aditya Ramakrishnan et.al. | 2211.06881 | null |
| 2022-11-12 | Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling | Zhihao Wang et.al. | 2211.06557 | link |
| 2022-11-11 | Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications | Jie Yang et.al. | 2211.05982 | null |
| 2022-11-10 | Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time | Ignacio Torroba et.al. | 2211.05601 | link |
| 2022-11-07 | When Geometry is not Enough: Using Reflector Markers in Lidar SLAM | Gerhard Kurz et.al. | 2211.03484 | null |
| 2022-11-07 | Detecting Invalid Map Merges in Lifelong SLAM | Matthias Holoch et.al. | 2211.03423 | null |
| 2022-11-06 | Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU | Yibin Wu et.al. | 2211.03174 | link |
| 2022-11-07 | Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments | Daniel Adolfsson et.al. | 2211.02445 | link |
| 2022-11-03 | DyOb-SLAM : Dynamic Object Tracking SLAM System | Rushmian Annoy Wadud et.al. | 2211.01941 | null |
| 2022-11-03 | Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM | Yang Chen et.al. | 2211.01749 | null |
| 2022-11-04 | Hao Xu et.al. | 2211.01538 | link | |
| 2022-11-02 | Semantic SuperPoint: A Deep Semantic Descriptor | Gabriel S. Gama et.al. | 2211.01098 | link |
| 2022-11-02 | Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation | Myung-Hwan Jeon et.al. | 2211.00960 | link |
| 2022-10-31 | Mapping Extended Landmarks for Radar SLAM | Shuai Sun et.al. | 2210.17207 | null |
| 2022-10-25 | MAROAM: Map-based Radar SLAM through Two-step Feature Selection | Dequan Wang et.al. | 2210.13797 | null |
| 2022-10-25 | S3E: A Large-scale Multimodal Dataset for Collaborative SLAM | Dapeng Feng et.al. | 2210.13723 | link |
| 2022-10-24 | NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields | Antoni Rosinol et.al. | 2210.13641 | link |
| 2022-10-24 | Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging | Geng Wang et.al. | 2210.13556 | null |
| 2022-10-28 | VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points | Andreas Georgis et.al. | 2210.12756 | null |
| 2022-10-22 | SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation | Junliang Chen et.al. | 2210.12417 | null |
| 2022-10-21 | DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm | Shipeng Zhong et.al. | 2210.11978 | link |
| 2022-10-21 | Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments | Shubham Kedia et.al. | 2210.11652 | null |
| 2022-10-22 | Visual SLAM: What are the Current Trends and What to Expect? | Ali Tourani et.al. | 2210.10491 | null |
| 2022-10-18 | Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM | Geon Choi et.al. | 2210.09636 | null |
| 2022-10-16 | D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments | Ayman Beghdadi et.al. | 2210.08647 | null |
| 2022-10-16 | Indoor Smartphone SLAM with Learned Echoic Location Features | Wenjie Luo et.al. | 2210.08493 | null |
| 2022-10-15 | Self-Improving SLAM in Dynamic Environments: Learning When to Mask | Adrian Bojko et.al. | 2210.08350 | link |
| 2022-10-13 | Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems | Pushyami Kaveti et.al. | 2210.07315 | link |
| 2022-10-12 | RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map | Xuecheng Xu et.al. | 2210.05984 | link |
| 2022-10-11 | Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization | Yuanzheng He et.al. | 2210.05600 | null |
| 2022-10-11 | Autonomous Asteroid Characterization Through Nanosatellite Swarming | Kaitlin Dennison et.al. | 2210.05518 | null |
| 2022-10-11 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | Yuxi Xiao et.al. | 2210.05517 | null |
| 2022-10-11 | Multi-Object Navigation with dynamically learned neural implicit representations | Pierre Marza et.al. | 2210.05129 | link |
| 2022-10-12 | Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation | Yulun Tian et.al. | 2210.05020 | null |
| 2022-10-10 | Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios | Xingyu Chen et.al. | 2210.04562 | null |
| 2022-10-09 | Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning | Ali Safa et.al. | 2210.04236 | null |
| 2022-10-06 | SCORE: A Second-Order Conic Initialization for Range-Aided SLAM | Alan Papalia et.al. | 2210.03177 | link |
| 2022-10-06 | Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding | Kirill Mazur et.al. | 2210.03043 | null |
| 2022-10-06 | Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence | Osian Morgan et.al. | 2210.02642 | null |
| 2022-10-05 | MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation | Hanwei Zhang et.al. | 2210.02038 | null |
| 2022-10-04 | O2S: Open-source open shuttle | Nwankwo Linus et.al. | 2210.01627 | null |
| 2022-10-04 | Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing | Weiying Wang et.al. | 2210.01320 | null |
| 2022-10-03 | Probabilistic Volumetric Fusion for Dense Monocular SLAM | Antoni Rosinol et.al. | 2210.01276 | null |
| 2022-10-03 | DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams | John McConnell et.al. | 2210.00867 | link |
| 2022-10-03 | A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments | Ha Sier et.al. | 2210.00812 | link |
| 2022-10-01 | Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2 | Ali Eslamian et.al. | 2210.00278 | null |
| 2022-09-30 | PyPose: A Library for Robot Learning with Physics-based Optimization | Chen Wang et.al. | 2209.15428 | link |
| 2022-09-29 | DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment | Mariia Gladkova et.al. | 2209.14965 | null |
| 2022-09-28 | Robust Incremental Smoothing and Mapping (riSAM) | Daniel McGann et.al. | 2209.14359 | null |
| 2022-09-27 | Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping | Chi-Ming Chung et.al. | 2209.13274 | link |
| 2022-09-24 | Graph Neural Networks for Multi-Robot Active Information Acquisition | Mariliza Tzes et.al. | 2209.12091 | null |
| 2022-09-24 | Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes | Jonathan J. Y. Kim et.al. | 2209.11894 | null |
| 2022-09-23 | involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs | Gilad Rotman et.al. | 2209.11591 | null |
| 2022-09-23 | Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot | David Balaban et.al. | 2209.11432 | null |
| 2022-09-22 | SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation | Xiao Han et.al. | 2209.10817 | null |
| 2022-09-22 | Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio | Wenhao Qiu et.al. | 2209.10726 | null |
| 2022-09-21 | Visual Localization and Mapping in Dynamic and Changing Environments | João Carlos Virgolino Soares et.al. | 2209.10710 | null |
| 2022-09-20 | Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM | Sabir Hossain et.al. | 2209.10047 | null |
| 2022-09-20 | WGICP: Differentiable Weighted GICP-Based Lidar Odometry | Sanghyun Son et.al. | 2209.09777 | null |
| 2022-09-20 | PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention | José Arce et.al. | 2209.09699 | link |
| 2022-09-19 | MeSLAM: Memory Efficient SLAM based on Neural Fields | Evgenii Kruzhkov et.al. | 2209.09357 | null |
| 2022-09-19 | LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM | Letian Zhang et.al. | 2209.08810 | null |
| 2022-09-18 | HGI-SLAM: Loop Closure With Human and Geometric Importance Features | Shuhul Mujoo et.al. | 2209.08608 | null |
| 2022-09-18 | Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM | Jiarui Tan et.al. | 2209.08578 | link |
| 2022-09-17 | DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments | Shihao Shen et.al. | 2209.08430 | link |
| 2022-09-17 | OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM | Matthieu Zins et.al. | 2209.08338 | null |
| 2022-09-17 | PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments | Adam Dai et.al. | 2209.08248 | link |
| 2022-09-16 | ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM | Aditya Arun et.al. | 2209.08091 | null |
| 2022-09-16 | iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking | Yuhang Ming et.al. | 2209.07919 | null |
| 2022-09-16 | TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM | Mathieu Gonzalez et.al. | 2209.07888 | null |
| 2022-09-15 | Landmark Management in the Application of Radar SLAM | Shuai Sun et.al. | 2209.07199 | link |
| 2022-09-15 | PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization | Xianwei Meng et.al. | 2209.07061 | null |
| 2022-09-14 | Semantic Visual Simultaneous Localization and Mapping: A Survey | Kaiqi Chen et.al. | 2209.06428 | null |
| 2022-09-13 | Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets | Islam Ali et.al. | 2209.06316 | null |
| 2022-09-12 | A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding | Tin Lai et.al. | 2209.05222 | null |
| 2022-09-12 | Attitude-Guided Loop Closure for Cameras with Negative Plane | Ze Wang et.al. | 2209.05167 | link |
| 2022-09-09 | General Place Recognition Survey: Towards the Real-world Autonomy Age | Peng Yin et.al. | 2209.04497 | link |
| 2022-09-08 | ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology | Julio A. Placed et.al. | 2209.03693 | link |
| 2022-09-08 | R |
Jiarong Lin et.al. | 2209.03666 | link |
| 2022-09-06 | Group- |
Brendon Forsgren et.al. | 2209.02658 | link |
| 2022-09-05 | Neuromorphic Visual Odometry with Resonator Networks | Alpha Renner et.al. | 2209.02000 | null |
| 2022-09-05 | MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM | Pavel Karpyshev et.al. | 2209.01936 | null |
| 2022-09-05 | ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics | Boyi Liu et.al. | 2209.01774 | null |
| 2022-09-04 | CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud | Evgeny Yudin et.al. | 2209.01605 | null |
| 2022-08-31 | PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM | Yifan Duan et.al. | 2208.14848 | null |
| 2022-08-30 | BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition | Peng Yin et.al. | 2208.14543 | null |
| 2022-08-27 | Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes | Ali Safa et.al. | 2208.12997 | null |
| 2022-08-25 | FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms | Jianhao Jiao et.al. | 2208.11865 | null |
| 2022-08-25 | Lidar SLAM for Autonomous Driving Vehicles | Farhad Aghili et.al. | 2208.11855 | null |
| 2022-08-24 | DynaVINS: A Visual-Inertial SLAM for Dynamic Environments | Seungwon Song et.al. | 2208.11500 | link |
| 2022-08-22 | Doppler Exploitation in Bistatic mmWave Radio SLAM | Yu Ge et.al. | 2208.10204 | null |
| 2022-08-21 | Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping | Lintong Zhang et.al. | 2208.09825 | link |
| 2022-08-26 | JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario | Longrui Dong et.al. | 2208.09777 | null |
| 2022-08-15 | BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM | Yunge Cui et.al. | 2208.07473 | link |
| 2022-08-12 | Handling Constrained Optimization in Factor Graphs for Autonomous Navigation | Barbara Bazzana et.al. | 2208.06325 | null |
| 2022-08-11 | RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild | Jason Y. Zhang et.al. | 2208.05963 | null |
| 2022-08-08 | Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation | Yifei Ren et.al. | 2208.04274 | link |
| 2022-08-08 | SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty | Shuai Zhang et.al. | 2208.03945 | link |
| 2022-08-05 | A Survey on Visual Map Localization Using LiDARs and Cameras | Elhousni Mahdi et.al. | 2208.03376 | null |
| 2022-08-04 | SROS2: Usable Cyber Security Tools for ROS 2 | Victor Mayoral Vilches et.al. | 2208.02615 | link |
| 2022-08-03 | Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms | Bharath Garigipati et.al. | 2208.02063 | null |
| 2022-08-02 | Present and Future of SLAM in Extreme Underground Environments | Kamak Ebadi et.al. | 2208.01787 | null |
| 2022-08-01 | Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion | Simon Boche et.al. | 2208.00709 | null |
| 2022-07-29 | Neural Density-Distance Fields | Itsuki Ueda et.al. | 2207.14455 | link |
| 2022-07-25 | DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions | Tristan Laidlow et.al. | 2207.12244 | null |
| 2022-07-25 | Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration | Kenji Koide et.al. | 2207.11942 | null |
| 2022-07-22 | NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction | Yunlong Ran et.al. | 2207.10985 | null |
| 2022-07-22 | Dense RGB-D-Inertial SLAM with Map Deformations | Tristan Laidlow et.al. | 2207.10940 | null |
| 2022-07-22 | PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes | BaoSheng Zhang et.al. | 2207.10916 | null |
| 2022-07-21 | Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion | Suman Ghosh et.al. | 2207.10494 | link |
| 2022-07-21 | Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions | Quentin Serdel et.al. | 2207.10489 | link |
| 2022-07-21 | On applicability of von Karman's momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity | Yujin Lu et.al. | 2207.10413 | null |
| 2022-07-19 | Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM | Tuvy Lemberg et.al. | 2207.09103 | null |
| 2022-07-18 | DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM | Weicai Ye et.al. | 2207.08794 | link |
| 2022-07-18 | Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction | Marco Orsingher et.al. | 2207.08439 | null |
| 2022-07-18 | ORB-based SLAM accelerator on SoC FPGA | Vibhakar Vemulapati et.al. | 2207.08405 | null |
| 2022-07-14 | Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset | Riccardo Giubilato et.al. | 2207.06815 | null |
| 2022-07-14 | Semi-supervised Vector-Quantization in Visual SLAM using HGCN | Amir Zarringhalam et.al. | 2207.06738 | null |
| 2022-07-14 | Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders | Amir Zarringhalam et.al. | 2207.06732 | null |
| 2022-07-13 | SLAM: SLO-Aware Memory Optimization for Serverless Applications | Gor Safaryan et.al. | 2207.06183 | null |
| 2022-07-19 | Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras | Fangwen Shu et.al. | 2207.06058 | link |
| 2022-07-12 | Accelerating Certifiable Estimation with Preconditioned Eigensolvers | David M. Rosen et.al. | 2207.05257 | null |
| 2022-07-12 | Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features | Meiyu Zhi et.al. | 2207.05244 | null |
| 2022-07-14 | SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial | Chih-Yuan Chiu et.al. | 2207.05043 | null |
| 2022-07-08 | BlindSpotNet: Seeing Where We Cannot See | Taichi Fukuda et.al. | 2207.03870 | null |
| 2022-07-08 | Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints | Philipp Glira et.al. | 2207.03785 | null |
| 2022-07-08 | Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements | Ran Liu et.al. | 2207.03700 | null |
| 2022-07-07 | RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments | Qihao Peng et.al. | 2207.03539 | null |
| 2022-07-06 | VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization | Marius Laska et.al. | 2207.02668 | null |
| 2022-07-06 | A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models | Axel Garcia-Vega et.al. | 2207.02396 | null |
| 2022-07-04 | VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM | Ling Gao et.al. | 2207.01404 | null |
| 2022-07-04 | VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM | Danpeng Chen et.al. | 2207.01158 | null |
| 2022-07-03 | Wireless Channel Prediction in Partially Observed Environments | Mingsheng Yin et.al. | 2207.00934 | null |
| 2022-07-01 | A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers | Julio A. Placed et.al. | 2207.00254 | null |
| 2022-07-01 | Keeping Less is More: Point Sparsification for Visual SLAM | Yeonsoo Park et.al. | 2207.00225 | null |
| 2022-06-30 | Controlled and impulsive compression of an entrapped air bubble during impact | Utkarsh Jain et.al. | 2206.15297 | null |
| 2022-06-30 | Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery | Yuehao Wang et.al. | 2206.15255 | link |
| 2022-06-27 | IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments | Abanob Soliman et.al. | 2206.13455 | link |
| 2022-06-26 | An Efficient Global Optimality Certificate for Landmark-Based SLAM | Connor Holmes et.al. | 2206.12961 | link |
| 2022-06-21 | Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping | Davide Tateo et.al. | 2206.10263 | link |
| 2022-06-20 | Data Fusion for Radio Frequency SLAM with Robust Sampling | Erik Leitinger et.al. | 2206.09746 | null |
| 2022-06-19 | RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments | Chenglong Qian et.al. | 2206.09463 | null |
| 2022-06-17 | Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments | Khairuldanial Ismail et.al. | 2206.08733 | null |
| 2022-06-17 | An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions | Yijun Yuan et.al. | 2206.08712 | link |
| 2022-06-13 | ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy | Hao Bai et.al. | 2206.06435 | null |
| 2022-06-10 | Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming | Javier Cremona et.al. | 2206.05066 | link |
| 2022-06-09 | SparseFormer: Attention-based Depth Completion Network | Frederik Warburg et.al. | 2206.04557 | null |
| 2022-06-07 | Robot Self-Calibration Using Actuated 3D Sensors | Arne Peters et.al. | 2206.03430 | null |
| 2022-06-07 | Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map | Haodong Yuan et.al. | 2206.03062 | null |
| 2022-06-05 | DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions | Alena Savinykh et.al. | 2206.02199 | null |
| 2022-06-04 | C |
Erez Posner et.al. | 2206.01961 | null |
| 2022-06-01 | PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry | Dong-Uk Seo et.al. | 2206.00266 | link |
| 2022-05-27 | A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching | Arno Solin et.al. | 2205.13821 | null |
| 2022-05-31 | LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments | Yun Chang et.al. | 2205.13135 | link |
| 2022-05-25 | Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM | Milad Ramezani et.al. | 2205.12595 | null |
| 2022-05-24 | Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM | Christopher E. Denniston et.al. | 2205.12402 | link |
| 2022-05-22 | ALITA: A Large-scale Incremental Dataset for Long-term Autonomy | Peng Yin et.al. | 2205.10737 | link |
| 2022-05-19 | FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2 | Jeffrey Ichnowski et.al. | 2205.09778 | link |
| 2022-05-17 | Global Data Association for SLAM with 3D Grassmannian Manifold Objects | Parker C. Lusk et.al. | 2205.08556 | null |
| 2022-05-19 | Cluster on Wheels | Yuanyuan Yang et.al. | 2205.08151 | null |
| 2022-05-12 | Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry | Shihao Shen et.al. | 2205.05916 | link |
| 2022-05-12 | S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization | Ran Cheng et.al. | 2205.05861 | null |
| 2022-05-14 | Multi-modal Semantic SLAM for Complex Dynamic Environments | Han Wang et.al. | 2205.04300 | link |
| 2022-05-06 | OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations | Carmen Delgado et.al. | 2205.03256 | null |
| 2022-05-05 | CNN-Augmented Visual-Inertial SLAM with Planar Constraints | Pan Ji et.al. | 2205.02940 | null |
| 2022-05-05 | PMBM-based SLAM Filters in 5G mmWave Vehicular Networks | Hyowon Kim et.al. | 2205.02502 | null |
| 2022-05-04 | BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking | Dorian Henning et.al. | 2205.02301 | null |
| 2022-05-04 | A Global Asymptotic Convergent Observer for SLAM | Seyed Hamed Hashemi et.al. | 2205.01953 | null |
| 2022-05-04 | Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation | Nathaniel Merrill et.al. | 2205.01823 | link |
| 2022-05-03 | GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping | Pan Ji et.al. | 2205.01656 | null |
| 2022-04-29 | Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM | Jinwoo Jeon et.al. | 2204.13877 | link |
| 2022-04-27 | The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection | Konstantinos A. Tsintotas et.al. | 2204.12831 | null |
| 2022-04-27 | Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment | Wenyu Li et.al. | 2204.12769 | null |
| 2022-04-29 | MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment | Tingchen Ma et.al. | 2204.11621 | null |
| 2022-04-23 | Indoor simultaneous localization and mapping based on fringe projection profilometry | Yang Zhao et.al. | 2204.11020 | null |
| 2022-04-22 | Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria | Julio A. Placed et.al. | 2204.10631 | null |
| 2022-04-22 | Fast Autonomous Robotic Exploration Using the Underlying Graph Structure | Julio A. Placed et.al. | 2204.10610 | null |
| 2022-04-22 | Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions | Yutong Hu et.al. | 2204.10552 | null |
| 2022-04-22 | Implicit Object Mapping With Noisy Data | Jad Abou-Chakra et.al. | 2204.10516 | link |
| 2022-04-19 | Photometric single-view dense 3D reconstruction in endoscopy | Victor M. Batlle et.al. | 2204.09083 | null |
| 2022-04-18 | Pulsar skips: Understanding variations in the regular periods of rotating neutron stars | Clayton Miller et.al. | 2204.08449 | null |
| 2022-04-18 | Tracking monocular camera pose and deformation for SLAM inside the human body | Juan J. Gomez Rodriguez et.al. | 2204.08309 | null |
| 2022-04-18 | Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker | Hanjing Ye et.al. | 2204.08163 | null |
| 2022-04-14 | ViViD++: Vision for Visibility Dataset | Alex Junho Lee et.al. | 2204.06183 | null |
| 2022-04-12 | HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud | Zhixing Hou et.al. | 2204.05481 | null |
| 2022-04-12 | RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room | Cong Gao et.al. | 2204.05467 | null |
| 2022-04-11 | Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context | Lizhou Liao et.al. | 2204.04932 | link |
| 2022-04-04 | Monitoring social distancing with single image depth estimation | Alessio Mingozzi et.al. | 2204.01693 | null |
| 2022-04-01 | Bi-directional Loop Closure for Visual SLAM | Ihtisham Ali et.al. | 2204.01524 | null |
| 2022-04-04 | IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers | Lei Sun et.al. | 2204.01324 | link |
| 2022-04-03 | Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor | Wenyan Ou et.al. | 2204.01154 | null |
| 2022-04-02 | UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps | Ayyappa Swamy Thatavarthy et.al. | 2204.00865 | link |
| 2022-03-31 | Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects | Yujie Lu et.al. | 2204.00035 | null |
| 2022-03-30 | GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios | Chih-Yuan Chiu et.al. | 2203.16690 | null |
| 2022-03-29 | Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field | Mostafa Osman et.al. | 2203.15866 | null |
| 2022-03-29 | Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform | Mingjun Li et.al. | 2203.15439 | null |
| 2022-03-29 | Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots | Pranay Mathur et.al. | 2203.15272 | null |
| 2022-03-28 | Are High-Resolution Event Cameras Really Needed? | Daniel Gehrig et.al. | 2203.14672 | null |
| 2022-03-25 | Spectral Measurement Sparsification for Pose-Graph SLAM | Kevin J. Doherty et.al. | 2203.13897 | link |
| 2022-03-25 | FD-SLAM: 3-D Reconstruction Using Features and Dense Matching | Xingrui Yang et.al. | 2203.13861 | null |
| 2022-03-25 | Gravity-constrained point cloud registration | Vladimír Kubelka et.al. | 2203.13799 | null |
| 2022-03-24 | MD-SLAM: Multi-cue Direct SLAM | Luca Di Giammarino et.al. | 2203.13237 | link |
| 2022-03-24 | Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video | Shun Taguchi et.al. | 2203.12804 | null |
| 2022-03-19 | Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems | Jie Yang et.al. | 2203.10267 | null |
| 2022-03-16 | Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR | Ian D. Miller et.al. | 2203.08925 | link |
| 2022-03-15 | Neural RF SLAM for unsupervised positioning and mapping with channel state information | Shreya Kadambi et.al. | 2203.08264 | null |
| 2022-03-15 | Simultaneous Localisation and Mapping with Quadric Surfaces | Tristan Laidlow et.al. | 2203.08040 | null |
| 2022-03-14 | Drift Reduced Navigation with Deep Explainable Features | Mohd Omama et.al. | 2203.06897 | link |
| 2022-03-11 | An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs | Keisuke Sugiura et.al. | 2203.05763 | null |
| 2022-03-10 | High Definition, Inexpensive, Underwater Mapping | Bharat Joshi et.al. | 2203.05640 | link |
| 2022-03-10 | SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning | Jaehoon Choi et.al. | 2203.05332 | null |
| 2022-03-08 | Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM | Pierre-Yves Lajoie et.al. | 2203.04446 | link |
| 2022-03-08 | SLAM-Supported Self-Training for 6D Object Pose Estimation | Ziqi Lu et.al. | 2203.04424 | link |
| 2022-03-08 | An Online Semantic Mapping System for Extending and Enhancing Visual SLAM | Thorsten Hempel et.al. | 2203.03944 | null |
| 2022-03-07 | Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms | Qingqing Li et.al. | 2203.03454 | link |
| 2022-03-07 | OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition | Junyi Ma et.al. | 2203.03397 | link |
| 2022-03-06 | Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM | Kazushi Aiba et.al. | 2203.02887 | null |
| 2022-03-06 | RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects | Ran Long et.al. | 2203.02882 | null |
| 2022-03-03 | STUN: Self-Teaching Uncertainty Estimation for Place Recognition | Kaiwen Cai et.al. | 2203.01851 | link |
| 2022-03-03 | Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning | Niclas Vödisch et.al. | 2203.01578 | link |
| 2022-03-02 | FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2203.00893 | link |
| 2022-03-02 | Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation | Yulun Tian et.al. | 2203.00851 | null |
| 2022-03-01 | Descriptellation: Deep Learned Constellation Descriptors for SLAM | Chunwei Xing et.al. | 2203.00567 | null |
| 2022-03-01 | Collaborative Robot Mapping using Spectral Graph Analysis | Lukas Bernreiter et.al. | 2203.00308 | null |
| 2022-02-26 | RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization | Nikolaos Kourtzanidis et.al. | 2202.13221 | link |
| 2022-02-25 | Probabilistic Data Association for Semantic SLAM at Scale | Elad Michael et.al. | 2202.12802 | link |
| 2022-02-24 | TwistSLAM: Constrained SLAM in Dynamic Environment | Mathieu Gonzalez et.al. | 2202.12384 | null |
| 2022-02-24 | Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion | Hyeonsoo Jang et.al. | 2202.12108 | null |
| 2022-02-23 | MITI: SLAM Benchmark for Laparoscopic Surgery | Regine Hartwig et.al. | 2202.11496 | null |
| 2022-02-23 | DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization | Xuebo Tian et.al. | 2202.11431 | null |
| 2022-02-23 | Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets | Islam Ali et.al. | 2202.11312 | null |
| 2022-02-22 | SAGE: SLAM with Appearance and Geometry Prior for Endoscopy | Xingtong Liu et.al. | 2202.09487 | link |
| 2022-02-18 | OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure | Stefan Leutenegger et.al. | 2202.09199 | null |
| 2022-02-18 | MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery | Ahmad Khaliq et.al. | 2202.09146 | link |
| 2022-02-18 | An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems | Qiang Liu et.al. | 2202.08952 | null |
| 2022-02-17 | Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study | Giovanni Cioffi et.al. | 2202.08894 | link |
| 2022-02-17 | LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building | Jiashi Zhang et.al. | 2202.08487 | null |
| 2022-02-16 | Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments | Jinkun Wang et.al. | 2202.08359 | null |
| 2022-02-11 | Overhead Image Factors for Underwater Sonar-based SLAM | John McConnell et.al. | 2202.05811 | null |
| 2022-02-10 | Scale Estimation with Dual Quadrics for Monocular Object SLAM | Shuangfu Song et.al. | 2202.04816 | null |
| 2022-02-08 | A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition | Nie Jiwei et.al. | 2202.03677 | null |
| 2022-01-25 | Autonomous Vehicles: Open-Source Technologies, Considerations, and Development | Oussama Saoudi et.al. | 2202.03148 | null |
| 2022-02-07 | Temporal Point Cloud Completion with Pose Disturbance | Jieqi Shi et.al. | 2202.03084 | null |
| 2022-02-04 | DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments | Xinggang Hu et.al. | 2202.01938 | null |
| 2022-02-01 | A Model for Multi-View Residual Covariances based on Perspective Deformation | Alejandro Fontan et.al. | 2202.00765 | null |
| 2022-01-30 | Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM | Xinghe Chu et.al. | 2201.12726 | null |
| 2022-01-28 | RGB-D SLAM Using Attention Guided Frame Association | Ali Caglayan et.al. | 2201.12047 | null |
| 2022-02-04 | Learning to Act with Affordance-Aware Multimodal Neural SLAM | Zhiwei Jia et.al. | 2201.09862 | link |
| 2022-01-22 | Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems | Xi Zheng et.al. | 2201.09048 | link |
| 2022-01-17 | SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System | Giseop Kim et.al. | 2201.06423 | null |
| 2022-01-14 | SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions | Ali Samadzadeh et.al. | 2201.05386 | link |
| 2022-01-19 | Multi-Hypothesis Scan Matching through Clustering | Giorgio Iavicoli et.al. | 2201.03814 | null |
| 2022-01-11 | Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM | Kevin J. Doherty et.al. | 2201.03773 | null |
| 2022-01-10 | High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM | Brian M. Hopkinson et.al. | 2201.03364 | link |
| 2022-01-10 | Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition | M. Usman Maqbool Bhutta et.al. | 2201.03212 | link |
| 2022-01-04 | Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds | Xueliang Wen et.al. | 2201.00959 | null |
| 2021-12-29 | Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic | Khen Elimelech et.al. | 2112.14428 | null |
| 2021-12-19 | M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots | Jie Yin et.al. | 2112.13659 | link |
| 2021-12-27 | UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping | Hyunjun Lim et.al. | 2112.13515 | link |
| 2021-12-25 | Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs | Yusheng Wang et.al. | 2112.13224 | null |
| 2021-12-25 | Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping | Peng Huang et.al. | 2112.13222 | null |
| 2021-12-24 | 3D Point Cloud Reconstruction and SLAM as an Input | Ziyu Li et.al. | 2112.12907 | null |
| 2021-12-22 | NICE-SLAM: Neural Implicit Scalable Encoding for SLAM | Zihan Zhu et.al. | 2112.12130 | link |
| 2021-12-18 | Fast and Robust Registration of Partially Overlapping Point Clouds | Eduardo Arnold et.al. | 2112.09922 | link |
| 2021-12-17 | Symmetry-aware Neural Architecture for Embodied Visual Navigation | Shuang Liu et.al. | 2112.09515 | null |
| 2021-12-27 | Homography Decomposition Networks for Planar Object Tracking | Xinrui Zhan et.al. | 2112.07909 | link |
| 2021-12-14 | Autonomous Navigation System from Simultaneous Localization and Mapping | Micheal Caracciolo et.al. | 2112.07723 | link |
| 2021-12-12 | 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation | Bolivar Solarte et.al. | 2112.06180 | link |
| 2021-12-11 | Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization | Amay Saxena et.al. | 2112.05921 | null |
| 2021-12-07 | Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems | Gideon Billings et.al. | 2112.03826 | link |
| 2021-12-05 | Iterated Posterior Linearization PMB Filter for 5G SLAM | Yu Ge et.al. | 2112.02575 | null |
| 2021-12-03 | Fast Direct Stereo Visual SLAM | Jiawei Mo et.al. | 2112.01890 | link |
| 2021-12-02 | MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment | Jie Ren et.al. | 2112.01349 | link |
| 2021-12-01 | Research on Event Accumulator Settings for Event-Based SLAM | Kun Xiao et.al. | 2112.00427 | link |
| 2021-11-29 | An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments | Assem Sadek et.al. | 2111.14666 | null |
| 2021-11-29 | Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report | Hartmut Surmann et.al. | 2111.14542 | null |
| 2021-11-24 | Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment | V. Ayala-Alfaro et.al. | 2111.12690 | null |
| 2021-11-24 | Autonomous bot with ML-based reactive navigation for indoor environment | Yash Srivastava et.al. | 2111.12542 | null |
| 2021-11-22 | A General Framework for Lifelong Localization and Mapping in Changing Environment | Min Zhao et.al. | 2111.10946 | link |
| 2021-11-17 | Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network | Xiaoming Zhao et.al. | 2111.09006 | null |
| 2021-11-10 | Comparing dominance of tennis' big three via multiple-output Bayesian quantile regression models | Bruno Santos et.al. | 2111.05631 | null |
| 2021-11-10 | TomoSLAM: factor graph optimization for rotation angle refinement in microtomography | Mark Griguletskii et.al. | 2111.05562 | null |
| 2021-11-07 | Hierarchical Segment-based Optimization for SLAM | Yuxin Tian et.al. | 2111.04101 | null |
| 2021-11-07 | Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM | Shing Yan Loo et.al. | 2111.04096 | null |
| 2021-11-05 | MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry | Joan P. Company-Corcoles et.al. | 2111.03408 | null |
| 2021-10-31 | Loop closure detection using local 3D deep descriptors | Youjie Zhou et.al. | 2111.00440 | link |
| 2021-10-27 | Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification | Mingsheng Yin et.al. | 2110.14789 | link |
| 2021-10-27 | Efficient Placard Discovery for Semantic Mapping During Frontier Exploration | David Balaban et.al. | 2110.14742 | null |
| 2021-10-26 | Robust Multi-view Registration of Point Sets with Laplacian Mixture Model | Jin Zhang et.al. | 2110.13744 | null |
| 2021-10-25 | WOLF: A modular estimation framework for robotics based on factor graphs | Joan Sola et.al. | 2110.12919 | null |
| 2021-10-21 | Real-Time Ground-Plane Refined LiDAR SLAM | Fan Yang et.al. | 2110.11517 | null |
| 2021-10-21 | SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words | Jonathan J. Y. Kim et.al. | 2110.11491 | null |
| 2021-10-21 | InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion | Zhenkun Zhu et.al. | 2110.11040 | null |
| 2021-10-20 | SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training | Ankur Bapna et.al. | 2110.10329 | null |
| 2021-10-18 | Enhancing exploration algorithms for navigation with visual SLAM | Kirill Muravyev et.al. | 2110.09156 | null |
| 2021-10-18 | Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment | Rui Tian et.al. | 2110.08977 | null |
| 2021-10-16 | Partial Hierarchical Pose Graph Optimization for SLAM | Alexander Korovko et.al. | 2110.08639 | null |
| 2021-10-14 | Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach | Shumon Koga et.al. | 2110.07546 | null |
| 2021-10-13 | Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity | Ran Liu et.al. | 2110.06541 | null |
| 2021-10-12 | Learning Efficient Multi-Agent Cooperative Visual Exploration | Chao Yu et.al. | 2110.05734 | null |
| 2021-10-07 | Self-Supervised Depth Completion for Active Stereo | Frederik Warburg et.al. | 2110.03234 | null |
| 2021-10-06 | InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes | Zhenkun Zhu et.al. | 2110.02593 | null |
| 2021-10-03 | AEROS: Adaptive RObust least-Squares for Graph-Based SLAM | Milad Ramezani et.al. | 2110.02018 | null |
| 2021-10-04 | Fast Uncertainty Quantification for Active Graph SLAM | Julio A. Placed et.al. | 2110.01289 | link |
| 2021-10-04 | Geometry-based Graph Pruning for Lifelong SLAM | Gerhard Kurz et.al. | 2110.01286 | null |
| 2021-10-03 | Quadrotor Control on |
Marcus Greiff et.al. | 2110.01099 | null |
| 2021-10-02 | Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows | Qiangqiang Huang et.al. | 2110.00876 | link |
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2025-12-04 | Deep infant brain segmentation from multi-contrast MRI | Malte Hoffmann et.al. | 2512.05114 | null |
| 2025-12-04 | QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory | Yu-Chao Hsu et.al. | 2512.05049 | null |
| 2025-12-04 | Geometric Data Science | Olga D Anosova et.al. | 2512.05040 | null |
| 2025-12-04 | Internal superfluid response and torque evolution in the giant glitch of PSR J1718-3718 | Peng Liu et.al. | 2512.04972 | null |
| 2025-12-04 | Canonical Rough Path over Tempered Fractional Brownian Motion: Existence, Construction, and Applications | Atef Lechiheb et.al. | 2512.04646 | null |
| 2025-12-04 | Refaçade: Editing Object with Given Reference Texture | Youze Huang et.al. | 2512.04534 | null |
| 2025-12-04 | Development of a 15-Degree-of-Freedom Bionic Hand with Cable-Driven Transmission and Distributed Actuation | Haoqi Han et.al. | 2512.04399 | null |
| 2025-12-03 | Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications | Gasser Elazab et.al. | 2512.04303 | null |
| 2025-12-03 | Emergent Outlier View Rejection in Visual Geometry Grounded Transformers | Jisang Han et.al. | 2512.04012 | null |
| 2025-12-03 | DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment | Sheng-Hao Liao et.al. | 2512.03981 | null |
| 2025-11-26 | TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos | Seungjae Lee et.al. | 2511.21690 | null |
| 2025-11-26 | UAVLight: A Benchmark for Illumination-Robust 3D Reconstruction in Unmanned Aerial Vehicle (UAV) Scenes | Kang Du et.al. | 2511.21565 | null |
| 2025-11-26 | From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings | Jiajie Zhang et.al. | 2511.21428 | null |
| 2025-11-26 | DeepRFTv2: Kernel-level Learning for Image Deblurring | Xintian Mao et.al. | 2511.21132 | null |
| 2025-11-25 | Hund-projected Kanamori model: an effective description of Hund's metals near the Mott insulating regime | Johan Carlström et.al. | 2511.20788 | null |
| 2025-11-25 | From Observations to Simulations: A Neural-Network Approach to Intracluster Medium Kinematics | E. Gatuzz et.al. | 2511.20755 | null |
| 2025-11-25 | Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization | Tahira Kazimi et.al. | 2511.20647 | null |
| 2025-11-25 | Dance Style Classification using Laban-Inspired and Frequency-Domain Motion Features | Ben Hamscher et.al. | 2511.20469 | null |
| 2025-11-25 | AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend | Hengyi Wang et.al. | 2511.20343 | null |
| 2025-11-25 | Stochastic Dynamics of Skyrmions on a Racetrack: Impact of Equilibrium and Nonequilibrium Noise | Anton V. Hlushchenko et.al. | 2511.20287 | null |
| 2025-11-24 | Dynamic Multi-Species Bird Soundscape Generation with Acoustic Patterning and 3D Spatialization | Ellie L. Zhang et.al. | 2511.19275 | null |
| 2025-11-24 | A Deep-Learning-Based Framework for Focal Mechanism Determination and Its Application to the 2022 Luding Earthquake Sequence | Ziye Yu et.al. | 2511.19185 | null |
| 2025-11-24 | MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes | Kehua Chen et.al. | 2511.19172 | null |
| 2025-11-24 | The variability of blazars throughout the electromagnetic spectrum | Claudia M. Raiteri et.al. | 2511.18975 | null |
| 2025-11-24 | MagicWorld: Interactive Geometry-driven Video World Exploration | Guangyuan Li et.al. | 2511.18886 | null |
| 2025-11-24 | STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution | Junyang Chen et.al. | 2511.18786 | null |
| 2025-11-24 | On the role of fractional Brownian motion in models of chemotaxis and stochastic gradient ascent | Gustavo Cornejo-Olea et.al. | 2511.18745 | null |
| 2025-11-23 | C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction | Kuan Wei Huang et.al. | 2511.18559 | null |
| 2025-11-23 | Non-Symplectic Deformations of Geometric Quantisation | Kerr Maxwell et.al. | 2511.18549 | null |
| 2025-11-23 | Zero-Shot Video Deraining with Video Diffusion Models | Tuomas Varanka et.al. | 2511.18537 | null |
| 2025-11-23 | Expanding the Workspace of Electromagnetic Navigation Systems Using Dynamic Feedback for Single- and Multi-agent Control | Jasan Zughaibi et.al. | 2511.18486 | null |
| 2025-11-23 | Escape from end-pinching in Herschel-Bulkley ligaments | Shu Yang et.al. | 2511.18388 | null |
| 2025-11-23 | EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning | Yogesh Kulkarni et.al. | 2511.18242 | null |
| 2025-11-22 | MotionDuet: Dual-Conditioned 3D Human Motion Generation with Video-Regularized Text Learning | Yi-Yang Zhang et.al. | 2511.18209 | null |
| 2025-11-22 | A Unified Multi-Dynamics Framework for Perception-Oriented Modeling in Tendon-Driven Continuum Robots | Ibrahim Alsarraj et.al. | 2511.18088 | null |
| 2025-11-22 | Plan-X: Instruct Video Generation via Semantic Planning | Lun Huang et.al. | 2511.17986 | null |
| 2025-11-22 | Dynamic Slowdown and Spatial Correlations in Viscous Silica Melt: Perspectives from Dynamic Disorder | Shubham Kumar et.al. | 2511.17887 | null |
| 2025-11-21 | Lane-Frame Quantum Multimodal Driving Forecasts for the Trajectory of Autonomous Vehicles | Navneet Singh et.al. | 2511.17675 | null |
| 2025-11-18 | Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression | Siddiqua Namrah et.al. | 2511.17612 | null |
| 2025-10-24 | RadioMapMotion: A Dataset and Baseline for Proactive Spatio-Temporal Radio Environment Prediction | Honggang Jia et.al. | 2511.17526 | null |
| 2025-11-21 | TRAO Survey of the Nearby Filamentary Molecular Clouds, the Universal Nursery of Stars (TRAO-FUNS). IV. Filaments and Dense Cores in the W40 and Serpens South Regions of Aquila | Satyajeet Moharana et.al. | 2511.16978 | null |
| 2025-11-21 | One Walk is All You Need: Data-Efficient 3D RF Scene Reconstruction with Human Movements | Yiheng Bian et.al. | 2511.16966 | null |
| 2025-11-20 | TriDiff-4D: Fast 4D Generation through Diffusion-based Triplane Re-posing | Eddie Pokming Sheung et.al. | 2511.16662 | null |
| 2025-11-20 | Flow and Depth Assisted Video Prediction with Latent Transformer | Eliyas Suleyman et.al. | 2511.16484 | null |
| 2025-11-20 | Two Epochs of VLBI Observations of 8 KISSR Seyfert & LINER Galaxies: Suggestions of Fast and Filamentary Outflows | Preeti Kharb et.al. | 2511.16159 | null |
| 2025-11-19 | MambaIO: Global-Coordinate Inertial Odometry for Pedestrians via Multi-Scale Frequency-Decoupled Modeling | Shanshan Zhang et.al. | 2511.15645 | null |
| 2025-11-19 | Covariant Measures of Non-Markovianity in Curved Spacetime | Tushar Waghmare et.al. | 2511.15365 | null |
| 2025-11-19 | Generating Natural-Language Surgical Feedback: From Structured Representation to Domain-Grounded Evaluation | Firdavs Nasriddinov et.al. | 2511.15159 | null |
| 2025-11-19 | SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection | Chun-Jung Lin et.al. | 2511.15153 | null |
| 2025-11-18 | Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video | Yarin Bekor et.al. | 2511.14848 | null |
| 2025-11-18 | Blur-Robust Detection via Feature Restoration: An End-to-End Framework for Prior-Guided Infrared UAV Target Detection | Xiaolin Wang et.al. | 2511.14371 | null |
| 2025-11-18 | Hubble Space Telescope proper motions of Large Magellanic Cloud star clusters -- II. Kinematic structure of young and intermediate-age clusters | F. Niederhofer et.al. | 2511.14351 | null |
| 2025-11-18 | Vortex stability in pseudo-Hermitian theories | R. A. Battye et.al. | 2511.14300 | null |
| 2025-11-18 | Model-Based Clustering of Football Event Sequences: A Marked Spatio-Temporal Point Process Mixture Approach | Koffi Amezouwui et.al. | 2511.14297 | null |
| 2025-11-18 | Newborn jet in the symbiotic system R Aquarii | T. Liimets et.al. | 2511.14243 | null |
| 2025-11-18 | FreeMusco: Motion-Free Learning of Latent Control for Morphology-Adaptive Locomotion in Musculoskeletal Characters | Minkwan Kim et.al. | 2511.14205 | null |
| 2025-11-18 | AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models | Yuhua Jiang et.al. | 2511.14148 | null |
| 2025-11-17 | B2F: End-to-End Body-to-Face Motion Generation with Style Reference | Bokyung Jang et.al. | 2511.13988 | null |
| 2025-11-17 | Enabling Real-Time Volumetric Imaging in Interventional Radiology Suits via a Deep Learning Framework Robust to C-arm Tilt | Fawazilla Utomo et.al. | 2511.13980 | null |
| 2025-11-17 | Ultrafast electron diffractive imaging of the dissociation of pre-excited molecules | Yanwei Xiong et.al. | 2511.13479 | null |
| 2025-11-17 | An Automated Framework for Analyzing Structural Evolution in On-the-fly Non-adiabatic Molecular Dynamics Using Autoencoder and Multiple Molecular Descriptors | Hangxu Liu et.al. | 2511.13364 | null |
| 2025-11-17 | The Spontaneous Genesis of Solar Prominence Structures Driven by Supergranulation in Three-Dimensional Simulations | Huanxin Chen et.al. | 2511.13252 | null |
| 2025-11-17 | Infrared photometry and CaT spectroscopy of the most metal-poor in-situ globular cluster VVV-CL001 | W. Haro Moya et.al. | 2511.13161 | null |
| 2025-11-16 | Kagome metals | Domenico Di Sante et.al. | 2511.12731 | null |
| 2025-11-16 | Examining Turbulence in Galactic Molecular Clouds - II: Continuity of Turbulence Cascading in a Portion of the Local Arm | Yuehui Ma et.al. | 2511.12418 | null |
| 2025-11-16 | Towards Rotation-only Imaging Geometry: Rotation Estimation | Xinrui Li et.al. | 2511.12415 | null |
| 2025-11-14 | Free3D: 3D Human Motion Emerges from Single-View 2D Supervision | Sheng Liu et.al. | 2511.11368 | null |
| 2025-11-14 | YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation | Pavel Rojtberg et.al. | 2511.11344 | null |
| 2025-11-14 | The Spatial Evolution of Star Clusters in NGC 628 with JWST | Anne S. M. Buckner et.al. | 2511.11115 | null |
| 2025-11-14 | Discovery of an X-ray bridge between the comma-shaped gas and the main cluster in MCXC J0157.4-0550 | Chong Yang et.al. | 2511.10968 | null |
| 2025-11-14 | DEFT-LLM: Disentangled Expert Feature Tuning for Micro-Expression Recognition | Ren Zhang et.al. | 2511.10948 | null |
| 2025-11-14 | A High-Precision Dynamical Model of Callisto: Incorporating Rotation Effects within Multi-Layer Internal Structure Models | Kai Huang et.al. | 2511.10929 | null |
| 2025-11-14 | Collaborative Multi-Robot Non-Prehensile Manipulation via Flow-Matching Co-Generation | Yorai Shaoul et.al. | 2511.10874 | null |
| 2025-11-13 | A validated lumped-element model for bioinspired acoustic flow sensing toward the performance limit | Wei Sun et.al. | 2511.10830 | null |
| 2025-11-13 | From Attention to Frequency: Integration of Vision Transformer and FFT-ReLU for Enhanced Image Deblurring | Syed Mumtahin Mahmud et.al. | 2511.10806 | null |
| 2025-11-13 | Towards Attribution of Generators and Emotional Manipulation in Cross-Lingual Synthetic Speech using Geometric Learning | Girish et.al. | 2511.10790 | null |
| 2025-11-13 | The Quiescent Merging Nature of the Coma Cluster Revealed by ICM Velocity Structure | E. Gatuzz et.al. | 2511.10740 | null |
| 2025-11-13 | From Fold to Function: Dynamic Modeling and Simulation-Driven Design of Origami Mechanisms | Tianhui Han et.al. | 2511.10580 | null |
| 2025-11-13 | M3Scope a 3D multimode multiplane microscope for imaging nanoscale dynamics in soft matter | Steven Huysecom et.al. | 2511.10174 | null |
| 2025-11-13 | Physics-informed Machine Learning for Static Friction Modeling in Robotic Manipulators Based on Kolmogorov-Arnold Networks | Yizheng Wang et.al. | 2511.10079 | null |
| 2025-11-13 | Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level Constraints | Xiangyue Zhang et.al. | 2511.10076 | null |
| 2025-11-13 | PuffyBot: An Untethered Shape Morphing Robot for Multi-environment Locomotion | Shashwat Singh et.al. | 2511.09885 | null |
| 2025-11-13 | AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting | Aymen Mir et.al. | 2511.09827 | null |
| 2025-11-12 | DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation | Jerrin Bright et.al. | 2511.09502 | null |
| 2025-11-12 | SPIDER: Scalable Physics-Informed Dexterous Retargeting | Chaoyi Pan et.al. | 2511.09484 | null |
| 2025-11-12 | 3D PIC simulation and theoretical modeling of RF Laser pulse in magnetized plasma for the generation of multidimensional relativistic Wakefields | A. A. Molavi Choobini et.al. | 2511.09079 | null |
| 2025-11-12 | Group-Theoretic Structure Governing Identifiability in Inverse Problems | Isshin Arai et.al. | 2511.08995 | null |
| 2025-11-11 | Resolving Thermospheric Vertical Wind Ambiguities and Energy Processes | Jeffrey P. Thayer et.al. | 2511.08830 | null |
| 2025-11-11 | Analytical Description of Baryonic Matter Fluctuations Using Jeans Filtering Functions in Second-Order Cosmological Perturbation Theory | Diego Fernando Fonseca et.al. | 2511.08820 | null |
| 2025-11-11 | 3D MHD simulations of coronal loops heated via magnetic braiding I. Continuous driving | Gabriele Cozzo et.al. | 2511.08726 | null |
| 2025-11-11 | Coordinated Space- and Ground-based Monitoring of Accretion Bursts in a Protoplanetary Disk: The Orbital and Accretion Properties of DQ Tau | Hala Alqubelat et.al. | 2511.08311 | null |
| 2025-11-11 | Direction and speed selectivity properties for spatio-temporal receptive fields according to the generalized Gaussian derivative model for visual receptive fields | Tony Lindeberg et.al. | 2511.08101 | null |
| 2025-11-17 | Silicon-photonic optomechanical magnetometer | Fernando Gottardo et.al. | 2511.07852 | null |
| 2025-11-11 | Human Motion Synthesis in 3D Scenes via Unified Scene Semantic Occupancy | Gong Jingyu et.al. | 2511.07819 | null |
| 2025-11-10 | DIMO: Diverse 3D Motion Generation for Arbitrary Objects | Linzhan Mou et.al. | 2511.07409 | null |
| 2025-11-10 | Ultrafast Topological Transitions Driven by Permittivity Modulation in Non-Hermitian Multilayers | Giuseppina Simone et.al. | 2511.06963 | null |
| 2025-11-10 | Ambiguity-aware Truncated Flow Matching for Ambiguous Medical Image Segmentation | Fanding Li et.al. | 2511.06857 | null |
| 2025-11-10 | SDSS-ALMA Legacy Value Archival Gas Exploration (SALVAGE) -- I: global star formation is governed by central (not global) molecular gas | Scott Wilkinson et.al. | 2511.06775 | null |
| 2025-11-08 | Development and testing of novel soft sleeve actuators | Mohammed Abboodi et.al. | 2511.06102 | null |
| 2025-11-08 | Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration | Umar Rashid et.al. | 2511.06087 | null |
| 2025-11-08 | Equilibrium Portfolio Selection under Utility-Variance Analysis of Log Returns in Incomplete Markets | Yue Cao et.al. | 2511.05861 | null |
| 2025-11-08 | Supermassive Black Hole and Broad-line Region in NGC 5548: 2023 Reverberation Mapping Results | Wen-Zhe Xi et.al. | 2511.05851 | null |
| 2025-11-07 | A dual grid geometric electromagnetic particle in cell method | Katharina Kormann et.al. | 2511.05032 | null |
| 2025-11-06 | Kinematic and extinction analysis of a potential spiral arm beyond the Galactic bar | Simran Joharle et.al. | 2511.04778 | null |
| 2025-11-06 | Sub-Gyr variability around the SFMS and its contribution to the scatter | A. Camps-Fariña et.al. | 2511.04745 | null |
| 2025-11-06 | Dissecting coherent motions in extreme wall shear stress events within adverse pressure gradient turbulent boundary layers | Leandro J. O. Silva et.al. | 2511.04620 | null |
| 2025-11-21 | Disentangled Concepts Speak Louder Than Words: Explainable Video Action Recognition | Jongseo Lee et.al. | 2511.03725 | null |
| 2025-11-05 | Extreme-Mass-Ratio Inspirals Embedded in Dark Matter Halo I:Existence of Homoclinic Orbit and Near-Horizon Chaos | Surajit Das et.al. | 2511.03657 | null |
| 2025-11-04 | Comparative Investigations on Active and Passive Tails of Undulating Swimmers | Dev Pradeepkumar Nayak et.al. | 2511.03057 | null |
| 2025-11-04 | Distributions and evolution of the equatorial rotation velocities of 2937 BAF-type main-sequence stars from asteroseismology | Conny Aerts et.al. | 2511.02909 | null |
| 2025-11-04 | Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization | Shaohan Li et.al. | 2511.02329 | null |
| 2025-11-04 | Characterizing the astrometric quality of AGNs in Gaia-CRF3 | Shilong Liao et.al. | 2511.02204 | null |
| 2025-11-03 | Fractional Diffusion Bridge Models | Gabriel Nobis et.al. | 2511.01795 | null |
| 2025-11-03 | Phason-driven temperature-dependent transport in moiré graphene | Alex Boschi et.al. | 2511.01691 | null |
| 2025-11-03 | Apsidal motion in massive binaries | Sophie Rosu et.al. | 2511.01522 | null |
| 2025-11-12 | Robust topological invariants of timelike circular orbits for spinning test particles in black hole spacetimes | Yong Song et.al. | 2511.01447 | null |
| 2025-11-04 | Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects | Jiawei Wang et.al. | 2511.01294 | null |
| 2025-11-03 | Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play | Jiatong Shi et.al. | 2511.01261 | null |
| 2025-11-02 | From Spray to Metric: The Geometric Construction of the Jacobi Metric | Zonghai Li et.al. | 2511.01004 | null |
| 2025-11-02 | The CatWISE2020 Quasar dipole: A Reassessment of the Cosmic Dipole Anomaly | Masroor Bashir et.al. | 2511.00822 | null |
| 2025-11-02 | Real-Time Learning of Predictive Dynamic Obstacle Models for Robotic Motion Planning | Stella Kombo et.al. | 2511.00814 | null |
| 2025-11-01 | Oitijjo-3D: Generative AI Framework for Rapid 3D Heritage Reconstruction from Street View Imagery | Momen Khandoker Ope et.al. | 2511.00362 | null |
| 2025-11-17 | Deep Chandra X-ray Observations of Abell 2029: the Merger History of a Relaxed, Strong Cool Core Cluster | Courtney B. Watson et.al. | 2511.00250 | null |
| 2025-10-30 | Comparing the magnetic Rayleigh-Taylor instability dynamics in two- and three-dimensions | Manohar Teja Kalluri et.al. | 2510.27053 | null |
| 2025-10-30 | HEIR: Learning Graph-Based Motion Hierarchies | Cheng Zheng et.al. | 2510.26786 | null |
| 2025-10-30 | Wrinkle-Induced Hexagonal Boron Nitride Nanochannels for Biomolecule Localization and Imaging | Xiliang Yang et.al. | 2510.26370 | null |
| 2025-10-30 | Ram pressure shaping HVC droplets -- FAST HI observations of HVC AC-III and theoretical interpretation | Xunchuan Liu et.al. | 2510.26077 | null |
| 2025-10-29 | Spherically Symmetric Quantum-Corrected Black Holes with String Clouds: A Multi-Observable Analysis | Faizuddin Ahmed et.al. | 2510.25764 | null |
| 2025-10-29 | Lost in Phonation: Voice Quality Variation as an Evaluation Dimension for Speech Foundation Models | Harm Lameris et.al. | 2510.25577 | null |
| 2025-10-29 | 4-Doodle: Text to 3D Sketches that Move! | Hao Chen et.al. | 2510.25319 | null |
| 2025-10-27 | SFMS-ALR: Script-First Multilingual Speech Synthesis with Adaptive Locale Resolution | Dharma Teja Donepudi et.al. | 2510.25178 | null |
| 2025-10-29 | Magnetic Fields in Massive Star-forming Regions (MagMaR). VI. Magnetic Field Dragging in the Filamentary High-mass Star-forming Region G35.20--0.74N due to Gravity | Jihye Hwang et.al. | 2510.25078 | null |
| 2025-10-28 | The Binary Ballet: Mapping Local Expansion Around M81 & M82 | Jenny Wagner et.al. | 2510.24840 | null |
| 2025-10-29 | Leveraging Scale Separation and Stochastic Closure for Data-Driven Prediction of Chaotic Dynamics | Ismaël Zighed et.al. | 2510.24583 | null |
| 2025-10-28 | Tracking the normal modes of an overpass highway bridge using Distributed Acoustic Sensing | E. Diego Mercerat et.al. | 2510.24212 | null |
| 2025-10-28 | High-energy droplet collisions in multi-interacting hollow cone sprays | Narendra Dev et.al. | 2510.24207 | null |
| 2025-10-27 | Adaptive Keyframe Selection for Scalable 3D Scene Reconstruction in Dynamic Environments | Raman Jha et.al. | 2510.23928 | null |
| 2025-10-27 | Non-Markovian quantum Mpemba effect in strongly correlated quantum dots | YuanDong Wang et.al. | 2510.23445 | null |
| 2025-10-27 | FlowCapX: Physics-Grounded Flow Capture with Long-Term Consistency | Ningxiao Tao et.al. | 2510.23122 | null |
| 2025-10-27 | EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction | Taoyu Wu et.al. | 2510.23087 | null |
| 2025-10-27 | Adapting Speech Foundation Models with Large Language Models for Unified Speech Recognition | Jing-Xuan Zhang et.al. | 2510.22961 | null |
| 2025-10-26 | MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control | Fatemeh Nazarieh et.al. | 2510.22810 | null |
| 2025-10-26 | Kinematics of Acceleration-Induced Excitations in Confined Quantum Fields | Hemansh Shah et.al. | 2510.22797 | null |
| 2025-10-25 | Learning 3D Anisotropic Noise Distributions Improves Molecular Force Field Modeling | Xixian Liu et.al. | 2510.22123 | null |
| 2025-10-21 | Vertex and front-tracking methods for the modeling of microstructure evolution at the solid state: a brief review | Marc Bernacki et.al. | 2510.21818 | null |
| 2025-10-14 | Beyond mechanochromism: Programmable multimodal actuation in cholesteric liquid crystal elastomer hollow fibers | Jiazhe Ma et.al. | 2510.21765 | null |
| 2025-10-24 | Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging | Ying Xue et.al. | 2510.21654 | null |
| 2025-10-24 | Magnetic Field Configuration of a Quiescent Prominence Revealed by Large-amplitude Longitudinal Oscillations in End-view Observations | Jun Dai et.al. | 2510.21487 | null |
| 2025-10-23 | Kinetics of Peierls dimerization transition: Machine learning force-field approach | Ho Jang et.al. | 2510.20659 | null |
| 2025-10-23 | RubbleSim: A Photorealistic Structural Collapse Simulator for Confined Space Mapping | Constantine Frost et.al. | 2510.20529 | null |
| 2025-10-23 | A simple model for PDFs and nPDFs | A. V. Kotikov et.al. | 2510.20139 | null |
| 2025-10-22 | Stochastic dynamics of quasiparticles in the hard rod gas | Seema Chahal et.al. | 2510.19693 | null |
| 2025-10-22 | Probing Accretion Disk Winds of Stratified Nature with Fe XXVI Doublet in Black Hole X-ray Binaries | Keigo Fukumura et.al. | 2510.19539 | null |
| 2025-10-22 | PRGCN: A Graph Memory Network for Cross-Sequence Pattern Reuse in 3D Human Pose Estimation | Zhuoyang Xie et.al. | 2510.19475 | null |
| 2025-10-22 | Advances in 4D Representation: Geometry, Motion, and Interaction | Mingrui Zhao et.al. | 2510.19255 | null |
| 2025-10-21 | The slope and scatter of the star forming main sequence at z~5 : reconciling observations with simulations | Claudia Di Cesare et.al. | 2510.19044 | null |
| 2025-10-21 | Zhirui Dai et.al. | 2510.18999 | null | |
| 2025-10-21 | Uniqueness of Angular Velocity Reconstruction in Parallel-Beam and Diffraction Tomography | Peter Elbau et.al. | 2510.18829 | null |
| 2025-10-21 | Nonthermal electron acceleration in turbulent post-flare coronal loops | Clarissa Mora et.al. | 2510.18742 | null |
| 2025-10-21 | Observational Tests of Regular Black Holes with Scalar Hair and their Stability | P. A. González et.al. | 2510.18647 | null |
| 2025-10-21 | Multiscale transitional flow in anisotropic nanoparticle suspensions revealed by time-resolved x-ray scatter microscopy | Kesavan Sekar et.al. | 2510.18444 | null |
| 2025-10-21 | MMRHP: A Miniature Mixed-Reality HIL Platform for Auditable Closed-Loop Evaluation | Mingxin Li et.al. | 2510.18371 | null |
| 2025-10-21 | The selection function of the Gaia DR3 open cluster census | Emily L. Hunt et.al. | 2510.18343 | null |
| 2025-10-21 | Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery | Xiang Zhang et.al. | 2510.18256 | null |
| 2025-10-20 | Geometric Field Theory for Elastohydrodynamics of Cosserat Rods | Mingjia Yan et.al. | 2510.18097 | null |
| 2025-10-20 | Bifurcations of planar balanced configurations for the |
Katharina Kormanna et.al. | 2510.17749 | null |
| 2025-10-20 | Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS | Feng Zhou et.al. | 2510.17479 | null |
| 2025-10-20 | Segmenting infant brains across magnetic fields: Domain randomization and annotation curation in ultra-low field MRI | Vladyslav Zalevskyi et.al. | 2510.17436 | null |
| 2025-10-21 | Leveraging AV1 motion vectors for Fast and Dense Feature Matching | Julien Zouein et.al. | 2510.17434 | null |
| 2025-10-21 | DeepDetect: Learning All-in-One Dense Keypoints | Shaharyar Ahmed Khan Tareen et.al. | 2510.17422 | null |
| 2025-10-20 | Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models | Katie Luo et.al. | 2510.17274 | null |
| 2025-10-20 | Kinetically-induced bound states in a frustrated Rydberg tweezer array | Mu Qiao et.al. | 2510.17183 | null |
| 2025-10-19 | The Lorentz-Violating effects in charged particle systems | E. Maciel et.al. | 2510.17055 | null |
| 2025-10-18 | CryoDyna: Multiscale end-to-end modeling of cryo-EM macromolecule dynamics with physics-aware neural network | Chengwei Zhang et.al. | 2510.16510 | null |
| 2025-10-18 | HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars | Haocheng Tang et.al. | 2510.16463 | null |
| 2025-10-18 | LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching | Aidyn Ubingazhibov et.al. | 2510.16438 | null |
| 2025-10-18 | Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models | Chenrui Tie et.al. | 2510.16344 | null |
| 2025-10-18 | XRISM-Subaru views of Abell 754: an off-axis, near-line-of-sight merging cluster | Nobuhiro Okabe et.al. | 2510.16291 | null |
| 2025-10-17 | DGME-T: Directional Grid Motion Encoding for Transformer-Based Historical Camera Movement Classification | Tingyu Lin et.al. | 2510.15725 | null |
| 2025-10-17 | A single optically detectable tumbling spin in silicon | Félix Cache et.al. | 2510.15590 | null |
| 2025-10-17 | Airway Mucus Rheology: Physical Insights for Navigating through Health to Pathology and Clinical Applications | Zhiwei Liu et.al. | 2510.15562 | null |
| 2025-10-17 | ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents | Tingyu Lin et.al. | 2510.15557 | null |
| 2025-10-17 | MRASfM: Multi-Camera Reconstruction and Aggregation through Structure-from-Motion in Driving Scenes | Lingfeng Xuan et.al. | 2510.15467 | null |
| 2025-10-17 | Modeling and Dynamic Simulation of a Hybrid Wind-Wave System on a Hexagonal Semi-Submersible Platform | Saeid Bayat et.al. | 2510.15285 | null |
| 2025-10-17 | CuSfM: CUDA-Accelerated Structure-from-Motion | Jingrui Yu et.al. | 2510.15271 | null |
| 2025-10-16 | OmniMotion: Multimodal Motion Generation with Continuous Masked Autoregression | Zhe Li et.al. | 2510.14954 | null |
| 2025-10-16 | A Physics Prior-Guided Dual-Stream Attention Network for Motion Prediction of Elastic Bragg Breakwaters | Lianzi Jiang et.al. | 2510.14250 | null |
| 2025-10-15 | Is Gravity Truly Balanced? A Historical-Critical Journey Through the Equivalence Principle and the Genesis of Spacetime Geometry | Jaume de Haro et.al. | 2510.13938 | null |
| 2025-10-15 | Turbulent transport for wall shear stress fluctuations | Myoungkyu Lee et.al. | 2510.13758 | null |
| 2025-10-15 | Orbital dynamics and precession in magnetized Kerr spacetime | Karthik Iyer et.al. | 2510.13569 | null |
| 2025-10-15 | Learning Neural Parametric 3D Breast Shape Models for Metrical Surface Reconstruction From Monocular RGB Videos | Maximilian Weiherer et.al. | 2510.13540 | null |
| 2025-10-15 | InstantSfM: Fully Sparse and Parallel Structure-from-Motion | Jiankun Zhong et.al. | 2510.13310 | null |
| 2025-10-15 | Investigating Buoyant Plume Dynamics Induced by Localized Fire-Simulated Heating over Plant Canopies Using LES | Ajinkya Desai et.al. | 2510.13196 | null |
| 2025-11-06 | Dependency of the Bar Formation Timescale On The Halo Spin | Bin-Hui Chen et.al. | 2510.13153 | null |
| 2025-10-15 | Edit-Your-Interest: Efficient Video Editing via Feature Most-Similar Propagation | Yi Zuo et.al. | 2510.13084 | null |
| 2025-10-14 | Mapping the Perseus Galaxy Cluster with XRISM: Gas Kinematic Features and their Implications for Turbulence | Congyao Zhang et.al. | 2510.12782 | null |
| 2025-10-14 | PET Head Motion Estimation Using Supervised Deep Learning with Attention | Zhuotong Cai et.al. | 2510.12758 | null |
| 2025-10-14 | Widespread Hot Molecular Gas Heated by Shear-induced Turbulence in the Galactic Center | Juan Li et.al. | 2510.12518 | null |
| 2025-10-14 | M3D-skin: Multi-material 3D-printed Tactile Sensor with Hierarchical Infill Structures for Pressure Sensing | Shunnosuke Yoshimura et.al. | 2510.12419 | null |
| 2025-10-14 | Scene Coordinate Reconstruction Priors | Wenjing Bian et.al. | 2510.12387 | null |
| 2025-10-14 | Holographic Turbulence and the Fractal Dimension of the Turbulent Horizon | Jia Du et.al. | 2510.12198 | null |
| 2025-10-14 | VIDMP3: Video Editing by Representing Motion with Pose and Position Priors | Sandeep Mishra et.al. | 2510.12069 | null |
| 2025-10-13 | NaviGait: Navigating Dynamically Feasible Gait Libraries using Deep Reinforcement Learning | Neil C. Janwani et.al. | 2510.11542 | null |
| 2025-10-13 | Behavior of passive polymeric tracers of different topologies in a dilute bath of active Brownian particles | Ramanand Singh Yadav et.al. | 2510.11337 | null |
| 2025-10-13 | The chemodynamical memory of a major merger in a NIHAO-UHD Milky Way analogue I: A golden thread through time and space | Sven Buder et.al. | 2510.11284 | null |
| 2025-10-13 | High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation | Runyang Feng et.al. | 2510.11017 | null |
| 2025-10-12 | Align2Act: Instruction-Tuned Models for Human-Aligned Autonomous Driving | Kanishkha Jaisankar et.al. | 2510.10503 | null |
| 2025-10-12 | Mesh-Gait: A Unified Framework for Gait Recognition Through Multi-Modal Representation Learning from 2D Silhouettes | Zhao-Yang Wang et.al. | 2510.10406 | null |
| 2025-10-11 | sqrtVINS: Robust and Ultrafast Square-Root Filter-based 3D Motion Tracking | Yuxiang Peng et.al. | 2510.10346 | null |
| 2025-10-11 | Ordinal Scale Traffic Congestion Classification with Multi-Modal Vision-Language and Motion Analysis | Yu-Hsuan Lin et.al. | 2510.10342 | null |
| 2025-10-11 | Detection of Quadruple Structure Near the ASCC 32 Region via Machine Learning Methods | Mohammad Noormohammadi et.al. | 2510.10296 | null |
| 2025-10-11 | Are Video Models Emerging as Zero-Shot Learners and Reasoners in Medical Imaging? | Yuxiang Lai et.al. | 2510.10254 | null |
| 2025-10-11 | BurstDeflicker: A Benchmark Dataset for Flicker Removal in Dynamic Scenes | Lishen Qu et.al. | 2510.09996 | null |
| 2025-10-11 | A no-contact result for a plate-fluid interaction system in dimension three | Mario Bukal et.al. | 2510.09992 | null |
| 2025-10-13 | Guiding Energy-Efficient Locomotion through Impact Mitigation Rewards | Chenghao Wang et.al. | 2510.09543 | null |
| 2025-10-10 | Two-Stage Gaussian Splatting Optimization for Outdoor Scene Reconstruction | Deborah Pintani et.al. | 2510.09489 | null |
| 2025-10-10 | What is the contribution of gravitational infall on the mass assembly of star-forming clouds? A case study in a numerical simulation of the interstellar medium | Noé Brucy et.al. | 2510.09480 | null |
| 2025-10-11 | The Visual Iconicity Challenge: Evaluating Vision-Language Models on Sign Language Form-Meaning Mapping | Onur Keleş et.al. | 2510.08482 | null |
| 2025-10-09 | Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools | Zhenlong Yuan et.al. | 2510.08480 | null |
| 2025-10-09 | Scalar-tensor theories in the Lyra geometry: Invariance under local transformations of length units and the Jordan-Einstein frame conundrum | E. C. Valadão et.al. | 2510.08433 | null |
| 2025-10-09 | Beyond hospital reach: Autonomous lightweight ultrasound robot for liver sonography | Zihan Li et.al. | 2510.08106 | null |
| 2025-10-09 | Executable Analytic Concepts as the Missing Link Between VLM Insight and Precise Manipulation | Mingyang Sun et.al. | 2510.07975 | null |
| 2025-10-08 | XRISM/Resolve observations of Hercules X-1: vertical structure and kinematics of the disk wind | Peter Kosec et.al. | 2510.07615 | null |
| 2025-10-08 | Curve separation in supercritical half-space last passage percolation | Evgeni Dimitrov et.al. | 2510.07508 | null |
| 2025-10-07 | Learning from Limited Multi-Phase CT: Dual-Branch Prototype-Guided Framework for Early Recurrence Prediction in HCC | Hsin-Pei Yu et.al. | 2510.07347 | null |
| 2025-10-08 | Dispersion and the transport of exciton-polaritons in an optical conveyor belt | Xingran Xu et.al. | 2510.07049 | null |
| 2025-10-08 | The Star-forming Main Sequence and Bursty Star-formation Histories at |
Leonardo Clarke et.al. | 2510.06681 | null |
| 2025-10-08 | Classical Polymerization of the Bianchi I Model with Deformed Poisson Structure | Babak Vakili et.al. | 2510.06628 | null |
| 2025-10-07 | Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation | Qingxuan Wu et.al. | 2510.06504 | null |
| 2025-10-07 | The first proper motion measurement of the acceleration regions in the large-scale jets of SS 433 powering the W50 nebula | Naomi Tsuji et.al. | 2510.06431 | null |
| 2025-10-07 | Gravitational deflection of charged massive particle around charged galactic wormhole | Md Khalid Hossain et.al. | 2510.06294 | null |
| 2025-10-07 | Cross-Embodiment Dexterous Hand Articulation Generation via Morphology-Aware Learning | Heng Zhang et.al. | 2510.06068 | null |
| 2025-10-07 | Midway Network: Learning Representations for Recognition and Motion from Latent Dynamics | Christopher Hoang et.al. | 2510.05558 | null |
| 2025-10-06 | The Prevalence of Bursty Star Formation in Low-Mass Galaxies at z=1-7 from Hα-to-UV Diagnostics | Marissa N. Perry et.al. | 2510.05388 | null |
| 2025-10-06 | StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation | Mingyu Liu et.al. | 2510.05057 | null |
| 2025-10-06 | Thermal effects in fluid structure interactions | Sourav Mitra et.al. | 2510.04801 | null |
| 2025-10-06 | Equilibrium properties of strongly confined fluids | Ana M. Montero et.al. | 2510.04546 | null |
| 2025-10-05 | Physics-Inspired All-Pair Interaction Learning for 3D Dynamics Modeling | Kai Yang et.al. | 2510.04233 | null |
| 2025-10-05 | From Shadow to Light: Toward Safe and Efficient Policy Learning Across MPC, DeePC, RL, and LLM Agents | Amin Vahidi-Moghaddam et.al. | 2510.04076 | null |
| 2025-10-04 | Dissecting Larval Zebrafish Hunting using Deep Reinforcement Learning Trained RNN Agents | Raaghav Malik et.al. | 2510.03699 | null |
| 2025-10-03 | Bloch Oscillations and Landau-Zener Transitions in Flat-Band Lattices with Quadratic and Linear Band Touchings | Chenhaoyue Wang et.al. | 2510.03530 | null |
| 2025-10-03 | Selective disruption of reach-related saccade timing following a middle-cerebral artery stroke | Mahya Beheshti et.al. | 2510.03076 | null |
| 2025-10-03 | A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios | Ruining Yang et.al. | 2510.02627 | null |
| 2025-10-23 | DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing | Zihan Zhou et.al. | 2510.02253 | null |
| 2025-10-02 | Non-Gaussian Rotational Diffusion and Swing Motion of Dumbbell Probes in Two Dimensional Colloids | Jeongmin Kim et.al. | 2510.01847 | null |
| 2025-10-02 | Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale | Yongbo Chen et.al. | 2510.01665 | null |
| 2025-10-01 | Depinning of KPZ Interfaces in Fractional Brownian Landscapes | Neda Valizadeh et.al. | 2510.01103 | null |
| 2025-10-01 | Can World Models Benefit VLMs for World Dynamics? | Kevin Zhang et.al. | 2510.00855 | null |
| 2025-09-30 | Learning Human Reaching Optimality Principles from Minimal Observation Inverse Reinforcement Learning | Sarmad Mehrdad et.al. | 2510.00329 | null |
| 2025-09-30 | JADES: An Abundance of Ultra-Distant T- and Y-Dwarfs in Deep Extragalactic Data | Kevin N. Hainline et.al. | 2510.00111 | null |
| 2025-10-03 | The warm outer layer of a Little Red Dot as the source of [Fe II] and collisional Balmer lines with scattering wings | Alberto Torralba et.al. | 2510.00103 | null |
| 2025-09-30 | Seeing Space and Motion: Enhancing Latent Actions with Spatial and Dynamic Awareness for VLA | Zhejia Cai et.al. | 2509.26251 | null |
| 2025-09-30 | Droplets sliding on single and multiple vertical fibers | Matteo Leonard et.al. | 2509.25898 | null |
| 2025-09-30 | Hierarchical Diffusion Motion Planning with Task-Conditioned Uncertainty-Aware Priors | Amelie Minji Kim et.al. | 2509.25685 | null |
| 2025-09-30 | On the shape of pancakes: catastrophe theory and Gaussian statistics in 2D | Abineet Parichha et.al. | 2509.25608 | null |
| 2025-10-06 | CoTaP: Compliant Task Pipeline and Reinforcement Learning of Its Controller with Compliance Modulation | Zewen He et.al. | 2509.25443 | null |
| 2025-09-29 | Data-Augmented Resolvent Analysis of Wall-Bounded High-Pressure Transcritical Flow | M. Bernades et.al. | 2509.25398 | null |
| 2025-09-29 | Seeking Kinematic Association of Known FU Orionis Stars with Young Clusters in Cygnus | Tamojeet Roychowdhury et.al. | 2509.25341 | null |
| 2025-10-08 | VGGT-X: When VGGT Meets Dense Novel View Synthesis | Yang Liu et.al. | 2509.25191 | null |
| 2025-09-29 | Fast Feature Field ( |
Richeek Das et.al. | 2509.25146 | null |
| 2025-09-29 | Impact of Atomic Substitution on Core-Hole Relaxation Dynamics: A Study of Br |
Nivedita Bhat et.al. | 2509.24915 | null |
| 2025-09-29 | Understanding Cognitive States from Head & Hand Motion Data | Kaiang Wen et.al. | 2509.24255 | null |
| 2025-09-28 | BOSfM: A View Planning Framework for Optimal 3D Reconstruction of Agricultural Scenes | Athanasios Bacharis et.al. | 2509.24126 | null |
| 2025-09-28 | RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization | Dongki Jung et.al. | 2509.23991 | null |
| 2025-09-28 | CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting | Dragoş-Andrei Chileban et.al. | 2509.23947 | null |
| 2025-09-28 | Witnessing Magnetic Reconnection in Tangled Superpenumbral Fibrils Around a Sunspot | Hechao Chen et.al. | 2509.23636 | null |
| 2025-09-27 | Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos | Junyi Wu et.al. | 2509.23492 | null |
| 2025-09-27 | Geometry-Aware Losses for Structure-Preserving Text-to-Sign Language Generation | Zetian Wu et.al. | 2509.23011 | null |
| 2025-09-26 | Scallop Theorem for Swimming in Anisotropic Fluids | Mojtaba Rajabi et.al. | 2509.22249 | null |
| 2025-09-26 | Taming Flow-based I2V Models for Creative Video Editing | Xianghao Kong et.al. | 2509.21917 | null |
| 2025-09-25 | First results from ALPPS: a sub-Alfvénic streamer in SVS13A | P. C. Cortes et.al. | 2509.21701 | null |
| 2025-09-25 | Multireference equation-of-motion driven similarity renormalization group for X-ray photoelectron spectra | Shuhang Li et.al. | 2509.21646 | null |
| 2025-09-25 | Taxonomy-aware Dynamic Motion Generation on Hyperbolic Manifolds | Luis Augenstein et.al. | 2509.21281 | null |
| 2025-09-24 | Pattern Formation in Agent-Based and PDE Models for Evolutionary Games with Payoff-Driven Motion | Tianyong Yao et.al. | 2509.20538 | null |
| 2025-09-24 | Glassy dynamics in two-dimensional ring polymers: size versus stiffness polydispersity | Rahul Nayak et.al. | 2509.20066 | null |
| 2025-09-24 | Modelling and Analysis of Non-Contacting Mechanical Face Seals with Axial Disturbances and Misalignment | Ben S Ashby et.al. | 2509.19993 | null |
| 2025-09-24 | Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering | Jiangxue Yu et.al. | 2509.19898 | null |
| 2025-09-23 | Probing the Origin of X-ray Flares in the Low-Hard State of GRS 1915+105 Using AstroSat and NuSTAR | Shahzada Akhter et.al. | 2509.19546 | null |
| 2025-10-30 | Reaction/Diffusion Competition Drives Anomalous Relaxation of Vitrimers | Makayla R. Branham-Ferrari et.al. | 2509.19496 | null |
| 2025-09-23 | Internal dynamics and structure of Cepheus OB4. The asymmetric expansion of Berkeley 59 | Bruno Wiesneth et.al. | 2509.19175 | null |
| 2025-09-23 | DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring | Pengteng Li et.al. | 2509.18898 | null |
| 2025-09-23 | Kinematics of the interstellar medium using Gaia: A catalogue of 102 YSO-MC associations within 3.5 kpc from the Sun with 3D velocities | Ji-Xuan Zhou et.al. | 2509.18496 | null |
| 2025-09-22 | Efficient Particle Acceleration in 2.5-Dimensional, Hybrid-Kinetic Simulations of Decaying, Supersonic, Plasma Turbulence | Keyan Gootkin et.al. | 2509.18374 | null |
| 2025-09-22 | Waves drive the rise and fall of 2D flows in rotating turbulence | Sébastien Gomé et.al. | 2509.18323 | null |
| 2025-09-22 | VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models | Geonung Kim et.al. | 2509.17985 | null |
| 2025-09-22 | Tensor-Based Self-Calibration of Cameras via the TrifocalCalib Method | Gregory Schroeder et.al. | 2509.17620 | null |
| 2025-10-15 | Energy Correlators Resolving Proton Spin | Jun Gao et.al. | 2509.17596 | null |
| 2025-09-22 | Learning Dexterous Manipulation with Quantized Hand State | Ying Feng et.al. | 2509.17450 | null |
| 2025-09-21 | Reference-aware SFM layers for intrusive intelligibility prediction | Hanlin Yu et.al. | 2509.17270 | null |
| 2025-09-21 | Beat on Gaze: Learning Stylized Generation of Gaze and Head Dynamics | Chengwei Shi et.al. | 2509.17168 | null |
| 2025-11-19 | Asymptotic Higher Spin Symmetries: Noether Realization & Algebraic Structure in Einstein-Yang-Mills Theory | Nicolas Cresto et.al. | 2509.17137 | null |
| 2025-09-21 | Insensitivity-induced potential non-uniqueness in system identification of Bouc-Wen models | Adrita Kundu et.al. | 2509.17122 | null |
| 2025-09-21 | Dynamics of the |
Elham Nazari et.al. | 2509.17017 | null |
| 2025-09-21 | VidCLearn: A Continual Learning Approach for Text-to-Video Generation | Luca Zanchetta et.al. | 2509.16956 | null |
| 2025-09-27 | HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos | Haoyang Weng et.al. | 2509.16757 | null |
| 2025-09-19 | On the application of refractive index matching to study the buoyancy-driven motion of spheres | Jibu Tom Jose et.al. | 2509.16384 | null |
| 2025-09-19 | Investigating Polyglot Speech Foundation Models for Learning Collective Emotion from Crowds | Orchid Chetia Phukan et.al. | 2509.16329 | null |
| 2025-11-05 | Modeling Elastic-Body Dynamics of Robotic Fish Using a Variational Framework | Zhiheng Chen et.al. | 2509.16145 | null |
| 2025-10-09 | Hierarchical Reinforcement Learning with Low-Level MPC for Multi-Agent Control | Max Studt et.al. | 2509.15799 | null |
| 2025-09-19 | Search for cosmic-ray induced gamma-ray emission from local galaxy clusters using Fermi-LAT data | Judit Pérez-Romero et.al. | 2509.15720 | null |
| 2025-10-24 | MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild | Deming Li et.al. | 2509.15548 | null |
| 2025-10-21 | SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models | Sen Wang et.al. | 2509.15536 | null |
| 2025-09-18 | Dynamical Analysis of the HD 169142 Planet-Forming Disk: Twelve Years of High-Contrast Polarimetry | Miles Lucas et.al. | 2509.15323 | null |
| 2025-09-18 | Static AdS Black Holes Surrounded by Strings and Quintessence-like Field within Rastall Gravity Framework | Allan. R. P. Moreira et.al. | 2509.15274 | null |
| 2025-09-27 | WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance | Chenxi Song et.al. | 2509.15130 | null |
| 2025-09-17 | Repulsive Trajectory Modification and Conflict Resolution for Efficient Multi-Manipulator Motion Planning | Junhwa Hong et.al. | 2509.13882 | null |
| 2025-09-18 | MapAnything: Universal Feed-Forward Metric 3D Reconstruction | Nikhil Keetha et.al. | 2509.13414 | null |
| 2025-09-16 | Optimal Annuitization with stochastic mortality: Piecewise Deterministic Mortality Force | Matteo Buttarazzi et.al. | 2509.13091 | null |
| 2025-09-16 | Spatiotemporal graph neural process for reconstruction, extrapolation, and classification of cardiac trajectories | Jaume Banus et.al. | 2509.12953 | null |
| 2025-09-18 | A-TDOM: Active TDOM via On-the-Fly 3DGS | Yiwei Xu et.al. | 2509.12759 | null |
| 2025-10-21 | Neural 3D Object Reconstruction with Small-Scale Unmanned Aerial Vehicles | Àlmos Veres-Vitàlyos et.al. | 2509.12458 | null |
| 2025-09-15 | DYNAMO: Dependency-Aware Deep Learning Framework for Articulated Assembly Motion Prediction | Mayank Patel et.al. | 2509.12430 | null |
| 2025-11-20 | End-to-End 4D Heart Mesh Recovery Across Full-Stack and Sparse Cardiac MRI | Yihong Chen et.al. | 2509.12090 | null |
| 2025-11-18 | Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting | Yi-Hsin Li et.al. | 2509.11853 | null |
| 2025-09-15 | WAFER: A new method to retrieve sun-induced fluorescence based on spectral wavelet decompositions | Veronika Oehl et.al. | 2509.11829 | null |
| 2025-09-14 | Understanding the effect of wall elasticity in turbulent channel flows | M. Koseki et.al. | 2509.11142 | null |
| 2025-09-14 | 3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment | Nhut Le et.al. | 2509.11097 | null |
| 2025-09-13 | Space Astrometry with Gaia: Advances in Understanding our Galaxy | Michael Perryman et.al. | 2509.10883 | null |
| 2025-11-04 | Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation | Hao Zhang et.al. | 2509.10687 | null |
| 2025-09-12 | Nanosculpting lateral weak link junctions in superconducting Fe(Te,Se)/Bi2Te3 with focused Si++ ions and implications on vortex pinning | Debarghya Mallick et.al. | 2509.10606 | null |
| 2025-09-17 | DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training | Jianxin Shi et.al. | 2509.10426 | null |
| 2025-09-12 | Breakdown of the critical state in the ferromagnetic superconductor EuFe $2$(As${1-x}$P$_x$)$_2$ | William Robert Fern et.al. | 2509.10339 | null |
| 2025-09-12 | A MeerKAT view of the parsec-scale jets in the black-hole X-ray binary GRS 1758-258 | I. Mariani et.al. | 2509.10275 | null |
| 2025-09-12 | Robustness and Diagnostic Performance of Super-Resolution Fetal Brain MRI | Ema Masterl et.al. | 2509.10257 | null |
| 2025-09-12 | Cluster Ages to Reconstruct the Milky Way Assembly (CARMA) IV. Chrono-dynamics of seven old star clusters in the Large Magellanic Cloud and the peculiar origin of NGC 1841 | F. Niederhofer et.al. | 2509.10144 | null |
| 2025-09-11 | Initial conditions for tidal synchronisation of a planet by its moon | Valeri V. Makarov et.al. | 2509.09858 | null |
| 2025-09-09 | Australian Supermarket Object Set (ASOS): A Benchmark Dataset of Physical Objects and 3D Models for Robotics and Computer Vision | Akansel Cosgun et.al. | 2509.09720 | null |
| 2025-09-11 | MOFU: Development of a MOrphing Fluffy Unit with Expansion and Contraction Capabilities and Evaluation of the Animacy of Its Movements | Taisei Mogi et.al. | 2509.09613 | null |
| 2025-09-11 | DualTrack: Sensorless 3D Ultrasound needs Local and Global Context | Paul F. R. Wilson et.al. | 2509.09530 | null |
| 2025-09-11 | BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging | Peng Zhou et.al. | 2509.09484 | null |
| 2025-09-11 | A Hybrid Hinge-Beam Continuum Robot with Passive Safety Capping for Real-Time Fatigue Awareness | Tongshun Chen et.al. | 2509.09404 | null |
| 2025-09-11 | Video Understanding by Design: How Datasets Shape Architectures and Insights | Lei Wang et.al. | 2509.09151 | null |
| 2025-09-11 | Exploration on the Two-stream Instability in the Polar Cusp Under Solar Storm Disturbances and its Potential Impacts on Spacecraft | Jikai Sun et.al. | 2509.09126 | null |
| 2025-09-11 | Propulsive transitions and scaling relations of a heaving flexible foil in a cylinder wake | Guojun Li et.al. | 2509.09102 | null |
| 2025-10-18 | Kinetostatics and Particle-Swarm Optimization of Vehicle-Mounted Underactuated Metamorphic Loading Manipulators | Nan Mao et.al. | 2509.09093 | null |
| 2025-10-04 | A comprehensive view of nuclear shapes, rotations and vibrations from fully quantum mechanical perspectives | Takaharu Otsuka et.al. | 2509.08552 | null |
| 2025-09-10 | The GECKOS survey: Jeans anisotropic models of edge-on discs uncover the impact of dust and kinematic structures | T. H. Rutherford et.al. | 2509.08371 | null |
| 2025-08-26 | Analog-based ensembles to characterize turbulent dynamics from observed data | Carlos Granero-Belinchon et.al. | 2509.07992 | null |
| 2025-09-09 | Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation | Shunlei Li et.al. | 2509.07957 | null |
| 2025-09-09 | Mode-coupling theory of the glass transition for a liquid in a periodic potential | Abolfazl Ahmadirahmat et.al. | 2509.07697 | null |
| 2025-09-09 | Beyond Motion Cues and Structural Sparsity: Revisiting Small Moving Target Detection | Guoyi Zhang et.al. | 2509.07654 | null |
| 2025-09-10 | VIM-GS: Visual-Inertial Monocular Gaussian Splatting via Object-level Guidance in Large Scenes | Shengkai Zhang et.al. | 2509.06685 | null |
| 2025-09-08 | From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans | Marilyn Keller et.al. | 2509.06607 | null |
| 2025-09-08 | Nonlinear planar Hall effect from superconducting vortex motion | Mio Hashimoto et.al. | 2509.06313 | null |
| 2025-11-11 | Limiting distribution of the chemical distance in high dimensional critical percolation | Shirshendu Chatterjee et.al. | 2509.06236 | null |
| 2025-09-07 | Micro-Expression Recognition via Fine-Grained Dynamic Perception | Zhiwen Shao et.al. | 2509.06015 | null |
| 2025-09-07 | Modeling Magnetoelastic Wave Interactions in Magnetic Films and Heterostructures: A finite-difference approach | Peter Flauger et.al. | 2509.06007 | null |
| 2025-09-07 | Skyrmion manipulation and logic gate functionality in transition metal multilayers | Tamali Mukherjee et.al. | 2509.05951 | null |
| 2025-09-06 | Depth Profiling of Oxygen Migration in Ta/HfO2 Stacks During Ionic Liquid Gating | Beatrice Bednarz et.al. | 2509.05748 | null |
| 2025-09-05 | Resolving Tangling in Multi-Conformer Refinement via Iterative Projections | Avinash Mandaiya et.al. | 2509.05189 | null |
| 2025-09-04 | Disentangling Multiple Gas Kinematic Drivers in the Perseus Galaxy Cluster | XRISM Collaboration et.al. | 2509.04421 | null |
| 2025-09-07 | Hyperuniformity and conservation laws in non-equilibrium systems | Raphaël Maire et.al. | 2509.04242 | null |
| 2025-09-03 | Exploiting correlations in multi-coincidence Coulomb explosion patterns for differentiating molecular structures using machine learning | Anbu Selvam Venkatachalam et.al. | 2509.03776 | null |
| 2025-09-03 | Beyond the Clouds: S3 as the most distant extended Milky Way stream, not of LMC origin | Ó. Jiménez-Arranz et.al. | 2509.03424 | null |
| 2025-09-02 | Voter Model stability with respect to conservative noises | Gideon Amir et.al. | 2509.02717 | null |
| 2025-09-02 | Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction | Xueyang Kang et.al. | 2509.01873 | null |
| 2025-09-01 | Optimal information injection and transfer mechanisms for active matter reservoir computing | Mario U. Gaimann et.al. | 2509.01799 | null |
| 2025-09-01 | An Accurate Comprehensive Approach to Substructure: IV. Dynamical Friction | Eduard Salvador-Solé et.al. | 2509.01553 | null |
| 2025-08-31 | Origin and control of pseudo-rotating spiral jets | Karol Wawrzak et.al. | 2509.00763 | null |
| 2025-09-30 | Intramolecular Singlet Fission Through a Coherently Coupled Excimer-like Intermediate | Sanjoy Patra et.al. | 2508.21568 | null |
| 2025-08-28 | Coherent motions to predict Lagrangian trajectories | Ali R Khojasteh et.al. | 2508.21191 | null |
| 2025-08-28 | First-Order Viscous Relativistic Hydrodynamics on the Two-Sphere | Lennox S. Keeble et.al. | 2508.20998 | null |
| 2025-08-28 | Scaling Fabric-Based Piezoresistive Sensor Arrays for Whole-Body Tactile Sensing | Curtis C. Johnson et.al. | 2508.20959 | null |
| 2025-08-28 | Language-Enhanced Mobile Manipulation for Efficient Object Search in Indoor Environments | Liding Zhang et.al. | 2508.20899 | null |
| 2025-08-28 | On W-algebras and ODE/IM correspondence | Matěj Kudrna et.al. | 2508.20793 | null |
| 2025-08-28 | AvatarBack: Back-Head Generation for Complete 3D Avatars from Front-View Images | Shiqi Xin et.al. | 2508.20623 | null |
| 2025-08-26 | PRISM: A Framework Harnessing Unsupervised Visual Representations and Textual Prompts for Explainable MACE Survival Prediction from Cardiac Cine MRI | Haoyang Su et.al. | 2508.19325 | null |
| 2025-08-26 | Thermoelectric evidence of the electronic structure changes from the charge-density-wave transition in FeGe | Kaila Jenkins et.al. | 2508.19116 | null |
| 2025-08-26 | WIde Separation Planets In Time (WISPIT): A Gap-clearing Planet in a Multi-ringed Disk around the Young Solar-type Star WISPIT 2 | Richelle F. van Capelleveen et.al. | 2508.19053 | null |
| 2025-08-27 | Striking Similarities in Dynamics and Vibrations of 2D Quasicrystals and Supercooled Liquids | Edwin A. Bedolla-Montiel et.al. | 2508.18856 | null |
| 2025-08-26 | Locally tuned hydrodynamics of active polymer chains | Lisa Sappl et.al. | 2508.18789 | null |
| 2025-08-26 | Chemical control of polymorphism and ferroelectricity in PbTiO3 and SrTiO3 monolayers and bilayers | Shaowen Xu et.al. | 2508.18777 | null |
| 2025-08-26 | A New Evidence of Interplay Between Tetrahedral and Octahedral Symmetries and Symmetry Breaking: Exotic Rotational Bands in |
S. Basak et.al. | 2508.18686 | null |
| 2025-11-24 | Warm Chat: Diffuse Emotion-aware Interactive Talking Head Avatar with Tree-Structured Guidance | Haijie Yang et.al. | 2508.18337 | null |
| 2025-08-25 | Cellular Flow Architecture Exposes the Hidden Mechanics of Biological Matter | Tianxiang Ma et.al. | 2508.17974 | null |
| 2025-08-25 | SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization | Junyuan Deng et.al. | 2508.17972 | null |
| 2025-08-25 | On the complexity of parametrized motion planning algorithms | Navnath Daundkar et.al. | 2508.17629 | null |
| 2025-10-07 | MoSA: Motion-Coherent Human Video Generation via Structure-Appearance Decoupling | Haoyu Wang et.al. | 2508.17404 | null |
| 2025-08-24 | Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery | Jiaqi Liu et.al. | 2508.17380 | null |
| 2025-08-23 | A fluxonium qubit-based hybrid electromechanical system | Roson Nongthombam et.al. | 2508.17105 | null |
| 2025-08-27 | A Black Hole Solution in Kalb-Ramond Gravity with Quintessence Field: From Geodesic Dynamics to Thermal Criticality | Ahmad Al-Badawi et.al. | 2508.16693 | null |
| 2025-11-10 | Stable black holes in lower dimensional |
G. G. L. Nashed et.al. | 2508.16679 | null |
| 2025-08-07 | Thermal convection in huddling emperor penguins | Dmitry Bratsun et.al. | 2508.16586 | null |
| 2025-08-22 | Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation | Chun-Peng Chang et.al. | 2508.16512 | null |
| 2025-08-25 | HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images | Anilkumar Swamy et.al. | 2508.16465 | null |
| 2025-08-26 | Prompting with Sign Parameters for Low-resource Sign Language Instruction Generation | Md Tariquzzaman et.al. | 2508.16076 | null |
| 2025-08-22 | NeuralMeshing: Complete Object Mesh Extraction from Casual Captures | Floris Erich et.al. | 2508.16026 | null |
| 2025-08-21 | WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception | Zhiheng Liu et.al. | 2508.15720 | null |
| 2025-08-21 | Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework | Zongqi He et.al. | 2508.15457 | null |
| 2025-09-21 | DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians | Cong Wang et.al. | 2508.15376 | null |
| 2025-09-04 | A Spectroscopic Hunt for Post-Red Supergiants in the Large Magellanic Cloud II: Turbulent Line Broadening in the Spectra of LMC Yellow Supergiants | Trevor Z. Dorn-Wallenstein et.al. | 2508.14971 | null |
| 2025-08-22 | The Alma catalogue of OB stars. III. A cross-match with Gaia DR3 and an extension based on new spectral classifications | M. Pantaleoni González et.al. | 2508.14875 | null |
| 2025-08-20 | Probing the farthest star clusters to the Small Magellanic Cloud | A. E. Piatti et.al. | 2508.14701 | null |
| 2025-08-20 | GeMS: Efficient Gaussian Splatting for Extreme Motion Blur | Gopi Raju Matta et.al. | 2508.14682 | null |
| 2025-08-20 | Identifying Monochromatic Signals in LISA and Taiji via Spectral Split: Gravitational Waves versus Ultralight Dark Matter | Yue-Hui Yao et.al. | 2508.14655 | null |
| 2025-08-20 | From Slices to Structures: Unsupervised 3D Reconstruction of Female Pelvic Anatomy from Freehand Transvaginal Ultrasound | Max Krähenmann et.al. | 2508.14552 | null |
| 2025-08-20 | Singularity of the axisymmetric stagnation-point-like solution within a cylinder of the 3D Euler incompressible fluid equations | Yinshen Xu et.al. | 2508.14550 | null |
| 2025-08-20 | Anisotropic Neutrino Emission from Spinning, Moving, and Charged Primordial Black Holes | Arnab Chaudhuri et.al. | 2508.14510 | null |
| 2025-08-19 | Gravitational Influence from Planets on the Measured Rates of Period Change of Pulsating White Dwarfs | Ling Xuan Yao et.al. | 2508.14195 | null |
| 2025-08-20 | Properties of the temporal transfer matrix in integrable Floquet circuits | Ilya Vilkoviskiy et.al. | 2508.13883 | null |
| 2025-10-31 | Smooth Flow Matching | Jianbin Tan et.al. | 2508.13831 | null |
| 2025-08-18 | Towards Routine Condensed Phase Simulations with Delta-Learned Coupled Cluster Accuracy: Application to Liquid Water | Niamh O'Neill et.al. | 2508.13391 | null |
| 2025-08-18 | Dynamic stall of a hydrofoil with tubercles in surface gravity waves | Guillaume Ricard et.al. | 2508.13329 | null |
| 2025-08-18 | MaskSem: Semantic-Guided Masking for Learning 3D Hybrid High-Order Motion Representation | Wei Wei et.al. | 2508.12948 | null |
| 2025-10-20 | Visual-Neural-Inspired Image Inpainting for Specific Objects-of-Interest Imaging | Yonghao Wu et.al. | 2508.12808 | null |
| 2025-08-18 | Discerning and quantifying high frequency activities in EEG under normal and epileptic conditions | Jyotiraj Nath et.al. | 2508.12670 | null |
| 2025-08-17 | HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization | Hyebin Ahn et.al. | 2508.12292 | null |
| 2025-08-17 | What do Speech Foundation Models Learn? Analysis and Applications | Ankita Pasad et.al. | 2508.12255 | null |
| 2025-08-16 | KP-INR: A Dual-Branch Implicit Neural Representation Model for Cardiac Cine MRI Reconstruction | Donghang Lyu et.al. | 2508.12147 | null |
| 2025-08-16 | Applied causality to infer protein dynamics and kinetics | Akashnathan Aranganathan et.al. | 2508.12060 | null |
| 2025-09-15 | WiseLVAM: A Novel Framework For Left Ventricle Automatic Measurements | Durgesh Kumar Singh et.al. | 2508.12023 | null |
| 2025-08-19 | Colloidal hydrodynamic interactions in viscoelastic fluids | Dae Yeon Kim et.al. | 2508.11948 | null |
| 2025-08-16 | Mapping feedback signatures in 3C 297: A quasar-host merger at Cosmic Noon | Chetna Duggal et.al. | 2508.11926 | null |
| 2025-09-08 | Deformation Driven Suction Cups: A Mechanics-Based Approach to Wearable Electronics | Seola Lee et.al. | 2508.11838 | null |
| 2025-08-01 | Multimodal Quantitative Measures for Multiparty Behaviour Evaluation | Ojas Shirekar et.al. | 2508.10916 | null |
| 2025-08-14 | Reduction of motion artifacts from photoplethysmography signals using learned convolutional sparse coding | Giulio Basso et.al. | 2508.10805 | null |
| 2025-08-14 | Snap-through time of arches is controlled by slenderness and imperfections | William Simpkins et.al. | 2508.10802 | null |
| 2025-08-14 | On the Derivation of Equations of Motion from Symmetries in Quantum-Mechanical Systems via Heisenberg's Uncertainty | Enrique Casanova et.al. | 2508.10661 | null |
| 2025-08-14 | EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba | Quang Nguyen et.al. | 2508.10522 | null |
| 2025-08-13 | Coulomb excitation of $^{124}$Te: Emerging collectivity and persisting seniority structure in the |
M. Reece et.al. | 2508.09643 | null |
| 2025-08-12 | A Galactic Interloper: A Study of the Cam OB1 Association's Clusters and its Visitor from the Perseus Arm | Joseph Mullen et.al. | 2508.09393 | null |
| 2025-08-12 | CLF-RL: Control Lyapunov Function Guided Reinforcement Learning | Kejun Li et.al. | 2508.09354 | null |
| 2025-08-12 | Quadrupolar gyration of a Brownian particle in a confining ring | Iman Abdoli et.al. | 2508.08792 | null |
| 2025-08-11 | Weak solutions and incompressible limit of a quasi-incompressible Navier--Stokes/Cahn--Hilliard model for viscous two-phase flows | Mingwen Fei et.al. | 2508.08090 | null |
| 2025-08-11 | Joint Transcription of Acoustic Guitar Strumming Directions and Chords | Sebastian Murgul et.al. | 2508.07973 | null |
| 2025-08-12 | Mem4D: Decoupling Static and Dynamic Memory for Dynamic Scene Reconstruction | Xudong Cai et.al. | 2508.07908 | null |
| 2025-08-11 | Tracking Any Point Methods for Markerless 3D Tissue Tracking in Endoscopic Stereo Images | Konrad Reuter et.al. | 2508.07851 | null |
| 2025-08-11 | Optimization of a Nonlinear Acoustics -- Structure Interaction Model | Barbara Kaltenbacher et.al. | 2508.07728 | null |
| 2025-08-10 | GS4Buildings: Prior-Guided Gaussian Splatting for 3D Building Reconstruction | Qilin Zhang et.al. | 2508.07355 | null |
| 2025-11-17 | Understanding Dynamic Scenes in Ego Centric 4D Point Clouds | Junsheng Huang et.al. | 2508.07251 | null |
| 2025-08-27 | From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving | Antonio Guillen-Perez et.al. | 2508.07029 | null |
| 2025-08-09 | Evaluating Fisheye-Compatible 3D Gaussian Splatting Methods on Real Images Beyond 180 Degree Field of View | Ulas Gunes et.al. | 2508.06968 | null |
| 2025-08-08 | Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video | Jixuan He et.al. | 2508.06715 | null |
| 2025-08-08 | Low temperature jet spectra of (DFE)2, DFE-He, DFE-He2 and DFE in the 2210-3105 cm-1 region (DFE = 1,1 difluoroethylene) | A. J. Barclay et.al. | 2508.06629 | null |
| 2025-08-08 | V: An Efficient Motion Planning Algorithm for Autonomous Vehicles* | Abdullah Zareh Andaryan et.al. | 2508.06404 | null |
| 2025-08-08 | Topological edge states and amplitude-dependent delocalization in quasiperiodic elliptically geared lattices | Shuaifeng Li et.al. | 2508.06286 | null |
| 2025-08-07 | CleanUpBench: Embodied Sweeping and Grasping Benchmark | Wenbo Li et.al. | 2508.05543 | null |
| 2025-08-07 | F2PASeg: Feature Fusion for Pituitary Anatomy Segmentation in Endoscopic Surgery | Lumin Chen et.al. | 2508.05465 | null |
| 2025-08-07 | Information-Theoretic Graph Fusion with Vision-Language-Action Model for Policy Reasoning and Dual Robotic Control | Shunlei Li et.al. | 2508.05342 | null |
| 2025-10-08 | Regular black hole's impact on the gravitational waveforms from periodic orbits | Mirzabek Alloqulov et.al. | 2508.05245 | null |
| 2025-08-07 | EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery | Bingyu Yang et.al. | 2508.05205 | null |
| 2025-08-07 | Refining Gaussian Splatting: A Volumetric Densification Approach | Mohamed Abdul Gafoor et.al. | 2508.05187 | null |
| 2025-09-02 | XRISM/Resolve View of Abell 2319: Turbulence, Sloshing, and ICM Dynamics | XRISM Collaboration et.al. | 2508.05067 | null |
| 2025-11-04 | Bursting at the seams: the star-forming main sequence and its scatter at z=3-9 using NIRCam photometry from JADES | C. Simmonds et.al. | 2508.04410 | null |
| 2025-09-19 | Variational mode decomposition analysis of the relationship between low-frequency shock-wave oscillations and buffet cells | Yuya Ohmichi et.al. | 2508.04250 | null |
| 2025-08-06 | PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction | Muhua Zhu et.al. | 2508.04236 | null |
| 2025-08-06 | SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition | Jiahui Li et.al. | 2508.04224 | null |
| 2025-08-06 | Probing globular clusters using modulated gravitational waves from binary black holes | Jie Wu et.al. | 2508.04021 | null |
| 2025-10-21 | Can Large Language Models Adequately Perform Symbolic Reasoning Over Time Series? | Zewen Liu et.al. | 2508.03963 | null |
| 2025-09-26 | Next Generation Equation-Free Multiscale Modelling of Crowd Dynamics via Machine Learning | Hector Vargas Alvarez et.al. | 2508.03926 | null |
| 2025-08-05 | High-Resolution Dynamic Full-Field Optical Coherence Microscopy: Illuminating Intracellular Activity in Deep Tissue | Erikas Tarvydas et.al. | 2508.03657 | null |
| 2025-08-05 | WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval | Junlong Ren et.al. | 2508.03343 | null |
| 2025-08-04 | A fluid--peridynamic structure model of deformation and damage of microchannels | Ziyu Wang et.al. | 2508.02875 | null |
| 2025-08-04 | Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering | Xu Wang et.al. | 2508.02362 | null |
| 2025-08-04 | Newtons First Law Is Not a Special Case of the Second Law | Indresh Yadav et.al. | 2508.02246 | null |
| 2025-08-04 | IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A | Chen Li et.al. | 2508.01984 | null |
| 2025-08-03 | CVD-SfM: A Cross-View Deep Front-end Structure-from-Motion System for Sparse Localization in Multi-Altitude Scenes | Yaxuan Li et.al. | 2508.01936 | null |
| 2025-10-16 | Orbital angular momentum of entangled photons as a probe for relativistic effects | Fazilah Nothlawala et.al. | 2508.01716 | null |
| 2025-08-02 | Rim destabilization and re-formation upon severance from its expanding sheet | M. Kharbedia et.al. | 2508.01308 | null |
| 2025-10-16 | UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation | Chaitanya Patel et.al. | 2508.01126 | null |
| 2025-08-01 | Counting topological interface modes using simplicial characteristic classes | N. Bohlsen et.al. | 2508.01063 | null |
| 2025-08-01 | 3D Reconstruction via Incremental Structure From Motion | Muhammad Zeeshan et.al. | 2508.01019 | null |
| 2025-08-01 | GeoMoE: Divide-and-Conquer Motion Field Modeling with Mixture-of-Experts for Two-View Geometry | Jiajun Le et.al. | 2508.00592 | null |
| 2025-08-01 | TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps | Zehui Xu et.al. | 2508.00303 | null |
| 2025-07-30 | X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention | Xiaochen Zhao et.al. | 2507.23143 | null |
| 2025-07-30 | Eddy population based model for the wall-pressure spectrum at high Reynolds number | Jonathan M. O. Massey et.al. | 2507.23098 | null |
| 2025-08-01 | Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future | Guoping Xu et.al. | 2507.22792 | null |
| 2025-08-14 | A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks | Hang Su et.al. | 2507.22733 | null |
| 2025-07-29 | Probing Turbulence, Gravity, Supernovae, and Magnetic Field Effects with the 6D Kinematics of Young Stars in Milky Way Star-Forming Regions | Benjamin N. Velguth et.al. | 2507.22107 | null |
| 2025-07-28 | Projecting the New Body: How Body Image Evolves During Learning to Walk with a Wearable Robot | I-Chieh Lee et.al. | 2507.21384 | null |
| 2025-07-28 | FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling | Jingting Li et.al. | 2507.20557 | null |
| 2025-07-27 | Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars | Mattia Piccinini et.al. | 2507.20427 | null |
| 2025-07-27 | Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models | Bohong Chen et.al. | 2507.20220 | null |
| 2025-07-27 | Unveiling the Sagittarius Dwarf Spheroidal Galaxy Core with Gaia DR3 | Ellie K. H. Toguchi-Tani et.al. | 2507.20212 | null |
| 2025-07-27 | PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks | Clinton Ansun Mo et.al. | 2507.20170 | null |
| 2025-10-04 | RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters | Xiaolin Liu et.al. | 2507.20117 | null |
| 2025-07-26 | Nonlinear causality of Israel-Stewart theory with diffusion | Ian Cordeiro et.al. | 2507.20064 | null |
| 2025-07-26 | TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking | Mengmeng Wang et.al. | 2507.19908 | null |
| 2025-11-08 | RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection | Xiaokai Bai et.al. | 2507.19856 | null |
| 2025-07-25 | The phase spiral's origin and evolution: indications from its varying properties across the Milky Way disk | Axel Widmark et.al. | 2507.19579 | null |
| 2025-08-02 | GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting | Baijun Ye et.al. | 2507.19451 | null |
| 2025-11-10 | A multi-dynamic low-rank deep image prior (ML-DIP) for 3D real-time cardiovascular MRI | Chong Chen et.al. | 2507.19404 | null |
| 2025-07-25 | NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography | Kirsten W. H. Maas et.al. | 2507.19328 | null |
| 2025-07-31 | MVG4D: Image Matrix-Based Multi-View and Motion Generation for 4D Content Creation from a Single Image | DongFu Yin et.al. | 2507.18371 | null |
| 2025-07-23 | Zero-Shot Dynamic Concept Personalization with Grid-Based LoRA | Rameen Abdal et.al. | 2507.17963 | null |
| 2025-07-23 | MCM: Mamba-based Cardiac Motion Tracking using Sequential Images in MRI | Jiahui Yin et.al. | 2507.17678 | null |
| 2025-07-23 | Constraints on Axion Dark Matter by Spin-Dependent Macroscopic Force | Dongyi Yang et.al. | 2507.17148 | null |
| 2025-10-01 | A Tutorial on MRI Reconstruction: From Modern Methods to Clinical Implications | Tolga Çukur et.al. | 2507.16715 | null |
| 2025-07-22 | Dyna3DGR: 4D Cardiac Motion Tracking with Dynamic 3D Gaussian Representation | Xueming Fu et.al. | 2507.16608 | null |
| 2025-07-22 | Sparse-View 3D Reconstruction: Recent Advances and Open Challenges | Tanveer Younis et.al. | 2507.16406 | null |
| 2025-07-22 | MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation | Yanchen Liu et.al. | 2507.16310 | null |
| 2025-07-22 | Universal Wavelet Units in 3D Retinal Layer Segmentation | An D. Le et.al. | 2507.16119 | null |
| 2025-09-24 | Interpretable Embeddings of Speech Enhance and Explain Brain Encoding Performance of Audio Models | Riki Shimizu et.al. | 2507.16080 | null |
| 2025-07-21 | Relationship between Structure and Dynamics of an Icosahedral Quasicrystal using Unsupervised Machine Learning | Edwin A. Bedolla-Montiel et.al. | 2507.15731 | null |
| 2025-07-21 | Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing | Boni Hu et.al. | 2507.15683 | null |
| 2025-08-28 | Edge-effects in the turbulent flow over flexible aquatic vegetation | Giulio Foggi Rota et.al. | 2507.15477 | null |
| 2025-07-21 | Low-Latency Event-Based Velocimetry for Quadrotor Control in a Narrow Pipe | Leonard Bauersfeld et.al. | 2507.15444 | null |
| 2025-07-21 | Few-Shot Object Detection via Spatial-Channel State Space Model | Zhimeng Xin et.al. | 2507.15308 | null |
| 2025-10-11 | TinyIO: Lightweight Reparameterized Inertial Odometry | Shanshan Zhang et.al. | 2507.15293 | null |
| 2025-10-24 | An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks | Xinyi Wu et.al. | 2507.14798 | null |
| 2025-07-20 | Flow Equivariant Recurrent Neural Networks | T. Anderson Keller et.al. | 2507.14793 | null |
| 2025-07-19 | The Serpent Eating Its Own Tail: Dust Destruction in the Apep Colliding-Wind Nebula | Ryan M. T. White et.al. | 2507.14610 | null |
| 2025-07-19 | BT-TL-DMPs: A Novel Robot TAMP Framework Combining Behavior Tree, Temporal Logic and Dynamical Movement Primitives | Zezhi Liu et.al. | 2507.14582 | null |
| 2025-07-19 | Motion Segmentation and Egomotion Estimation from Event-Based Normal Flow | Zhiyuan Hua et.al. | 2507.14500 | null |
| 2025-07-18 | DUSTrack: Semi-automated point tracking in ultrasound videos | Praneeth Namburi et.al. | 2507.14368 | null |
| 2025-07-18 | Efficient Variational Dynamics of Open Quantum Bosonic Systems via Automatic Differentiation | Jacopo Tosca et.al. | 2507.14076 | null |
| 2025-07-29 | DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation | Haoran Li et.al. | 2507.13985 | null |
| 2025-07-18 | Gaussian kernel-based motion measurement | Hongyi Liu et.al. | 2507.13693 | null |
| 2025-10-20 | Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation | Masahiro Ogawa et.al. | 2507.13628 | null |
| 2025-07-16 | Enhancing In-Domain and Out-Domain EmoFake Detection via Cooperative Multilingual Speech Foundation Models | Orchid Chetia Phukan et.al. | 2507.12595 | null |
| 2025-07-16 | BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images | Davide Di Nucci et.al. | 2507.12095 | null |
| 2025-07-16 | Spatial Frequency Modulation for Semantic Segmentation | Linwei Chen et.al. | 2507.11893 | null |
| 2025-07-14 | Supporting SENĆOTEN Language Documentation Efforts with Automatic Speech Recognition | Mengzhe Geng et.al. | 2507.10827 | null |
| 2025-07-11 | Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT | Wei Zhang et.al. | 2507.08448 | null |
| 2025-07-04 | MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion | Peilin Tao et.al. | 2507.03306 | null |
| 2025-06-30 | Towards Initialization-free Calibrated Bundle Adjustment | Carl Olsson et.al. | 2506.23808 | null |
| 2025-06-30 | AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention | Ziao Liu et.al. | 2506.23611 | null |
| 2025-06-27 | Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras | Petr Hruby et.al. | 2506.22069 | null |
| 2025-06-24 | ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes | Chenhao Zhang et.al. | 2506.21629 | null |
| 2025-07-08 | Wild refitting for black box prediction | Martin J. Wainwright et.al. | 2506.21460 | null |
| 2025-06-24 | Experimental Assessment of Neural 3D Reconstruction for Small UAV-based Applications | Genís Castillo Gómez-Raya et.al. | 2506.19491 | null |
| 2025-06-23 | ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs | Michal Nazarczuk et.al. | 2506.18792 | null |
| 2025-06-23 | Room temperature spin injection into commercial VCSELs at non-resonant wavelengths | Timur Almabetov et.al. | 2506.18376 | null |
| 2025-06-11 | OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary | Yui Sudo et.al. | 2506.09448 | null |
| 2025-06-06 | SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction | Yuchao Zheng et.al. | 2506.05935 | null |
| 2025-06-05 | On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images | Andreas Meuleman et.al. | 2506.05558 | null |
| 2025-06-05 | SupeRANSAC: One RANSAC to Rule Them All | Daniel Barath et.al. | 2506.04803 | link |
| 2025-06-04 | Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation | Tianyu Huang et.al. | 2506.04225 | null |
| 2025-06-04 | Accelerating SfM-based Pose Estimation with Dominating Set | Joji Joseph et.al. | 2506.03667 | null |
| 2025-06-03 | Nearby dwarf galaxies with extreme star formation rates: a window into dwarf-galaxy evolution in the early Universe | S. Kaviraj et.al. | 2506.03265 | null |
| 2025-06-02 | Fast and Robust Rotation Averaging with Anisotropic Coordinate Descent | Yaroslava Lochman et.al. | 2506.01940 | null |
| 2025-06-03 | Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC | Qingzheng Wang et.al. | 2505.24200 | null |
| 2025-05-29 | Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping | Justin Lazarow et.al. | 2505.23756 | null |
| 2025-05-30 | FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian | Sara Papi et.al. | 2505.22759 | link |
| 2025-05-28 | UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images | Junhuan Liu et.al. | 2505.22098 | null |
| 2025-05-28 | Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule | San Jiang et.al. | 2505.22089 | null |
| 2025-05-30 | Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations | Whenty Ariyanti et.al. | 2505.21356 | null |
| 2025-05-27 | Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting | Xiangyu Sun et.al. | 2505.20729 | null |
| 2025-05-26 | Robust fine-tuning of speech recognition models via model merging: application to disordered speech | Alexandre Ducorroy et.al. | 2505.20477 | null |
| 2025-05-29 | Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud | Natsuki Takama et.al. | 2505.19854 | null |
| 2025-05-25 | Improving Novel view synthesis of 360 |
Guangan Chen et.al. | 2505.19264 | link |
| 2025-05-24 | Token-Level Logits Matter: A Closer Look at Speech Foundation Models for Ambiguous Emotion Recognition | Jule Valendo Halim et.al. | 2505.18484 | null |
| 2025-05-22 | Tracking the Flight: Exploring a Computational Framework for Analyzing Escape Responses in Plains Zebra (Equus quagga) | Isla Duporge et.al. | 2505.16882 | link |
| 2025-05-21 | A Taxonomy of Structure from Motion Methods | Federica Arrigoni et.al. | 2505.15814 | null |
| 2025-05-18 | Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis | Dong Yang et.al. | 2505.12226 | null |
| 2025-05-15 | Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis | Francisco Raverta Capua et.al. | 2505.10751 | link |
| 2025-05-13 | Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People | Haoshuai Zhou et.al. | 2505.08215 | null |
| 2025-05-12 | RDD: Robust Feature Detector and Descriptor using Deformable Transformer | Gonglin Chen et.al. | 2505.08013 | null |
| 2025-05-12 | Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild | Lintao Xiang et.al. | 2505.07373 | null |
| 2025-05-11 | Symmetry in Fundamental Parameters of Galaxies on the Star-forming Main Sequence | Zhicheng He et.al. | 2505.06868 | null |
| 2025-05-10 | TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility | Marius Baden et.al. | 2505.06743 | null |
| 2025-05-08 | DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion | Qitao Zhao et.al. | 2505.05473 | null |
| 2025-05-20 | FastMap: Revisiting Dense and Scalable Structure from Motion | Jiahao Li et.al. | 2505.04612 | link |
| 2025-05-15 | Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera | Siming He et.al. | 2505.03093 | null |
| 2025-05-03 | AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting | Junhao Shi et.al. | 2505.01799 | null |
| 2025-05-03 | PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth | Bu Jin et.al. | 2505.01729 | null |
| 2025-05-01 | Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation? | Viktor Kocur et.al. | 2505.00866 | link |
| 2025-04-29 | Large-scale visual SLAM for in-the-wild videos | Shuo Sun et.al. | 2504.20496 | null |
| 2025-04-29 | Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views | Jiang Wu et.al. | 2504.20378 | link |
| 2025-04-28 | MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion | Zador Pataki et.al. | 2504.20040 | link |
| 2025-04-24 | Dynamic Camera Poses and Where to Find Them | Chris Rockwell et.al. | 2504.17788 | null |
| 2025-04-24 | EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy | Haodi Yao et.al. | 2504.17280 | null |
| 2025-04-23 | A Low-Cost Photogrammetry System for 3D Plant Modeling and Phenotyping | Joe Hrzich et.al. | 2504.16840 | null |
| 2025-04-23 | PRaDA: Projective Radial Distortion Averaging | Daniil Sinitsyn et.al. | 2504.16499 | null |
| 2025-04-21 | Traversing the Star-Forming Main Sequence with Molecular Gas Stacks of z~1.6 Cluster Galaxies | Alex Pigarelli et.al. | 2504.15381 | null |
| 2025-04-21 | Towards Understanding Camera Motions in Any Video | Zhiqiu Lin et.al. | 2504.15376 | null |
| 2025-04-21 | StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models | Yeona Hong et.al. | 2504.14915 | null |
| 2025-04-17 | Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering | Landon Dyken et.al. | 2504.13339 | null |
| 2025-04-15 | EDGS: Eliminating Densification for Efficient Convergence of 3DGS | Dmytro Kotovenko et.al. | 2504.13204 | null |
| 2025-04-15 | Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps | Panagiotis Agrafiotis et.al. | 2504.11416 | link |
| 2025-04-12 | A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds | Jizong Peng et.al. | 2504.09129 | null |
| 2025-04-11 | Stereophotoclinometry Revisited | Travis Driver et.al. | 2504.08252 | null |
| 2025-04-08 | Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring | José A. Pilartes-Congo et.al. | 2504.06464 | null |
| 2025-04-07 | Decoding the variability in the star-formation histories of z ~ 0.8 galaxies | Jenny T. Wan et.al. | 2504.05281 | null |
| 2025-04-05 | 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS | Zhisheng Huang et.al. | 2504.04294 | null |
| 2025-04-04 | An Algebraic Geometry Approach to Viewing Graph Solvability | Federica Arrigoni et.al. | 2504.03637 | null |
| 2025-04-04 | Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video | Jiaxin Guo et.al. | 2504.03198 | null |
| 2025-04-03 | Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation | Feng Gao et.al. | 2504.02647 | link |
| 2025-04-09 | FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking | Ulas Gunes et.al. | 2504.01732 | null |
| 2025-03-31 | LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors | Han Zhou et.al. | 2504.00219 | null |
| 2025-03-30 | AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos | Felix Wimbauer et.al. | 2503.23282 | link |
| 2025-03-24 | Ground Penetrating Radar-Assisted Multimodal Robot Odometry Using Subsurface Feature Matrix | Haifeng Li et.al. | 2503.18301 | null |
| 2025-03-22 | 3D Modeling: Camera Movement Estimation and path Correction for SFM Model using the Combination of Modified A-SIFT and Stereo System | Usha Kumari et.al. | 2503.17668 | null |
| 2025-03-25 | ProtoGS: Efficient and High-Quality Rendering with 3D Gaussian Prototypes | Zhengqing Gao et.al. | 2503.17486 | null |
| 2025-03-21 | ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration | Johan Edstedt et.al. | 2503.17093 | link |
| 2025-03-20 | From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction | Ayberk Acar et.al. | 2503.16263 | null |
| 2025-03-22 | Euclid Quick Data Release (Q1). A first view of the star-forming main sequence in the Euclid Deep Fields | Euclid Collaboration et.al. | 2503.15314 | null |
| 2025-03-18 | Multi-view Reconstruction via SfM-guided Monocular Depth Estimation | Haoyu Guo et.al. | 2503.14483 | null |
| 2025-03-18 | A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios | Huy-Hoang Bui et.al. | 2503.13982 | link |
| 2025-03-17 | Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios | Iryna Repinetska et.al. | 2503.13710 | null |
| 2025-03-17 | Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization | Yiwei Xu et.al. | 2503.13086 | null |
| 2025-03-15 | SFMNet: Sparse Focal Modulation for 3D Object Detection | Oren Shrout et.al. | 2503.12093 | null |
| 2025-03-11 | A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds | Felix Rydell et.al. | 2503.08142 | null |
| 2025-03-11 | DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection | Johan Edstedt et.al. | 2503.07347 | link |
| 2025-03-18 | Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion | Mona Sheikh Zeinoddin et.al. | 2503.07204 | null |
| 2025-03-10 | VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation | Hanzhi Chen et.al. | 2503.07135 | null |
| 2025-03-09 | AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation | Yang Zou et.al. | 2503.06660 | null |
| 2025-03-07 | LiDAR-enhanced 3D Gaussian Splatting Mapping | Jian Shen et.al. | 2503.05425 | null |
| 2025-03-06 | PLMP -- Point-Line Minimal Problems for Projective SfM | Kim Kiehn et.al. | 2503.04351 | null |
| 2025-03-03 | MUSt3R: Multi-view Network for Stereo 3D Reconstruction | Yohann Cabon et.al. | 2503.01661 | link |
| 2025-03-03 | ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization | Anas Abdelkarim et.al. | 2503.01311 | link |
| 2025-03-05 | A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping | Jialei He et.al. | 2503.01202 | null |
| 2025-03-02 | MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain | Rui Yi Yong et.al. | 2503.00853 | null |
| 2025-03-02 | PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery | BoCheng Li et.al. | 2503.00848 | null |
| 2025-03-02 | Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration | Jinjiang You et.al. | 2503.00737 | link |
| 2025-02-28 | The THESAN-ZOOM project: Burst, quench, repeat -- unveiling the evolution of high-redshift galaxies along the star-forming main sequence | William McClymont et.al. | 2503.00106 | null |
| 2025-02-27 | Best Foot Forward: Robust Foot Reconstruction in-the-wild | Kyle Fogarty et.al. | 2502.20511 | null |
| 2025-02-26 | SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images | Yangfan Xu et.al. | 2502.18932 | null |
| 2025-03-04 | Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model | Yaxuan Huang et.al. | 2502.16779 | null |
| 2025-02-20 | CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting | Qilin Zhang et.al. | 2502.14684 | link |
| 2025-02-19 | Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections | Seong Jong Yoo et.al. | 2502.13986 | null |
| 2025-02-19 | IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 |
Dongki Jung et.al. | 2502.12545 | null |
| 2025-02-12 | Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors | Vishwanath Pratap Singh et.al. | 2502.08587 | null |
| 2025-02-10 | FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences | Oliver Boyne et.al. | 2502.06367 | link |
| 2025-02-09 | Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models | Jing-Xuan Zhang et.al. | 2502.05766 | link |
| 2025-02-10 | Building Rome with Convex Optimization | Haoyu Han et.al. | 2502.04640 | null |
| 2025-02-04 | SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification | Yifu Tao et.al. | 2502.02657 | null |
| 2025-02-05 | GP-GS: Gaussian Processes for Enhanced Gaussian Splatting | Zhihao Guo et.al. | 2502.02283 | link |
| 2025-02-03 | XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications | Shangjin Zhai et.al. | 2502.01297 | null |
| 2025-01-29 | Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment | Zixue Zeng et.al. | 2501.17690 | link |
| 2025-01-28 | Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction | Tim Flückiger et.al. | 2501.16221 | null |
| 2025-01-25 | Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos | Zhen-Hui Dong et.al. | 2501.15096 | null |
| 2025-01-24 | MATCHA:Towards Matching Anything | Fei Xue et.al. | 2501.14945 | null |
| 2025-01-24 | Light3R-SfM: Towards Feed-forward Structure-from-Motion | Sven Elflein et.al. | 2501.14914 | null |
| 2025-01-24 | Dense-SfM: Structure from Motion with Dense Consistent Matching | JongMin Lee et.al. | 2501.14277 | null |
| 2025-01-21 | Theory of quantum-geometric charge and spin Josephson diode effects in strongly spin-polarized hybrid structures with noncoplanar spin textures | Niklas L. Schulz et.al. | 2501.12232 | null |
| 2025-01-14 | Selective Attention Merging for low resource tasks: A case study of Child ASR | Natarajan Balaji Shankar et.al. | 2501.08468 | link |
| 2025-01-14 | SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting | Yue Hu et.al. | 2501.07015 | null |
| 2025-02-02 | CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications | Xinyi Zheng et.al. | 2501.06927 | link |
| 2025-01-11 | Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis | Aditya Rauniyar et.al. | 2501.06431 | null |
| 2025-01-09 | Existence of dynamical fluctuation in AMPT generated data for Au+Au collisions at 10 AGeV | Somen Gope et.al. | 2501.05175 | null |
| 2025-01-06 | Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation | Yuezhang Lv et.al. | 2501.02821 | null |
| 2025-01-02 | On Unifying Video Generation and Camera Pose Estimation | Chun-Hao Paul Huang et.al. | 2501.01409 | null |
| 2025-01-02 | EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy | Ao Gao et.al. | 2501.01003 | null |
| 2024-12-30 | KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences | Keng-Wei Chang et.al. | 2412.20767 | null |
| 2024-12-27 | Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images | Xudong Cai et.al. | 2412.19518 | null |
| 2024-12-25 | Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition | Shujie Hu et.al. | 2412.18832 | null |
| 2024-12-23 | Reconstructing People, Places, and Cameras | Lea Müller et.al. | 2412.17806 | link |
| 2024-12-18 | Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation | Rémi Marsal et.al. | 2412.14103 | null |
| 2024-12-16 | Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection | Beomseok Lee et.al. | 2412.11978 | null |
| 2024-12-18 | SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video | Jongmin Park et.al. | 2412.09982 | null |
| 2024-12-12 | CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework | Yushan Han et.al. | 2412.08344 | null |
| 2024-12-10 | Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling | Hui Deng et.al. | 2412.07230 | null |
| 2024-12-08 | Unveiling True Talent: The Soccer Factor Model for Skill Evaluation | Alexandre Andorra et.al. | 2412.05911 | null |
| 2024-12-08 | Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features | Yuanbo Xiangli et.al. | 2412.05826 | null |
| 2024-12-06 | MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos | Zhengqi Li et.al. | 2412.04463 | null |
| 2024-12-03 | ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification | Pan Zhang et.al. | 2412.02044 | link |
| 2024-12-02 | SfM-Free 3D Gaussian Splatting via Hierarchical Training | Bo Ji et.al. | 2412.01553 | link |
| 2024-12-02 | MVImgNet2.0: A Larger-scale Dataset of Multi-view Images | Xiaoguang Han et.al. | 2412.01430 | null |
| 2024-12-02 | TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories | Mengran Li et.al. | 2412.01122 | null |
| 2024-12-02 | Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM | Alejandro Fontan et.al. | 2412.01116 | null |
| 2024-11-27 | RoMo: Robust Motion Segmentation Improves Structure from Motion | Lily Goli et.al. | 2411.18650 | null |
| 2024-11-26 | The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at |
Marcie Mun et.al. | 2411.17882 | null |
| 2024-11-25 | Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations | Peng Wei et.al. | 2411.16150 | null |
| 2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
| 2024-11-20 | DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild | Weicai Ye et.al. | 2411.13291 | null |
| 2024-11-15 | SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction | Yutao Tang et.al. | 2411.12592 | link |
| 2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
| 2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
| 2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
| 2024-11-08 | From Transparent to Opaque: Rethinking Neural Implicit Surfaces with |
Haoran Zhang et.al. | 2411.05362 | link |
| 2024-10-29 | A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching | Yi-Ting Huang et.al. | 2410.22602 | null |
| 2024-10-29 | LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues | Hanqing Jiang et.al. | 2410.22213 | null |
| 2024-10-17 | Stochastic Flow Matching for Resolving Small-Scale Physics | Stathi Fotiadis et.al. | 2410.19814 | null |
| 2024-10-25 | A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint | Changshi Mu et.al. | 2410.19473 | link |
| 2024-10-30 | Large Spatial Model: End-to-end Unposed Images to Semantic 3D | Zhiwen Fan et.al. | 2410.18956 | link |
| 2024-10-23 | CO-CAVITY project: Molecular gas and star formation in void galaxies | M. I. Rodríguez et.al. | 2410.18078 | null |
| 2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
| 2024-10-20 | Neural Active Structure-from-Motion in Dark and Textureless Environment | Kazuto Ichimaru et.al. | 2410.15378 | null |
| 2024-10-17 | SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation | Shiao Xie et.al. | 2410.13486 | null |
| 2024-10-16 | Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks | Orchid Chetia Phukan et.al. | 2410.12947 | null |
| 2024-10-16 | Gravity-aligned Rotation Averaging with Circular Regression | Linfei Pan et.al. | 2410.12763 | link |
| 2024-10-16 | Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals | Orchid Chetia Phukan et.al. | 2410.12645 | null |
| 2024-10-15 | SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection | Yizhe Liu et.al. | 2410.12080 | link |
| 2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
| 2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
| 2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
| 2024-10-09 | Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models | Ange Lou et.al. | 2410.07434 | null |
| 2024-10-09 | Deep HI Mapping of M 106 Group with FAST | Yao Liu et.al. | 2410.07038 | null |
| 2024-10-09 | MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data | Mingu Kang et.al. | 2410.06442 | null |
| 2024-10-08 | Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? | Charalambos Tzamos et.al. | 2410.05984 | link |
| 2024-10-04 | Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering | Laura Fink et.al. | 2410.03861 | link |
| 2024-10-01 | MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages | Marco Gaido et.al. | 2410.01036 | link |
| 2024-10-01 | Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance | Hongchao Shu et.al. | 2410.00386 | null |
| 2024-09-29 | Robust Incremental Structure-from-Motion with Hybrid Features | Shaohui Liu et.al. | 2409.19811 | null |
| 2024-09-27 | MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion | Bardienus Duisterhof et.al. | 2409.19152 | null |
| 2024-09-27 | Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras | Yipeng Lu et.al. | 2409.18673 | null |
| 2024-09-26 | BlinkTrack: Feature Tracking over 100 FPS via Events and Images | Yichen Shen et.al. | 2409.17981 | null |
| 2024-09-25 | How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not | Francesco Verdini et.al. | 2409.17044 | null |
| 2024-09-24 | Frequency-based View Selection in Gaussian Splatting Reconstruction | Monica M. Q. Li et.al. | 2409.16470 | null |
| 2024-10-07 | Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion | Juan-Diego Florez et.al. | 2409.16465 | null |
| 2024-09-24 | Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research | Vandita Shukla et.al. | 2409.15914 | null |
| 2024-09-23 | Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments | Francisco Roza de Moraes et.al. | 2409.15602 | null |
| 2024-09-23 | Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking | Subham Agrawal et.al. | 2409.14844 | null |
| 2024-09-21 | Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models | Orchid Chetia Phukan et.al. | 2409.14131 | null |
| 2024-09-17 | GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module | Yichen Zhang et.al. | 2409.11307 | null |
| 2024-09-13 | Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints | Shan Chen et.al. | 2409.08613 | null |
| 2024-09-09 | KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction | Davide Di Nucci et.al. | 2409.05407 | null |
| 2024-09-06 | The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population | Ryan P. Keenan et.al. | 2409.03963 | null |
| 2024-09-05 | Active Galactic Nuclei in the Green Valley at z |
Charity Woodrum et.al. | 2409.03197 | null |
| 2024-09-04 | Object Gaussian for Monocular 6D Pose Estimation from Sparse Views | Luqing Luo et.al. | 2409.02581 | null |
| 2024-09-11 | Geometry-aware Feature Matching for Large-Scale Structure from Motion | Gonglin Chen et.al. | 2409.02310 | null |
| 2024-09-04 | The study of strongly intensive observables for |
Tumpa Biswas et.al. | 2409.00525 | null |
| 2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
| 2024-09-15 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
| 2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
| 2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
| 2024-08-16 | Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS | Wei Sun et.al. | 2408.08723 | null |
| 2024-08-15 | CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning | Wei Zhu et.al. | 2408.08134 | link |
| 2024-08-13 | A Miniature Vision-Based Localization System for Indoor Blimps | Shicong Ma et.al. | 2408.06648 | null |
| 2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
| 2024-08-05 | Context-aware Mamba-based Reinforcement Learning for social robot navigation | Syed Muhammad Mustafa et.al. | 2408.02661 | null |
| 2024-08-04 | Birational geometry of critical loci in Algebraic Vision | Marina Bertolini et.al. | 2408.02067 | null |
| 2024-08-04 | PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Xin Yang et.al. | 2408.02053 | null |
| 2024-08-02 | Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris | Kentaro Uno et.al. | 2408.01035 | null |
| 2024-08-01 | LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting | Zhenyu Bao et.al. | 2408.00254 | null |
| 2024-07-29 | Global Structure-from-Motion Revisited | Linfei Pan et.al. | 2407.20219 | link |
| 2024-08-06 | Revisit Self-supervised Depth Estimation with Local Structure-from-Motion | Shengjie Zhu et.al. | 2407.19166 | null |
| 2024-07-23 | The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations | Hao Liu et.al. | 2407.16452 | null |
| 2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
| 2024-07-16 | NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models | Francesco Milano et.al. | 2407.12207 | link |
| 2024-07-15 | LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning | Zhuozhu Jian et.al. | 2407.10782 | null |
| 2024-07-15 | Towards Scale-Aware Full Surround Monodepth with Transformers | Yuchen Yang et.al. | 2407.10406 | null |
| 2024-07-14 | 3DEgo: 3D Editing on the Go! | Umar Khalid et.al. | 2407.10102 | null |
| 2024-07-10 | Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization | Jinjie Mai et.al. | 2407.08023 | link |
| 2024-07-10 | Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods | Euclid Collaboration et.al. | 2407.07940 | null |
| 2024-07-10 | Controlling Space and Time with Diffusion Models | Daniel Watson et.al. | 2407.07860 | null |
| 2024-07-09 | Computer vision tasks for intelligent aerospace missions: An overview | Huilin Chen et.al. | 2407.06513 | null |
| 2024-07-08 | Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views | Jiawei Guo et.al. | 2407.05666 | null |
| 2024-07-05 | Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization | Shaohan Li et.al. | 2407.04260 | null |
| 2024-07-15 | SfM on-the-fly: Get better 3D from What You Capture | Zongqian Zhan et.al. | 2407.03939 | null |
| 2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
| 2024-07-02 | Indoor 3D Reconstruction with an Unknown Camera-Projector Pair | Zhaoshuai Qi et.al. | 2407.01945 | null |
| 2024-06-27 | SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas | John Lambert et.al. | 2406.19390 | link |
| 2024-06-27 | STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning | Yanan Zhang et.al. | 2406.19362 | null |
| 2024-06-26 | VDG: Vision-Only Dynamic Gaussian for Driving Simulation | Hao Li et.al. | 2406.18198 | null |
| 2024-06-25 | Consensus Learning with Deep Sets for Essential Matrix Estimation | Dror Moran et.al. | 2406.17414 | link |
| 2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
| 2024-06-21 | The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization | Ivan Nikolić et.al. | 2406.15237 | link |
| 2024-06-19 | MVSBoost: An Efficient Point Cloud-based 3D Reconstruction | Umair Haroon et.al. | 2406.13515 | null |
| 2024-06-17 | MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung et.al. | 2406.11819 | link |
| 2024-06-15 | Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models | Ruchao Fan et.al. | 2406.10507 | link |
| 2024-06-14 | On the Evaluation of Speech Foundation Models for Spoken Language Understanding | Siddhant Arora et.al. | 2406.10083 | null |
| 2024-06-12 | Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement | Maxime Pietrantoni et.al. | 2406.08463 | null |
| 2024-06-12 | SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models | Chun Yin et.al. | 2406.08445 | null |
| 2024-06-10 | Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Xin Jin et.al. | 2406.06216 | link |
| 2024-06-07 | The Star-Forming Main Sequence in JADES and CEERS at |
Leonardo Clarke et.al. | 2406.05178 | null |
| 2024-06-13 | Gaussian Splatting with Localized Points Management | Haosen Yang et.al. | 2406.04251 | null |
| 2024-06-05 | L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration | Yibo Liu et.al. | 2406.03298 | link |
| 2024-06-04 | CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation | Dejia Xu et.al. | 2406.02509 | null |
| 2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
| 2024-05-29 | 3D Reconstruction with Fast Dipole Sums | Hanyu Chen et.al. | 2405.16788 | null |
| 2024-05-26 | MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups | Yusen Xie et.al. | 2405.16599 | null |
| 2024-05-26 | Categorical Flow Matching on Statistical Manifolds | Chaoran Cheng et.al. | 2405.16441 | link |
| 2024-05-22 | Exploring Galaxy Properties of eCALIFA with Contrastive Learning | G. Martínez-Solaeche et.al. | 2405.13471 | null |
| 2024-05-23 | Switched Flow Matching: Eliminating Singularities via Switching ODEs | Qunxi Zhu et.al. | 2405.11605 | null |
| 2024-05-28 | NeRO: Neural Road Surface Reconstruction | Ruibo Wang et.al. | 2405.10554 | link |
| 2024-05-15 | Three Dimensional Spatial Cognition: Bees and Bats | Robert Worden et.al. | 2405.09413 | null |
| 2024-05-09 | Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media | Zhizhen Zhang et.al. | 2405.05760 | null |
| 2024-05-09 | Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment | Simon Weber et.al. | 2405.05079 | link |
| 2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
| 2024-05-07 | Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling | Jiawei Shi et.al. | 2405.04309 | null |
| 2024-05-06 | Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion | Yunfeng Li et.al. | 2405.03177 | link |
| 2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
| 2024-04-25 | The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time | Marcie Mun et.al. | 2404.16319 | null |
| 2024-04-22 | Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann et.al. | 2404.14351 | null |
| 2024-04-22 | RESFM: Robust Equivariant Multiview Structure from Motion | Fadi Khatib et.al. | 2404.14280 | null |
| 2024-04-22 | Does Gaussian Splatting need SFM Initialization? | Yalda Foroutan et.al. | 2404.12547 | null |
| 2024-05-07 | A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion | Feng Yu et.al. | 2404.11590 | link |
| 2024-04-18 | DeblurGS: Gaussian Splatting for Camera Motion Blur | Jeongtaek Oh et.al. | 2404.11358 | null |
| 2024-05-21 | LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives | Jiadi Cui et.al. | 2404.09748 | null |
| 2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
| 2024-04-11 | Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han et.al. | 2404.07933 | null |
| 2024-04-07 | NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization | Peng Tu et.al. | 2404.04875 | null |
| 2024-04-04 | GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis | Emmanouil Nikolakakis et.al. | 2404.03126 | null |
| 2024-03-29 | InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds | Zhiwen Fan et.al. | 2403.20309 | link |
| 2024-03-29 | HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li et.al. | 2403.20032 | null |
| 2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
| 2024-03-25 | INPC: Implicit Neural Point Clouds for Radiance Field Rendering | Florian Hahlbohm et.al. | 2403.16862 | null |
| 2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
| 2024-03-14 | Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting | Jaewoo Jung et.al. | 2403.09413 | link |
| 2024-03-13 | Refractive COLMAP: Refractive Structure-from-Motion Revisited | Mengkun She et.al. | 2403.08640 | null |
| 2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
| 2024-03-11 | SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection | Yifu Tao et.al. | 2403.06877 | null |
| 2024-03-24 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
| 2024-02-22 | GaussianPro: 3D Gaussian Splatting with Progressive Propagation | Kai Cheng et.al. | 2402.14650 | null |
| 2024-02-25 | A Robust Error-Resistant View Selection Method for 3D Reconstruction | Shaojie Zhang et.al. | 2402.11431 | null |
| 2024-02-17 | Dense Matchers for Dense Tracking | Tomáš Jelínek et.al. | 2402.11287 | null |
| 2024-03-11 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
| 2024-01-22 | HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs | Zelin Gao et.al. | 2401.11711 | null |
| 2024-01-19 | SCENES: Subpixel Correspondence Estimation With Epipolar Supervision | Dominik A. Kloepfer et.al. | 2401.10886 | null |
| 2024-01-15 | 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data | Mathilde Letard et.al. | 2401.09481 | link |
| 2024-01-17 | 3D Scene Geometry Estimation from 360 |
Thiago Lopes Trugillo da Silveira et.al. | 2401.09252 | null |
| 2024-01-17 | ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization | Weiyao Wang et.al. | 2401.08937 | null |
| 2024-01-16 | Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions | Yi-Fan Zuo et.al. | 2401.08043 | link |
| 2024-01-10 | Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects | Tianhang Cheng et.al. | 2401.05236 | link |
| 2024-01-07 | A Classification of Critical Configurations for any Number of Projective Views | Martin Bråtelund et.al. | 2401.03450 | link |
| 2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
| 2023-12-16 | Transformers in Unsupervised Structure-from-Motion | Hemang Chawla et.al. | 2312.10529 | link |
| 2023-12-14 | HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video | Xueying Wang et.al. | 2312.08863 | null |
| 2023-12-14 | CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning | Qingsong Yan et.al. | 2312.08760 | null |
| 2023-12-11 | Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach | Travis Driver et.al. | 2312.06865 | link |
| 2023-12-11 | Gaussian Splatting SLAM | Hidenobu Matsuki et.al. | 2312.06741 | null |
| 2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889 | null |
| 2023-12-07 | Visual Geometry Grounded Deep Structure From Motion | Jianyuan Wang et.al. | 2312.04563 | null |
| 2023-11-30 | Distributed Global Structure-from-Motion with a Deep Front-End | Ayush Baid et.al. | 2311.18801 | link |
| 2023-11-21 | Robot Hand-Eye Calibration using Structure-from-Motion | Nicolas Andreff et.al. | 2311.11808 | null |
| 2023-11-18 | LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation | Sébastien Henry et.al. | 2311.11171 | null |
| 2023-11-10 | MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty | Rémi Marsal et.al. | 2311.06137 | link |
| 2023-11-08 | VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering | Linus Franke et.al. | 2311.04634 | link |
| 2023-10-22 | A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video | Jan Emily Mangulabnan et.al. | 2310.14364 | null |
| 2023-10-20 | FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer | Xinyu Zhang et.al. | 2310.13605 | null |
| 2023-10-09 | Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration | Chunge Bai et.al. | 2310.05504 | link |
| 2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
| 2023-11-29 | Pose-Free Generalizable Rendering Transformer | Zhiwen Fan et.al. | 2310.03704 | link |
| 2023-10-02 | Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images | Georg Bökman et.al. | 2310.01092 | null |
| 2023-10-01 | Propagating Semantic Labels in Video Data | David Balaban et.al. | 2310.00783 | null |
| 2023-09-22 | Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning | Jonathan Sauder et.al. | 2309.12804 | null |
| 2023-09-21 | On-the-Fly SfM: What you capture is What you get | Zongqian Zhan et.al. | 2309.11883 | link |
| 2023-09-19 | Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water | Jayesh Tripathi et.al. | 2309.10269 | null |
| 2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | link |
| 2023-09-08 | Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Akankshya Kar et.al. | 2309.04147 | null |
| 2023-09-01 | SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation | Youhong Wang et.al. | 2309.00526 | null |
| 2023-09-01 | Dense Voxel 3D Reconstruction Using a Monocular Event Camera | Haodong Chen et.al. | 2309.00385 | null |
| 2023-08-30 | Learning Structure-from-Motion with Graph Attention Networks | Lucas Brynte et.al. | 2308.15984 | link |
| 2023-08-26 | Disjoint Pose and Shape for 3D Face Reconstruction | Raja Kumar et.al. | 2308.13903 | null |
| 2023-08-30 | CamP: Camera Preconditioning for Neural Radiance Fields | Keunhong Park et.al. | 2308.10902 | null |
| 2023-08-18 | Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling | Haorui Ji et.al. | 2308.10705 | null |
| 2023-08-14 | Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation | Tao Liu et.al. | 2308.07231 | link |
| 2023-08-11 | Efficient Large-scale AUV-based Visual Seafloor Mapping | Mengkun She et.al. | 2308.06147 | null |
| 2023-08-04 | EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems | Weihan Wang et.al. | 2308.02670 | null |
| 2023-08-15 | Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites | Jyotirmaya Shivottam et.al. | 2308.01246 | link |
| 2023-08-02 | Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network | Shenbagaraj Kannapiran et.al. | 2308.01125 | null |
| 2023-07-27 | PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking | Yang Zheng et.al. | 2307.15055 | link |
| 2023-07-28 | SACReg: Scene-Agnostic Coordinate Regression for Visual Localization | Jerome Revaud et.al. | 2307.11702 | null |
| 2023-07-19 | Lazy Visual Localization via Motion Averaging | Siyan Dong et.al. | 2307.09981 | null |
| 2023-07-10 | Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor | San Jiang et.al. | 2307.04520 | null |
| 2023-07-07 | RGB-D Mapping and Tracking in a Plenoxel Radiance Field | Andreas L. Teigen et.al. | 2307.03404 | link |
| 2023-06-29 | The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes | David Recasens et.al. | 2306.16917 | link |
| 2023-06-27 | Detector-Free Structure from Motion | Xingyi He et.al. | 2306.15669 | link |
| 2023-06-28 | PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment | Jianyuan Wang et.al. | 2306.15667 | null |
| 2023-06-24 | 3D Reconstruction of Spherical Images based on Incremental Structure from Motion | San Jiang et.al. | 2306.12770 | link |
| 2023-06-15 | NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations | Varun Jampani et.al. | 2306.09109 | link |
| 2023-06-15 | Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization | Dror Aiger et.al. | 2306.09012 | link |
| 2023-06-10 | 3D reconstruction using Structure for Motion | Kshitij Karnawat et.al. | 2306.06360 | link |
| 2023-06-02 | Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images | Marcela Mera-Trujillo et.al. | 2306.01938 | null |
| 2023-05-31 | FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow | Cameron Smith et.al. | 2306.00180 | null |
| 2023-05-19 | SIDAR: Synthetic Image Dataset for Alignment & Restoration | Monika Kwiatkowski et.al. | 2305.12036 | link |
| 2023-05-09 | Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization | Clémentin Boittiaux et.al. | 2305.05301 | link |
| 2023-05-09 | Rotation Synchronization via Deep Matrix Factorization | Gk Tejus et.al. | 2305.05268 | link |
| 2023-04-20 | A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion | Miriam Jäger et.al. | 2304.10664 | null |
| 2023-04-14 | Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments | Felix Ott et.al. | 2304.07250 | null |
| 2023-04-12 | Visual Localization using Imperfect 3D Models from the Internet | Vojtech Panek et.al. | 2304.05947 | link |
| 2023-04-08 | Photometric Correction for Infrared Sensors | Jincheng Zhang et.al. | 2304.03930 | null |
| 2023-04-07 | DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium | Antyanta Bangunharcana et.al. | 2304.03560 | link |
| 2023-04-05 | Semantic Validation in Structure from Motion | Joseph Rowell et.al. | 2304.02420 | link |
| 2023-03-31 | Learning Internal Representations of 3D Transformations from 2D Projected Inputs | Marissa Connor et.al. | 2303.17776 | null |
| 2023-03-30 | 3D Line Mapping Revisited | Shaohui Liu et.al. | 2303.17504 | link |
| 2023-03-27 | TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering | Jaehoon Choi et.al. | 2303.15060 | null |
| 2023-03-26 | On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks | HyunJun Jung et.al. | 2303.14840 | link |
| 2023-03-24 | Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container | Jinguang Tong et.al. | 2303.13805 | link |
| 2023-03-24 | Progressively Optimized Local Radiance Fields for Robust View Synthesis | Andreas Meuleman et.al. | 2303.13791 | null |
| 2023-03-15 | RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters | Shuja Khalid et.al. | 2303.08695 | null |
| 2023-03-09 | Revisiting Rotation Averaging: Uncertainties and Robust Losses | Ganlin Zhang et.al. | 2303.05195 | link |
| 2023-02-28 | Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images | Zhongli Fan et.al. | 2302.14239 | link |
| 2023-03-25 | BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling | Sameera Ramasinghe et.al. | 2302.13543 | null |
| 2023-02-21 | EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images | Zhichao Ye et.al. | 2302.10544 | link |
| 2023-02-18 | Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering | Tatsuro Yamane et.al. | 2302.09208 | null |
| 2023-02-12 | Uncertainty-Driven Dense Two-View Structure from Motion | Weirong Chen et.al. | 2302.00523 | null |
| 2023-01-28 | AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion | Yu Chen et.al. | 2301.12135 | null |
| 2023-01-20 | A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles | Zhefan Xu et.al. | 2301.08422 | link |
| 2023-03-21 | Robust Dynamic Radiance Fields | Yu-Lun Liu et.al. | 2301.02239 | link |
| 2022-12-24 | Polarimetric Multi-View Inverse Rendering | Jinyu Zhao et.al. | 2212.12721 | null |
| 2022-12-13 | Accidental Turntables: Learning 3D Pose by Watching Objects Turn | Zezhou Cheng et.al. | 2212.06300 | null |
| 2022-12-04 | 3D Object Aided Self-Supervised Monocular Depth Estimation | Songlin Wei et.al. | 2212.01768 | null |
| 2022-12-02 | High-Res Facial Appearance Capture from Polarized Smartphone Images | Dejan Azinović et.al. | 2212.01160 | null |
| 2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
| 2022-11-24 | JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models | Sepidehsadat Hosseini et.al. | 2211.13785 | null |
| 2022-11-24 | SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks | Sergio Izquierdo et.al. | 2211.13551 | link |
| 2022-11-22 | Level-S |
Yuxi Xiao et.al. | 2211.12018 | link |
| 2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
| 2022-11-14 | Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion | René Haas et.al. | 2211.07195 | null |
| 2022-10-13 | Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach | Zhiang Chen et.al. | 2210.07349 | null |
| 2022-10-11 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | Yuxi Xiao et.al. | 2210.05517 | null |
| 2022-10-07 | Leveraging Structure from Motion to Localize Inaccessible Bus Stops | Indu Panigrahi et.al. | 2210.03646 | link |
| 2022-10-01 | Structure-Aware NeRF without Posed Camera via Epipolar Constraint | Shu Chen et.al. | 2210.00183 | link |
| 2022-10-05 | FAST-LIO, Then Bayesian ICP, Then GTSFM | Jerred Chen et.al. | 2210.00146 | null |
| 2022-09-20 | BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction | Ahalya Ravendran et.al. | 2209.09470 | null |
| 2022-09-19 | A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion | Gerry Chen et.al. | 2209.08690 | null |
| 2022-09-14 | End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes | Qiao Chen et.al. | 2209.06926 | null |
| 2022-09-07 | Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021 | Hartmut Surmann et.al. | 2209.03084 | null |
| 2022-08-27 | Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data | Thomas A. Ciarfuglia et.al. | 2208.13001 | null |
| 2022-08-12 | Handling Constrained Optimization in Factor Graphs for Autonomous Navigation | Barbara Bazzana et.al. | 2208.06325 | null |
| 2022-08-04 | Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training | Yao-Chih Lee et.al. | 2208.02709 | link |
| 2022-07-31 | One Object at a Time: Accurate and Robust Structure From Motion for Robots | Aravind Battaje et.al. | 2208.00487 | null |
| 2022-07-23 | Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks | Daniel Posada et.al. | 2207.11413 | null |
| 2022-07-25 | MeshLoc: Mesh-Based Visual Localization | Vojtech Panek et.al. | 2207.10762 | link |
| 2022-07-19 | ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild | Wang Zhao et.al. | 2207.09137 | link |
| 2022-07-16 | Organic Priors in Non-Rigid Structure from Motion | Suryansh Kumar et.al. | 2207.06262 | null |
| 2022-07-06 | A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models | Axel Garcia-Vega et.al. | 2207.02396 | null |
| 2022-06-24 | Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set | San Jiang et.al. | 2206.11499 | null |
| 2022-06-13 | TC-SfM: Robust Track-Community-Based Structure-from-Motion | Lei Wang et.al. | 2206.05866 | null |
| 2022-06-10 | EigenFairing: 3D Model Fairing using Image Coherence | Pragyana Mishra et.al. | 2206.05309 | null |
| 2022-06-01 | Semantic Room Wireframe Detection from a Single View | David Gillsjö et.al. | 2206.00491 | link |
| 2022-05-31 | Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction | Qiancheng Fu et.al. | 2205.15848 | null |
| 2022-05-09 | Is my Depth Ground-Truth Good Enough? HAMMER -- Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression | HyunJun Jung et.al. | 2205.04565 | null |
| 2022-05-07 | Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs | Pedro F. Proença et.al. | 2205.03522 | null |
| 2022-05-06 | EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms | Levi Burner et.al. | 2205.03467 | null |
| 2022-04-20 | Learned Monocular Depth Priors in Visual-Inertial Initialization | Yunwen Zhou et.al. | 2204.09171 | null |
| 2022-04-10 | Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective | Hui Deng et.al. | 2204.04730 | null |
| 2022-04-08 | Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems | Debao Huang et.al. | 2204.04145 | null |
| 2022-04-07 | SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation | Yi Wei et.al. | 2204.03636 | link |
| 2022-04-06 | Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion | Lukas Bommes et.al. | 2204.02733 | link |
| 2022-04-05 | Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows | Sheng Liu et.al. | 2204.02509 | link |
| 2022-03-31 | Fast, Accurate and Memory-Efficient Partial Permutation Synchronization | Shaohan Li et.al. | 2203.16505 | null |
| 2022-03-28 | Visual Odometry for RGB-D Cameras | Afonso Fontes et.al. | 2203.15119 | null |
| 2022-03-28 | Optimizing Elimination Templates by Greedy Parameter Search | Evgeniy Martyushev et.al. | 2203.14901 | link |
| 2022-03-23 | Event-Based Dense Reconstruction Pipeline | Kun Xiao et.al. | 2203.12270 | null |
| 2022-03-21 | DiffPoseNet: Direct Differentiable Camera Pose Estimation | Chethan M. Parameshwara et.al. | 2203.11174 | null |
| 2022-03-02 | Asynchronous Optimisation for Event-based Visual Odometry | Daqi Liu et.al. | 2203.01037 | null |
| 2022-03-02 | Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation | Yulun Tian et.al. | 2203.00851 | null |
| 2022-02-18 | MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery | Ahmad Khaliq et.al. | 2202.09146 | link |
| 2022-01-20 | GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry | Yunhan Zhao et.al. | 2201.08131 | null |
| 2022-01-13 | Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching | Yunpeng Shi et.al. | 2201.04797 | link |
| 2022-01-10 | High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM | Brian M. Hopkinson et.al. | 2201.03364 | link |
| 2022-01-06 | De-rendering 3D Objects in the Wild | Felix Wimbauer et.al. | 2201.02279 | link |
| 2021-12-29 | On the Instability of Relative Pose Estimation and RANSAC's Role | Hongyi Fan et.al. | 2112.14651 | null |
| 2021-12-16 | Road-aware Monocular Structure from Motion and Homography Estimation | Wei Sui et.al. | 2112.08635 | null |
| 2021-12-10 | Critical configurations for three projective views | Martin Bråtelund et.al. | 2112.05478 | null |
| 2021-12-09 | Critical configurations for two projective views, a new approach | Martin Bråtelund et.al. | 2112.05074 | null |
| 2021-12-06 | Dense Depth Priors for Neural Radiance Fields from Sparse Input Views | Barbara Roessle et.al. | 2112.03288 | link |
| 2021-12-10 | MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment | Jie Ren et.al. | 2112.01349 | link |
| 2021-11-11 | Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft | Pascal Schoppmann et.al. | 2111.06271 | null |
| 2021-11-10 | Damage Estimation and Localization from Sparse Aerial Imagery | Rene Garcia Franceschini et.al. | 2111.03708 | null |
| 2021-11-03 | Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems | Swarnabja Bhaumik et.al. | 2111.02064 | null |
| 2021-10-14 | Modeling dynamic target deformation in camera calibration | Annika Hagemann et.al. | 2110.07322 | null |
| 2021-10-13 | Hyperspectral 3D Mapping of Underwater Environments | Maxime Ferrera et.al. | 2110.06571 | null |
| 2021-09-24 | Automatic Map Update Using Dashcam Videos | Aziza Zhanabatyrova et.al. | 2109.12131 | null |
| 2021-09-16 | Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs | Gabriel Moreira et.al. | 2109.08046 | link |
| 2021-09-06 | Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications | Tejas Mane et.al. | 2109.02740 | null |
| 2021-09-02 | Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency | Beatrix-Emőke Fülöp-Balogh et.al. | 2109.01018 | null |
| 2021-09-01 | On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation | Eric Brachmann et.al. | 2109.00524 | link |
| 2021-08-31 | DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension | Roman Shapovalov et.al. | 2109.00033 | null |
| 2021-08-29 | Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration | Seyed-Mahdi Nasiri et.al. | 2108.12876 | null |
| 2021-08-23 | Burst Imaging for Light-Constrained Structure-From-Motion | Ahalya Ravendran et.al. | 2108.09895 | null |
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2025-12-04 | ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning | Shengyuan Ding et.al. | 2512.05111 | null |
| 2025-12-04 | Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark | Haobo Yuan et.al. | 2512.05091 | null |
| 2025-12-04 | Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding | Abhigyan Bhattacharya et.al. | 2512.05039 | null |
| 2025-12-04 | Revealing stimulus-dependent dynamics through statistical complexity | Edson V. de Paula et.al. | 2512.05007 | null |
| 2025-12-04 | Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis | Supriya Bordoloi et.al. | 2512.04989 | null |
| 2025-12-04 | LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging | Zhijian Shu et.al. | 2512.04939 | null |
| 2025-12-04 | Terahertz Fourier Ptychographic Imaging | Pitambar Mukherjee et.al. | 2512.04783 | null |
| 2025-12-04 | TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards | Mauro Martini et.al. | 2512.04772 | null |
| 2025-12-04 | MemLoRA: Distilling Expert Adapters for On-Device Memory Systems | Massimo Bini et.al. | 2512.04763 | null |
| 2025-12-04 | Spectral micro-CT for quantitative analysis of calcification in fibrocartilage | Vittoria Mazzini et.al. | 2512.04662 | null |
| 2025-11-26 | Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models | Naifu Zhang et.al. | 2511.21663 | null |
| 2025-11-26 | Fast 3D Ultrasound Localization Microscopy via Projection-based Processing Framework | Jingke Zhang et.al. | 2511.21647 | null |
| 2025-11-26 | Qwen3-VL Technical Report | Shuai Bai et.al. | 2511.21631 | null |
| 2025-11-26 | Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy | Teng Hu et.al. | 2511.21579 | null |
| 2025-11-26 | FITRep: Attention-Guided Item Representation via MLLMs | Guoxiao Zhang et.al. | 2511.21389 | null |
| 2025-11-26 | Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning | Xin Gu et.al. | 2511.21375 | null |
| 2025-11-26 | HTTM: Head-wise Temporal Token Merging for Faster VGGT | Weitian Wang et.al. | 2511.21317 | null |
| 2025-11-26 | Low-dose Chemically Specific Bioimaging via Deep-UV Lensless Holographic Microscopy on a Standard Camera | Piotr Arcab et.al. | 2511.21311 | null |
| 2025-11-26 | Adaptive Lighting Control in Visible Light Systems: An Integrated Sensing, Communication, and Illumination Framework | Xinyan Xie et.al. | 2511.21271 | null |
| 2025-11-26 | Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition | Baoli Sun et.al. | 2511.21202 | null |
| 2025-11-24 | Wigner and Gabor phase-space analysis of propagators for evolution equations | Elena Cordero et.al. | 2511.19400 | null |
| 2025-11-24 | Real-Time Object Tracking with On-Device Deep Learning for Adaptive Beamforming in Dynamic Acoustic Environments | Jorge Ortigoso-Narro et.al. | 2511.19396 | null |
| 2025-11-24 | In-vivo imaging with a low-cost MRI scanner and cloud data processing in low-resource settings | Teresa Guallart-Naval et.al. | 2511.19226 | null |
| 2025-11-24 | Can Modern Vision Models Understand the Difference Between an Object and a Look-alike? | Itay Cohen et.al. | 2511.19200 | null |
| 2025-11-24 | From Pixels to Posts: Retrieval-Augmented Fashion Captioning and Hashtag Generation | Moazzam Umer Gondal et.al. | 2511.19149 | null |
| 2025-11-24 | Graph-based 3D Human Pose Estimation using WiFi Signals | Jichao Chen et.al. | 2511.19105 | null |
| 2025-11-24 | Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach | Fan Nie et.al. | 2511.19080 | null |
| 2025-11-24 | LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space | Hai Wu et.al. | 2511.19057 | null |
| 2025-11-24 | Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors | Haihang Wu et.al. | 2511.19031 | null |
| 2025-11-24 | Dynamic Granularity Matters: Rethinking Vision Transformers Beyond Fixed Patch Splitting | Qiyang Yu et.al. | 2511.19021 | null |
| 2025-11-24 | AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization | Christos Koutlis et.al. | 2511.18993 | null |
| 2025-11-24 | Zero-shot segmentation of skin tumors in whole-slide images with vision-language foundation models | Santiago Moreno et.al. | 2511.18978 | null |
| 2025-11-24 | MagicWorld: Interactive Geometry-driven Video World Exploration | Guangyuan Li et.al. | 2511.18886 | null |
| 2025-11-24 | SP-VINS: A Hybrid Stereo Visual Inertial Navigation System based on Implicit Environmental Map | Xueyu Du et.al. | 2511.18756 | null |
| 2025-11-24 | Seeing What Matters: Visual Preference Policy Optimization for Visual Generation | Ziqi Ni et.al. | 2511.18719 | null |
| 2025-11-24 | CNN-Based Camera Pose Estimation and Localisation of Scan Images for Aircraft Visual Inspection | Xueyan Oh et.al. | 2511.18702 | null |
| 2025-11-24 | Stable Multi-Drone GNSS Tracking System for Marine Robots | Shuo Wen et.al. | 2511.18694 | null |
| 2025-11-23 | Shape-Adapting Gated Experts: Dynamic Expert Routing for Colonoscopic Lesion Segmentation | Gia Huy Thai et.al. | 2511.18493 | null |
| 2025-11-23 | Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span | Heeseung Yun et.al. | 2511.18470 | null |
| 2025-11-23 | LungX: A Hybrid EfficientNet-Vision Transformer Architecture with Multi-Scale Attention for Accurate Pneumonia Detection | Mansur Yerzhanuly et.al. | 2511.18425 | null |
| 2025-11-23 | 4D-VGGT: A General Foundation Model with SpatioTemporal Awareness for Dynamic Scene Geometry Estimation | Haonan Wang et.al. | 2511.18416 | null |
| 2025-11-23 | NSTR: Neural Spectral Transport Representation for Space-Varying Frequency Fields | Plein Versace et.al. | 2511.18384 | null |
| 2025-11-23 | Learning Visually Interpretable Oscillator Networks for Soft Continuum Robots from Video | Henrik Krauss et.al. | 2511.18322 | null |
| 2025-11-23 | Table Comprehension in Building Codes using Vision Language Models and Domain-Specific Fine-Tuning | Mohammad Aqib et.al. | 2511.18306 | null |
| 2025-11-23 | AIA-UltraNeRF:Acoustic-Impedance-Aware Neural Radiance Field with Hash Encodings for Robotic Ultrasound Reconstruction and Localization | Shuai Zhang et.al. | 2511.18293 | null |
| 2025-11-23 | SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes | Jungho Lee et.al. | 2511.18290 | null |
| 2025-11-22 | AFT: Appearance-Based Feature Tracking for Markerless and Training-Free Shape Reconstruction of Soft Robots | Shangyuan Yuan et.al. | 2511.18215 | null |
| 2025-11-22 | ProHD: Projection-Based Hausdorff Distance Approximation | Jiuzhou Fu et.al. | 2511.18207 | null |
| 2025-11-22 | ARIAL: An Agentic Framework for Document VQA with Precise Answer Localization | Ahmad Mohammadshirazi et.al. | 2511.18192 | null |
| 2025-11-22 | Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models | Dachuan Zhao et.al. | 2511.18123 | null |
| 2025-11-22 | PromptMoE: Generalizable Zero-Shot Anomaly Detection via Visually-Guided Prompt Mixtures | Yuheng Shao et.al. | 2511.18116 | null |
| 2025-11-22 | Spotlight: Identifying and Localizing Video Generation Errors Using VLMs | Aditya Chinchure et.al. | 2511.18102 | null |
| 2025-11-22 | VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object Detection | Jianhang Yao et.al. | 2511.18075 | null |
| 2025-11-22 | HyM-UNet: Synergizing Local Texture and Global Context via Hybrid CNN-Mamba Architecture for Medical Image Segmentation | Haodong Chen et.al. | 2511.17988 | null |
| 2025-11-22 | Signal: Selective Interaction and Global-local Alignment for Multi-Modal Object Re-Identification | Yangyang Liu et.al. | 2511.17965 | null |
| 2025-11-22 | MambaTAD: When State-Space Models Meet Long-Range Temporal Action Detection | Hui Lu et.al. | 2511.17929 | null |
| 2025-11-22 | MGA-VQA: Secure and Interpretable Graph-Augmented Visual Question Answering with Memory-Guided Protection Against Unauthorized Knowledge Use | Ahmad Mohammadshirazi et.al. | 2511.17881 | null |
| 2025-11-21 | AEGIS: Preserving privacy of 3D Facial Avatars with Adversarial Perturbations | Dawid Wolkiewicz et.al. | 2511.17747 | null |
| 2025-11-21 | Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolved Transcriptomics | Wei Zhang et.al. | 2511.17685 | null |
| 2025-11-18 | Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression | Siddiqua Namrah et.al. | 2511.17612 | null |
| 2025-11-18 | 3D Ground Truth Reconstruction from Multi-Camera Annotations Using UKF | Linh Van Ma et.al. | 2511.17609 | null |
| 2025-11-21 | REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing | Binger Chen et.al. | 2511.17442 | null |
| 2025-11-21 | IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation | Yifan Li et.al. | 2511.17384 | null |
| 2025-11-21 | SVRecon: Sparse Voxel Rasterization for Surface Reconstruction | Seunghun Oh et.al. | 2511.17364 | null |
| 2025-11-21 | NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior | Dongbo Shi et.al. | 2511.17322 | null |
| 2025-11-21 | MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning | Wenrui Zhang et.al. | 2511.17300 | null |
| 2025-11-21 | Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation | Chuancheng Shi et.al. | 2511.17282 | null |
| 2025-11-21 | A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback | Bulat Khaertdinov et.al. | 2511.17255 | null |
| 2025-11-21 | Mixed Reality Scenic Live Streaming for Cultural Heritage: Visual Interactions in a Historic Landscape | Zeyu Huang et.al. | 2511.17246 | null |
| 2025-11-21 | SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors | Kunyi Li et.al. | 2511.17207 | null |
| 2025-11-21 | Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition | Aditya Mishra et.al. | 2511.17183 | null |
| 2025-11-21 | Reflection-Based Relative Localization for Cooperative UAV Teams Using Active Markers | Tim Lakemann et.al. | 2511.17166 | null |
| 2025-11-21 | Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation | Shuo Wang et.al. | 2511.17097 | null |
| 2025-11-21 | Spanning Tree Autoregressive Visual Generation | Sangkyu Lee et.al. | 2511.17089 | null |
| 2025-11-24 | ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion | Junming Liu et.al. | 2511.17068 | null |
| 2025-11-21 | Stable Offline Hand-Eye Calibration for any Robot with Just One Mark | Sicheng Xie et.al. | 2511.17001 | null |
| 2025-11-21 | VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions | Qianyi Shao et.al. | 2511.16998 | null |
| 2025-11-21 | DReX: Pure Vision Fusion of Self-Supervised and Convolutional Representations for Image Complexity Prediction | Jonathan Skaza et.al. | 2511.16991 | null |
| 2025-11-21 | The Finer the Better: Towards Granular-aware Open-set Domain Generalization | Yunyun Wang et.al. | 2511.16979 | null |
| 2025-11-21 | Single-Axis Ptychographic Coherent Diffractive Imaging for Spectroscopic and Wavefront Retrieval | Qijun You et.al. | 2511.16950 | null |
| 2025-11-20 | SAM 3: Segment Anything with Concepts | Nicolas Carion et.al. | 2511.16719 | null |
| 2025-11-24 | PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation | Ting Pan et.al. | 2511.16712 | null |
| 2025-11-20 | Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation | Ziyu Guo et.al. | 2511.16671 | null |
| 2025-11-23 | Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems | Elias Lumer et.al. | 2511.16654 | null |
| 2025-11-20 | SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction | Guolin Huang et.al. | 2511.16635 | null |
| 2025-11-21 | POMA-3D: The Point Map Way to 3D Scene Understanding | Ye Mao et.al. | 2511.16567 | null |
| 2025-11-20 | NutriScreener: Retrieval-Augmented Multi-Pose Graph Attention Network for Malnourishment Screening | Misaal Khan et.al. | 2511.16566 | null |
| 2025-11-20 | Contrastive vision-language learning with paraphrasing and negation | Kwun Ho Ngan et.al. | 2511.16527 | null |
| 2025-11-20 | BoxingVI: A Multi-Modal Benchmark for Boxing Action Recognition and Localization | Rahul Kumar et.al. | 2511.16524 | null |
| 2025-11-20 | YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras | Fan Yang et.al. | 2511.16521 | null |
| 2025-11-20 | TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models | Li Zhang et.al. | 2511.16423 | null |
| 2025-11-20 | CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering | Joni Vanherck et.al. | 2511.16349 | null |
| 2025-11-20 | Real-Time Inference for Distributed Multimodal Systems under Communication Delay Uncertainty | Victor Croisfelt et.al. | 2511.16225 | null |
| 2025-11-20 | Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2511.16091 | null |
| 2025-11-20 | AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive Transformers | Boxun Xu et.al. | 2511.16047 | null |
| 2025-11-19 | EfficientSAM3: Progressive Hierarchical Distillation for Video Concept Segmentation from SAM1, 2, and 3 | Chengxi Zeng et.al. | 2511.15833 | null |
| 2025-11-19 | IMACT-CXR - An Interactive Multi-Agent Conversational Tutoring System for Chest X-Ray Interpretation | Tuan-Anh Le et.al. | 2511.15825 | null |
| 2025-11-19 | Multidimensional scaling of two-mode three-way asymmetric dissimilarities: finding archetypal profiles and clustering | Aleix Alcacer et.al. | 2511.15813 | null |
| 2025-11-19 | GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization | Yikun Wang et.al. | 2511.15705 | null |
| 2025-11-19 | First Frame Is the Place to Go for Video Content Customization | Jingxi Chen et.al. | 2511.15700 | null |
| 2025-11-19 | Hierarchical Semantic Tree Anchoring for CLIP-Based Class-Incremental Learning | Tao Hu et.al. | 2511.15633 | null |
| 2025-11-19 | Multi-Text Guided Few-Shot Semantic Segmentation | Qiang Jiao et.al. | 2511.15515 | null |
| 2025-11-19 | SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome | Dabin Jeong et.al. | 2511.15464 | null |
| 2025-11-19 | HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation | Linyin Luo et.al. | 2511.15435 | null |
| 2025-11-19 | The Empowerment of Science of Science by Large Language Models: New Tools and Methods | Guoqiang Liang et.al. | 2511.15370 | null |
| 2025-11-19 | C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models | Nayoung Oh et.al. | 2511.15333 | null |
| 2025-11-19 | Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval | Qing Wang et.al. | 2511.15201 | null |
| 2025-11-19 | Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation | Jin Wang et.al. | 2511.15118 | null |
| 2025-11-19 | BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer | Wenhan Yu et.al. | 2511.15090 | null |
| 2025-11-18 | FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding | Zhenshi Li et.al. | 2511.14901 | null |
| 2025-11-18 | Quantum Transport Spectroscopy of Pseudomagnetic Field in Graphene | Divya Sahani et.al. | 2511.14888 | null |
| 2025-09-16 | Image-Seeking Intent Prediction for Cross-Device Product Search | Mariya Hendriksen et.al. | 2511.14764 | null |
| 2025-11-18 | FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation | Yunfeng Wu et.al. | 2511.14712 | null |
| 2025-11-18 | Overcoming global sensitivity limitations: using active subspaces to explore discrepancies between global and local parameter sensitivities | Huiyan Zou et.al. | 2511.14687 | null |
| 2025-11-18 | A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases | Tao Yang et.al. | 2511.14638 | null |
| 2025-11-18 | Mind the Gaps: Measuring Visual Artifacts in Dimensionality Reduction | Jaume Ros et.al. | 2511.14544 | null |
| 2025-11-18 | D-PerceptCT: Deep Perceptual Enhancement for Low-Dose CT Images | Taifour Yousra Nabila et.al. | 2511.14518 | null |
| 2025-11-18 | Aerial Assistance System for Automated Firefighting during Turntable Ladder Operations | Jan Quenzel et.al. | 2511.14504 | null |
| 2025-11-18 | DIR-TIR: Dialog-Iterative Refinement for Text-to-Image Retrieval | Zongwei Zhen et.al. | 2511.14449 | null |
| 2025-11-18 | Agentic Video Intelligence: A Flexible Framework for Advanced Video Exploration and Understanding | Hong Gao et.al. | 2511.14446 | null |
| 2025-11-19 | Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving | Kangqiao Zhao et.al. | 2511.14386 | null |
| 2025-11-18 | O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language Model | Rishi Gupta et.al. | 2511.14368 | null |
| 2025-11-23 | Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors | Jeryes Danial et.al. | 2511.14335 | null |
| 2025-11-18 | Statistically controllable microstructure reconstruction framework for heterogeneous materials using sliced-Wasserstein metric and neural networks | Zhenchuan Ma et.al. | 2511.14268 | null |
| 2025-11-18 | EBind: a practical approach to space binding | Jim Broadbent et.al. | 2511.14229 | null |
| 2025-11-18 | LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation | Hao Jiang et.al. | 2511.14221 | null |
| 2025-11-19 | Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution | N Dinesh Reddy et.al. | 2511.14210 | null |
| 2025-11-19 | PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation | Xiangyu Li et.al. | 2511.14185 | null |
| 2025-11-18 | SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM | An Yu et.al. | 2511.14143 | null |
| 2025-11-18 | $A^2$GC: $A$symmetric |
Zhenyu Li et.al. | 2511.14109 | null |
| 2025-11-18 | SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts | Fan Zhang et.al. | 2511.14093 | null |
| 2025-11-18 | HiEAG: Evidence-Augmented Generation for Out-of-Context Misinformation Detection | Junjie Wu et.al. | 2511.14027 | null |
| 2025-11-17 | EchoAgent: Guideline-Centric Reasoning Agent for Echocardiography Measurement and Interpretation | Matin Daghyani et.al. | 2511.13948 | null |
| 2025-11-17 | Start Small, Think Big: Curriculum-based Relative Policy Optimization for Visual Grounding | Qingyang Yan et.al. | 2511.13924 | null |
| 2025-11-17 | GRLoc: Geometric Representation Regression for Visual Localization | Changyang Li et.al. | 2511.13864 | null |
| 2025-11-17 | Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification | Linhan Zhou et.al. | 2511.13575 | null |
| 2025-11-17 | Language-Guided Invariance Probing of Vision-Language Models | Jae Joong Lee et.al. | 2511.13494 | null |
| 2025-11-17 | Attention Grounded Enhancement for Visual Document Retrieval | Wanqing Cui et.al. | 2511.13415 | null |
| 2025-11-17 | Stray Light Correction for the Helioseismic and Magnetic Imager | A. A. Norton et.al. | 2511.13348 | null |
| 2025-11-17 | Uncovering and Mitigating Transient Blindness in Multimodal Model Editing | Xiaoqi Han et.al. | 2511.13243 | null |
| 2025-11-17 | GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry | Chiyun Noh et.al. | 2511.13216 | null |
| 2025-11-17 | Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework | Diego Ortego et.al. | 2511.13189 | null |
| 2025-11-17 | THIR: Topological Histopathological Image Retrieval | Zahra Tabatabaei et.al. | 2511.13170 | null |
| 2025-11-17 | SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration | Haodong Wang et.al. | 2511.13168 | null |
| 2025-11-17 | MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications | Gagan Raj Gupta et.al. | 2511.13131 | null |
| 2025-11-17 | Region-Point Joint Representation for Effective Trajectory Similarity Learning | Hao Long et.al. | 2511.13125 | null |
| 2025-11-17 | Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining | Zhaocheng Yu et.al. | 2511.13113 | null |
| 2025-11-17 | uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired Data | Dahyun Chung et.al. | 2511.13036 | null |
| 2025-11-17 | Angular Gradient Sign Method: Uncovering Vulnerabilities in Hyperbolic Networks | Minsoo Jo et.al. | 2511.12985 | null |
| 2025-11-17 | MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning | Yoonjae Seo et.al. | 2511.12976 | null |
| 2025-11-16 | Enhancing Neuro-Oncology Through Self-Assessing Deep Learning Models for Brain Tumor Unified Model for MRI Segmentation | Andrew Zhou et.al. | 2511.12801 | null |
| 2025-11-16 | Predicting upcoming visual features during eye movements yields scene representations aligned with human visual cortex | Sushrut Thorat et.al. | 2511.12715 | null |
| 2025-11-16 | FLClear: Visually Verifiable Multi-Client Watermarking for Federated Learning | Chen Gu et.al. | 2511.12663 | null |
| 2025-11-16 | D |
Zheyuan Zhang et.al. | 2511.12528 | null |
| 2025-11-16 | Visible Structure Retrieval for Lightweight Image-Based Relocalisation | Fereidoon Zangeneh et.al. | 2511.12503 | null |
| 2025-11-16 | CoTBox-TTT: Grounding Medical VQA with Visual Chain-of-Thought Boxes During Test-time Training | Jiahe Qian et.al. | 2511.12446 | null |
| 2025-11-15 | Calibrated Decomposition of Aleatoric and Epistemic Uncertainty in Deep Features for Inference-Time Adaptation | Divake Kumar et.al. | 2511.12389 | null |
| 2025-11-15 | SpaceVLM: Sub-Space Modeling of Negation in Vision-Language Models | Sepehr Kazemi Ranjbar et.al. | 2511.12331 | null |
| 2025-11-15 | A Disease-Aware Dual-Stage Framework for Chest X-ray Report Generation | Puzhen Wu et.al. | 2511.12259 | null |
| 2025-11-21 | Model Inversion Attack Against Deep Hashing | Dongdong Zhao et.al. | 2511.12233 | null |
| 2025-11-15 | FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention | Peng Zhang et.al. | 2511.12215 | null |
| 2025-11-18 | OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs | Feng Chen et.al. | 2511.12201 | null |
| 2025-11-15 | MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering | Seokwon Song et.al. | 2511.12142 | null |
| 2025-11-15 | Look As You Think: Unifying Reasoning and Visual Evidence Attribution for Verifiable Document RAG via Reinforcement Learning | Shuochen Liu et.al. | 2511.12003 | null |
| 2025-11-21 | Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models | Siyou Li et.al. | 2511.11910 | null |
| 2025-11-14 | TopoPerception: A Shortcut-Free Evaluation of Global Visual Perception in Large Vision-Language Models | Wenhao Zhou et.al. | 2511.11831 | null |
| 2025-11-14 | Lessons Learned from Developing a Privacy-Preserving Multimodal Wearable for Local Voice-and-Vision Inference | Yonatan Tussa et.al. | 2511.11811 | null |
| 2025-11-12 | Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement | Lian He et.al. | 2511.11702 | null |
| 2025-11-12 | Doubly Debiased Test-Time Prompt Tuning for Vision-Language Models | Fei Song et.al. | 2511.11690 | null |
| 2025-11-10 | A Deep Learning Model to Predicting Changes in Consumer Attributes for New Line-extended Products | Li Yinxing et.al. | 2511.11646 | null |
| 2025-11-14 | DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding | Dawei Zhu et.al. | 2511.11552 | null |
| 2025-11-14 | STEM EBIC as a Quantitative Probe of Semiconductor Devices | Sebastian Schneider et.al. | 2511.11528 | null |
| 2025-11-14 | Bridging Hidden States in Vision-Language Models | Benjamin Fein-Ashley et.al. | 2511.11526 | null |
| 2025-11-14 | Comprehension of Multilingual Expressions Referring to Target Objects in Visual Inputs | Francisco Nogueira et.al. | 2511.11427 | null |
| 2025-11-14 | Shrinking the Teacher: An Adaptive Teaching Paradigm for Asymmetric EEG-Vision Alignment | Lukun Wu et.al. | 2511.11422 | null |
| 2025-11-14 | Bidimensional measurements of photon statistics within a multimodal temporal framework | C. Hainaut et.al. | 2511.11403 | null |
| 2025-11-18 | GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes | Shumit A. Mitra et.al. | 2511.11401 | null |
| 2025-11-14 | StochEP: Stochastic Equilibrium Propagation for Spiking Convergent Recurrent Neural Networks | Jiaqi Lin et.al. | 2511.11320 | null |
| 2025-11-21 | DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding | Tanveer Hannan et.al. | 2511.11313 | null |
| 2025-11-18 | MOON Embedding: Multimodal Representation Learning for E-commerce Search Advertising | Chenghan Fu et.al. | 2511.11305 | null |
| 2025-11-14 | 3D Stokes polarimetric imaging at nanoscales | Isael Herrera et.al. | 2511.11222 | null |
| 2025-11-14 | Positional Bias in Multimodal Embedding Models: Do They Favor the Beginning, the Middle, or the End? | Kebin Wu et.al. | 2511.11216 | null |
| 2025-11-21 | Draft and Refine with Visual Experts | Sungheon Jeong et.al. | 2511.11005 | null |
| 2025-11-14 | ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization | Anzhe Cheng et.al. | 2511.10971 | null |
| 2025-11-13 | From Attention to Frequency: Integration of Vision Transformer and FFT-ReLU for Enhanced Image Deblurring | Syed Mumtahin Mahmud et.al. | 2511.10806 | null |
| 2025-11-13 | Semantic Property Maps for Driving Applications | Marcus Greiff et.al. | 2511.10798 | null |
| 2025-11-13 | Fast Data Attribution for Text-to-Image Models | Sheng-Yu Wang et.al. | 2511.10721 | null |
| 2025-11-18 | CARScenes: Semantic VLM Dataset for Safe Autonomous Driving | Yuankai He et.al. | 2511.10701 | null |
| 2025-11-12 | DualVision ArthroNav: Investigating Opportunities to Enhance Localization and Reconstruction in Image-based Arthroscopy Navigation via External Cameras | Hongchao Shu et.al. | 2511.10699 | null |
| 2025-11-12 | Dong Liu et.al. | 2511.10696 | null | |
| 2025-11-13 | Mined Prompting and Metadata-Guided Generation for Wound Care Visual Question Answering | Bavana Durgapraveen et.al. | 2511.10591 | null |
| 2025-11-13 | SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation | Wei Li et.al. | 2511.10518 | null |
| 2025-11-13 | Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators | Maximiliane Gruber et.al. | 2511.10424 | null |
| 2025-11-16 | MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns | Jiarui Zhang et.al. | 2511.10390 | null |
| 2025-11-17 | Physics informed Transformer-VAE for biophysical parameter estimation: PROSAIL model inversion in Sentinel-2 imagery | Prince Mensah et.al. | 2511.10387 | null |
| 2025-11-13 | Rethinking Visual Information Processing in Multimodal LLMs | Dongwan Kim et.al. | 2511.10301 | null |
| 2025-11-13 | H3Former: Hypergraph-based Semantic-Aware Aggregation via Hyperbolic Hierarchical Contrastive Loss for Fine-Grained Visual Classification | Yongji Zhang et.al. | 2511.10260 | null |
| 2025-11-20 | TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding | Jinxuan Li et.al. | 2511.10241 | null |
| 2025-11-13 | Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization | Ashutosh Anshul et.al. | 2511.10212 | null |
| 2025-11-13 | Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA | Yiran Zhang et.al. | 2511.10182 | null |
| 2025-11-13 | GEA: Generation-Enhanced Alignment for Text-to-Image Person Retrieval | Hao Zou et.al. | 2511.10154 | null |
| 2025-11-13 | Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction | Mingda Jia et.al. | 2511.10134 | null |
| 2025-11-13 | GridPrune: From "Where to Look" to "What to Select" in Visual Token Pruning for MLLMs | Yuxiang Duan et.al. | 2511.10081 | null |
| 2025-11-13 | Radiology Workflow-Guided Hierarchical Reinforcement Fine-Tuning for Medical Report Generation | Bodong Du et.al. | 2511.10065 | null |
| 2025-11-13 | Trapped by Their Own Light: Deployable and Stealth Retroreflective Patch Attacks on Traffic Sign Recognition Systems | Go Tsuruoka et.al. | 2511.10050 | null |
| 2025-11-13 | AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models | Xinyi Wang et.al. | 2511.10017 | null |
| 2025-11-13 | Learning phase diversity for solving ill-posed inverse problems in imaging | Jasleen Birdi et.al. | 2511.09952 | null |
| 2025-11-13 | MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding | Ketong Chen et.al. | 2511.09919 | null |
| 2025-11-12 | From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance | Jeongho Min et.al. | 2511.09820 | null |
| 2025-11-12 | PALMS+: Modular Image-Based Floor Plan Localization Leveraging Depth Foundation Model | Yunqian Cheng et.al. | 2511.09724 | null |
| 2025-11-12 | SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control | Arman Zarei et.al. | 2511.09715 | null |
| 2025-11-12 | IFG: Internet-Scale Guidance for Functional Grasping Generation | Ray Muxin Liu et.al. | 2511.09558 | null |
| 2025-11-12 | Warped Disk Galaxies: Statistical Properties from DESI Legacy Imaging Surveys DR8 | Yiheng Wang et.al. | 2511.09518 | null |
| 2025-11-12 | A general framework for adaptive nonparametric dimensionality reduction | Antonio Di Noia et.al. | 2511.09486 | null |
| 2025-11-12 | BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation | Hongchao Shu et.al. | 2511.09443 | null |
| 2025-11-12 | NeuroCLIP: Brain-Inspired Prompt Tuning for EEG-to-Image Multimodal Contrastive Learning | Jiyuan Wang et.al. | 2511.09250 | null |
| 2025-11-12 | SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields | Sangheon Yang et.al. | 2511.09072 | null |
| 2025-11-12 | ROI-based Deep Image Compression with Implicit Bit Allocation | Kai Hu et.al. | 2511.08918 | null |
| 2025-11-12 | Negative Entity Suppression for Zero-Shot Captioning with Synthetic Images | Zimao Lu et.al. | 2511.08909 | null |
| 2025-11-13 | LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis | Ibne Farabi Shihab et.al. | 2511.08903 | null |
| 2025-11-11 | SIFT-Graph: Benchmarking Multimodal Defense Against Image Adversarial Attacks With Robust Feature Graph | Jingjie He et.al. | 2511.08810 | null |
| 2025-11-11 | Decoupling Composition and Band Gap in |
Annett Thøgersen et.al. | 2511.08728 | null |
| 2025-11-11 | Spatio-Temporal Cluster-Triggered Encoding for Spiking Neural Networks | Lingyun Ke et.al. | 2511.08469 | null |
| 2025-11-11 | Isolated massive star candidates in NGC 4242 with GULP | Pietro Facchini et.al. | 2511.08447 | null |
| 2025-11-11 | Text-based Aerial-Ground Person Retrieval | Xinyu Zhou et.al. | 2511.08369 | null |
| 2025-11-11 | VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion | Samet Hicsonmez et.al. | 2511.08173 | null |
| 2025-11-11 | Multi-Granularity Mutual Refinement Network for Zero-Shot Learning | Ning Wang et.al. | 2511.08163 | null |
| 2025-11-11 | Direction and speed selectivity properties for spatio-temporal receptive fields according to the generalized Gaussian derivative model for visual receptive fields | Tony Lindeberg et.al. | 2511.08101 | null |
| 2025-11-11 | Multi-modal Deepfake Detection and Localization with FPN-Transformer | Chende Zheng et.al. | 2511.08031 | null |
| 2025-11-12 | EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision | Yifei Cao et.al. | 2511.08007 | null |
| 2025-11-11 | Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition | Lintong Zhang et.al. | 2511.07974 | null |
| 2025-11-11 | Exploring the Underwater World Segmentation without Extra Training | Bingyu Li et.al. | 2511.07923 | null |
| 2025-11-11 | Visual Bridge: Universal Visual Perception Representations Generating | Yilin Gao et.al. | 2511.07877 | null |
| 2025-11-11 | MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection | Sunghun Yang et.al. | 2511.07862 | null |
| 2025-11-11 | Semantic-Consistent Bidirectional Contrastive Hashing for Noisy Multi-Label Cross-Modal Retrieval | Likang Peng et.al. | 2511.07780 | null |
| 2025-11-14 | Multistep Quasimetric Learning for Scalable Goal-conditioned Reinforcement Learning | Bill Chunyuan Zheng et.al. | 2511.07730 | null |
| 2025-11-11 | Operational machine learning for remote spectroscopic detection of CH |
Vít Růžička et.al. | 2511.07719 | null |
| 2025-11-19 | Cross Modal Fine-Grained Alignment via Granularity-Aware and Region-Uncertain Modeling | Jiale Liu et.al. | 2511.07710 | null |
| 2025-11-10 | Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning | Michael Hoffmann et.al. | 2511.07682 | null |
| 2025-11-10 | CAVER: Curious Audiovisual Exploring Robot | Luca Macesanu et.al. | 2511.07619 | null |
| 2025-11-08 | Multivariate Variational Autoencoder | Mehmet Can Yavuz et.al. | 2511.07472 | null |
| 2025-11-20 | AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents | Ye Zheng et.al. | 2511.07441 | null |
| 2025-11-10 | TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research | Han Zhang et.al. | 2511.07412 | null |
| 2025-11-10 | YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting | Botao Ye et.al. | 2511.07321 | null |
| 2025-11-10 | VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models | Ying Cheng et.al. | 2511.07299 | null |
| 2025-11-10 | Direct imaging of magnetotransport at graphene-metal interfaces with a single-spin quantum sensor | C. Ding et.al. | 2511.07181 | null |
| 2025-11-10 | LeCoT: revisiting network architecture for two-view correspondence pruning | Luanyuan Dai et.al. | 2511.07078 | null |
| 2025-11-10 | Integration of Visual SLAM into Consumer-Grade Automotive Localization | Luis Diener et.al. | 2511.06919 | null |
| 2025-11-10 | Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding | Yuzhen Li et.al. | 2511.06908 | null |
| 2025-11-10 | NeuroBridge: Bio-Inspired Self-Supervised EEG-to-Image Decoding via Cognitive Priors and Bidirectional Semantic Alignment | Wenjiang Zhang et.al. | 2511.06836 | null |
| 2025-11-10 | Semi-distributed Cross-modal Air-Ground Relative Localization | Weining Lu et.al. | 2511.06749 | null |
| 2025-11-10 | AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer | Yulim So et.al. | 2511.06687 | null |
| 2025-11-10 | HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment | Ruijia Wu et.al. | 2511.06653 | null |
| 2025-11-09 | DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization | Tao Liu et.al. | 2511.06422 | null |
| 2025-11-09 | A generalization bound for exit wave reconstruction via deep unfolding | Moussa Atwi et.al. | 2511.06413 | null |
| 2025-11-09 | CINEMAE: Leveraging Frozen Masked Autoencoders for Cross-Generator AI Image Detection | Minsuk Jang et.al. | 2511.06325 | null |
| 2025-11-09 | ALIGN: A Vision-Language Framework for High-Accuracy Accident Location Inference through Geo-Spatial Neural Reasoning | MD Thamed Bin Zaman Chowdhury et.al. | 2511.06316 | null |
| 2025-11-11 | Robust Nearest Neighbour Retrieval Using Targeted Manifold Manipulation | B. Ghosh et.al. | 2511.06261 | null |
| 2025-11-09 | ExpReS-VLA: Specializing Vision-Language-Action Models Through Experience Replay and Retrieval | Shahram Najam Syed et.al. | 2511.06202 | null |
| 2025-11-08 | Real-Time Bundle Adjustment for Ultra-High-Resolution UAV Imagery Using Adaptive Patch-Based Feature Tracking | Selim Ahmet Iz et.al. | 2511.06152 | null |
| 2025-11-11 | When Object-Centric World Models Meet Policy Learning: From Pixels to Policies, and Where It Breaks | Stefano Ferraro et.al. | 2511.06136 | null |
| 2025-11-08 | Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration | Umar Rashid et.al. | 2511.06087 | null |
| 2025-11-08 | Visual Exploration of Feature Relationships in Sparse Autoencoders with Curated Concepts | Xinyuan Yan et.al. | 2511.06048 | null |
| 2025-11-08 | S2ML: Spatio-Spectral Mutual Learning for Depth Completion | Zihui Zhao et.al. | 2511.06033 | null |
| 2025-11-08 | Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era | Feng Lu et.al. | 2511.06024 | null |
| 2025-11-08 | Dissecting the Perseus-Pisces supercluster observed with CFHT-MegaCam: Investigating environmental effects on galaxy morphology | M. Mondelin et.al. | 2511.05925 | null |
| 2025-11-08 | Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning | Fei Yu et.al. | 2511.05894 | null |
| 2025-11-08 | HAPS Communication Networks: A Tutorial-cum-Survey on Integration with Optical Atmospheric Sensing | Ali Elkhazraji et.al. | 2511.05877 | null |
| 2025-11-07 | SARCH: Multimodal Search for Archaeological Archives | Nivedita Sinha et.al. | 2511.05667 | null |
| 2025-11-05 | Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps | Yoojin Oh et.al. | 2511.05590 | null |
| 2025-11-07 | Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments | Laura Alejandra Encinar Gonzalez et.al. | 2511.05404 | null |
| 2025-11-07 | PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization | Zehui Feng et.al. | 2511.05393 | null |
| 2025-11-07 | Turning Adversaries into Allies: Reversing Typographic Attacks for Multimodal E-Commerce Product Retrieval | Janet Jenq et.al. | 2511.05325 | null |
| 2025-11-07 | On the possibility of using decayless kink oscillations of coronal loops to forecast powerful solar flares and coronal mass ejections | A. B. Nechaeva et.al. | 2511.05175 | null |
| 2025-11-07 | Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start | Fuyang Liu et.al. | 2511.05095 | null |
| 2025-11-07 | Dynamic Residual Encoding with Slide-Level Contrastive Learning for End-to-End Whole Slide Image Representation | Jing Jin et.al. | 2511.05034 | null |
| 2025-11-07 | DAFM: Dynamic Adaptive Fusion for Multi-Model Collaboration in Composed Image Retrieval | Yawei Cai et.al. | 2511.05020 | null |
| 2025-11-07 | Nuclear Ptychoscopy: A Ptychographic Framework for Nuclear Spectroscopy | Ziyang Yuan et.al. | 2511.04924 | null |
| 2025-11-06 | Learning to reason about rare diseases through retrieval-augmented agents | Ha Young Kim et.al. | 2511.04720 | null |
| 2025-11-06 | PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning | Yicheng Xiao et.al. | 2511.04601 | null |
| 2025-11-06 | Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA | Itbaan Safwan et.al. | 2511.04384 | null |
| 2025-11-06 | High-Resolution Forest Mapping from L-Band Interferometric SAR Time Series using Deep Learning over Northern Spain | Chiara Telli et.al. | 2511.04362 | null |
| 2025-11-06 | Probing the Probes: Methods and Metrics for Concept Alignment | Jacob Lysnæs-Larsen et.al. | 2511.04312 | null |
| 2025-11-06 | DINOv2 Driven Gait Representation Learning for Video-Based Visible-Infrared Person Re-identification | Yujie Yang et.al. | 2511.04281 | null |
| 2025-11-07 | On the Brittleness of CLIP Text Encoders | Allie Tran et.al. | 2511.04247 | null |
| 2025-11-06 | An Efficient Algorithm for Learning-Based Visual Localization | Jindi Zhong et.al. | 2511.04232 | null |
| 2025-11-06 | GraspView: Active Perception Scoring and Best-View Optimization for Robotic Grasping in Cluttered Environments | Shenglin Wang et.al. | 2511.04199 | null |
| 2025-11-06 | Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories | Olav Finne Praesteng Larsen et.al. | 2511.04155 | null |
| 2025-11-06 | Learning from Online Videos at Inference Time for Computer-Use Agents | Yujian Liu et.al. | 2511.04137 | null |
| 2025-11-06 | SpatialLock: Precise Spatial Control in Text-to-Image Synthesis | Biao Liu et.al. | 2511.04112 | null |
| 2025-11-06 | Caption Injection for Optimization in Generative Search Engine | Xiaolu Chen et.al. | 2511.04080 | null |
| 2025-11-06 | CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation | Yuwen Tao et.al. | 2511.03992 | null |
| 2025-11-05 | SILVI: Simple Interface for Labeling Video Interactions | Ozan Kanbertay et.al. | 2511.03819 | null |
| 2025-11-05 | Expert Evaluation of LLM World Models: A High- |
Haoyu Guo et.al. | 2511.03782 | null |
| 2025-11-05 | The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents | Xingyao Wang et.al. | 2511.03690 | null |
| 2025-11-10 | Coherent Differential Imaging of high-contrast extended sources with VLT/SPHERE | Axel Potier et.al. | 2511.03518 | null |
| 2025-11-05 | Performance Evaluation of a Position-Sensitive SiPM-based Gamma Camera for Intraoperative Imaging | Aramis Raiola et.al. | 2511.03493 | null |
| 2025-11-05 | Lightwave Power Transfer-Enabled Underwater Optical ISAC Systems under Ship Attitude Variation | Kapila W. S. Palitharathna et.al. | 2511.03366 | null |
| 2025-11-05 | Accelerating Physical Property Reasoning for Augmented Visual Cognition | Hongbo Lan et.al. | 2511.03126 | null |
| 2025-11-04 | The Curved Spacetime of Transformer Architectures | Riccardo Di Sipio et.al. | 2511.03060 | null |
| 2025-11-04 | SLIP: Structural-aware Language-Image Pretraining for Vision-Language Alignment | Wenbo Lu et.al. | 2511.03019 | null |
| 2025-11-04 | Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data | Jessica Plassmann et.al. | 2511.02541 | null |
| 2025-11-04 | Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization | Tao Liu et.al. | 2511.02489 | null |
| 2025-11-04 | LUMA-RAG: Lifelong Multimodal Agents with Provably Stable Streaming Alignment | Rohan Wandre et.al. | 2511.02371 | null |
| 2025-11-04 | Learning Spatial Awareness for Laparoscopic Surgery with AI Assisted Visual Feedback | Songyang Liu et.al. | 2511.02233 | null |
| 2025-11-03 | AlloyLens: A Visual Analytics Tool for High-throughput Alloy Screening and Inverse Design | Suyang Li et.al. | 2511.02133 | null |
| 2025-11-10 | Enhancing Multimodal Recommendations with Vision-Language Models and Information-Aware Fusion | Hai-Dang Kieu et.al. | 2511.02113 | null |
| 2025-11-03 | TurboMap: GPU-Accelerated Local Mapping for Visual SLAM | Parsa Hosseininejad et.al. | 2511.02036 | null |
| 2025-11-03 | Topological Expansion of Boehm's Brushes via Structured Light | Dmitry A. Pushin et.al. | 2511.01841 | null |
| 2025-11-05 | TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning | Ming Li et.al. | 2511.01833 | null |
| 2025-11-03 | 3EED: Ground Everything Everywhere in 3D | Rong Li et.al. | 2511.01755 | null |
| 2025-11-03 | Progressive Translation of H&E to IHC with Enhanced Structural Fidelity | Yuhang Kang et.al. | 2511.01698 | null |
| 2025-11-03 | Vote-in-Context: Turning VLMs into Zero-Shot Rank Fusers | Mohamed Eltahir et.al. | 2511.01617 | null |
| 2025-11-03 | Wave-Particle (Continuous-Discrete) Dualistic Visual Tokenization for Unified Understanding and Generation | Yizhu Chen et.al. | 2511.01593 | null |
| 2025-11-03 | Floor Plan-Guided Visual Navigation Incorporating Depth and Directional Cues | Wei Huang et.al. | 2511.01493 | null |
| 2025-11-03 | UniSOT: A Unified Framework for Multi-Modality Single Object Tracking | Yinchao Ma et.al. | 2511.01427 | null |
| 2025-11-03 | Semantic BIM enrichment for firefighting assets: Fire-ART dataset and panoramic image-based 3D reconstruction | Ya Wen et.al. | 2511.01399 | null |
| 2025-11-03 | SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment | Xinyu Mao et.al. | 2511.01390 | null |
| 2025-11-03 | MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement | Jierui Qu et.al. | 2511.01345 | null |
| 2025-11-03 | Direct Mapping of Intrinsic Topology of Bound States in the Continuum via Nonlinear Emission | Shuzheng Chen et.al. | 2511.01337 | null |
| 2025-11-03 | MotionStream: Real-Time Video Generation with Interactive Motion Controls | Joonghyuk Shin et.al. | 2511.01266 | null |
| 2025-11-03 | A Saddle Point Remedy: Power of Variable Elimination in Non-convex Optimization | Min Gan et.al. | 2511.01234 | null |
| 2025-11-02 | Efficient Test-Time Retrieval Augmented Generation | Hailong Yin et.al. | 2511.01059 | null |
| 2025-11-02 | Integrating Visual and X-Ray Machine Learning Features in the Study of Paintings by Goya | Hassan Ugail et.al. | 2511.01000 | null |
| 2025-11-02 | Dynamic Multi-level Weighted Alignment Network for Zero-shot Sketch-based Image Retrieval | Hanwen Su et.al. | 2511.00925 | null |
| 2025-11-02 | GraphGeo: Multi-Agent Debate Framework for Visual Geo-localization with Heterogeneous Graph Neural Networks | Heng Zheng et.al. | 2511.00908 | null |
| 2025-11-02 | Enhancing Adversarial Transferability in Visual-Language Pre-training Models via Local Shuffle and Sample-based Attack | Xin Liu et.al. | 2511.00831 | null |
| 2025-11-01 | Applying Medical Imaging Tractography Techniques to Painterly Rendering of Images | Alberto Di Biase et.al. | 2511.00702 | null |
| 2025-11-01 | Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles | Hyungtae Lim et.al. | 2511.00635 | null |
| 2025-11-05 | Text-guided Fine-Grained Video Anomaly Detection | Jihao Gu et.al. | 2511.00524 | null |
| 2025-11-01 | OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback | Kai Luo et.al. | 2511.00510 | null |
| 2025-11-09 | VinDr-CXR-VQA: A Visual Question Answering Dataset for Explainable Chest X-Ray Analysis with Multi-Task Learning | Dang H. Nguyen et.al. | 2511.00504 | null |
| 2025-11-01 | FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts | Weihao Bo et.al. | 2511.00480 | null |
| 2025-11-20 | Weakly Supervised Pneumonia Localization from Chest X-Rays Using Deep Neural Network and Grad-CAM Explanations | Kiran Shahi et.al. | 2511.00456 | null |
| 2025-11-01 | ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training | Xin Yao et.al. | 2511.00446 | null |
| 2025-11-01 | Leveraging Hierarchical Image-Text Misalignment for Universal Fake Image Detection | Daichi Zhang et.al. | 2511.00427 | null |
| 2025-11-01 | VinciCoder: Unifying Multimodal Code Generation via Coarse-to-fine Visual Reinforcement Learning | Xuanle Zhao et.al. | 2511.00391 | null |
| 2025-11-19 | Spot The Ball: A Benchmark for Visual Social Inference | Neha Balamurugan et.al. | 2511.00261 | null |
| 2025-10-31 | Generative Modeling Enables Molecular Structure Retrieval from Coulomb Explosion Imaging | Xiang Li et.al. | 2511.00179 | null |
| 2025-10-31 | Integrating ConvNeXt and Vision Transformers for Enhancing Facial Age Estimation | Gaby Maroun et.al. | 2511.00123 | null |
| 2025-11-03 | Image Hashing via Cross-View Code Alignment in the Age of Foundation Models | Ilyass Moummad et.al. | 2510.27584 | null |
| 2025-10-31 | DP-FedPGN: Finding Global Flat Minima for Differentially Private Federated Learning via Penalizing Gradient Norm | Junkang Liu et.al. | 2510.27504 | null |
| 2025-10-31 | ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use | Mengjie Deng et.al. | 2510.27363 | null |
| 2025-10-31 | RzenEmbed: Towards Comprehensive Multimodal Retrieval | Weijian Jian et.al. | 2510.27350 | null |
| 2025-11-24 | FOCUS: Efficient Keyframe Selection for Long Video Understanding | Zirui Zhu et.al. | 2510.27280 | null |
| 2025-10-31 | Approximate Diverse |
Jiachen Zhao et.al. | 2510.27243 | null |
| 2025-11-04 | Dual-level Progressive Hardness-Aware Reweighting for Cross-View Geo-Localization | Guozheng Zheng et.al. | 2510.27181 | null |
| 2025-10-31 | M^3Detection: Multi-Frame Multi-Level Feature Fusion for Multi-Modal 3D Object Detection with Camera and 4D Imaging Radar | Xiaozhi Li et.al. | 2510.27166 | null |
| 2025-10-31 | AFM-Net: Advanced Fusing Hierarchical CNN Visual Priors with Global Sequence Modeling for Remote Sensing Image Scene Classification | Yuanhao Tang et.al. | 2510.27155 | null |
| 2025-10-31 | WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond | Zhicong Sun et.al. | 2510.27133 | null |
| 2025-11-04 | NaviTrace: Evaluating Embodied Navigation of Vision-Language Models | Tim Windecker et.al. | 2510.26909 | null |
| 2025-10-30 | Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench | Fenfen Lin et.al. | 2510.26865 | null |
| 2025-11-03 | Evaluating Perspectival Biases in Cross-Modal Retrieval | Teerapol Saengsukhiran et.al. | 2510.26861 | null |
| 2025-10-29 | Audio-Visual Speech Enhancement In Complex Scenarios With Separation And Dereverberation Joint Modeling | Jiarong Du et.al. | 2510.26825 | null |
| 2025-10-30 | Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark | Ziyu Guo et.al. | 2510.26802 | null |
| 2025-10-30 | Scaling Image Geo-Localization to Continent Level | Philipp Lindenberger et.al. | 2510.26795 | null |
| 2025-11-03 | ChartAB: A Benchmark for Chart Grounding & Dense Alignment | Aniruddh Bansal et.al. | 2510.26781 | null |
| 2025-10-30 | STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization | Marco Federici et.al. | 2510.26771 | null |
| 2025-10-30 | Fire Behavior Monitoring using MeteoSat Third Generation, FCI-FireDyn algorithm: Rate Of Spread and Burnt Area Dynamics for large fire event | Ronan Paugam et.al. | 2510.26677 | null |
| 2025-10-30 | Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection | Yuanting Fan et.al. | 2510.26464 | null |
| 2025-10-30 | CorVS: Person Identification via Video Trajectory-Sensor Correspondence in a Real-World Warehouse | Kazuma Kano et.al. | 2510.26369 | null |
| 2025-10-30 | Weak-Lensing Detection of Intercluster Filaments in Three Nearby Cluster Systems | Rahul Shinde et.al. | 2510.26318 | null |
| 2025-10-30 | Self-localization on a 3D map by fusing global and local features from a monocular camera | Satoshi Kikuch et.al. | 2510.26170 | null |
| 2025-10-30 | CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark | Jiaqi Wang et.al. | 2510.26160 | null |
| 2025-10-30 | Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM | Ali Caglayan et.al. | 2510.26131 | null |
| 2025-10-30 | Josephson effect with periodic order parameter | Klaus Ziegler et.al. | 2510.26128 | null |
| 2025-10-30 | OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research | Caoshuo Li et.al. | 2510.26114 | null |
| 2025-10-29 | RADRON: Cooperative Localization of Ionizing Radiation Sources by MAVs with Compton Cameras | Petr Stibinger et.al. | 2510.26018 | null |
| 2025-10-29 | DARTS: A Drone-Based AI-Powered Real-Time Traffic Incident Detection System | Bai Li et.al. | 2510.26004 | null |
| 2025-10-31 | Larger Hausdorff Dimension in Scanning Pattern Facilitates Mamba-Based Methods in Low-Light Image Enhancement | Xinhua Wang et.al. | 2510.26001 | null |
| 2025-10-29 | Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer | Roman Beliy et.al. | 2510.25976 | null |
| 2025-10-26 | Towards Piece-by-Piece Explanations for Chess Positions with SHAP | Francesco Spinnato et.al. | 2510.25775 | null |
| 2025-10-29 | Retrieval-Augmented Search for Large-Scale Map Collections with ColPali | Jamie Mahowald et.al. | 2510.25718 | null |
| 2025-10-29 | Instance-Level Composed Image Retrieval | Bill Psomas et.al. | 2510.25387 | null |
| 2025-10-29 | Prompt Estimation from Prototypes for Federated Prompt Tuning of Vision Transformers | M Yashwanth et.al. | 2510.25372 | null |
| 2025-10-29 | Development of a new phase-retrieval algorithm from a single-shot image for X-ray schlieren microscopy | Ryutaro Nishimura et.al. | 2510.25264 | null |
| 2025-10-29 | Spectral analysis of the stiffness matrix sequence in the approximated Stokes equation | Samuele Ferri et.al. | 2510.25252 | null |
| 2025-10-29 | Hybrid Vision Servoing with Depp Alignment and GRU-Based Occlusion Recovery | Jee Won Lee et.al. | 2510.25233 | null |
| 2025-10-29 | MMM-Fact: A Multimodal, Multi-Domain Fact-Checking Dataset with Multi-Level Retrieval Difficulty | Wenyan Xu et.al. | 2510.25120 | null |
| 2025-10-29 | Visual Diversity and Region-aware Prompt Learning for Zero-shot HOI Detection | Chanhyeong Yang et.al. | 2510.25094 | null |
| 2025-10-28 | Defect Mitigation for Robot Arm-based Additive Manufacturing Utilizing Intelligent Control and IOT | Matsive Ali et.al. | 2510.24994 | null |
| 2025-10-28 | DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts | Binbin Li et.al. | 2510.24813 | null |
| 2025-10-28 | Generative AI for Healthcare: Fundamentals, Challenges, and Perspectives | Gang Chen et.al. | 2510.24551 | null |
| 2025-10-28 | GeVI-SLAM: Gravity-Enhanced Stereo Visua Inertial SLAM for Underwater Robots | Yuan Shen et.al. | 2510.24533 | null |
| 2025-10-28 | Fast and accurate neural reflectance transformation imaging through knowledge distillation | Tinsae G. Dulecha et.al. | 2510.24486 | null |
| 2025-10-28 | Deeply-Conditioned Image Compression via Self-Generated Priors | Zhineng Zhao et.al. | 2510.24437 | null |
| 2025-10-28 | Half-Light Radius Measurements of Andromeda Dwarf Satellites from the Isaac Newton Telescope Survey Using Exponential, Plummer, and Sérsic Fits | Hedieh Abdollahi et.al. | 2510.24377 | null |
| 2025-10-28 | Decoupling What to Count and Where to See for Referring Expression Counting | Yuda Zou et.al. | 2510.24374 | null |
| 2025-10-28 | Sound Source Localization for Spatial Mapping of Surgical Actions in Dynamic Scenes | Jonas Hein et.al. | 2510.24332 | null |
| 2025-10-28 | CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation | Anshul Kaushal et.al. | 2510.24202 | null |
| 2025-10-28 | LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation | Haotian Zhou et.al. | 2510.24118 | null |
| 2025-10-27 | Explainable Detection of AI-Generated Images with Artifact Localization Using Faster-Than-Lies and Vision-Language Models for Edge Devices | Aryan Mathur et.al. | 2510.23775 | null |
| 2025-10-27 | EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT | Baoqi Pei et.al. | 2510.23569 | null |
| 2025-10-27 | MDReID: Modality-Decoupled Learning for Any-to-Any Multi-Modal Object Re-Identification | Yingying Feng et.al. | 2510.23301 | null |
| 2025-10-27 | Learning from Frustration: Torsor CNNs on Graphs | Daiyuan Li et.al. | 2510.23288 | null |
| 2025-10-27 | Moderating Role of Presence in EEG Responses to Visuo-haptic Prediction Error in Virtual Reality | Lukas Gehrke et.al. | 2510.23262 | null |
| 2025-10-27 | Accurate and Scalable Multimodal Pathology Retrieval via Attentive Vision-Language Alignment | Hongyi Wang et.al. | 2510.23224 | null |
| 2025-10-27 | The Sun as an X-ray star V.: A new method to retrieve coronal filling factors | Wilhelmina Maryann Joseph et.al. | 2510.23161 | null |
| 2025-10-27 | Reliable Robotic Task Execution in the Face of Anomalies | Bharath Santhanam et.al. | 2510.23121 | null |
| 2025-10-27 | Multi-Stage Field Extraction of Financial Documents with OCR and Compact Vision-Language Models | Yichao Jin et.al. | 2510.23066 | null |
| 2025-10-26 | Seeing the Unseen: Towards Zero-Shot Inspection for Wind Turbine Blades using Knowledge-Augmented Vision Language Models | Yang Zhang et.al. | 2510.22868 | null |
| 2025-10-26 | Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models | Lexiang Xiong et.al. | 2510.22851 | null |
| 2025-10-26 | Analytical Swarm Chemistry: Characterization and Analysis of Emergent Swarm Behaviors | Ricardo Vega et.al. | 2510.22821 | null |
| 2025-10-26 | VEHME: A Vision-Language Model For Evaluating Handwritten Mathematics Expressions | Thu Phuong Nguyen et.al. | 2510.22798 | null |
| 2025-11-01 | Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval | Binxiao Xu et.al. | 2510.22765 | null |
| 2025-10-26 | TWC-SLAM: Multi-Agent Cooperative SLAM with Text Semantics and WiFi Features Integration for Similar Indoor Environments | Chunyu Li et.al. | 2510.22754 | null |
| 2025-10-30 | Cross-view Localization and Synthesis -- Datasets, Challenges and Opportunities | Ningli Xu et.al. | 2510.22736 | null |
| 2025-10-26 | S-Chain: Structured Visual Chain-of-Thought For Medicine | Khai Le-Duc et.al. | 2510.22728 | null |
| 2025-10-26 | SpoofTrackBench: Interpretable AI for Spoof-Aware UAV Tracking and Benchmarking | Van Le et.al. | 2510.22726 | null |
| 2025-10-26 | LRW-Persian: Lip-reading in the Wild Dataset for Persian Language | Zahra Taghizadeh et.al. | 2510.22716 | null |
| 2025-10-26 | SARCLIP: A Vision Language Foundation Model for Semantic Understanding and Target Recognition in SAR Imagery | Qiwei Ma et.al. | 2510.22665 | null |
| 2025-10-26 | CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation | Md. Mehedi Hasan et.al. | 2510.22609 | null |
| 2025-10-26 | SWAN: Self-supervised Wavelet Neural Network for Hyperspectral Image Unmixing | Yassh Ramchandani et.al. | 2510.22607 | null |
| 2025-10-26 | RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience | Huilin Yin et.al. | 2510.22600 | null |
| 2025-10-26 | STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models | Mahiro Ukai et.al. | 2510.22571 | null |
| 2025-10-26 | Structure Aware Image Downscaling | G B Kevin Arjun et.al. | 2510.22551 | null |
| 2025-10-26 | Low-Light Image Enhancement Using Gamma Learning And Attention-Enabled Encoder-Decoder Networks | Bibhabasu Debnath et.al. | 2510.22547 | null |
| 2025-10-26 | Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing | Xiang Fei et.al. | 2510.22529 | null |
| 2025-10-26 | Open Multimodal Retrieval-Augmented Factual Image Generation | Yang Tian et.al. | 2510.22521 | null |
| 2025-10-25 | Moving Beyond Diffusion: Hierarchy-to-Hierarchy Autoregression for fMRI-to-Image Reconstruction | Xu Zhang et.al. | 2510.22335 | null |
| 2025-10-25 | From Slides to Chatbots: Enhancing Large Language Models with University Course Materials | Tu Anh Dinh et.al. | 2510.22272 | null |
| 2025-10-25 | Scaling Non-Parametric Sampling with Representation | Vincent Lu et.al. | 2510.22196 | null |
| 2025-10-24 | Earth Analogs in Reflected Light: Insights from Early Spectral Characterization in Unconstrained Orbits | Arnaud Salvador et.al. | 2510.21973 | null |
| 2025-10-23 | TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary Weights and Distilled Knowledge | Shu-Hao Zhang et.al. | 2510.21879 | null |
| 2025-10-22 | SCoPE VLM: Selective Context Processing for Efficient Document Navigation in Vision-Language Models | Gyubeum Lim et.al. | 2510.21850 | null |
| 2025-10-24 | Modest-Align: Data-Efficient Alignment for Vision-Language Models | Jiaxiang Liu et.al. | 2510.21606 | null |
| 2025-10-23 | GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs | Guanghao Zheng et.al. | 2510.21501 | null |
| 2025-10-24 | MUVR: A Multi-Modal Untrimmed Video Retrieval Benchmark with Multi-Level Visual Correspondence | Yue Feng et.al. | 2510.21406 | null |
| 2025-10-24 | Dynamic Semantic-Aware Correlation Modeling for UAV Tracking | Xinyu Zhou et.al. | 2510.21351 | null |
| 2025-10-24 | CT-CLIP: A Multi-modal Fusion Framework for Robust Apple Leaf Disease Recognition in Complex Environments | Lemin Liu et.al. | 2510.21346 | null |
| 2025-10-24 | FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning | Lu Zhang et.al. | 2510.21311 | null |
| 2025-10-24 | Underwater Visual-Inertial-Acoustic-Depth SLAM with DVL Preintegration for Degraded Environments | Shuoshuo Ding et.al. | 2510.21215 | null |
| 2025-10-24 | A visual big data system for the prediction of weather-related variables: Jordan-Spain case study | Shadi Aljawarneh et.al. | 2510.21176 | null |
| 2025-10-24 | MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning | Siyong Chen et.al. | 2510.21093 | null |
| 2025-10-27 | LayerComposer: Interactive Personalized T2I via Spatially-Aware Layered Canvas | Guocheng Gordon Qian et.al. | 2510.20820 | null |
| 2025-10-23 | Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation | Yuhan Liu et.al. | 2510.20812 | null |
| 2025-10-23 | Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence | Jiahao Meng et.al. | 2510.20579 | null |
| 2025-10-23 | Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired Navigation | Marziyeh Bamdad et.al. | 2510.20549 | null |
| 2025-10-24 | Robust Preference Alignment via Directional Neighborhood Consensus | Ruochen Mao et.al. | 2510.20498 | null |
| 2025-10-23 | Degradation-Aware Cooperative Multi-Modal GNSS-Denied Localization Leveraging LiDAR-Based Robot Detections | Václav Pritzl et.al. | 2510.20480 | null |
| 2025-11-20 | Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence | Kun Ouyang et.al. | 2510.20470 | null |
| 2025-10-23 | Mitigating Cross-modal Representation Bias for Multicultural Image-to-Recipe Retrieval | Qing Wang et.al. | 2510.20393 | null |
| 2025-10-25 | DB-FGA-Net: Dual Backbone Frequency Gated Attention Network for Multi-Class Brain Tumor Classification with Grad-CAM Interpretability | Saraf Anzum Shreya et.al. | 2510.20299 | null |
| 2025-10-23 | A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization | LinFeng Li et.al. | 2510.20291 | null |
| 2025-10-23 | Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures | Rahul Raja et.al. | 2510.20193 | null |
| 2025-10-23 | PathFormer: A Transformer with 3D Grid Constraints for Digital Twin Robot-Arm Trajectory Generation | Ahmed Alanazi et.al. | 2510.20161 | null |
| 2025-10-27 | "Learning Together": AI-Mediated Support for Parental Involvement in Everyday Learning | Yao Li et.al. | 2510.20123 | null |
| 2025-10-24 | BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models | Ziheng Zhang et.al. | 2510.20095 | null |
| 2025-10-22 | Exposing Blindspots: Cultural Bias Evaluation in Generative Image Models | Huichan Seo et.al. | 2510.20042 | null |
| 2025-10-22 | Automating Iconclass: LLMs and RAG for Large-Scale Classification of Religious Woodcuts | Drew B. Thomas et.al. | 2510.19986 | null |
| 2025-10-22 | Compressing Biology: Evaluating the Stable Diffusion VAE for Phenotypic Drug Discovery | Télio Cropsal et.al. | 2510.19887 | null |
| 2025-10-22 | Multilayer Perceptron Neural Network Model: A Novel Approach for LFP Contrast Sensitivity Tuning | Sahar Maleki et.al. | 2510.19636 | null |
| 2025-10-22 | XBench: A Comprehensive Benchmark for Visual-Language Explanations in Chest Radiography | Haozhe Luo et.al. | 2510.19599 | null |
| 2025-10-22 | Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation | Su Ho Han et.al. | 2510.19592 | null |
| 2025-10-22 | AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields | Woo Jae Kim et.al. | 2510.19371 | null |
| 2025-10-22 | Exploring Scale Shift in Crowd Localization under the Context of Domain Generalization | Juncheng Wang et.al. | 2510.19330 | null |
| 2025-10-22 | Step-Aware Residual-Guided Diffusion for EEG Spatial Super-Resolution | Hongjun Liu et.al. | 2510.19166 | null |
| 2025-10-21 | UniHPR: Unified Human Pose Representation via Singular Value Contrastive Learning | Zhongyu Jiang et.al. | 2510.19078 | null |
| 2025-10-21 | Macroscopic EEG Reveals Discriminative Low-Frequency Oscillations in Plan-to-Grasp Visuomotor Tasks | Anna Cetera et.al. | 2510.19057 | null |
| 2025-10-21 | Visually Comparing Graph Vertex Ordering Algorithms through Geometrical and Topological Approaches | Karelia Salinas et.al. | 2510.19009 | null |
| 2025-10-21 | Underwater Dense Mapping with the First Compact 3D Sonar | Chinmay Burgul et.al. | 2510.18991 | null |
| 2025-10-18 | Small Language Models Offer Significant Potential for Science Community | Jian Zhang et.al. | 2510.18890 | null |
| 2025-10-21 | FedDEAP: Adaptive Dual-Prompt Tuning for Multi-Domain Federated Learning | Yubin Zheng et.al. | 2510.18837 | null |
| 2025-10-21 | UltraGen: High-Resolution Video Generation with Hierarchical Attention | Teng Hu et.al. | 2510.18775 | null |
| 2025-10-21 | Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting | Taha Binhuraib et.al. | 2510.18745 | null |
| 2025-10-21 | SSD: Spatial-Semantic Head Decoupling for Efficient Autoregressive Image Generation | Siyong Jian et.al. | 2510.18716 | null |
| 2025-10-21 | Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents | Yiqi Lin et.al. | 2510.18703 | null |
| 2025-10-21 | CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder | Yongmin Lee et.al. | 2510.18583 | null |
| 2025-11-12 | Large deviations in the many-body localization transition: The case of the random-field XXZ chain | Greivin Alfaro Miranda et.al. | 2510.18545 | null |
| 2025-10-21 | RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation | Junwen Huang et.al. | 2510.18521 | null |
| 2025-10-21 | Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation | Wei-Chia Chang et.al. | 2510.18502 | null |
| 2025-10-21 | Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection | Ji Du et.al. | 2510.18437 | null |
| 2025-10-21 | ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization | Yuanhe Guo et.al. | 2510.18433 | null |
| 2025-10-21 | Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents | Guangfu Guo et.al. | 2510.18424 | null |
| 2025-10-21 | Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models | Lehan Wang et.al. | 2510.18303 | null |
| 2025-10-22 | Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs | Yanhong Li et.al. | 2510.18279 | null |
| 2025-10-21 | TreeFedDG: Alleviating Global Drift in Federated Domain Generalization for Medical Image Segmentation | Yucheng Song et.al. | 2510.18268 | null |
| 2025-10-21 | UWBench: A Comprehensive Vision-Language Benchmark for Underwater Understanding | Da Zhang et.al. | 2510.18262 | null |
| 2025-10-21 | DualHash: A Stochastic Primal-Dual Algorithm with Theoretical Guarantee for Deep Hashing | Luxuan Li et.al. | 2510.18218 | null |
| 2025-10-20 | AION-1: Omnimodal Foundation Model for Astronomical Sciences | Liam Parker et.al. | 2510.17960 | null |
| 2025-10-13 | Pre to Post-Treatment Glioblastoma MRI Prediction using a Latent Diffusion Model | Alexandre G. Leclercq et.al. | 2510.17851 | null |
| 2025-09-30 | Micromechanical characterisation of osteoarthritic subchondral bone by micropillar compression | Samuel McPhee et.al. | 2510.17824 | null |
| 2025-10-20 | SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference | Samir Khaki et.al. | 2510.17777 | null |
| 2025-10-20 | Seeing but Not Believing: Probing the Disconnect Between Visual Attention and Answer Correctness in VLMs | Zhining Liu et.al. | 2510.17771 | null |
| 2025-10-20 | Joint Multi-Condition Representation Modelling via Matrix Factorisation for Visual Place Recognition | Timur Ismagilov et.al. | 2510.17739 | null |
| 2025-10-20 | Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning | Min Cao et.al. | 2510.17685 | null |
| 2025-10-20 | MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning | Mir Nafis Sharear Shopnil et.al. | 2510.17590 | null |
| 2025-10-20 | BenCao: An Instruction-Tuned Large Language Model for Traditional Chinese Medicine | Jiacheng Xie et.al. | 2510.17415 | null |
| 2025-10-20 | Model Metamers Reveal Invariances in Graph Neural Networks | Wei Xu et.al. | 2510.17378 | null |
| 2025-10-20 | Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation | Chenghao Zhang et.al. | 2510.17354 | null |
| 2025-10-21 | LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding | ZhaoYang Han et.al. | 2510.17305 | null |
| 2025-10-20 | Performance Evaluation of an Integrated System for Visible Light Communication and Positioning Using an Event Camera | Ryota Soga et.al. | 2510.17203 | null |
| 2025-10-20 | Generation then Reconstruction: Accelerating Masked Autoregressive Models via Two-Stage Sampling | Feihong Yan et.al. | 2510.17171 | null |
| 2025-10-22 | OmniVIC: A Self-Improving Variable Impedance Controller with Vision-Language In-Context Learning for Safe Robotic Manipulation | Heng Zhang et.al. | 2510.17150 | null |
| 2025-10-19 | Person Re-Identification via Generalized Class Prototypes | Md Ahmed Al Muzaddid et.al. | 2510.17043 | null |
| 2025-10-19 | A Low-Complexity View Synthesis Distortion Estimation Method for 3D Video with Large Baseline Considerations | Chongyuan Bi et.al. | 2510.17037 | null |
| 2025-10-19 | SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models | Chih-Kai Yang et.al. | 2510.16917 | null |
| 2025-10-19 | ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification | Akhila Kambhatla et.al. | 2510.16854 | null |
| 2025-11-24 | ReefNet: A Large scale, Taxonomically Enriched Dataset and Benchmark for Hard Coral Classification | Yahia Battach et.al. | 2510.16822 | null |
| 2025-10-19 | An Efficient Framework for Whole-Page Reranking via Single-Modal Supervision | Zishuai Zhang et.al. | 2510.16803 | null |
| 2025-10-19 | Region in Context: Text-condition Image editing with Human-like semantic reasoning | Thuy Phuong Vu et.al. | 2510.16772 | null |
| 2025-10-19 | See or Say Graphs: Agent-Driven Scalable Graph Understanding with Vision-Language Models | Shuo Han et.al. | 2510.16769 | null |
| 2025-10-19 | Exact Nearest-Neighbor Search on Energy-Efficient FPGA Devices | Patrizio Dazzi et.al. | 2510.16736 | null |
| 2025-10-27 | UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid | Tianyang Dou et.al. | 2510.16730 | null |
| 2025-10-18 | Safire: Similarity Framework for Visualization Retrieval | Huyen N. Nguyen et.al. | 2510.16662 | null |
| 2025-10-18 | A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications | Melika Filvantorkaman et.al. | 2510.16611 | null |
| 2025-10-18 | Image Categorization and Search via a GAT Autoencoder and Representative Models | Duygu Sap et.al. | 2510.16514 | null |
| 2025-10-18 | RefAtomNet++: Advancing Referring Atomic Video Action Recognition using Semantic Retrieval based Multi-Trajectory Mamba | Kunyu Peng et.al. | 2510.16444 | null |
| 2025-10-18 | RL makes MLLMs see better than SFT | Junha Song et.al. | 2510.16333 | null |
| 2025-10-17 | Out-of-Equilibrium Dynamics in a U(1) Lattice Gauge Theory via Local Information Flows: Scattering and String Breaking | Claudia Artiaco et.al. | 2510.16101 | null |
| 2025-10-14 | Frequency domain laser ultrasound microscopy for nanometric layer thickness imaging with GHz elastic plate resonances | Martin Ryzy et.al. | 2510.16000 | null |
| 2025-10-27 | ESCA: Contextualizing Embodied Agents via Scene-Graph Generation | Jiani Huang et.al. | 2510.15963 | null |
| 2025-10-17 | Memory-SAM: Human-Prompt-Free Tongue Segmentation via Retrieval-to-Prompt | Joongwon Chae et.al. | 2510.15849 | null |
| 2025-10-17 | FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification | Zhen Sun et.al. | 2510.15595 | null |
| 2025-10-17 | MCA: Modality Composition Awareness for Robust Composed Multimodal Retrieval | Qiyu Wu et.al. | 2510.15543 | null |
| 2025-10-17 | DPTrack:Directional Kernel-Guided Prompt Learning for Robust Nighttime Aerial Tracking | Zhiqiang Zhu et.al. | 2510.15449 | null |
| 2025-10-17 | Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning | Xuchen Li et.al. | 2510.15440 | null |
| 2025-10-17 | Semantic4Safety: Causal Insights from Zero-shot Street View Imagery Segmentation for Urban Road Safety | Huan Chen et.al. | 2510.15434 | null |
| 2025-11-07 | Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs | Lee Qi Zun et.al. | 2510.15418 | null |
| 2025-10-17 | PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction | Ting-Yu Yen et.al. | 2510.15386 | null |
| 2025-10-17 | WebGen-V Bench: Structured Representation for Enhancing Visual Design in LLM-based Web Generation and Evaluation | Kuang-Da Wang et.al. | 2510.15306 | null |
| 2025-10-17 | Post-Processing Methods for Improving Accuracy in MRI Inpainting | Nishad Kulkarni et.al. | 2510.15282 | null |
| 2025-10-17 | CuSfM: CUDA-Accelerated Structure-from-Motion | Jingrui Yu et.al. | 2510.15271 | null |
| 2025-11-02 | Experience-Driven Exploration for Efficient API-Free AI Agents | Chenwei Tang et.al. | 2510.15259 | null |
| 2025-10-17 | LVI-Q: Robust LiDAR-Visual-Inertial-Kinematic Odometry for Quadruped Robots Using Tightly-Coupled and Efficient Alternating Optimization | Kevin Christiansen Marsim et.al. | 2510.15220 | null |
| 2025-10-16 | TGT: Text-Grounded Trajectories for Locally Controlled Video Generation | Guofeng Zhang et.al. | 2510.15104 | null |
| 2025-10-16 | Comprehensive language-image pre-training for 3D medical image understanding | Tassilo Wald et.al. | 2510.15042 | null |
| 2025-10-16 | NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks | Junliang Ye et.al. | 2510.15019 | null |
| 2025-10-16 | ChangingGrounding: 3D Visual Grounding in Changing Scenes | Miao Hu et.al. | 2510.14965 | null |
| 2025-10-16 | RainDiff: End-to-end Precipitation Nowcasting Via Token-wise Attention Diffusion | Thao Nguyen et.al. | 2510.14962 | null |
| 2025-10-16 | CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection | Hojun Choi et.al. | 2510.14792 | null |
| 2025-10-16 | Improving Cybercrime Detection and Digital Forensics Investigations with Artificial Intelligence | Silvia Lucia Sanna et.al. | 2510.14638 | null |
| 2025-10-16 | Multimodal RAG for Unstructured Data:Leveraging Modality-Aware Knowledge Graphs with Hybrid Retrieval | Rashmi R et.al. | 2510.14592 | null |
| 2025-10-16 | Talking Points: Describing and Localizing Pixels | Matan Rusanovsky et.al. | 2510.14583 | null |
| 2025-10-16 | Acquisition of interpretable domain information during brain MR image harmonization for content-based image retrieval | Keima Abe et.al. | 2510.14535 | null |
| 2025-11-24 | Structured Random Models for Phase Retrieval with Optical Diffusers | Zhiyuan Hu et.al. | 2510.14490 | null |
| 2025-10-16 | Spatial Preference Rewarding for MLLMs Spatial Understanding | Han Qiu et.al. | 2510.14374 | null |
| 2025-10-14 | K-frames: Scene-Driven Any-k Keyframe Selection for long video understanding | Yifeng Yao et.al. | 2510.13891 | null |
| 2025-10-12 | Multimodal Retrieval-Augmented Generation with Large Language Models for Medical VQA | A H M Rezaul Karim et.al. | 2510.13856 | null |
| 2025-09-19 | GQVis: A Dataset of Genomics Data Questions and Visualizations for Generative AI | Skylar Sargent Walters et.al. | 2510.13816 | null |
| 2025-10-15 | Adaptive Visual Conditioning for Semantic Consistency in Diffusion-Based Story Continuation | Seyed Mohammad Mousavi et.al. | 2510.13787 | null |
| 2025-10-16 | NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow Matching | Run Luo et.al. | 2510.13721 | null |
| 2025-10-15 | Jacobian-Based Interpretation of Nonlinear Neural Encoding Model | Xiaohui Gao et.al. | 2510.13688 | null |
| 2025-11-11 | AVAR-Net: A Lightweight Audio-Visual Anomaly Recognition Framework with a Benchmark Dataset | Amjid Ali et.al. | 2510.13630 | null |
| 2025-10-15 | Characterizing Lidar Point-Cloud Adversities Using a Vector Field Visualization | Daniel Choate et.al. | 2510.13619 | null |
| 2025-10-15 | Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU | Ruiqi Ye et.al. | 2510.13546 | null |
| 2025-10-15 | Through the Lens of Doubt: Robust and Efficient Uncertainty Estimation for Visual Place Recognition | Emily Miller et.al. | 2510.13464 | null |
| 2025-10-15 | Improving Visual Recommendation on E-commerce Platforms Using Vision-Language Models | Yuki Yada et.al. | 2510.13359 | null |
| 2025-10-15 | UniVector: Unified Vector Extraction via Instance-Geometry Interaction | Yinglong Yan et.al. | 2510.13234 | null |
| 2025-10-15 | OS-HGAdapter: Open Semantic Hypergraph Adapter for Large Language Models Assisted Entropy-Enhanced Image-Text Alignment | Rongjun Chen et.al. | 2510.13131 | null |
| 2025-10-23 | Epistemic-aware Vision-Language Foundation Model for Fetal Ultrasound Interpretation | Xiao He et.al. | 2510.12953 | null |
| 2025-10-14 | DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search | Kartik Narayan et.al. | 2510.12801 | null |
| 2025-10-14 | SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models | Weiyang Jin et.al. | 2510.12784 | null |
| 2025-10-24 | E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization | Wenpu Li et.al. | 2510.12753 | null |
| 2025-10-14 | A Text-Image Fusion Method with Data Augmentation Capabilities for Referring Medical Image Segmentation | Shurong Chai et.al. | 2510.12482 | null |
| 2025-10-14 | SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression | Biao Zhang et.al. | 2510.12474 | null |
| 2025-10-14 | SpineBench: Benchmarking Multimodal LLMs for Spinal Pathology Analysis | Chenghanyu Zhang et.al. | 2510.12267 | null |
| 2025-10-14 | Local Background Features Matter in Out-of-Distribution Detection | Jinlun Ye et.al. | 2510.12259 | null |
| 2025-10-14 | SDGraph: Multi-Level Sketch Representation Learning by Sparse-Dense Graph Architecture | Xi Cheng et.al. | 2510.12192 | null |
| 2025-10-14 | ImageSentinel: Protecting Visual Datasets from Unauthorized Retrieval-Augmented Image Generation | Ziyuan Luo et.al. | 2510.12119 | null |
| 2025-10-13 | Embedding the Teacher: Distilling vLLM Preferences for Scalable Image Retrieval | Eric He et.al. | 2510.12014 | null |
| 2025-10-11 | Benefits and Limitations of Using GenAI for Political Education and Municipal Elections | Raphael Fischer et.al. | 2510.11749 | null |
| 2025-10-13 | High-resolution Photo Enhancement in Real-time: A Laplacian Pyramid Network | Feng Zhang et.al. | 2510.11613 | null |
| 2025-10-14 | Massive Activations are the Key to Local Detail Synthesis in Diffusion Transformers | Chaofan Gan et.al. | 2510.11538 | null |
| 2025-10-13 | A Modular AIoT Framework for Low-Latency Real-Time Robotic Teleoperation in Smart Cities | Shih-Chieh Sun et.al. | 2510.11421 | null |
| 2025-10-13 | MMAP: A Multi-Magnification and Prototype-Aware Architecture for Predicting Spatial Gene Expression | Hai Dang Nguyen et.al. | 2510.11344 | null |
| 2025-10-13 | A Large-Language-Model Assisted Automated Scale Bar Detection and Extraction Framework for Scanning Electron Microscopic Images | Yuxuan Chen et.al. | 2510.11260 | null |
| 2025-10-13 | PhysHSI: Towards a Real-World Generalizable and Natural Humanoid-Scene Interaction System | Huayi Wang et.al. | 2510.11072 | null |
| 2025-10-13 | Impact of elastic inhomogeneity on collective dynamical properties investigated by field theoretical description in real space | Cunyuan Jiang et.al. | 2510.10928 | null |
| 2025-10-13 | SceneTextStylizer: A Training-Free Scene Text Style Transfer Framework with Diffusion Model | Honghui Yuan et.al. | 2510.10910 | null |
| 2025-10-13 | Spatial Correlation of Superconducting and Pseudogap Dynamics in a Bi-based Cuprate | T. Shimizu et.al. | 2510.10906 | null |
| 2025-10-13 | Where on Earth? A Vision-Language Benchmark for Probing Model Geolocation Skills Across Scales | Zhaofang Qian et.al. | 2510.10880 | null |
| 2025-10-12 | OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs | Caorui Li et.al. | 2510.10689 | null |
| 2025-10-12 | A Simple and Better Baseline for Visual Grounding | Jingchao Wang et.al. | 2510.10587 | null |
| 2025-10-12 | BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices | Euhid Aman et.al. | 2510.10560 | null |
| 2025-10-12 | Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs | Suyang Xi et.al. | 2510.10426 | null |
| 2025-10-11 | B2N3D: Progressive Learning from Binary to N-ary Relationships for 3D Object Grounding | Feng Xiao et.al. | 2510.10194 | null |
| 2025-10-11 | TCMA: Text-Conditioned Multi-granularity Alignment for Drone Cross-Modal Text-Video Retrieval | Zixu Zhao et.al. | 2510.10180 | null |
| 2025-10-11 | ViConEx-Med: Visual Concept Explainability via Multi-Concept Token Transformer for Medical Image Analysis | Cristiano Patrício et.al. | 2510.10174 | null |
| 2025-10-11 | Cooperative Pseudo Labeling for Unsupervised Federated Classification | Kuangpu Guo et.al. | 2510.10100 | null |
| 2025-10-11 | Think Twice to See More: Iterative Visual Reasoning in Medical VLMs | Kaitao Chen et.al. | 2510.10052 | null |
| 2025-10-11 | Complementary and Contrastive Learning for Audio-Visual Segmentation | Sitong Gong et.al. | 2510.10051 | null |
| 2025-10-11 | Scaling Traffic Insights with AI and Language Model-Powered Camera Systems for Data-Driven Transportation Decision Making | Fan Zuo et.al. | 2510.09981 | null |
| 2025-10-14 | J-RAS: Enhancing Medical Image Segmentation via Retrieval-Augmented Joint Training | Salma J. Ahmed et.al. | 2510.09953 | null |
| 2025-10-15 | Egocentric Visual Navigation through Hippocampal Sequences | Xiao-Xiong Lin et.al. | 2510.09951 | null |
| 2025-10-10 | The Geometry of Reasoning: Flowing Logics in Representation Space | Yufa Zhou et.al. | 2510.09782 | null |
| 2025-10-10 | VisRAG 2.0: Evidence-Guided Multi-Image Reasoning in Visual Retrieval-Augmented Generation | Yubo Sun et.al. | 2510.09733 | null |
| 2025-10-07 | Semantic-Cohesive Knowledge Distillation for Deep Cross-modal Hashing | Changchang Sun et.al. | 2510.09664 | null |
| 2025-10-10 | MRMR: A Realistic and Expert-Level Multidisciplinary Benchmark for Reasoning-Intensive Multimodal Retrieval | Siyue Zhang et.al. | 2510.09510 | null |
| 2025-10-10 | Diagonal Artifacts in Samsung Images: PRNU Challenges and Solutions | David Vázquez-Padín et.al. | 2510.09509 | null |
| 2025-10-10 | Dynamic Weight-based Temporal Aggregation for Low-light Video Enhancement | Ruirui Lin et.al. | 2510.09450 | null |
| 2025-10-10 | Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians | Jin-Chuan Shi et.al. | 2510.09438 | null |
| 2025-10-10 | Sub-Diffraction Chromatin Domains: Architecture, Regulation, and Functional Roles in Nuclear Organization | Vinayak Vinayak et.al. | 2510.09375 | null |
| 2025-10-10 | Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation | Wenyao Zhang et.al. | 2510.09320 | null |
| 2025-10-10 | Instance-Level Generation for Representation Learning | Yankun Wu et.al. | 2510.09171 | null |
| 2025-10-10 | Robust Visual Teach-and-Repeat Navigation with Flexible Topo-metric Graph Map Representation | Jikai Wang et.al. | 2510.09089 | null |
| 2025-10-10 | Visual Anomaly Detection for Reliable Robotic Implantation of Flexible Microelectrode Array | Yitong Chen et.al. | 2510.09071 | null |
| 2025-10-10 | HandEval: Taking the First Step Towards Hand Quality Evaluation in Generated Images | Zichuan Wang et.al. | 2510.08978 | null |
| 2025-10-10 | Hierarchical Scheduling for Multi-Vector Image Retrieval | Maoliang Li et.al. | 2510.08976 | null |
| 2025-11-19 | FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation | Samuel Hildebrand et.al. | 2510.08945 | null |
| 2025-10-09 | Identifying Video Game Debugging Bottlenecks: An Industry Perspective | Carlos Pinto Gomez et.al. | 2510.08834 | null |
| 2025-10-09 | Whole Body Model Predictive Control for Spin-Aware Quadrupedal Table Tennis | David Nguyen et.al. | 2510.08754 | null |
| 2025-10-08 | Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry | Thomas Fel et.al. | 2510.08638 | null |
| 2025-10-11 | MultiCOIN: Multi-Modal COntrollable Video INbetweening | Maham Tanveer et.al. | 2510.08561 | null |
| 2025-10-09 | X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering | Zhitong Huang et.al. | 2510.08530 | null |
| 2025-10-09 | Observation of electromagnons in a monolayer multiferroic | Mohammad Amini et.al. | 2510.08253 | null |
| 2025-10-09 | DarkHash: A Data-Free Backdoor Attack Against Deep Hashing | Ziqi Zhou et.al. | 2510.08094 | null |
| 2025-10-09 | CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning | Weihuang Lin et.al. | 2510.08003 | null |
| 2025-10-09 | MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding | Peiran Wu et.al. | 2510.07915 | null |
| 2025-10-09 | RePainter: Empowering E-commerce Object Removal via Spatial-matting Reinforcement Learning | Zipeng Guo et.al. | 2510.07721 | null |
| 2025-10-09 | Multimodal Safety Evaluation in Generative Agent Social Simulations | Alhim Vera et.al. | 2510.07709 | null |
| 2025-10-09 | Mutual Learning for Hashing: Unlocking Strong Hash Functions from Weak Supervision | Xiaoxu Ma et.al. | 2510.07703 | null |
| 2025-10-16 | Ctrl-VI: Controllable Video Synthesis via Variational Inference | Haoyi Duan et.al. | 2510.07670 | null |
| 2025-10-08 | SpecGuard: Spectral Projection-based Advanced Invisible Watermarking | Inzamamul Alam et.al. | 2510.07302 | null |
| 2025-10-10 | DPL: Depth-only Perceptive Humanoid Locomotion via Realistic Depth Synthesis and Cross-Attention Terrain Reconstruction | Jingkai Sun et.al. | 2510.07152 | null |
| 2025-10-08 | ELMUR: External Layer Memory with Update/Rewrite for Long-Horizon RL | Egor Cherepanov et.al. | 2510.07151 | null |
| 2025-11-14 | Concept Retrieval -- What and How? | Ori Nizan et.al. | 2510.07058 | null |
| 2025-10-08 | High-Performance Imaging in a Dilution Refrigerator | Timo Eikelmann et.al. | 2510.07054 | null |
| 2025-10-08 | Introspection in Learned Semantic Scene Graph Localisation | Manshika Charvi Bissessur et.al. | 2510.07053 | null |
| 2025-10-08 | IAR2: Improving Autoregressive Visual Generation with Semantic-Detail Associated Token Prediction | Ran Yi et.al. | 2510.06928 | null |
| 2025-10-08 | M3Retrieve: Benchmarking Multimodal Retrieval for Medicine | Arkadeep Acharya et.al. | 2510.06888 | null |
| 2025-10-08 | Versatile 3D reconstruction framework for hard X-ray grazing incidence imaging of nanostructures | Luke Besley et.al. | 2510.06877 | null |
| 2025-10-08 | Multi-hop Deep Joint Source-Channel Coding with Deep Hash Distillation for Semantically Aligned Image Retrieval | Didrik Bergström et.al. | 2510.06868 | null |
| 2025-10-08 | Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking | Mitchell Keren Taraday et.al. | 2510.06820 | null |
| 2025-10-08 | Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity | Islomjon Shukhratov et.al. | 2510.06802 | null |
| 2025-10-08 | DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining | Zhiliang Zhu et.al. | 2510.06746 | null |
| 2025-10-08 | ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory | Yunzhong Xiao et.al. | 2510.06664 | null |
| 2025-11-15 | Implicit-Knowledge Visual Question Answering with Structured Reasoning Traces | Zhihao Wen et.al. | 2510.06638 | null |
| 2025-10-07 | TDiff: Thermal Plug-And-Play Prior with Patch-Based Diffusion | Piyush Dashpute et.al. | 2510.06460 | null |
| 2025-10-07 | Vi-TacMan: Articulated Object Manipulation via Vision and Touch | Leiyao Cui et.al. | 2510.06339 | null |
| 2025-10-05 | A Mixed-Methods Analysis of Repression and Mobilization in Bangladesh's July Revolution Using Machine Learning and Statistical Modeling | Md. Saiful Bari Siddiqui et.al. | 2510.06264 | null |
| 2025-10-09 | A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants | Hans G. W. van Dam et.al. | 2510.06223 | null |
| 2025-10-07 | Human3R: Everyone Everywhere All at Once | Yue Chen et.al. | 2510.06219 | null |
| 2025-10-07 | DYMO-Hair: Generalizable Volumetric Dynamics Modeling for Robot Hair Manipulation | Chengyang Zhao et.al. | 2510.06199 | null |
| **2025-10-0 |