GitHub - Vincentqyw/cv-arxiv-daily: 🎓Automatically Update CV Papers Daily using Github Actions

Updated on 2025.12.06

Usage instructions: here

Table of Contents

SLAM
SFM
Visual Localization
Keypoint Detection
Image Matching
NeRF

SLAM

Publish Date	Title	Authors	PDF	Code
2025-12-04	TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards	Mauro Martini et.al.	2512.04772	null
2025-12-03	What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models	Tianchen Deng et.al.	2512.03422	null
2025-12-02	VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM	Zihan Zhu et.al.	2512.02293	null
2025-12-01	KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM	Zaid Nasser et.al.	2512.01889	null
2025-12-01	Register Any Point: Scaling 3D Point Cloud Registration by Flow Matching	Yue Pan et.al.	2512.01850	null
2025-12-01	AgriLiRa4D: A Multi-Sensor UAV Dataset for Robust SLAM in Challenging Agricultural Fields	Zhihao Zhan et.al.	2512.01753	null
2025-12-01	EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly	Xiaokun Pan et.al.	2512.01296	null
2025-11-30	Integration of UWB Radar on Mobile Robots for Continuous Obstacle and Environment Mapping	Adelina Giurea et.al.	2512.01018	null
2025-11-30	EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes	Xiaoshan Wu et.al.	2512.00771	null
2025-11-29	Odometry Without Correspondence from Inertially Constrained Ruled Surfaces	Chenqi Zhu et.al.	2512.00327	null
2025-11-26	Dual-Agent Reinforcement Learning for Adaptive and Cost-Aware Visual-Inertial Odometry	Feiyang Pan et.al.	2511.21083	null
2025-11-25	Estimating Fog Parameters from a Sequence of Stereo Images	Yining Ding et.al.	2511.20865	null
2025-11-25	The origin of B-type runaway stars based on kinematics	Yanjun Guo et.al.	2511.20566	null
2025-11-25	Metric, inertially aligned monocular state estimation via kinetodynamic priors	Jiaxin Liu et.al.	2511.20496	null
2025-11-25	AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend	Hengyi Wang et.al.	2511.20343	null
2025-11-25	Stellar Parameters of BOSS M dwarfs in SDSS-V DR19	Dan Qiu et.al.	2511.20005	null
2025-11-26	Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors	Yuchen Zhou et.al.	2511.19031	null
2025-11-24	AutoOdom: Learning Auto-regressive Proprioceptive Odometry for Legged Locomotion	Changsheng Luo et.al.	2511.18857	null
2025-11-24	SP-VINS: A Hybrid Stereo Visual Inertial Navigation System based on Implicit Environmental Map	Xueyu Du et.al.	2511.18756	null
2025-11-24	Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing	Xiaotong Huang et.al.	2511.18755	null
2025-11-24	Stable Multi-Drone GNSS Tracking System for Marine Robots	Shuo Wen et.al.	2511.18694	null
2025-11-23	Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span	Heeseung Yun et.al.	2511.18470	null
2025-11-22	Unobservable Subspace Evolution and Alignment for Consistent Visual-Inertial Navigation	Chungeng Tian et.al.	2511.17992	null
2025-11-21	Target-Bench: Can World Models Achieve Mapless Path Planning with Semantic Targets?	Dingrui Wang et.al.	2511.17792	null
2025-11-21	IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation	Yifan Li et.al.	2511.17384	null
2025-11-21	MonoSpheres: Large-Scale Monocular SLAM-Based UAV Exploration through Perception-Coupled Mapping and Planning	Tomáš Musil et.al.	2511.17299	null
2025-11-21	SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors	Kunyi Li et.al.	2511.17207	null
2025-11-20	CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering	Joni Vanherck et.al.	2511.16349	null
2025-11-20	Building temporally coherent 3D maps with VGGT for memory-efficient Semantic SLAM	Gergely Dinya et.al.	2511.16282	null
2025-11-20	LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM	Sibaek Lee et.al.	2511.16144	null
2025-11-20	Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2511.16091	null
2025-11-20	Semantic Glitch: Agency and Artistry in an Autonomous Pixel Cloud	Qing Zhang et.al.	2511.16048	null
2025-11-11	Real-time Point Cloud Data Transmission via L4S for 5G-Edge-Assisted Robotics	Gerasimos Damigos et.al.	2511.15677	null
2025-11-19	Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2511.15597	null
2025-11-18	A visual study of ICP variants for Lidar Odometry	Sebastian Dingler et.al.	2511.14919	null
2025-11-18	SLAM-AGS: Slide-Label Aware Multi-Task Pretraining Using Adaptive Gradient Surgery in Computational Cytology	Marco Acerbis et.al.	2511.14639	null
2025-11-23	Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors	Jeryes Danial et.al.	2511.14335	null
2025-11-18	MA-SLAM: Active SLAM in Large-Scale Unknown Environment using Map Aware Deep Reinforcement Learning	Yizhen Yin et.al.	2511.14330	null
2025-11-18	iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion	Hao Wang et.al.	2511.14149	null
2025-11-17	GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry	Chiyun Noh et.al.	2511.13216	null
2025-11-16	DPVO-QAT++: Heterogeneous QAT and CUDA Kernel Fusion for High-Performance Deep Patch Visual Odometry	Cheng Liao et.al.	2511.12653	null
2025-11-14	Autonomous Underwater Cognitive System for Adaptive Navigation: A SLAM-Integrated Cognitive Architecture	K. A. I. N Jayarathne et.al.	2511.11845	null
2025-11-12	DualVision ArthroNav: Investigating Opportunities to Enhance Localization and Reconstruction in Image-based Arthroscopy Navigation via External Cameras	Hongchao Shu et.al.	2511.10699	null
2025-11-12	Generation-Agnostic Zero-Energy Devices for Sustainable Connectivity, Sensing, and Localization	Navid Amani et.al.	2511.09372	null
2025-11-12	UMIGen: A Unified Framework for Egocentric Point Cloud Generation and Cross-Embodiment Robotic Imitation Learning	Yan Huang et.al.	2511.09302	null
2025-11-12	SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields	Sangheon Yang et.al.	2511.09072	null
2025-11-10	Integration of Visual SLAM into Consumer-Grade Automotive Localization	Luis Diener et.al.	2511.06919	null
2025-11-10	Robust and High-Fidelity 3D Gaussian Splatting: Fusing Pose Priors and Geometry Constraints for Texture-Deficient Outdoor Scenes	Meijun Guo et.al.	2511.06765	null
2025-11-10	Semi-distributed Cross-modal Air-Ground Relative Localization	Weining Lu et.al.	2511.06749	null
2025-11-08	ViTaMIn-B: A Reliable and Efficient Visuo-Tactile Bimanual Manipulation Interface	Chuanyu Li et.al.	2511.05858	null
2025-11-08	3D Mapping Using a Lightweight and Low-Power Monocular Camera Embedded inside a Gripper of Limbed Climbing Robots	Taku Okawara et.al.	2511.05816	null
2025-11-07	Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments	Laura Alejandra Encinar Gonzalez et.al.	2511.05404	null
2025-11-06	Synchronous Observer Design for Landmark-Inertial SLAM with Almost-Global Convergence	Arkadeep Saha et.al.	2511.04531	null
2025-11-06	PUL-SLAM: Path-Uncertainty Co-Optimization with Lightweight Stagnation Detection for Efficient Robotic Exploration	Yizhen Yin et.al.	2511.04180	null
2025-11-04	Analytical modelling of a stop-less modular bus service with an application to charging strategies comparison	Haoran Zhao et.al.	2511.03754	null
2025-11-04	Self-Supervised Moving Object Segmentation of Sparse and Noisy Radar Point Clouds	Leon Schwarzer et.al.	2511.02395	null
2025-11-03	TurboMap: GPU-Accelerated Local Mapping for Visual SLAM	Parsa Hosseininejad et.al.	2511.02036	null
2025-11-03	CM-LIUW-Odometry: Robust and High-Precision LiDAR-Inertial-UWB-Wheel Odometry for Extreme Degradation Coal Mine Tunnels	Kun Hu et.al.	2511.01379	null
2025-11-11	Tackling the Kidnapped Robot Problem via Sparse Feasible Hypothesis Sampling and Reliable Batched Multi-Stage Inference	Muhua Zhang et.al.	2511.01219	null
2025-11-03	LiDAR-VGGT: Cross-Modal Coarse-to-Fine Fusion for Globally Consistent and Metric-Scale Dense Mapping	Lijie Wang et.al.	2511.01186	null
2025-11-01	Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles	Hyungtae Lim et.al.	2511.00635	null
2025-10-31	WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond	Zhicong Sun et.al.	2510.27133	null
2025-10-30	AgriGS-SLAM: Orchard Mapping Across Seasons via Multi-View Gaussian Splatting SLAM	Mirko Usuelli et.al.	2510.26358	null
2025-10-30	Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM	Ali Caglayan et.al.	2510.26131	null
2025-10-29	EA3D: Online Open-World 3D Object Extraction from Streaming Videos	Xiaoyu Zhou et.al.	2510.25146	null
2025-10-28	Spatiotemporal Calibration of Doppler Velocity Logs for Underwater Robots	Hongxu Zhao et.al.	2510.24571	null
2025-10-28	GeVI-SLAM: Gravity-Enhanced Stereo Visua Inertial SLAM for Underwater Robots	Yuan Shen et.al.	2510.24533	null
2025-10-28	A Survey on Collaborative SLAM with 3D Gaussian Splatting	Phuc Nguyen Xuan et.al.	2510.23988	null
2025-10-26	TWC-SLAM: Multi-Agent Cooperative SLAM with Text Semantics and WiFi Features Integration for Similar Indoor Environments	Chunyu Li et.al.	2510.22754	null
2025-10-26	Policies over Poses: Reinforcement Learning based Distributed Pose-Graph Optimization for Multi-Robot SLAM	Sai Krishna Ghanta et.al.	2510.22740	null
2025-10-26	LVD-GS: Gaussian Splatting SLAM for Dynamic Scenes via Hierarchical Explicit-Implicit Representation Collaboration Rendering	Wenkai Zhu et.al.	2510.22669	null
2025-10-26	RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience	Huilin Yin et.al.	2510.22600	null
2025-10-26	UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models	Wenming Tu et.al.	2510.22588	null
2025-10-26	Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing	Xiang Fei et.al.	2510.22529	null
2025-10-24	Underwater Visual-Inertial-Acoustic-Depth SLAM with DVL Preintegration for Degraded Environments	Shuoshuo Ding et.al.	2510.21215	null
2025-10-23	Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired Navigation	Marziyeh Bamdad et.al.	2510.20549	null
2025-10-23	Degradation-Aware Cooperative Multi-Modal GNSS-Denied Localization Leveraging LiDAR-Based Robot Detections	Václav Pritzl et.al.	2510.20480	null
2025-10-21	Underwater Dense Mapping with the First Compact 3D Sonar	Chinmay Burgul et.al.	2510.18991	null
2025-10-21	DeepDetect: Learning All-in-One Dense Keypoints	Shaharyar Ahmed Khan Tareen et.al.	2510.17422	null
2025-10-18	LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching	Aidyn Ubingazhibov et.al.	2510.16438	null
2025-10-17	VAR-SLAM: Visual Adaptive and Robust SLAM for Dynamic Environments	João Carlos Virgolino Soares et.al.	2510.16205	null
2025-10-17	Dynamic Recalibration in LiDAR SLAM: Integrating AI and Geometric Methods with Real-Time Feedback Using INAF Fusion	Zahra Arjmandi et.al.	2510.15803	null
2025-10-17	LVI-Q: Robust LiDAR-Visual-Inertial-Kinematic Odometry for Quadruped Robots Using Tightly-Coupled and Efficient Alternating Optimization	Kevin Christiansen Marsim et.al.	2510.15220	null
2025-10-16	3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation	JoungBin Lee et.al.	2510.14945	null
2025-10-15	Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU	Ruiqi Ye et.al.	2510.13546	null
2025-10-15	Through the Lens of Doubt: Robust and Efficient Uncertainty Estimation for Visual Place Recognition	Emily Miller et.al.	2510.13464	null
2025-10-15	DAMM-LOAM: Degeneracy Aware Multi-Metric LiDAR Odometry and Mapping	Nishant Chandna et.al.	2510.13287	null
2025-10-14	SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding	Zhiliu Yang et.al.	2510.12749	null
2025-10-14	PolygMap: A Perceptive Locomotion Framework for Humanoid Robot Stair Climbing	Bingquan Li et.al.	2510.12346	null
2025-10-09	ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation	Guanghao Li et.al.	2510.08551	null
2025-10-09	RTGS: Real-Time 3D Gaussian Splatting SLAM via Multi-Level Redundancy Reduction	Leshu Li et.al.	2510.06644	null
2025-10-07	Human3R: Everyone Everywhere All at Once	Yue Chen et.al.	2510.06219	null
2025-11-02	Dropping the D: RGB-D SLAM Without the Depth Sensor	Mert Kiray et.al.	2510.06216	null
2025-10-07	Coordinate-Consistent Localization via Continuous-Time Calibration and Fusion of UWB and SLAM Observations	Tien-Dat Nguyen et.al.	2510.05992	null
2025-10-06	OKVIS2-X: Open Keyframe-based Visual-Inertial SLAM Configurable with Dense Depth or LiDAR, and GNSS	Simon Boche et.al.	2510.04612	null
2025-10-04	TCB-VIO: Tightly-Coupled Focal-Plane Binary-Enhanced Visual Inertial Odometry	Matthew Lisondra et.al.	2510.03919	null
2025-11-19	Visual Odometry with Transformers	Vlardimir Yugay et.al.	2510.03348	null
2025-10-02	RSV-SLAM: Toward Real-Time Semantic Visual SLAM in Indoor Dynamic Environments	Mobin Habibpour et.al.	2510.02616	null
2025-10-02	EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction	Lingxiang Hu et.al.	2510.02080	null
2025-10-02	Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale	Yongbo Chen et.al.	2510.01665	null
2025-10-02	Statistical Uncertainty Learning for Robust Visual-Inertial State Estimation	Seungwon Choi et.al.	2510.01648	null
2025-10-01	Instant4D: 4D Gaussian Splatting in Minutes	Zhanpeng Luo et.al.	2510.01119	null
2025-10-01	Semantic Visual Simultaneous Localization and Mapping: A Survey on State of the Art, Challenges, and Future Directions	Thanh Nguyen Canh et.al.	2510.00783	null
2025-09-30	Benchmarking Egocentric Visual-Inertial SLAM at City Scale	Anusha Krishnan et.al.	2509.26639	null
2025-09-30	Graphite: A GPU-Accelerated Mixed-Precision Graph Optimization Framework	Shishir Gopinath et.al.	2509.26581	null
2025-09-30	Radio-based Multi-Robot Odometry and Relative Localization	Andrés Martínez-Silva et.al.	2509.26558	null
2025-09-30	DEPTHOR++: Robust Depth Enhancement from a Real-World Lightweight dToF and RGB Guidance	Jijun Xiang et.al.	2509.26498	null
2025-09-30	Side Scan Sonar-based SLAM for Autonomous Algae Farm Monitoring	Julian Valdez et.al.	2509.26121	null
2025-09-30	User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality	Conghao Zhou et.al.	2509.25905	null
2025-09-29	PROFusion: Robust and Accurate Dense Reconstruction via Camera Pose Regression and Optimization	Siyan Dong et.al.	2509.24236	null
2025-09-28	GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State	Guole Shen et.al.	2509.23737	null
2025-09-28	From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations	Javed Ahmad et.al.	2509.23555	null
2025-09-27	EKF-Based Fusion of Wi-Fi/LiDAR/IMU for Indoor Localization and Navigation	Zeyi Li et.al.	2509.23118	null
2025-09-26	Good Weights: Proactive, Adaptive Dead Reckoning Fusion for Continuous and Robust Visual SLAM	Yanwei Du et.al.	2509.22910	null
2025-09-26	IMU-Preintegrated Radar Factors for Asynchronous Radar-LiDAR-Inertial SLAM	Johan Hatleskog et.al.	2509.22288	null
2025-09-25	Real-Time Indoor Object SLAM with LLM-Enhanced Priors	Yang Jiao et.al.	2509.21602	null
2025-09-25	PL-VIWO2: A Lightweight, Fast and Robust Visual-Inertial-Wheel Odometry Using Points and Lines	Zhixin Zhang et.al.	2509.21563	null
2025-09-25	AnywhereVLA: Language-Conditioned Exploration and Mobile Manipulation	Konstantin Gubernatorov et.al.	2509.21006	null
2025-11-16	MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM	Yuxuan Zhou et.al.	2509.20757	null
2025-09-25	SLAM-Free Visual Navigation with Hierarchical Vision-Language Perception and Coarse-to-Fine Semantic Topological Planning	Guoyang Zhao et.al.	2509.20739	null
2025-09-24	Optical Ocean Recipes: Creating Realistic Datasets to Facilitate Underwater Vision Research	Patricia Schöntag et.al.	2509.20171	null
2025-09-23	Bioinspired SLAM Approach for Unmanned Surface Vehicle	Fabio Coelho et.al.	2509.19522	null
2025-09-23	CU-Multi: A Dataset for Multi-Robot Collaborative Perception	Doncey Albin et.al.	2509.19463	null
2025-09-23	Towards Robust LiDAR Localization: Deep Learning-based Uncertainty Estimation	Minoo Dolatabadi et.al.	2509.18954	null
2025-09-23	An Extended Kalman Filter for Systems with Infinite-Dimensional Measurements	Maxwell M. Varley et.al.	2509.18749	null
2025-09-22	Semantic-Aware Particle Filter for Reliable Vineyard Robot Localisation	Rajitha de Silva et.al.	2509.18342	null
2025-09-22	ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos	Shi Chen et.al.	2509.17864	null
2025-09-21	SLAM-Former: Putting SLAM into One Transformer	Yijun Yuan et.al.	2509.16909	null
2025-09-21	ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM	Amanuel T. Dufera et.al.	2509.16863	null
2025-09-19	SLaM-DiMM: Shared Latent Modeling for Diffusion Based Missing Modality Synthesis in MRI	Bhavesh Sandbhor et.al.	2509.16019	null
2025-09-19	Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion	Yinong Cao et.al.	2509.15673	null
2025-09-19	STARC: See-Through-Wall Augmented Reality Framework for Human-Robot Collaboration in Emergency Response	Shenghai Yuan et.al.	2509.15507	null
2025-09-18	Human Interaction for Collaborative Semantic SLAM using Extended Reality	Laura Ribeiro et.al.	2509.14949	null
2025-09-18	BEV-ODOM2: Enhanced BEV-based Monocular Visual Odometry with PV-BEV Fusion and Dense Flow Supervision for Ground Robots	Yufei Wei et.al.	2509.14636	null
2025-09-18	Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods	Adam D. Hines et.al.	2509.14516	null
2025-10-03	MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping	Zhihao Cao et.al.	2509.14191	null
2025-10-08	BIM Informed Visual SLAM for Construction Monitoring	Asier Bikandi-Noya et.al.	2509.13972	null
2025-09-17	UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry	Tae-Wook Um et.al.	2509.13713	null
2025-09-17	Barometer-Aided Attitude Estimation	Méloné Nyoba Tchonkeu et.al.	2509.13649	null
2025-09-16	Semantic 3D Reconstructions with SLAM for Central Airway Obstruction	Ayberk Acar et.al.	2509.13541	null
2025-09-16	MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM	Yinlong Bai et.al.	2509.13536	null
2025-09-18	MATTER: Multiscale Attention for Registration Error Regression	Shipeng Liu et.al.	2509.12924	null
2025-09-16	Match Chat: Real Time Generative AI and Generative Computing for Tennis	Aaron Baughman et.al.	2509.12592	null
2025-09-15	See What I Mean? Mobile Eye-Perspective Rendering for Optical See-through Head-mounted Displays	Gerlinde Emsenhuber et.al.	2509.11653	null
2025-09-15	Gaussian-Plus-SDF SLAM: High-fidelity 3D Reconstruction at 150+ fps	Zhexi Peng et.al.	2509.11574	null
2025-09-28	Autonomous Close-Proximity Photovoltaic Panel Coating Using a Quadcopter	Dimitri Jacquemont et.al.	2509.10979	null
2025-09-13	FastTrack: GPU-Accelerated Tracking for Visual SLAM	Kimia Khabiri et.al.	2509.10757	null
2025-09-12	Robust Localization in Modern Cellular Networks using Global Map Features	Junshi Chen et.al.	2509.10433	null
2025-09-12	Efficient and Accurate Downfacing Visual Inertial Odometry	Jonas Kühne et.al.	2509.10021	null
2025-10-10	SMapper: A Multi-Modal Data Acquisition Platform for SLAM Benchmarking	Pedro Miguel Bastos Soares et.al.	2509.09509	null
2025-09-11	S-BEVLoc: BEV-based Self-supervised Framework for Large-scale LiDAR Global Localization	Chenghao Zhang et.al.	2509.09110	null
2025-09-10	Good Deep Features to Track: Self-Supervised Feature Extraction and Tracking in Visual Odometry	Sai Puneeth Reddy Gottam et.al.	2509.08333	null
2025-09-10	Behaviorally Heterogeneous Multi-Agent Exploration Using Distributed Task Allocation	Nirabhra Mandal et.al.	2509.08242	null
2025-09-10	Deep Visual Odometry for Stereo Event Cameras	Sheng Zhong et.al.	2509.08235	null
2025-09-10	Online Dynamic SLAM with Incremental Smoothing and Mapping	Jesse Morris et.al.	2509.08197	null
2025-09-09	Sensing with Mobile Devices through Radio SLAM: Models, Methods, Opportunities, and Challenges	Yu Ge et.al.	2509.07775	null
2025-11-04	Radar-Based Odometry for Low-Speed Driving	Luis Diener et.al.	2509.07683	null
2025-09-09	Aerial-ground Cross-modal Localization: Dataset, Ground-truth, and Benchmark	Yandi Yang et.al.	2509.07362	null
2025-09-08	Detection and Recovery of Adversarial Slow-Pose Drift in Offloaded Visual-Inertial Odometry	Soruya Saha et.al.	2509.07130	null
2025-09-08	Co-Located VR with Hybrid SLAM-based HMD Tracking and Motion Capture Synchronization	Carlos A. Pinheiro de Sousa et.al.	2509.06582	null
2025-09-15	Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation	Ian Page et.al.	2509.06433	null
2025-09-07	DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal Fusion	Mengmeng Liu et.al.	2509.06023	null
2025-09-06	Multi-LVI-SAM: A Robust LiDAR-Visual-Inertial Odometry for Multiple Fisheye Cameras	Xinyu Zhang et.al.	2509.05740	null
2025-09-30	LiDAR-BIND-T: Improved and Temporally Consistent Sensor Modality Translation and Fusion for Robotic Applications	Niels Balemans et.al.	2509.05728	null
2025-09-04	Stitching the Story: Creating Panoramic Incident Summaries from Body-Worn Footage	Dor Cohen et.al.	2509.04370	null
2025-09-04	Odometry Calibration and Pose Estimation of a 4WIS4WID Mobile Wall Climbing Robot	Branimir Ćaran et.al.	2509.04016	null
2025-09-03	IL-SLAM: Intelligent Line-assisted SLAM Based on Feature Awareness for Dynamic Environments	Haolan Zhang et.al.	2509.02972	null
2025-09-02	Coral: A Unifying Abstraction Layer for Composable Robotics Software	Steven Swanbeck et.al.	2509.02453	null
2025-09-02	Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction	Xueyang Kang et.al.	2509.01873	null
2025-09-01	ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association	Ganlin Zhang et.al.	2509.01584	null
2025-09-01	FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field	Fan Zhu et.al.	2509.01547	null
2025-09-01	SR-SLAM: Scene-reliability Based RGB-D SLAM in Diverse Environments	Haolan Zhang et.al.	2509.01111	null
2025-08-31	DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments	Yi Liu et.al.	2509.00741	null
2025-08-30	AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility Detection	Houshu He et.al.	2509.00433	null
2025-08-29	The Rosario Dataset v2: Multimodal Dataset for Agricultural Robotics	Nicolas Soncini et.al.	2508.21635	null
2025-08-28	Observer Design for Optical Flow-Based Visual-Inertial Odometry with Almost-Global Convergence	Tarek Bouazza et.al.	2508.21163	null
2025-08-28	Adam SLAM - the last mile of camera calibration with 3DGS	Matthieu Gendrin et.al.	2508.20526	null
2025-08-24	SEER-VAR: Semantic Egocentric Environment Reasoner for Vehicle Augmented Reality	Yuzhi Lai et.al.	2508.17255	null
2025-08-24	VROOM - Visual Reconstruction over Onboard Multiview	Yajat Yadav et.al.	2508.17172	null
2025-08-23	DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration	Jiayi Li et.al.	2508.17034	null
2025-08-23	A Workflow for Map Creation in Autonomous Vehicle Simulations	Zubair Islam et.al.	2508.16856	null
2025-09-12	COSMO-Bench: A Benchmark for Collaborative SLAM Optimization	Daniel McGann et.al.	2508.16731	null
2025-08-22	GPL-SLAM: A Laser SLAM Framework with Gaussian Process Based Extended Landmarks	Ali Emre Balcı et.al.	2508.16459	null
2025-08-21	GelSLAM: A Real-time, High-Fidelity, and Robust 3D Tactile SLAM System	Hung-Jui Huang et.al.	2508.15990	null
2025-08-19	SLAM-based Safe Indoor Exploration Strategy	Omar Mostafa et.al.	2508.14235	null
2025-09-05	Online 3D Gaussian Splatting Modeling with Novel View Selection	Byeonggwon Lee et.al.	2508.14014	null
2025-08-19	ROVER: Robust Loop Closure Verification with Trajectory Prior in Repetitive Environments	Jingwen Yu et.al.	2508.13488	null
2025-08-18	XR-NPE: High-Throughput Mixed-precision SIMD Neural Processing Engine for Extended Reality Perception Workloads	Tejas Chaudhari et.al.	2508.13049	null
2025-08-16	DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects	Tingbang Liang et.al.	2508.11950	null
2025-08-14	CVIRO: A Consistent and Tightly-Coupled Visual-Inertial-Ranging Odometry on Lie Groups	Yizhi Zhou et.al.	2508.10867	null
2025-08-14	Super LiDAR Reflectance for Robotic Perception	Wei Gao et.al.	2508.10398	null
2025-08-12	Transient Noise Removal via Diffusion-based Speech Inpainting	Mordehay Moradi et.al.	2508.08890	null
2025-08-09	EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events	Siyu Chen et.al.	2508.07003	null
2025-08-07	A Multi-view Landmark Representation Approach with Application to GNSS-Visual-Inertial Odometry	Tong Hua et.al.	2508.05368	null
2025-08-07	Speech LLMs in Low-Resource Scenarios: Data Volume Requirements and the Impact of Pretraining on High-Resource Languages	Seraphina Fong et.al.	2508.05149	null
2025-08-06	Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline	Linqing Zhao et.al.	2508.04597	null
2025-10-15	Inland-LOAM: Voxel-Based Structural Semantic LiDAR Odometry and Mapping for Inland Waterway Navigation	Zhongbi Luo et.al.	2508.03672	null
2025-08-04	A Moment Matching-Based Method for Sparse and Noisy Point Cloud Registration	Xingyi Li et.al.	2508.02187	null
2025-08-04	AID4AD: Aerial Image Data for Automated Driving Perception	Daniel Lengerer et.al.	2508.02140	null
2025-08-01	CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry	Jingchao Xie et.al.	2508.00568	null
2025-07-31	The Monado SLAM Dataset for Egocentric Visual-Inertial Tracking	Mateo de Mayo et.al.	2508.00088	null
2025-07-31	Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes	Xiaohan Li et.al.	2507.23677	null
2025-07-31	DRACo-SLAM2: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar EquippedUnderwater Robot Teams with Object Graph Matching	Yewei Huang et.al.	2507.23629	null
2025-07-31	GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting	Jaeseok Park et.al.	2507.23273	null
2025-07-30	Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques	Weide Liu et.al.	2507.22791	null
2025-07-30	UAVScenes: A Multi-Modal Dataset for UAVs	Sijie Wang et.al.	2507.22412	null
2025-07-29	Impact of Underwater Image Enhancement on Feature Matching	Jason M. Summers et.al.	2507.21715	null
2025-07-29	Adaptive Prior Scene-Object SLAM for Dynamic Environments	Haolan Zhang et.al.	2507.21709	null
2025-08-01	Multi-robot LiDAR SLAM: a practical case study in underground tunnel environments	Federica Di Lauro et.al.	2507.21553	null
2025-07-28	$S^3$ LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping	Ruoyu Fan et.al.	2507.20854	null
2025-07-28	Large-Scale LiDAR-Inertial Dataset for Degradation-Robust High-Precision Mapping	Xiaofeng Jin et.al.	2507.20516	null
2025-07-26	DOA: A Degeneracy Optimization Agent with Adaptive Pose Compensation Capability based on Deep Reinforcement Learning	Yanbin Li et.al.	2507.19742	null
2025-07-25	DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations	Ziren Gong et.al.	2507.19474	null
2025-07-25	The Eloquence team submission for task 1 of MLC-SLM challenge	Lorenzo Concina et.al.	2507.19308	null
2025-07-31	SmartPNT-MSF: A Multi-Sensor Fusion Dataset for Positioning and Navigation Research	Feng Zhu et.al.	2507.19079	null
2025-07-25	A Fast and Light-weight Non-Iterative Visual Odometry with RGB-D Cameras	Zheng Yang et.al.	2507.18886	null
2025-07-24	G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM	Gyuhyeon Pak et.al.	2507.18344	null
2025-07-23	Physics-based Human Pose Estimation from a Single Moving RGB Camera	Ayce Idil Aytekin et.al.	2507.17406	null
2025-08-01	CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance	Peiqi Chen et.al.	2507.17312	null
2025-07-21	DiffPF: Differentiable Particle Filtering with Generative Sampling via Conditional Diffusion Models	Ziyu Wan et.al.	2507.15716	null
2025-07-21	Dense-depth map guided deep Lidar-Visual Odometry with Sparse Point Clouds and Images	JunYing Huang et.al.	2507.15496	null
2025-07-21	All-UWB SLAM Using UWB Radar and UWB AOA	Charith Premachandra et.al.	2507.15474	null
2025-07-21	BenchDepth: Are We on the Right Way to Evaluate Depth Foundation Models?	Zhenyu Li et.al.	2507.15321	null
2025-07-20	LoopNet: A Multitasking Few-Shot Learning Approach for Loop Closure in Large Scale SLAM	Mohammad-Maher Nakshbandi et.al.	2507.15109	null
2025-11-04	Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey	Jiahui Zhang et.al.	2507.14501	null
2025-07-18	SaWa-ML: Structure-Aware Pose Correction and Weight Adaptation-Based Robust Multi-Robot Localization	Junho Choi et.al.	2507.13702	null
2025-07-17	DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model	Maulana Bisyir Azhari et.al.	2507.13145	null
2025-07-17	MoCap2GT: A High-Precision Ground Truth Estimator for SLAM Benchmarking Based on Motion Capture and IMU Fusion	Zichao Shu et.al.	2507.12920	null
2025-07-17	Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot	Luca Garello et.al.	2507.12273	null
2025-07-16	Tree-SLAM: semantic object SLAM for efficient mapping of individual trees in orchards	David Rapado-Rincon et.al.	2507.12093	null
2025-07-11	Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework	Deteng Zhang et.al.	2507.08364	null
2025-07-10	Hardware-Aware Feature Extraction Quantisation for Real-Time Visual Odometry on FPGA Platforms	Mateusz Wasala et.al.	2507.07903	null
2025-07-10	IRAF-SLAM: An Illumination-Robust and Adaptive Feature-Culling Front-End for Visual SLAM in Challenging Environments	Thanh Nguyen Canh et.al.	2507.07752	null
2025-07-09	g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM	Quanjie Qiu et.al.	2507.07142	null
2025-07-08	Mapping the Catacombs: An Underwater Cave Segment of the Devil's Eye System	Michalis Chatzispyrou et.al.	2507.06397	null
2025-07-08	Cooperative Mapping, Localization, and Beam Management via Multi-Modal SLAM in ISAC Systems	Hang Que et.al.	2507.05718	null
2025-07-07	Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR	Tao Du et.al.	2507.04662	null
2025-07-06	Lidar Variability: A Novel Dataset and Comparative Study of Solid-State and Spinning Lidars	Doumegna Mawuto Koudjo Felix et.al.	2507.04321	null
2025-07-09	Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM	Xiaolei Lang et.al.	2507.04004	null
2025-07-04	Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps	Chong Cheng et.al.	2507.03737	null
2025-07-01	RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles	David Hunt et.al.	2507.00937	null
2025-07-01	Generation of Indoor Open Street Maps for Robot Navigation from CAD Files	Jiajie Zhang et.al.	2507.00552	null
2025-06-30	VOCAL: Visual Odometry via ContrAstive Learning	Chi-Yao Huang et.al.	2507.00243	null
2025-06-29	TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints	Zhen Tan et.al.	2506.23207	null
2025-06-29	Event-based Stereo Visual-Inertial Odometry with Voxel Map	Zhaoxing Zhang et.al.	2506.23078	null
2025-06-26	Adaptive Multipath-Based SLAM for Distributed MIMO Systems	Xuhong Li et.al.	2506.21798	null
2025-06-24	Ark: An Open-source Python-based Framework for Robot Learning	Magnus Dierking et.al.	2506.21628	null
2025-06-26	EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting	Taoyu Wu et.al.	2506.21420	null
2025-06-26	CURL-SLAM: Continuous and Compact LiDAR Mapping	Kaicheng Zhang et.al.	2506.21077	null
2025-06-25	SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning	Mimo Shirasaka et.al.	2506.20394	null
2025-06-25	Real-Time Obstacle Avoidance Algorithms for Unmanned Aerial and Ground Vehicles	Jingwen Wei et.al.	2506.20311	null
2025-06-24	Posterior Cramér-Rao Bounds on Localization and Mapping Errors in Distributed MIMO SLAM	Benjamin J. B. Deutschmann et.al.	2506.19957	null
2025-06-23	GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM	Annika Thomas et.al.	2506.18885	null
2025-06-23	MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation	Tianchen Deng et.al.	2506.18678	null
2025-06-24	Multimodal Fusion SLAM with Fourier Attention	Youjie Zhou et.al.	2506.18204	null
2025-06-22	ADA-DPM: A Neural Descriptors-based Adaptive Noise Point Filtering Strategy for SLAM	Yongxin Shao et.al.	2506.18016	null
2025-06-21	Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems	Sebastian Sansoni et.al.	2506.17775	null
2025-06-18	MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System	Miaoxin Pan et.al.	2506.15402	null
2025-06-24	RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories	Qingsong Yan et.al.	2506.15242	null
2025-06-18	SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization	Hanjun Kim et.al.	2506.15175	null
2025-06-18	VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments	Bingbing Zhang et.al.	2506.15126	null
2025-06-16	Slanted light-sheet array microscopy for large volume imaging at rates exceeding 100 Hz	Kai Long et.al.	2506.13664	null
2025-06-16	Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots	Jaehong Oh et.al.	2506.13149	null
2025-06-16	A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method	Zhanhua Xin et.al.	2506.13100	null
2025-06-16	SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure	Shahram Najam Syed et.al.	2506.13089	link
2025-06-12	LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System	Hongbeen Park et.al.	2506.10567	null
2025-06-11	VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots	Miguel Á. González-Santamarta et.al.	2506.09583	null
2025-06-10	UFM: A Simple Path towards Unified Dense Correspondence with Flow	Yuchen Zhang et.al.	2506.09278	null
2025-06-10	Princeton365: A Diverse Dataset with Accurate Camera Pose	Karhan Kayan et.al.	2506.09035	null
2025-06-10	Planar Collisionless Shock Simulations with Semi-Implicit Particle-in-Cell Model FLEKS	Hongyang Zhou et.al.	2506.08384	null
2025-06-09	ZeroVO: Visual Odometry with Minimal Assumptions	Lei Lai et.al.	2506.08005	null
2025-06-08	Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs	Qiong Chang et.al.	2506.07164	null
2025-06-08	UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment	Wentao Zhao et.al.	2506.07013	null
2025-06-06	GS4: Generalizable Sparse Splatting Semantic SLAM	Mingqi Jiang et.al.	2506.06517	null
2025-06-06	Enhancing Situational Awareness in Underwater Robotics with Multi-modal Spatial Perception	Pushyami Kaveti et.al.	2506.06476	null
2025-06-04	Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset	Zirui Wang et.al.	2506.04224	null
2025-06-03	LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM	Roman Titkov et.al.	2506.03073	null
2025-06-03	Online Performance Assessment of Multi-Source-Localization for Autonomous Driving Systems Using Subjective Logic	Stefan Orf et.al.	2506.02932	null
2025-06-03	VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians	Pengchong Hu et.al.	2506.02741	null
2025-06-03	GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal	Shufan Qing et.al.	2506.02736	link
2025-06-03	Olfactory Inertial Odometry: Methodology for Effective Robot Navigation by Scent	Kordel K. France et.al.	2506.02373	null
2025-06-01	Globally Consistent RGB-D SLAM with 2D Gaussian Splatting	Xingguang Zhong et.al.	2506.00970	link
2025-05-30	Black-box Adversarial Attacks on CNN-based SLAM Algorithms	Maria Rafaela Gkeka et.al.	2505.24654	null
2025-05-28	Semantic Exploration and Dense Mapping of Complex Environments using Ground Robots Equipped with LiDAR and Panoramic Camera	Xiaoyang Zhan et.al.	2505.22880	null
2025-05-28	4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians	Hidenobu Matsuki et.al.	2505.22859	null
2025-05-28	UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments	Wancai Zheng et.al.	2505.22335	null
2025-05-27	HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving	Bingxiang Kang et.al.	2505.20906	null
2025-05-27	ProBA: Probabilistic Bundle Adjustment with the Bhattacharyya Coefficient	Jason Chui et.al.	2505.20858	null
2025-05-26	ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting	Wenhua Wu et.al.	2505.19420	null
2025-05-25	VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes	Tianchen Deng et.al.	2505.18992	link
2025-05-23	CU-Multi: A Dataset for Multi-Robot Data Association	Doncey Albin et.al.	2505.17576	null
2025-05-22	TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition	Oliver Grainge et.al.	2505.16447	null
2025-05-20	A Methodological Framework for Measuring Spatial Labeling Similarity	Yihang Du et.al.	2505.14128	link
2025-05-22	Place Recognition: A Comprehensive Review, Current Challenges and Future Directions	Zhenyu Li et.al.	2505.14068	link
2025-05-19	eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks	Jad Mansour et.al.	2505.13309	null
2025-05-23	VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold	Dominic Maggio et.al.	2505.12549	null
2025-05-18	Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey	Calvin Galagain et.al.	2505.12384	null
2025-05-18	Structureless VIO	Junlin Song et.al.	2505.12337	null
2025-05-16	EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video	Ryan Hoque et.al.	2505.11709	null
2025-05-16	Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization	Aaron Wilhelm et.al.	2505.11620	null
2025-05-16	Robust 2D lidar-based SLAM in arboreal environments without IMU/GNSS	Paola Nazate-Burgos et.al.	2505.10847	null
2025-05-15	TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation	Manthan Patel et.al.	2505.10696	null
2025-05-15	A hybrid SLAM-Payne framework for atmospheric parameter and abundance determination of early-type Stars from LAMOST DR9 low-resolution Spectra	Weijia Sun et.al.	2505.10310	null
2025-05-15	Large-Scale Gaussian Splatting SLAM	Zhe Xin et.al.	2505.09915	null
2025-05-13	Automated Meta Prompt Engineering for Alignment with the Theory of Mind	Aaron Baughman et.al.	2505.09024	null
2025-05-13	MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM	Saqi Hussain Kalan et.al.	2505.08388	null
2025-05-13	SKiD-SLAM: Robust, Lightweight, and Distributed Multi-Robot LiDAR SLAM in Resource-Constrained Field Environments	Hogyun Kim et.al.	2505.08230	null
2025-05-12	RDD: Robust Feature Detector and Descriptor using Deformable Transformer	Gonglin Chen et.al.	2505.08013	null
2025-05-12	Ranking-aware Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2505.07198	null
2025-05-07	Scalable Aerial GNSS Localization for Marine Robots	Shuo Wen et.al.	2505.04095	link
2025-05-06	Thermal-LiDAR Fusion for Robust Tunnel Localization in GNSS-Denied and Low-Visibility Conditions	Lukas Schichler et.al.	2505.03565	null
2025-05-06	AquaticVision: Benchmarking Visual SLAM in Underwater Environment with Events and Frames	Yifan Peng et.al.	2505.03448	null
2025-05-06	LiftFeat: 3D Geometry-Aware Local Feature Matching	Yepeng Liu et.al.	2505.03422	link
2025-05-05	LiDAR-Inertial SLAM-Based Navigation and Safety-Oriented AI-Driven Control System for Skid-Steer Robots	Mehdi Heydari Shahna et.al.	2505.02598	null
2025-05-04	Robust Localization, Mapping, and Navigation for Quadruped Robots	Dyuman Aditya et.al.	2505.02272	null
2025-05-04	SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment	Ganesh Sapkota et.al.	2505.01956	null
2025-05-03	GauS-SLAM: Dense RGB-D SLAM with Gaussian Surfels	Yongxin Su et.al.	2505.01934	null
2025-05-02	Tightly Coupled Range Inertial Odometry and Mapping with Exact Point Cloud Downsampling	Kenji Koide et.al.	2505.01017	null
2025-04-30	An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation	Yaming Ou et.al.	2504.21826	null
2025-04-30	eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes	Henry John Krumb et.al.	2504.21562	null
2025-04-29	Large-scale visual SLAM for in-the-wild videos	Shuo Sun et.al.	2504.20496	null
2025-04-28	Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM	Leon Davies et.al.	2504.19654	null
2025-04-28	GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM	Leon Davies et.al.	2504.19653	null
2025-04-28	GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field	Zuxing Lu et.al.	2504.19409	null
2025-04-27	Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users	Apurv Varshney et.al.	2504.19345	null
2025-04-27	NANO-SLAM : Natural Gradient Gaussian Approximation for Vehicle SLAM	Tianyi Zhang et.al.	2504.19195	null
2025-04-27	MISO: Multiresolution Submap Optimization for Efficient Globally Consistent Neural Implicit Reconstruction	Yulun Tian et.al.	2504.19104	null
2025-04-25	Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift	Devansh R. Agrawal et.al.	2504.18713	null
2025-04-25	Range-based 6-DoF Monte Carlo SLAM with Gradient-guided Particle Filter on GPU	Takumi Nakao et.al.	2504.18056	null
2025-04-24	Autonomous Navigation Of Quadrupeds Using Coverage Path Planning	Alexander James Becoy et.al.	2504.17880	null
2025-04-24	BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring	Asier Bikandi et.al.	2504.17693	null
2025-04-24	Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images	Zebo Huang et.al.	2504.17582	null
2025-04-24	Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization	Guangyang Zeng et.al.	2504.17410	null
2025-04-24	EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy	Haodi Yao et.al.	2504.17280	null
2025-04-23	ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration	Andrea Conti et.al.	2504.16545	null
2025-04-22	DERD-Net: Learning Depth from Event-based Ray Densities	Diego de Oliveira Hitzges et.al.	2504.15863	null
2025-04-23	SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems	Abhishek Tyagi et.al.	2504.15305	null
2025-04-20	Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction	Weirong Chen et.al.	2504.14516	null
2025-04-20	SG-Reg: Generalizable and Efficient Scene Graph Registration	Chuhao Liu et.al.	2504.14440	link
2025-04-19	Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering	Jonathan Embley-Riches et.al.	2504.14135	null
2025-04-16	An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World	Xingwu Ji et.al.	2504.11698	link
2025-04-18	Doppler-SLAM: Doppler-Aided Radar-Inertial and LiDAR-Inertial Simultaneous Localization and Mapping	Dong Wang et.al.	2504.11634	link
2025-04-14	Region Based SLAM-Aware Exploration: Efficient and Robust Autonomous Mapping Strategy That Can Scale	Megha Maheshwari et.al.	2504.10416	null
2025-04-14	RoboCup Rescue 2025 Team Description Paper UruBots	Kevin Farias et.al.	2504.09778	null
2025-04-11	FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment	Sebastián Barbas Laina et.al.	2504.08603	null
2025-04-11	PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection	Xiong Li et.al.	2504.08280	null
2025-04-11	II-NVM: Enhancing Map Accuracy and Consistency with Normal Vector-Assisted Mapping	Chengwei Zhao et.al.	2504.08204	link
2025-04-10	UWB Anchor Based Localization of a Planetary Rover	Andreas Nüchter et.al.	2504.07658	null
2025-04-10	Event Signal Filtering via Probability Flux Estimation	Jinze Chen et.al.	2504.07503	null
2025-04-07	Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM	Zhicong Sun et.al.	2504.04844	link
2025-04-06	SELC: Self-Supervised Efficient Local Correspondence Learning for Low Quality Images	Yuqing Wang et.al.	2504.04497	null
2025-04-06	VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets	Alejandro Fontan et.al.	2504.04457	link
2025-04-05	Nonlinear Observer Design for Landmark-Inertial Simultaneous Localization and Mapping	Mouaad Boughellaba et.al.	2504.04239	null
2025-04-04	WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments	Jianhao Zheng et.al.	2504.03886	null
2025-04-03	SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections	Prashant Kumar et.al.	2504.03089	null
2025-04-03	Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision	Xiaofeng Han et.al.	2504.02477	null
2025-04-03	MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM	Renwu Li et.al.	2504.02437	null
2025-04-02	A Chefs KISS -- Utilizing semantic information in both ICP and SLAM framework	Sven Ochs et.al.	2504.02086	null
2025-04-01	Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments	Yuchen Zhang et.al.	2504.01997	null
2025-04-02	Strengthening Multi-Robot Systems for SAR: Co-Designing Robotics and Communication Towards 6G	Juan Bravo-Arrabal et.al.	2504.01940	null
2025-04-02	Dynamic Initialization for LiDAR-inertial SLAM	Jie Xu et.al.	2504.01451	link
2025-04-02	ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue	Thomas Pritchard et.al.	2504.01261	link
2025-03-31	SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection	Yannick Burkhardt et.al.	2504.00139	null
2025-03-30	A Visual-Inertial Motion Prior SLAM for Dynamic Environments	Weilong Sun et.al.	2503.23429	null
2025-03-30	AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos	Felix Wimbauer et.al.	2503.23282	link
2025-03-27	HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM	Ziren Gong et.al.	2503.21778	null
2025-03-27	STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM	Yongxu Wang et.al.	2503.21425	null
2025-03-25	Scene-agnostic Pose Regression for Visual Localization	Junwei Zheng et.al.	2503.19543	null
2025-03-25	First Results on UAV-aided User Localization Using ToA and OpenAirInterface in 5G NR	Omid Esrafilian et.al.	2503.19529	null
2025-03-25	MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments	Yongxin Ma et.al.	2503.19506	link
2025-03-24	Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control	Tohid Kargar Tasooji et.al.	2503.19135	null
2025-03-24	GI-SLAM: Gaussian-Inertial SLAM	Xulang Liu et.al.	2503.18275	null
2025-03-22	LightLoc: Learning Outdoor LiDAR Localization at Light Speed	Wen Li et.al.	2503.17814	link
2025-03-21	Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions	Muhua Zhang et.al.	2503.17005	null
2025-03-20	4D Gaussian Splatting SLAM	Yanyan Li et.al.	2503.16710	null
2025-03-20	Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education	Giovanni Adorni et.al.	2503.16307	null
2025-03-20	Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors	Tian Yi Lim et.al.	2503.16275	null
2025-03-19	A Sigma Point-based Low Complexity Algorithm for Multipath-based SLAM in MIMO Systems	Anna Masiero et.al.	2503.15286	null
2025-03-19	ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents	Hao Liang et.al.	2503.14948	null
2025-03-18	3D Densification for Multi-Map Monocular VSLAM in Endoscopy	X. Anadón et.al.	2503.14346	null
2025-03-18	GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics	Tingyang Xiao et.al.	2503.14247	link
2025-03-18	A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios	Huy-Hoang Bui et.al.	2503.13982	link
2025-03-17	Digital Beamforming Enhanced Radar Odometry	Jingqi Jiang et.al.	2503.13252	link
2025-03-17	Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes	Tatsuro Sakai et.al.	2503.12768	null
2025-03-16	KISS-SLAM: A Simple, Robust, and Accurate 3D LiDAR SLAM System With Enhanced Generalization Capabilities	Tiziano Guadagnino et.al.	2503.12660	null
2025-03-16	Deblur Gaussian Splatting SLAM	Francesco Girlanda et.al.	2503.12572	null
2025-03-16	M2UD: A Multi-model, Multi-scenario, Uneven-terrain Dataset for Ground Robot with Localization and Mapping Evaluation	Yanpeng Jia et.al.	2503.12387	null
2025-03-13	OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions	Maxim Popov et.al.	2503.10331	null
2025-03-12	Online Language Splatting	Saimouli Katragadda et.al.	2503.09447	null
2025-03-12	MonoSLAM: Robust Monocular SLAM with Global Structure Optimization	Bingzheng Jiang et.al.	2503.09296	null
2025-03-11	Keypoint Detection and Description for Raw Bayer Images	Jiakai Lin et.al.	2503.08673	null
2025-03-11	GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats	Kai Deng et.al.	2503.08071	link
2025-03-10	POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality	Joey Wilson et.al.	2503.07819	null
2025-03-08	HIPPO-MAT: Decentralized Task Allocation Using GraphSAGE and Multi-Agent Deep Reinforcement Learning	Lavanya Ratnabala et.al.	2503.07662	null
2025-03-10	AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones	Xiaowei Li et.al.	2503.06890	link
2025-03-08	InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning	Seongjun Choi et.al.	2503.06010	link
2025-03-07	THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks	Chaoran Xiong et.al.	2503.05112	null
2025-03-07	Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry	Chengwei Zhao et.al.	2503.05077	link
2025-03-06	MarsLGPR: Mars Rover Localization with Ground Penetrating Radar	Anja Sheppard et.al.	2503.04944	null
2025-03-06	On the Connection Between Magnetic-Field Odometry Aided Inertial Navigation and Magnetic-Field SLAM	Isaac Skog et.al.	2503.04286	null
2025-03-06	Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes	Hui Zhang et.al.	2503.04235	null
2025-03-06	DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems	Joshua Bird et.al.	2503.04126	null
2025-03-05	Equivariant Filter Design for Range-only SLAM	Yixiao Ge et.al.	2503.03973	null
2025-03-05	Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments	Jie Deng et.al.	2503.03373	link
2025-03-05	OpenGV 2.0: Motion prior-assisted calibration and SLAM with vehicle-mounted surround-view systems	Kun Huang et.al.	2503.03230	null
2025-03-05	Distributed Certifiably Correct Range-Aided SLAM	Alexander Thoms et.al.	2503.03192	link
2025-03-04	Introspective Loop Closure for SLAM with 4D Imaging Radar	Maximilian Hilger et.al.	2503.02383	null
2025-03-04	DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting	Haoyuan Li et.al.	2503.02223	link
2025-03-03	Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM	Marco Giberna et.al.	2503.02050	null
2025-03-03	vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding	Ali Tourani et.al.	2503.01783	null
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	link
2025-03-03	OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding	Dianyi Yang et.al.	2503.01646	null
2025-03-03	MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features	Chao Ye et.al.	2503.01571	link
2025-03-03	AI-Driven Relocation Tracking in Dynamic Kitchen Environments	Arash Nasr Esfahani et.al.	2503.01547	link
2025-03-03	Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning	Xintao Chao et.al.	2503.01543	null
2025-03-03	RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation	Shu Pan et.al.	2503.01434	null
2025-02-27	BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground	Yufei Wei et.al.	2502.20078	null
2025-02-26	Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects	Petri Mäkinen et.al.	2502.19169	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-02-25	S-Graphs 2.0 -- A Hierarchical-Semantic Optimization and Loop Closure for SLAM	Hriday Bavle et.al.	2502.18044	link
2025-02-25	MegaLoc: One Retrieval to Place Them All	Gabriele Berton et.al.	2502.17237	link
2025-02-24	SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building	Haoming Huang et.al.	2502.16856	link
2025-02-27	Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM	Yao Zhang et.al.	2502.16495	null
2025-02-19	Slamming: Training a Speech Language Model on One GPU in a Day	Gallil Maimon et.al.	2502.15814	link
2025-02-21	RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes	Sicheng Yu et.al.	2502.15633	null
2025-02-20	Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2502.14931	null
2025-02-19	3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments	Vincent Ress et.al.	2502.13803	null
2025-02-19	Active Illumination for Visual Ego-Motion Estimation in the Dark	Francesco Crocetti et.al.	2502.13708	null
2025-02-17	From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations	Matteo Scucchia et.al.	2502.12303	null
2025-02-19	pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM	Luigi Freda et.al.	2502.11955	link
2025-02-17	Anti-Degeneracy Scheme for Lidar SLAM based on Particle Filter in Geometry Feature-Less Environments	Yanbin Li et.al.	2502.11486	null
2025-02-16	GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting	Zelin Zhou et.al.	2502.10975	null
2025-02-19	MonoForce: Learnable Image-conditioned Physics Engine	Ruslan Agishev et.al.	2502.10156	link
2025-02-13	Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions	Dario Pisanti et.al.	2502.09795	null
2025-02-13	DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior	Mingrui Li et.al.	2502.09111	null
2025-02-12	LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features	Shujie Zhou et.al.	2502.08676	link
2025-02-10	Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map	Yingyu Wang et.al.	2502.06292	link
2025-02-09	PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map	Yue Pan et.al.	2502.05752	link
2025-02-07	Joint State and Noise Covariance Estimation	Kasra Khosoussi et.al.	2502.04584	null
2025-02-05	GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM	Mingrui Li et.al.	2502.03228	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657	null
2025-02-04	HeRCULES: Heterogeneous Radar Dataset in Complex Urban Environment for Multi-session Radar SLAM	Hanjun Kim et.al.	2502.01946	null
2025-02-03	Statistical enhance learning for modeling and prediction tennis matches at Grand Slam tournaments	Nourah Buhamra et.al.	2502.01613	null
2025-02-03	Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter	Dabin Kim et.al.	2502.01092	null
2025-02-01	FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps	Maximilian Leitenstern et.al.	2502.00395	link
2025-01-31	LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks	Liudi Yang et.al.	2501.19382	link
2025-01-31	Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping	Yiming Huang et.al.	2501.19319	link
2025-01-31	GO: The Great Outdoors Multimodal Dataset	Peng Jiang et.al.	2501.19274	null
2025-01-30	Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems	Liudi Yang et.al.	2501.18110	null
2025-01-28	SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios	Yinqi Chen et.al.	2501.16754	null
2025-01-27	Visual-Lidar Map Alignment for Infrastructure Inspections	Jake McLaughlin et.al.	2501.14486	link
2025-01-24	Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video	Xiaohao Xu et.al.	2501.14319	link
2025-01-24	HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting	Javier Yu et.al.	2501.14147	null
2025-01-23	FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation	Bingyang Zhou et.al.	2501.13876	null
2025-01-23	VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM	Gyuhyeon Pak et.al.	2501.13402	null
2025-01-22	Grid-based Submap Joining: An Efficient Algorithm for Simultaneously Optimizing Global Occupancy Map and Local Submap Frames	Yingyu Wang et.al.	2501.12764	null
2025-01-21	DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM	Jesse Morris et.al.	2501.11893	link
2025-01-21	Survey on Monocular Metric Depth Estimation	Jiuling Zhang et.al.	2501.11841	null
2025-01-19	OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors	Dominik Kulmer et.al.	2501.11111	link
2025-01-19	Factor Graph-Based Active SLAM for Spacecraft Proximity Operations	Lorenzo Ticozzi et.al.	2501.10950	null
2025-01-23	Mesh2SLAM in VR: A Fast Geometry-Based SLAM Framework for Rapid Prototyping in Virtual Reality Applications	Carlos Augusto Pinheiro de Sousa et.al.	2501.09600	null
2025-01-16	Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment	Maksim Filipenko et.al.	2501.09490	null
2025-01-15	Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures	Pengru Deng et.al.	2501.09203	null
2025-01-15	AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning	Assaf Lahiany et.al.	2501.09160	null
2025-01-15	SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM	Yuhang Ming et.al.	2501.08880	null
2025-01-15	GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping	Sheng Hong et.al.	2501.08672	null
2025-01-16	BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module	Dongzhihan Wang et.al.	2501.08659	null
2025-01-15	Self-Organizing Edge Computing Distribution Framework for Visual SLAM	Jussi Kalliola et.al.	2501.08629	null
2025-01-14	VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes	Ke Wu et.al.	2501.08286	null
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399	null
2025-01-13	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-01-12	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-11	SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors	Zhen Hong et.al.	2501.06469	null
2025-01-09	Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping	Wen Tianci et.al.	2501.05242	null
2025-01-07	SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment	Yuchun Fan et.al.	2501.03681	link
2025-01-06	HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos	Jinglei Zhang et.al.	2501.02973	null
2025-01-09	LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments	Haosong Yue et.al.	2501.02580	link
2025-01-04	ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle	Yinchuan Wang et.al.	2501.02166	link
2024-12-31	PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM	Runnan Chen et.al.	2501.00352	null
2024-12-30	Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields	Evgenii Kruzhkov et.al.	2412.20976	null
2024-12-28	MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing	Shuo Wang et.al.	2412.20082	null
2024-12-27	DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction	Kai Xu et.al.	2412.19584	null
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	null
2024-12-23	End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework	Fuhua Jia et.al.	2412.17343	null
2024-12-23	LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation	Riku Uemura et.al.	2412.17282	null
2024-12-23	Selective Kalman Filter: When and How to Fuse Multi-Sensor Information to Overcome Degeneracy in SLAM	Jie Xu et.al.	2412.17235	null
2025-01-03	Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry	Zhaoxing Zhang et.al.	2412.16923	link
2024-12-21	Query Quantized Neural SLAM	Sijia Jiang et.al.	2412.16476	link
2024-12-20	SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training	Wenxi Chen et.al.	2412.15649	link
2024-12-18	Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed	Zidong Han et.al.	2412.13912	null
2024-12-18	Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation	Sait Akturk et.al.	2412.13752	null
2024-12-18	4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching	Fernando Amodeo et.al.	2412.13639	link
2024-12-17	NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment	Andrea Dunn Beltran et.al.	2412.13176	null
2024-12-18	Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera	Zhengdi Yu et.al.	2412.12861	null
2024-12-16	Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration	Meisam Kabiri et.al.	2412.12406	null
2024-12-16	MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors	Riku Murai et.al.	2412.12392	null
2024-12-16	Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges	Martin Aubard et.al.	2412.11840	null
2024-12-19	RoMeO: Robust Metric Visual Odometry	Junda Cheng et.al.	2412.11530	null
2024-12-14	Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency	Yang Song et.al.	2412.10809	link
2024-12-13	RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting	Lizhi Bai et.al.	2412.09868	null
2024-12-12	SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos	Yuzheng Liu et.al.	2412.09401	link
2024-12-12	eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction	Jad Mansour et.al.	2412.09209	link
2024-12-12	Drift-free Visual SLAM using Digital Twins	Roxane Merat et.al.	2412.08496	null
2024-12-10	A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM	Zongbo Liao et.al.	2412.07513	null
2024-12-08	DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments	Juwon Kim et.al.	2412.05839	null
2024-12-06	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	null
2024-12-05	Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset	Fuzhang Han et.al.	2412.04287	link
2024-12-10	MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application	Hyesu Jang et.al.	2412.03887	null
2024-12-04	Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars	John McConnell et.al.	2412.03760	null
2024-12-04	BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement	Miguel Arturo Vega Torres et.al.	2412.03434	link
2024-12-04	NeRF and Gaussian Splatting SLAM in the Wild	Fabian Schmidt et.al.	2412.03263	link
2024-12-04	MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras	Huai Yu et.al.	2412.03146	link
2024-12-04	An indoor DSO-based ceiling-vision odometry system for indoor industrial environments	Abdelhak Bougouffa et.al.	2412.02950	null
2024-12-03	ROVER: A Multi-Season Dataset for Visual SLAM	Fabian Schmidt et.al.	2412.02506	link
2024-12-04	RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting	Zhenzhong Cao et.al.	2412.01217	link
2024-11-28	Visual SLAMMOT Considering Multiple Motion Models	Peilin Tian et.al.	2411.19134	null
2024-11-27	ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching	Yangrui Dong et.al.	2411.18174	null
2024-11-27	HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction	Wei Zhang et.al.	2411.17982	link
2024-11-26	MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework	Xiangcheng Hu et.al.	2411.17928	link
2024-11-29	DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting	Christian Homeyer et.al.	2411.17660	link
2024-11-25	MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM	Vladimir Yugay et.al.	2411.16785	null
2024-11-24	Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors	Soumava Paul et.al.	2411.15966	null
2024-11-24	Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors	R. Herrmann et.al.	2411.15901	null
2024-11-24	PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments	Haoang Li et.al.	2411.15800	null
2024-11-23	Gassidy: Gaussian Splatting SLAM in Dynamic Environments	Long Wen et.al.	2411.15476	null
2024-11-22	OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping	Tomas Berriel Martins et.al.	2411.15043	link
2024-11-22	A Benchmark Dataset for Collaborative SLAM in Service Environments	Harin Park et.al.	2411.14775	link
2024-11-21	InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation	Marziyeh Bamdad et.al.	2411.14358	link
2024-11-20	Robust Monocular Visual Odometry using Curriculum Learning	Assaf Lahiany et.al.	2411.13438	null
2024-11-20	Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds	Jelena Trisovic et.al.	2411.13310	null
2024-11-19	3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality	Hanbeom Chang et.al.	2411.12514	null
2024-11-19	LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2411.12185	null
2024-11-18	Exploring Emerging Trends and Research Opportunities in Visual Place Recognition	Antonios Gasteratos et.al.	2411.11481	null
2024-11-18	The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters	Jie Ju et.al.	2411.11250	null
2024-11-17	A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality	Wei-Hsiang Lien et.al.	2411.10940	null
2024-11-16	DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment	Mangyu Kong et.al.	2411.10722	link
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-15	BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation	Yufei Wei et.al.	2411.10195	null
2024-11-13	DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization	Yueming Xu et.al.	2411.08373	null
2024-11-13	MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation	Peng Wang et.al.	2411.08279	link
2024-11-12	Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments	Ankit Shaw et.al.	2411.08231	null
2024-11-12	NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN	Sonia Raychaudhuri et.al.	2411.07848	null
2024-11-11	Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems	Yasra Chandio et.al.	2411.07146	null
2024-11-11	Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models	Jungseok Hong et.al.	2411.06752	null
2024-11-11	HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation	Xiaolong Wang et.al.	2411.06700	null
2024-11-08	Development of an indoor localization and navigation system based on monocular SLAM for mobile robots	Thanh Nguyen Canh et.al.	2411.05337	null
2024-11-07	Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping	Sayat Ibrayev et.al.	2411.04797	null
2024-11-07	MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation	Sayan Paul et.al.	2411.04796	null
2024-11-09	DEIO: Deep Event Inertial Odometry	Weipeng Guan et.al.	2411.03928	link
2024-11-06	Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward	Shashi Kumar et.al.	2411.03866	null
2024-11-06	LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior	Jiahui Wang et.al.	2411.03610	link
2024-11-05	LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting	Huibin Zhao et.al.	2411.02703	null
2024-11-04	Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing	Xinran Zhang et.al.	2411.02553	null
2024-11-04	Semantic Masking and Visual Feature Matching for Robust Localization	Luisa Mao et.al.	2411.01804	null
2024-10-31	XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM	Xiaomeng Wang et.al.	2410.23690	link
2024-10-30	LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM	Yucheng Huang et.al.	2410.23231	link
2024-10-30	ISAC Prototype System for Multi-Domain Cooperative Communication Networks	Jie Yang et.al.	2410.22956	null
2024-10-30	SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark	HyunJun Jung et.al.	2410.22715	link
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213	null
2024-10-29	EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments	Linus Nwankwo et.al.	2410.22200	null
2024-10-28	NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments	Taiyi Pan et.al.	2410.21615	link
2024-10-28	coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM	Emiliano Höss et.al.	2410.21149	link
2024-11-01	RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior	Mingjiang Liang et.al.	2410.20358	null
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-22	AG-SLAM: Active Gaussian Splatting SLAM	Wen Jiang et.al.	2410.17422	null
2024-10-22	Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study	J. Jorge et.al.	2410.17171	null
2024-10-19	EndoMetric: Near-light metric scale monocular SLAM	Raúl Iranzo et.al.	2410.15065	null
2024-10-17	Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot	Dongkun Han et.al.	2410.13612	null
2024-10-17	TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal	Yanpeng Jia et.al.	2410.13240	null
2024-10-16	QueensCAMP: an RGB-D dataset for robust Visual SLAM	Hudson M. S. Bruno et.al.	2410.12520	link
2024-10-18	PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM	Guanghao Li et.al.	2410.12324	null
2024-10-16	Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem	Yichen Sha et.al.	2410.12169	null
2024-10-15	V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting	Tuan Dang et.al.	2410.12068	link
2024-10-15	GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information	Wancai Zheng et.al.	2410.11356	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	link
2024-10-14	MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator	Taozhe Li et.al.	2410.10669	null
2024-10-13	Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph	Benoit Casseau et.al.	2410.09896	null
2024-10-12	SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs	Wenxi Chen et.al.	2410.09503	link
2024-10-12	An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation	Wei Liang et.al.	2410.09443	null
2024-10-12	ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras	Junkai Niu et.al.	2410.09374	link
2024-10-11	Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System	Zheng Liu et.al.	2410.08935	link
2024-10-11	Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints	Yicheng He et.al.	2410.08780	null
2024-10-10	ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization	Mason B. Peterson et.al.	2410.08262	link
2024-10-10	IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera	Jian Huang et.al.	2410.08107	link
2024-10-08	Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching	Gongxin Yao et.al.	2410.06285	null
2024-10-08	Submodular Optimization for Keyframe Selection & Usage in SLAM	David Thorne et.al.	2410.05576	null
2024-10-07	SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones	Denis Davletshin et.al.	2410.05405	null
2024-10-07	Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection	Ang He et.al.	2410.05017	null
2024-10-05	A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems	Nikola Radulov et.al.	2410.04242	link
2024-10-05	High-Speed Stereo Visual SLAM for Low-Powered Computing Devices	Ashish Kumar et.al.	2410.04090	link
2024-10-04	EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM	Shi Chen et.al.	2410.03812	null
2024-10-04	Estimating Body and Hand Motion in an Ego-sensed World	Brent Yi et.al.	2410.03665	null
2024-10-03	LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features	Zihao Dong et.al.	2410.02961	null
2024-10-02	ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space	Hogyun Kim et.al.	2410.01325	null
2024-10-01	Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency	William Dubois et.al.	2410.00758	null
2024-10-02	CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM	Dapeng Feng et.al.	2410.00486	link
2024-09-30	Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications	Zachary Fuge et.al.	2410.00122	null
2024-09-30	Direct Multipath-Based SLAM	Mingchao Liang et.al.	2409.20552	null
2024-09-30	Robust Gaussian Splatting SLAM by Leveraging Loop Closure	Zunjie Zhu et.al.	2409.20111	null
2024-09-30	DynORecon: Dynamic Object Reconstruction for Navigation	Yiduo Wang et.al.	2409.19928	null
2024-09-29	CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation	Yifan Duan et.al.	2409.19597	null
2024-09-29	CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought	Yexing Du et.al.	2409.19510	link
2024-09-29	Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface	Ziniu Wu et.al.	2409.19499	null
2024-09-27	Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet's Halls	Leon Davies et.al.	2409.18752	null
2024-09-26	BlinkTrack: Feature Tracking over 100 FPS via Events and Images	Yichen Shen et.al.	2409.17981	null
2024-09-26	Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry	Qi Zhang et.al.	2409.17729	null
2024-09-26	Event-based Stereo Depth Estimation: A Survey	Suman Ghosh et.al.	2409.17680	null
2024-09-25	Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras	Sotiris Papatheodorou et.al.	2409.16972	null
2024-09-25	Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM	Phu Pham et.al.	2409.16944	null
2024-09-25	Inline Photometrically Calibrated Hybrid Visual SLAM	Nicolas Abboud et.al.	2409.16810	link
2024-09-25	Topological SLAM in colonoscopies leveraging deep features and topological priors	Javier Morlana et.al.	2409.16806	link
2024-09-25	Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots	Masoud Dayani Najafabadi et.al.	2409.16595	link
2024-09-25	Task-driven SLAM Benchmarking	Yanwei Du et.al.	2409.16573	link
2024-09-24	SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints	Jeahn Han et.al.	2409.15736	null
2024-09-23	Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization	Neelkamal Somisetty et.al.	2409.15506	null
2024-09-22	SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms	Niraj Pudasaini et.al.	2409.14515	null
2024-09-21	Point Cloud Structural Similarity-based Underwater Sonar Loop Detection	Donghwi Jung et.al.	2409.14020	link
2024-09-20	HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device	Vladimir Guzov et.al.	2409.13426	null
2024-09-20	Learning Visual Information Utility with PIXER	Yash Turkar et.al.	2409.13151	null
2024-09-19	MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting	Yan Song Hu et.al.	2409.13055	null
2024-09-19	Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2409.12518	link
2024-09-18	Bundle Adjustment in the Eager Mode	Zitong Zhan et.al.	2409.12190	null
2024-09-23	Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping	Jaehyung Jung et.al.	2409.12051	null
2024-09-18	Metric-Semantic Factor Graph Generation based on Graph Neural Networks	Jose Andres Millan-Romera et.al.	2409.11972	null
2024-09-18	Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments	Lei Cheng et.al.	2409.11854	null
2024-09-18	ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation	Yanlin Jin et.al.	2409.11692	null
2024-09-18	SLAM assisted 3D tracking system for laparoscopic surgery	Jingwei Song et.al.	2409.11688	null
2024-09-17	GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure	Ziheng Xu et.al.	2409.10982	null
2024-09-17	Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells	Ankit Butola et.al.	2409.10971	null
2024-09-17	Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping	Bo Yang et.al.	2409.10824	link
2024-09-16	P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty	Yufan Zhang et.al.	2409.10143	link
2024-09-16	SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning	Amogh Joshi et.al.	2409.09990	null
2024-09-16	Enhancing Visual Inertial SLAM with Magnetic Measurements	Bharat Joshi et.al.	2409.09904	null
2024-09-15	Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics	Zi Cong Guo et.al.	2409.09871	link
2024-09-15	Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping	Yi Liu et.al.	2409.09763	null
2024-09-15	High Definition Map Mapping and Update: A General Overview and Future Directions	Benny Wijaya et.al.	2409.09726	null
2024-09-14	MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry	Yuheng Qiu et.al.	2409.09479	null
2024-09-14	Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM	Haoying Li et.al.	2409.09410	null
2024-09-14	GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians	Dasong Gao et.al.	2409.09295	link
2024-09-14	Panoramic Direct LiDAR-assisted Visual Odometry	Zikang Yuan et.al.	2409.09287	link
2024-09-11	Object Depth and Size Estimation using Stereo-vision and Integration with SLAM	Layth Hamad et.al.	2409.07623	null
2024-09-11	Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry	Anbo Tao et.al.	2409.06948	null
2024-09-10	Technical Report of Mobile Manipulator Robot for Industrial Environments	Erfan Amoozad Khalili et.al.	2409.06693	null
2024-09-10	Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios	Zhiqiang Chen et.al.	2409.04961	link
2024-09-08	FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat	Changfei Fu et.al.	2409.03457	null
2024-09-03	Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness	Michael D. Friske et.al.	2409.01915	null
2024-09-03	Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric	Tingchen Ma et.al.	2409.01856	null
2024-09-02	Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM	Ilari Vallivaara et.al.	2409.01242	null
2024-09-02	Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection	Manon Kok et.al.	2409.01091	null
2024-09-02	Robust Vehicle Localization and Tracking in Rain using Street Maps	Yu Xiang Tan et.al.	2409.01038	link
2024-08-31	UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM	Mostafa Mansour et.al.	2409.00362	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373	null
2024-08-30	Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning	Shuyang Zhang et.al.	2408.17005	link
2024-08-29	Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry	Michael Adlerstein et.al.	2408.16472	null
2024-08-28	Single-Photon 3D Imaging with Equi-Depth Photon Histograms	Kaustubh Sadekar et.al.	2408.16150	null
2024-08-28	BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR	Miguel Arturo Vega Torres et.al.	2408.15870	link
2024-08-30	Addressing the challenges of loop detection in agricultural environments	Nicolás Soncini et.al.	2408.15761	link
2024-08-28	ES-PTAM: Event-based Stereo Parallel Tracking and Mapping	Suman Ghosh et.al.	2408.15605	link
2024-08-28	PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry	Kaiqiao Yang et.al.	2408.15583	null
2024-09-02	Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration	Rongge Zhang et.al.	2408.14726	link
2024-08-26	A Survey on Reinforcement Learning Applications in SLAM	Mohammad Dehghani Tezerjani et.al.	2408.14518	null
2024-08-28	FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2408.14035	link
2024-08-21	Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild	Turcan Tuna et.al.	2408.11809	null
2024-08-21	LiFCal: Online Light Field Camera Calibration via Bundle Adjustment	Aymeric Fleith et.al.	2408.11682	null
2024-08-21	Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars	Zhihao Lin et.al.	2408.11582	null
2024-08-21	RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform	Maximilian Hilger et.al.	2408.11576	link
2024-08-21	Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models	Kento Kawaharazuka et.al.	2408.11380	null
2024-08-20	LoopSplat: Loop Closure by Registering 3D Gaussian Splats	Liyuan Zhu et.al.	2408.10154	link
2024-08-19	Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM	Sanghyun Hahn et.al.	2408.09727	link
2024-08-17	GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System	Shuo Wang et.al.	2408.09191	null
2024-08-15	GOReloc: Graph-based Object-Level Relocalization for Visual SLAM	Yutong Wang et.al.	2408.07917	link
2024-08-14	Inverse k-visibility for RSSI-based Indoor Geometric Mapping	Junseo Kim et.al.	2408.07757	null
2024-08-14	Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition	Hogyun Kim et.al.	2408.07330	link
2024-08-12	CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments	Yanpeng Jia et.al.	2408.05981	null
2024-08-21	Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis	Zhongche Qu et.al.	2408.05635	null
2024-08-10	TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping	Seoyeon Jang et.al.	2408.05453	null
2024-08-08	Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods	Yiming Zhou et.al.	2408.04268	null
2024-08-07	Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM	Yan Song Hu et.al.	2408.03825	null
2024-08-07	AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System	Kuan Xu et.al.	2408.03520	link
2024-08-06	BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications	G. Manni et.al.	2408.03078	link
2024-08-04	SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks	Vladimir Zeković et.al.	2408.02084	null
2024-08-03	Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing	Fabian Schmidt et.al.	2408.01716	link
2024-08-03	Deep Patch Visual SLAM	Lahav Lipson et.al.	2408.01654	link
2024-08-02	Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data	Chang Liu et.al.	2408.01544	null
2024-08-07	IG-SLAM: Instant Gaussian SLAM	F. Aykut Sarikamis et.al.	2408.01126	null
2024-08-01	Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform	Yuxin Lin et.al.	2408.00545	null
2024-08-01	High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets	Jian Li et.al.	2408.00538	link
2024-07-31	SuperVINS: A visual-inertial SLAM framework integrated deep learning features	Hongkun Luo et.al.	2407.21348	link
2024-07-30	NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding	Hongjia Zhai et.al.	2407.20853	null
2024-07-29	A flexible framework for accurate LiDAR odometry, map manipulation, and localization	José Luis Blanco-Claraco et.al.	2407.20465	link
2024-07-28	Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data	Azmyin Md. Kamal et.al.	2407.19518	null
2024-07-26	Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation	Aditya Penumarti et.al.	2407.19046	null
2024-07-26	HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM	Zhe Xin et.al.	2407.18813	null
2024-07-25	CodedVO: Coded Visual Odometry	Sachin Shah et.al.	2407.18240	null
2024-07-28	HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation	Zhenzhi Wang et.al.	2407.17438	link
2024-07-22	Memory Management for Real-Time Appearance-Based Loop Closure Detection	Mathieu Labbé et.al.	2407.15890	null
2024-07-22	Reinforcement Learning Meets Visual Odometry	Nico Messikommer et.al.	2407.15626	link
2024-07-22	Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM	Mathieu Labbe et.al.	2407.15305	null
2024-07-21	Semi-Supervised Pipe Video Temporal Defect Interval Localization	Zhu Huang et.al.	2407.15170	null
2024-07-21	VoxDepth: Rectification of Depth Images on Edge Devices	Yashashwee Chakrabarty et.al.	2407.15067	null
2024-07-20	From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM	Lorenzo Montano-Oliván et.al.	2407.14797	null
2024-07-19	MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion	Qiyan Li et.al.	2407.14102	null
2024-07-18	A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion	Jianxiang Xu et.al.	2407.13878	link
2024-07-18	Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM	Baicheng Li et.al.	2407.13338	null
2024-07-18	Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain	Bach Nguyen Gia et.al.	2407.13159	link
2024-07-17	Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge	Andrea Albanese et.al.	2407.12663	null
2024-07-17	Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM	Markus Weißflog et.al.	2407.12408	null
2024-07-19	Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion	Sangjun Lee et.al.	2407.12405	link
2024-07-17	Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM	Manh Do Duc et.al.	2407.11870	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736	link
2024-07-16	Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems	Jianzhu Huai et.al.	2407.11705	null
2024-07-16	Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization	Yu Ge et.al.	2407.11643	null
2024-07-16	I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM	Gwangtak Bae et.al.	2407.11347	null
2024-07-16	FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration	Jiantao Feng et.al.	2407.11299	null
2024-07-15	Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method	Adam Korycki et.al.	2407.11238	null
2024-07-12	An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks	Seyed Alireza Rahimi Azghadi et.al.	2407.09242	null
2024-07-11	SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM	Neng Wang et.al.	2407.08106	link
2024-07-09	Hyperion -- A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM	David Hug et.al.	2407.07074	link
2024-07-15	A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM	Yasra Chandio et.al.	2407.06889	null
2024-07-08	Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots	Siva Krishna Ravipati et.al.	2407.06077	link
2024-07-10	Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact	Sangwoo Jung et.al.	2407.05820	null
2024-07-07	Active Collaborative Visual SLAM exploiting ORB Features	Muhammad Farhan Ahmed et.al.	2407.05453	null
2024-07-06	VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking	Xuefeng Jiang et.al.	2407.05017	null
2024-07-06	Symmetric Linear Arc Monadic Datalog and Gadget Reductions	Manuel Bodirsky et.al.	2407.04924	null
2024-07-03	Ultra-Lightweight Collaborative Mapping for Robot Swarms	Vlad Niculescu et.al.	2407.03136	null
2024-07-01	RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields	Haochen Jiang et.al.	2407.01303	link
2024-07-01	Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation	Lianjie Guo et.al.	2407.01292	link
2024-07-01	Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization	Ruofei Bai et.al.	2407.01013	link
2024-06-30	Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation	Adnan Abdullah et.al.	2407.00848	null
2024-06-30	OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration	Fengyuan Yang et.al.	2407.00574	null
2024-06-24	Compressing Search with Language Models	Thomas Mulc et.al.	2407.00085	null
2024-06-28	CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services	DongKi Noh et.al.	2406.19634	null
2024-06-25	Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System	Xinzhe Liu et.al.	2406.17586	null
2024-07-02	SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation	Xu Liu et.al.	2406.17249	link
2024-06-24	From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking	Xiaohao Xu et.al.	2406.16850	link
2024-06-23	Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy	Chen Wang et.al.	2406.16087	null
2024-06-19	Simultaneous Map and Object Reconstruction	Nathaniel Chodosh et.al.	2406.13896	null
2024-06-14	Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization	Wonho Song et.al.	2406.11599	null
2024-06-16	Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry	Boris Chidlovskii et.al.	2406.11019	null
2024-06-15	Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM	Yinjie Li et.al.	2406.10494	link
2024-06-12	From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers	Swaminathan Gurumurthy et.al.	2406.07785	link
2024-06-27	Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF)	Gyubeom Im et.al.	2406.06427	null
2024-06-10	Notes on Various Errors and Jacobian Derivations for SLAM	Gyubeom Im et.al.	2406.06422	null
2024-06-23	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374	link
2024-06-15	Visual-Inertial SLAM as Simple as A, B, VINS	Nathaniel Merrill et.al.	2406.05969	null
2024-06-09	MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps	Jianhao Zheng et.al.	2406.05849	null
2024-06-06	Open Problem: Active Representation Learning	Nikola Milosevic et.al.	2406.03845	null
2024-06-04	ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization	Chen Mao et.al.	2406.01906	link
2024-06-03	The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry	Paolo Cudrano et.al.	2406.01797	null
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929	null
2024-06-02	Visual place recognition for aerial imagery: A survey	Ivan Moskalenko et.al.	2406.00885	link
2024-05-30	Structure Gaussian SLAM with Manhattan World Hypothesis	Shuhong Liu et.al.	2405.20031	null
2024-05-30	Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar	Wouter Jansen et.al.	2405.19869	null
2024-05-30	SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization	Jiang Wang et.al.	2405.19813	link
2024-05-30	TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM	Peifeng Jiang et.al.	2405.19614	null
2024-05-27	CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy	Richard Elvira et.al.	2405.16932	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544	link
2024-05-24	NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes	Lizhi Bai et.al.	2405.15151	null
2024-05-23	ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization	Han Song et.al.	2405.15082	null
2024-05-23	Synergistic Global-space Camera and Human Reconstruction from Videos	Yizhou Zhao et.al.	2405.14855	null
2024-05-23	CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments	Yang Zhou et.al.	2405.14731	link
2024-05-23	Efficient Robot Learning for Perception and Mapping	Niclas Vödisch et.al.	2405.14688	null
2024-05-22	Monocular Gaussian SLAM with Language Extended Loop Closure	Tian Lan et.al.	2405.13748	null
2024-05-26	NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments	Dongha Chung et.al.	2405.12563	link
2024-05-20	EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving	Boyi Liu et.al.	2405.12120	null
2024-05-24	Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation	Hyungtae Lim et.al.	2405.11176	null
2024-05-18	MotionGS : Compact Gaussian Splatting SLAM by Motion Filter	Xinli Guo et.al.	2405.11129	link
2024-05-17	CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion	Gang Wang et.al.	2405.10793	null
2024-05-17	Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map	Liang Zhao et.al.	2405.10743	null
2024-05-10	MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization	Pengcheng Zhu et.al.	2405.06241	null
2024-05-07	Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map	Yuxuan Xia et.al.	2405.04290	null
2024-05-07	IMU-Aided Event-based Stereo Visual Odometry	Junkai Niu et.al.	2405.04071	link
2024-04-27	An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation	Olivier Brochu Dufour et.al.	2404.17745	null
2024-04-26	Camera Motion Estimation from RGB-D-Inertial Scene Flow	Samuel Cerezo et.al.	2404.17251	link
2024-04-23	Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization	Lahav Lipson et.al.	2404.15263	link
2024-04-18	SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints	Spencer Carmichael et.al.	2404.12339	null
2024-04-17	VBR: A Vision Benchmark in Rome	Leonardo Brizi et.al.	2404.11322	link
2024-04-14	Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration	Yanhao Zhang et.al.	2404.09169	link
2024-04-06	Salient Sparse Visual Odometry With Pose-Only Supervision	Siyu Chen et.al.	2404.04677	null
2024-03-25	A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments	Gianluca D'Amico et.al.	2403.17084	null
2024-03-19	On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine	Jagatpreet Singh Nir et.al.	2403.13170	null
2024-03-18	The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions	Margaret Hansen et.al.	2403.12194	null
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639	null
2024-03-16	Efficient Domain Adaptation for Endoscopic Visual Odometry	Junyang Wu et.al.	2403.10860	null
2024-03-14	Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO)	Matthew Lisondra et.al.	2403.09882	null
2024-03-02	Grid-based Fast and Structural Visual Odometry	Zhang Zhihe et.al.	2403.01110	null
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961	link
2024-02-22	Secure Navigation using Landmark-based Localization in a GPS-denied Environment	Ganesh Sapkota et.al.	2402.14280	null
2024-02-19	Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment	Ganesh Sapkota et.al.	2402.12551	null
2024-02-07	Online and Certifiably Correct Visual Odometry and Mapping	Devansh R Agrawal et.al.	2402.05254	null
2024-02-06	YOLOPoint Joint Keypoint and Object Detection	Anton Backhaus et.al.	2402.03989	link
2024-01-19	Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning	André O. Françani et.al.	2401.10857	null
2024-01-17	Event-Based Visual Odometry on Non-Holonomic Ground Vehicles	Wanting Xu et.al.	2401.09331	link
2024-01-11	On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering	Feng Zhu et.al.	2401.05836	null
2023-12-19	Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry	Olaya Álvarez-Tuñón et.al.	2401.05396	link
2024-01-07	Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people	Ali Samadzadeh et.al.	2401.03604	link
2024-01-03	LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry	Weirong Chen et.al.	2401.01887	link
2023-12-28	SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction	Zikang Yuan et.al.	2312.16800	link
2023-12-20	NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields	Jens Naumann et.al.	2312.13471	null
2023-12-22	Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM	Junru Lin et.al.	2312.13332	null
2023-12-20	Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach	Habib Boloorchi Tabrizi et.al.	2312.13162	link
2023-12-20	Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera	Abdulkadhem A. Abdulkadhem et.al.	2312.12680	null
2023-12-15	Deep Event Visual Odometry	Simon Klenk et.al.	2312.09800	link
2023-12-10	SuperPrimitive: Scene Reconstruction at a Primitive Level	Kirill Mazur et.al.	2312.05889	null
2023-12-04	iMatching: Imperative Correspondence Learning	Zitong Zhan et.al.	2312.02141	link
2023-11-30	Event-based Visual Inertial Velometer	Xiuyuan Lu et.al.	2311.18189	null
2023-11-21	CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems	Young-Hee Lee et.al.	2311.12580	null
2023-11-10	Dense Visual Odometry Using Genetic Algorithm	Slimane Djema et.al.	2311.06149	null
2023-11-07	Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM	Seongwook Yoon et.al.	2311.03722	null
2023-10-23	Converting Depth Images and Point Clouds for Feature-based Pose Estimation	Robert Lösch et.al.	2310.14924	link
2023-10-17	Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms	Yanyan Li et.al.	2310.10931	link
2023-10-12	Jointly Optimized Global-Local Visual Localization of UAVs	Haoling Li et.al.	2310.08082	null
2023-10-10	l-dyno: framework to learn consistent visual features using robot's motion	Kartikeya Singh et.al.	2310.06249	link
2023-10-08	XVO: Generalized Visual Odometry via Cross-Modal Self-Training	Lei Lai et.al.	2309.16772	null
2023-10-22	ObVi-SLAM: Long-Term Object-Visual SLAM	Amanda Adkins et.al.	2309.15268	link
2023-09-23	Tag-based Visual Odometry Estimation for Indoor UAVs Localization	Massimiliano Bertoni et.al.	2309.13311	null
2023-09-22	Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms	Olivier Gamache et.al.	2309.13139	link
2023-09-20	Conformalized Multimodal Uncertainty Regression and Reasoning	Domenico Parente et.al.	2309.11018	null
2023-09-20	OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving	Heng Li et.al.	2309.11011	link
2023-09-19	LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation	Haizhou Zhang et.al.	2309.10436	link
2023-09-21	Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration	Hongbo Zhao et.al.	2309.10314	null
2023-09-18	End-to-End Learned Event- and Image-based Visual Odometry	Roberto Pellerito et.al.	2309.09947	link
2023-09-14	An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments	Yehao Liu et.al.	2309.07408	null
2023-09-11	Evaluating Visual Odometry Methods for Autonomous Driving in Rain	Yu Xiang Tan et.al.	2309.05249	null
2023-09-08	Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147	null
2023-09-04	EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity	Zijie Jiang et.al.	2309.01296	null
2023-08-27	Deep Learning for Visual Localization and Mapping: A Survey	Changhao Chen et.al.	2308.14039	null
2023-08-19	Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters	Xiao Liu et.al.	2308.09870	link
2023-08-12	4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion	Guirong Zhuo et.al.	2308.06573	null
2023-08-10	Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU	U. V. B. L. Udugama et.al.	2308.05515	null
2023-08-02	A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry	Cora A. Dimmig et.al.	2308.01398	null
2023-08-02	Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network	Shenbagaraj Kannapiran et.al.	2308.01125	null
2023-08-02	Preliminary Design of the Dragonfly Navigation Filter	Ben Schilling et.al.	2307.13513	null
2023-07-19	Optimizing the extended Fourier Mellin Transformation Algorithm	Wenqing Jiang et.al.	2307.10015	link
2023-07-15	Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents	Ke Cao et.al.	2307.07763	null
2023-07-26	Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression	Jianeng Wang et.al.	2306.01188	null
2023-07-06	OSPC: Online Sequential Photometric Calibration	Jawad Haidar et.al.	2305.17673	null
2023-05-15	Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface	Shifan Zhu et.al.	2305.08962	null
2023-05-10	Transformer-based model for monocular visual odometry: a video understanding approach	André O. Françani et.al.	2305.06121	link
2023-04-29	Modality-invariant Visual Odometry for Embodied Vision	Marius Memmel et.al.	2305.00348	link
2023-04-21	FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving	Yuxuan Liu et.al.	2304.10719	null
2023-07-08	Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping	Hanyu Cai et.al.	2304.08978	null
2023-04-12	SiLK -- Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194	link
2023-04-11	ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster	Yifei Dong et.al.	2304.04943	null
2023-03-21	Learning a Depth Covariance Function	Eric Dexheimer et.al.	2303.12157	null
2023-03-21	Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network	Alessandro Navone et.al.	2303.11725	null
2023-03-20	VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors	Thien Hoang Nguyen et.al.	2303.10903	null
2023-03-17	CoVIO: Online Continual Learning for Visual-Inertial Odometry	Niclas Vödisch et.al.	2303.10149	link
2023-03-15	UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry	Chaoyang Jiang et.al.	2303.08550	null
2023-03-13	Discovering Multiple Algorithm Configurations	Leonid Keselman et.al.	2303.07434	null
2023-03-09	Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation	Masahiro Hirano et.al.	2303.05192	null
2023-03-16	Stereo Event-based Visual-Inertial Odometry	Kunfeng Wang et.al.	2303.05086	link
2023-03-07	Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor	Eduardo Gallo et.al.	2303.03804	null
2023-03-03	Lightweight, Uncertainty-Aware Conformalized Visual Odometry	Alex C. Stutts et.al.	2303.02207	null
2023-02-24	FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets	Yelena Randall et.al.	2302.12772	null
2023-02-27	CP+: Camera Poses Augmentation with Large-scale LiDAR Maps	Jiadi Cui et.al.	2302.12198	null
2023-02-19	EdgeVO: An Efficient and Accurate Edge-based Visual Odometry	Hui Zhao et.al.	2302.09493	null
2023-01-27	HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera	Mostafa Ahmadi et.al.	2301.11823	null
2023-01-26	Distributed Optimization Methods for Multi-Robot Systems: Part I -- A Tutorial	Ola Shorinwa et.al.	2301.11313	null
2023-01-24	Generalized Object Search	Kaiyu Zheng et.al.	2301.10121	null
2023-01-22	Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories	Hanlin Chen et.al.	2301.09194	null
2023-01-21	Dense RGB SLAM with Neural Implicit Maps	Heng Li et.al.	2301.08930	null
2023-01-18	Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information	Junshi Chen et.al.	2301.07560	null
2023-01-17	COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM	Manthan Patel et.al.	2301.07147	link
2023-01-31	Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems	Pierre-Yves Lajoie et.al.	2301.06230	link
2023-01-13	A LiDAR-Inertial-Visual SLAM System with Loop Detection	Kangcheng Liu et.al.	2301.05604	null
2023-01-11	AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization	Ying Chen et.al.	2301.04620	link
2023-01-12	TBV Radar SLAM -- trust but verify loop candidates	Daniel Adolfsson et.al.	2301.04397	link
2022-12-31	Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges	Maxwell McManus et.al.	2301.03359	null
2023-01-09	Motion Addition and Motion Optimization	Liqun Qi et.al.	2301.03174	null
2023-01-08	Towards Open World NeRF-Based SLAM	Daniil Lisus et.al.	2301.03102	null
2023-01-06	CyberLoc: Towards Accurate Long-term Visual Localization	Liu Liu et.al.	2301.02403	null
2023-01-03	LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation	Shreyansh Daftry et.al.	2301.01350	null
2022-12-31	4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions	Patrick Wenzel et.al.	2301.01147	null
2023-01-03	BS3D: Building-scale 3D Reconstruction from RGB-D Images	Janne Mustaniemi et.al.	2301.01057	null
2023-01-10	An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping	Masoud Dayani Najafabadi et.al.	2301.00618	link
2022-12-25	A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion	Nadia Figueroa et.al.	2212.14772	null
2022-12-29	An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping	Kangcheng Liu et.al.	2212.14209	link
2022-12-27	Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands	Felipe Gómez-Cuba et.al.	2212.13477	link
2022-12-26	ESVIO: Event-based Stereo Visual Inertial Odometry	Peiyu Chen et.al.	2212.13184	link
2022-12-24	A Comprehensive Review on Autonomous Navigation	Saeid Nahavandi et.al.	2212.12808	null
2022-12-23	Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation	Marina Lotti et.al.	2212.12388	null
2022-12-23	Implementation of a Blind navigation method in outdoors/indoors areas	Mohammad Javadian Farzaneh et.al.	2212.12185	null
2022-12-22	S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations	Hriday Bavle et.al.	2212.11770	link
2022-12-22	Active SLAM: A Review On Last Decade	Muhammad Farhan Ahmed et.al.	2212.11654	null
2022-12-27	Motion, Unit Dual Quaternion and Motion Optimization	Liqun Qi et.al.	2212.11593	null
2022-12-22	Vision-Based Environmental Perception for Autonomous Driving	Fei Liu et.al.	2212.11453	null
2022-12-19	Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models	Yong Cheng et.al.	2212.09553	null
2022-12-16	Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments	Lasitha Weerakoon et.al.	2212.08633	null
2022-12-16	rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments	Bo Wei et.al.	2212.08418	null
2023-03-02	AirVO: An Illumination-Robust Point-Line Visual Odometry	Kuan Xu et.al.	2212.07595	link
2022-12-14	Autonomous Vehicle Navigation with LIDAR using Path Planning	Rahul M K et.al.	2212.07155	null
2022-12-14	RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping	Hyowon Kim et.al.	2212.07141	null
2022-12-13	Know What You Don't Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version)	Daniil Lisus et.al.	2212.06923	null
2022-12-13	SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance	Chenyangguang Zhang et.al.	2212.06524	null
2022-12-13	Localization and Navigation System for Indoor Mobile Robot	Yanbaihui Liu et.al.	2212.06391	null
2022-12-12	Evaluation of RGB-D SLAM in Large Indoor Environments	Kirill Muravyev et.al.	2212.05980	null
2022-12-19	A Light-Weight LiDAR-Inertial SLAM System with Loop Closing	Kangcheng Liu et.al.	2212.05743	link
2022-12-12	An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds	Kangcheng Liu et.al.	2212.05705	link
2022-12-09	SLAM for Visually Impaired People: A Survey	Marziyeh Bamdad et.al.	2212.04745	null
2022-12-09	Ego-Body Pose Estimation via Ego-Head Pose Estimation	Jiaman Li et.al.	2212.04636	null
2022-12-06	Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles	Sushant Veer et.al.	2212.03323	link
2022-12-06	PRISM: Probabilistic Real-Time Inference in Spatial World Models	Atanas Mirchev et.al.	2212.02988	null
2022-12-06	RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps	Florian Sauerbeck et.al.	2212.02085	link
2022-12-05	DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization	Xuebo Tian et.al.	2212.02077	null
2022-12-05	ObjectMatch: Robust Registration using Canonical Object Correspondences	Can Gümeli et.al.	2212.01985	null
2022-12-02	Sparse SPN: Depth Completion from Sparse Keypoints	Yuqun Wu et.al.	2212.00987	null
2022-12-01	maplab 2.0 -- A Modular and Multi-Modal Mapping Framework	Andrei Cramariuc et.al.	2212.00654	link
2022-12-01	AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body -- Theory and Experiments	Mehregan Dor et.al.	2212.00350	null
2022-11-30	MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves	Pranjali Pathre et.al.	2211.16882	null
2022-11-29	PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images	Hartmut Surmann et.al.	2211.16266	link
2022-11-29	MmWave Mapping and SLAM for 5G and Beyond	Yu Ge et.al.	2211.16024	null
2022-11-28	Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map	Xi Zheng et.al.	2211.15127	null
2022-11-29	BALF: Simple and Efficient Blur Aware Local Feature Detector	Zhenjun Zhao et.al.	2211.14731	null
2022-11-27	Development of a Modular Real-time Shared-control System for a Smart Wheelchair	Vaishanth Ramaraj et.al.	2211.14711	null
2022-11-26	A1 SLAM: Quadruped SLAM using the A1's Onboard Sensors	Jerred Chen et.al.	2211.14432	link
2022-11-23	ActiveRMAP: Radiance Field for Active Mapping And Planning	Huangying Zhan et.al.	2211.12656	null
2022-11-22	Vision-based localization methods under GPS-denied conditions	Zihao Lu et.al.	2211.11988	null
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836	null
2022-11-21	ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Mohammad Mahdi Johari et.al.	2211.11704	null
2022-11-24	Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths	Erik Leitinger et.al.	2211.09241	null
2022-11-16	Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery	Hao Qu et.al.	2211.08904	null
2022-11-20	Detecting Line Segments in Motion-blurred Images with Events	Huai Yu et.al.	2211.07365	link
2022-11-13	Automatic Eye-in-Hand Calibration using EKF	Aditya Ramakrishnan et.al.	2211.06881	null
2022-11-12	Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling	Zhihao Wang et.al.	2211.06557	link
2022-11-11	Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications	Jie Yang et.al.	2211.05982	null
2022-11-10	Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time	Ignacio Torroba et.al.	2211.05601	link
2022-11-07	When Geometry is not Enough: Using Reflector Markers in Lidar SLAM	Gerhard Kurz et.al.	2211.03484	null
2022-11-07	Detecting Invalid Map Merges in Lifelong SLAM	Matthias Holoch et.al.	2211.03423	null
2022-11-06	Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU	Yibin Wu et.al.	2211.03174	link
2022-11-07	Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments	Daniel Adolfsson et.al.	2211.02445	link
2022-11-03	DyOb-SLAM : Dynamic Object Tracking SLAM System	Rushmian Annoy Wadud et.al.	2211.01941	null
2022-11-03	Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM	Yang Chen et.al.	2211.01749	null
2022-11-04	$D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm	Hao Xu et.al.	2211.01538	link
2022-11-02	Semantic SuperPoint: A Deep Semantic Descriptor	Gabriel S. Gama et.al.	2211.01098	link
2022-11-02	Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation	Myung-Hwan Jeon et.al.	2211.00960	link
2022-10-31	Mapping Extended Landmarks for Radar SLAM	Shuai Sun et.al.	2210.17207	null
2022-10-25	MAROAM: Map-based Radar SLAM through Two-step Feature Selection	Dequan Wang et.al.	2210.13797	null
2022-10-25	S3E: A Large-scale Multimodal Dataset for Collaborative SLAM	Dapeng Feng et.al.	2210.13723	link
2022-10-24	NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields	Antoni Rosinol et.al.	2210.13641	link
2022-10-24	Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging	Geng Wang et.al.	2210.13556	null
2022-10-28	VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points	Andreas Georgis et.al.	2210.12756	null
2022-10-22	SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation	Junliang Chen et.al.	2210.12417	null
2022-10-21	DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm	Shipeng Zhong et.al.	2210.11978	link
2022-10-21	Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments	Shubham Kedia et.al.	2210.11652	null
2022-10-22	Visual SLAM: What are the Current Trends and What to Expect?	Ali Tourani et.al.	2210.10491	null
2022-10-18	Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM	Geon Choi et.al.	2210.09636	null
2022-10-16	D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments	Ayman Beghdadi et.al.	2210.08647	null
2022-10-16	Indoor Smartphone SLAM with Learned Echoic Location Features	Wenjie Luo et.al.	2210.08493	null
2022-10-15	Self-Improving SLAM in Dynamic Environments: Learning When to Mask	Adrian Bojko et.al.	2210.08350	link
2022-10-13	Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems	Pushyami Kaveti et.al.	2210.07315	link
2022-10-12	RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map	Xuecheng Xu et.al.	2210.05984	link
2022-10-11	Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization	Yuanzheng He et.al.	2210.05600	null
2022-10-11	Autonomous Asteroid Characterization Through Nanosatellite Swarming	Kaitlin Dennison et.al.	2210.05518	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517	null
2022-10-11	Multi-Object Navigation with dynamically learned neural implicit representations	Pierre Marza et.al.	2210.05129	link
2022-10-12	Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation	Yulun Tian et.al.	2210.05020	null
2022-10-10	Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios	Xingyu Chen et.al.	2210.04562	null
2022-10-09	Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning	Ali Safa et.al.	2210.04236	null
2022-10-06	SCORE: A Second-Order Conic Initialization for Range-Aided SLAM	Alan Papalia et.al.	2210.03177	link
2022-10-06	Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Kirill Mazur et.al.	2210.03043	null
2022-10-06	Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence	Osian Morgan et.al.	2210.02642	null
2022-10-05	MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation	Hanwei Zhang et.al.	2210.02038	null
2022-10-04	O2S: Open-source open shuttle	Nwankwo Linus et.al.	2210.01627	null
2022-10-04	Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing	Weiying Wang et.al.	2210.01320	null
2022-10-03	Probabilistic Volumetric Fusion for Dense Monocular SLAM	Antoni Rosinol et.al.	2210.01276	null
2022-10-03	DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams	John McConnell et.al.	2210.00867	link
2022-10-03	A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments	Ha Sier et.al.	2210.00812	link
2022-10-01	Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2	Ali Eslamian et.al.	2210.00278	null
2022-09-30	PyPose: A Library for Robot Learning with Physics-based Optimization	Chen Wang et.al.	2209.15428	link
2022-09-29	DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment	Mariia Gladkova et.al.	2209.14965	null
2022-09-28	Robust Incremental Smoothing and Mapping (riSAM)	Daniel McGann et.al.	2209.14359	null
2022-09-27	Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping	Chi-Ming Chung et.al.	2209.13274	link
2022-09-24	Graph Neural Networks for Multi-Robot Active Information Acquisition	Mariliza Tzes et.al.	2209.12091	null
2022-09-24	Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes	Jonathan J. Y. Kim et.al.	2209.11894	null
2022-09-23	involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs	Gilad Rotman et.al.	2209.11591	null
2022-09-23	Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot	David Balaban et.al.	2209.11432	null
2022-09-22	SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation	Xiao Han et.al.	2209.10817	null
2022-09-22	Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio	Wenhao Qiu et.al.	2209.10726	null
2022-09-21	Visual Localization and Mapping in Dynamic and Changing Environments	João Carlos Virgolino Soares et.al.	2209.10710	null
2022-09-20	Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM	Sabir Hossain et.al.	2209.10047	null
2022-09-20	WGICP: Differentiable Weighted GICP-Based Lidar Odometry	Sanghyun Son et.al.	2209.09777	null
2022-09-20	PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention	José Arce et.al.	2209.09699	link
2022-09-19	MeSLAM: Memory Efficient SLAM based on Neural Fields	Evgenii Kruzhkov et.al.	2209.09357	null
2022-09-19	LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM	Letian Zhang et.al.	2209.08810	null
2022-09-18	HGI-SLAM: Loop Closure With Human and Geometric Importance Features	Shuhul Mujoo et.al.	2209.08608	null
2022-09-18	Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM	Jiarui Tan et.al.	2209.08578	link
2022-09-17	DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments	Shihao Shen et.al.	2209.08430	link
2022-09-17	OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM	Matthieu Zins et.al.	2209.08338	null
2022-09-17	PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments	Adam Dai et.al.	2209.08248	link
2022-09-16	ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM	Aditya Arun et.al.	2209.08091	null
2022-09-16	iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking	Yuhang Ming et.al.	2209.07919	null
2022-09-16	TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM	Mathieu Gonzalez et.al.	2209.07888	null
2022-09-15	Landmark Management in the Application of Radar SLAM	Shuai Sun et.al.	2209.07199	link
2022-09-15	PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization	Xianwei Meng et.al.	2209.07061	null
2022-09-14	Semantic Visual Simultaneous Localization and Mapping: A Survey	Kaiqi Chen et.al.	2209.06428	null
2022-09-13	Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets	Islam Ali et.al.	2209.06316	null
2022-09-12	A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding	Tin Lai et.al.	2209.05222	null
2022-09-12	Attitude-Guided Loop Closure for Cameras with Negative Plane	Ze Wang et.al.	2209.05167	link
2022-09-09	General Place Recognition Survey: Towards the Real-world Autonomy Age	Peng Yin et.al.	2209.04497	link
2022-09-08	ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology	Julio A. Placed et.al.	2209.03693	link
2022-09-08	R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator	Jiarong Lin et.al.	2209.03666	link
2022-09-06	Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection	Brendon Forsgren et.al.	2209.02658	link
2022-09-05	Neuromorphic Visual Odometry with Resonator Networks	Alpha Renner et.al.	2209.02000	null
2022-09-05	MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM	Pavel Karpyshev et.al.	2209.01936	null
2022-09-05	ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics	Boyi Liu et.al.	2209.01774	null
2022-09-04	CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud	Evgeny Yudin et.al.	2209.01605	null
2022-08-31	PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM	Yifan Duan et.al.	2208.14848	null
2022-08-30	BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition	Peng Yin et.al.	2208.14543	null
2022-08-27	Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes	Ali Safa et.al.	2208.12997	null
2022-08-25	FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms	Jianhao Jiao et.al.	2208.11865	null
2022-08-25	Lidar SLAM for Autonomous Driving Vehicles	Farhad Aghili et.al.	2208.11855	null
2022-08-24	DynaVINS: A Visual-Inertial SLAM for Dynamic Environments	Seungwon Song et.al.	2208.11500	link
2022-08-22	Doppler Exploitation in Bistatic mmWave Radio SLAM	Yu Ge et.al.	2208.10204	null
2022-08-21	Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping	Lintong Zhang et.al.	2208.09825	link
2022-08-26	JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario	Longrui Dong et.al.	2208.09777	null
2022-08-15	BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM	Yunge Cui et.al.	2208.07473	link
2022-08-12	Handling Constrained Optimization in Factor Graphs for Autonomous Navigation	Barbara Bazzana et.al.	2208.06325	null
2022-08-11	RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild	Jason Y. Zhang et.al.	2208.05963	null
2022-08-08	Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation	Yifei Ren et.al.	2208.04274	link
2022-08-08	SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty	Shuai Zhang et.al.	2208.03945	link
2022-08-05	A Survey on Visual Map Localization Using LiDARs and Cameras	Elhousni Mahdi et.al.	2208.03376	null
2022-08-04	SROS2: Usable Cyber Security Tools for ROS 2	Victor Mayoral Vilches et.al.	2208.02615	link
2022-08-03	Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms	Bharath Garigipati et.al.	2208.02063	null
2022-08-02	Present and Future of SLAM in Extreme Underground Environments	Kamak Ebadi et.al.	2208.01787	null
2022-08-01	Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion	Simon Boche et.al.	2208.00709	null
2022-07-29	Neural Density-Distance Fields	Itsuki Ueda et.al.	2207.14455	link
2022-07-25	DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions	Tristan Laidlow et.al.	2207.12244	null
2022-07-25	Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration	Kenji Koide et.al.	2207.11942	null
2022-07-22	NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction	Yunlong Ran et.al.	2207.10985	null
2022-07-22	Dense RGB-D-Inertial SLAM with Map Deformations	Tristan Laidlow et.al.	2207.10940	null
2022-07-22	PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes	BaoSheng Zhang et.al.	2207.10916	null
2022-07-21	Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion	Suman Ghosh et.al.	2207.10494	link
2022-07-21	Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions	Quentin Serdel et.al.	2207.10489	link
2022-07-21	On applicability of von Karman's momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity	Yujin Lu et.al.	2207.10413	null
2022-07-19	Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM	Tuvy Lemberg et.al.	2207.09103	null
2022-07-18	DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM	Weicai Ye et.al.	2207.08794	link
2022-07-18	Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction	Marco Orsingher et.al.	2207.08439	null
2022-07-18	ORB-based SLAM accelerator on SoC FPGA	Vibhakar Vemulapati et.al.	2207.08405	null
2022-07-14	Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset	Riccardo Giubilato et.al.	2207.06815	null
2022-07-14	Semi-supervised Vector-Quantization in Visual SLAM using HGCN	Amir Zarringhalam et.al.	2207.06738	null
2022-07-14	Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders	Amir Zarringhalam et.al.	2207.06732	null
2022-07-13	SLAM: SLO-Aware Memory Optimization for Serverless Applications	Gor Safaryan et.al.	2207.06183	null
2022-07-19	Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras	Fangwen Shu et.al.	2207.06058	link
2022-07-12	Accelerating Certifiable Estimation with Preconditioned Eigensolvers	David M. Rosen et.al.	2207.05257	null
2022-07-12	Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features	Meiyu Zhi et.al.	2207.05244	null
2022-07-14	SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial	Chih-Yuan Chiu et.al.	2207.05043	null
2022-07-08	BlindSpotNet: Seeing Where We Cannot See	Taichi Fukuda et.al.	2207.03870	null
2022-07-08	Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints	Philipp Glira et.al.	2207.03785	null
2022-07-08	Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements	Ran Liu et.al.	2207.03700	null
2022-07-07	RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments	Qihao Peng et.al.	2207.03539	null
2022-07-06	VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization	Marius Laska et.al.	2207.02668	null
2022-07-06	A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models	Axel Garcia-Vega et.al.	2207.02396	null
2022-07-04	VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM	Ling Gao et.al.	2207.01404	null
2022-07-04	VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM	Danpeng Chen et.al.	2207.01158	null
2022-07-03	Wireless Channel Prediction in Partially Observed Environments	Mingsheng Yin et.al.	2207.00934	null
2022-07-01	A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers	Julio A. Placed et.al.	2207.00254	null
2022-07-01	Keeping Less is More: Point Sparsification for Visual SLAM	Yeonsoo Park et.al.	2207.00225	null
2022-06-30	Controlled and impulsive compression of an entrapped air bubble during impact	Utkarsh Jain et.al.	2206.15297	null
2022-06-30	Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery	Yuehao Wang et.al.	2206.15255	link
2022-06-27	IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments	Abanob Soliman et.al.	2206.13455	link
2022-06-26	An Efficient Global Optimality Certificate for Landmark-Based SLAM	Connor Holmes et.al.	2206.12961	link
2022-06-21	Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping	Davide Tateo et.al.	2206.10263	link
2022-06-20	Data Fusion for Radio Frequency SLAM with Robust Sampling	Erik Leitinger et.al.	2206.09746	null
2022-06-19	RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments	Chenglong Qian et.al.	2206.09463	null
2022-06-17	Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments	Khairuldanial Ismail et.al.	2206.08733	null
2022-06-17	An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions	Yijun Yuan et.al.	2206.08712	link
2022-06-13	ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy	Hao Bai et.al.	2206.06435	null
2022-06-10	Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming	Javier Cremona et.al.	2206.05066	link
2022-06-09	SparseFormer: Attention-based Depth Completion Network	Frederik Warburg et.al.	2206.04557	null
2022-06-07	Robot Self-Calibration Using Actuated 3D Sensors	Arne Peters et.al.	2206.03430	null
2022-06-07	Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map	Haodong Yuan et.al.	2206.03062	null
2022-06-05	DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions	Alena Savinykh et.al.	2206.02199	null
2022-06-04	C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy	Erez Posner et.al.	2206.01961	null
2022-06-01	PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry	Dong-Uk Seo et.al.	2206.00266	link
2022-05-27	A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching	Arno Solin et.al.	2205.13821	null
2022-05-31	LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments	Yun Chang et.al.	2205.13135	link
2022-05-25	Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM	Milad Ramezani et.al.	2205.12595	null
2022-05-24	Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM	Christopher E. Denniston et.al.	2205.12402	link
2022-05-22	ALITA: A Large-scale Incremental Dataset for Long-term Autonomy	Peng Yin et.al.	2205.10737	link
2022-05-19	FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2	Jeffrey Ichnowski et.al.	2205.09778	link
2022-05-17	Global Data Association for SLAM with 3D Grassmannian Manifold Objects	Parker C. Lusk et.al.	2205.08556	null
2022-05-19	Cluster on Wheels	Yuanyuan Yang et.al.	2205.08151	null
2022-05-12	Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry	Shihao Shen et.al.	2205.05916	link
2022-05-12	S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization	Ran Cheng et.al.	2205.05861	null
2022-05-14	Multi-modal Semantic SLAM for Complex Dynamic Environments	Han Wang et.al.	2205.04300	link
2022-05-06	OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations	Carmen Delgado et.al.	2205.03256	null
2022-05-05	CNN-Augmented Visual-Inertial SLAM with Planar Constraints	Pan Ji et.al.	2205.02940	null
2022-05-05	PMBM-based SLAM Filters in 5G mmWave Vehicular Networks	Hyowon Kim et.al.	2205.02502	null
2022-05-04	BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking	Dorian Henning et.al.	2205.02301	null
2022-05-04	A Global Asymptotic Convergent Observer for SLAM	Seyed Hamed Hashemi et.al.	2205.01953	null
2022-05-04	Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation	Nathaniel Merrill et.al.	2205.01823	link
2022-05-03	GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping	Pan Ji et.al.	2205.01656	null
2022-04-29	Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM	Jinwoo Jeon et.al.	2204.13877	link
2022-04-27	The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection	Konstantinos A. Tsintotas et.al.	2204.12831	null
2022-04-27	Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment	Wenyu Li et.al.	2204.12769	null
2022-04-29	MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment	Tingchen Ma et.al.	2204.11621	null
2022-04-23	Indoor simultaneous localization and mapping based on fringe projection profilometry	Yang Zhao et.al.	2204.11020	null
2022-04-22	Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria	Julio A. Placed et.al.	2204.10631	null
2022-04-22	Fast Autonomous Robotic Exploration Using the Underlying Graph Structure	Julio A. Placed et.al.	2204.10610	null
2022-04-22	Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions	Yutong Hu et.al.	2204.10552	null
2022-04-22	Implicit Object Mapping With Noisy Data	Jad Abou-Chakra et.al.	2204.10516	link
2022-04-19	Photometric single-view dense 3D reconstruction in endoscopy	Victor M. Batlle et.al.	2204.09083	null
2022-04-18	Pulsar skips: Understanding variations in the regular periods of rotating neutron stars	Clayton Miller et.al.	2204.08449	null
2022-04-18	Tracking monocular camera pose and deformation for SLAM inside the human body	Juan J. Gomez Rodriguez et.al.	2204.08309	null
2022-04-18	Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker	Hanjing Ye et.al.	2204.08163	null
2022-04-14	ViViD++: Vision for Visibility Dataset	Alex Junho Lee et.al.	2204.06183	null
2022-04-12	HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud	Zhixing Hou et.al.	2204.05481	null
2022-04-12	RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room	Cong Gao et.al.	2204.05467	null
2022-04-11	Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context	Lizhou Liao et.al.	2204.04932	link
2022-04-04	Monitoring social distancing with single image depth estimation	Alessio Mingozzi et.al.	2204.01693	null
2022-04-01	Bi-directional Loop Closure for Visual SLAM	Ihtisham Ali et.al.	2204.01524	null
2022-04-04	IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers	Lei Sun et.al.	2204.01324	link
2022-04-03	Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor	Wenyan Ou et.al.	2204.01154	null
2022-04-02	UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps	Ayyappa Swamy Thatavarthy et.al.	2204.00865	link
2022-03-31	Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects	Yujie Lu et.al.	2204.00035	null
2022-03-30	GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios	Chih-Yuan Chiu et.al.	2203.16690	null
2022-03-29	Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field	Mostafa Osman et.al.	2203.15866	null
2022-03-29	Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform	Mingjun Li et.al.	2203.15439	null
2022-03-29	Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots	Pranay Mathur et.al.	2203.15272	null
2022-03-28	Are High-Resolution Event Cameras Really Needed?	Daniel Gehrig et.al.	2203.14672	null
2022-03-25	Spectral Measurement Sparsification for Pose-Graph SLAM	Kevin J. Doherty et.al.	2203.13897	link
2022-03-25	FD-SLAM: 3-D Reconstruction Using Features and Dense Matching	Xingrui Yang et.al.	2203.13861	null
2022-03-25	Gravity-constrained point cloud registration	Vladimír Kubelka et.al.	2203.13799	null
2022-03-24	MD-SLAM: Multi-cue Direct SLAM	Luca Di Giammarino et.al.	2203.13237	link
2022-03-24	Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video	Shun Taguchi et.al.	2203.12804	null
2022-03-19	Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems	Jie Yang et.al.	2203.10267	null
2022-03-16	Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR	Ian D. Miller et.al.	2203.08925	link
2022-03-15	Neural RF SLAM for unsupervised positioning and mapping with channel state information	Shreya Kadambi et.al.	2203.08264	null
2022-03-15	Simultaneous Localisation and Mapping with Quadric Surfaces	Tristan Laidlow et.al.	2203.08040	null
2022-03-14	Drift Reduced Navigation with Deep Explainable Features	Mohd Omama et.al.	2203.06897	link
2022-03-11	An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs	Keisuke Sugiura et.al.	2203.05763	null
2022-03-10	High Definition, Inexpensive, Underwater Mapping	Bharat Joshi et.al.	2203.05640	link
2022-03-10	SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning	Jaehoon Choi et.al.	2203.05332	null
2022-03-08	Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM	Pierre-Yves Lajoie et.al.	2203.04446	link
2022-03-08	SLAM-Supported Self-Training for 6D Object Pose Estimation	Ziqi Lu et.al.	2203.04424	link
2022-03-08	An Online Semantic Mapping System for Extending and Enhancing Visual SLAM	Thorsten Hempel et.al.	2203.03944	null
2022-03-07	Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms	Qingqing Li et.al.	2203.03454	link
2022-03-07	OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition	Junyi Ma et.al.	2203.03397	link
2022-03-06	Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM	Kazushi Aiba et.al.	2203.02887	null
2022-03-06	RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects	Ran Long et.al.	2203.02882	null
2022-03-03	STUN: Self-Teaching Uncertainty Estimation for Place Recognition	Kaiwen Cai et.al.	2203.01851	link
2022-03-03	Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning	Niclas Vödisch et.al.	2203.01578	link
2022-03-02	FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry	Chunran Zheng et.al.	2203.00893	link
2022-03-02	Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation	Yulun Tian et.al.	2203.00851	null
2022-03-01	Descriptellation: Deep Learned Constellation Descriptors for SLAM	Chunwei Xing et.al.	2203.00567	null
2022-03-01	Collaborative Robot Mapping using Spectral Graph Analysis	Lukas Bernreiter et.al.	2203.00308	null
2022-02-26	RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization	Nikolaos Kourtzanidis et.al.	2202.13221	link
2022-02-25	Probabilistic Data Association for Semantic SLAM at Scale	Elad Michael et.al.	2202.12802	link
2022-02-24	TwistSLAM: Constrained SLAM in Dynamic Environment	Mathieu Gonzalez et.al.	2202.12384	null
2022-02-24	Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion	Hyeonsoo Jang et.al.	2202.12108	null
2022-02-23	MITI: SLAM Benchmark for Laparoscopic Surgery	Regine Hartwig et.al.	2202.11496	null
2022-02-23	DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization	Xuebo Tian et.al.	2202.11431	null
2022-02-23	Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets	Islam Ali et.al.	2202.11312	null
2022-02-22	SAGE: SLAM with Appearance and Geometry Prior for Endoscopy	Xingtong Liu et.al.	2202.09487	link
2022-02-18	OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure	Stefan Leutenegger et.al.	2202.09199	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146	link
2022-02-18	An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems	Qiang Liu et.al.	2202.08952	null
2022-02-17	Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study	Giovanni Cioffi et.al.	2202.08894	link
2022-02-17	LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building	Jiashi Zhang et.al.	2202.08487	null
2022-02-16	Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments	Jinkun Wang et.al.	2202.08359	null
2022-02-11	Overhead Image Factors for Underwater Sonar-based SLAM	John McConnell et.al.	2202.05811	null
2022-02-10	Scale Estimation with Dual Quadrics for Monocular Object SLAM	Shuangfu Song et.al.	2202.04816	null
2022-02-08	A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition	Nie Jiwei et.al.	2202.03677	null
2022-01-25	Autonomous Vehicles: Open-Source Technologies, Considerations, and Development	Oussama Saoudi et.al.	2202.03148	null
2022-02-07	Temporal Point Cloud Completion with Pose Disturbance	Jieqi Shi et.al.	2202.03084	null
2022-02-04	DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments	Xinggang Hu et.al.	2202.01938	null
2022-02-01	A Model for Multi-View Residual Covariances based on Perspective Deformation	Alejandro Fontan et.al.	2202.00765	null
2022-01-30	Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM	Xinghe Chu et.al.	2201.12726	null
2022-01-28	RGB-D SLAM Using Attention Guided Frame Association	Ali Caglayan et.al.	2201.12047	null
2022-02-04	Learning to Act with Affordance-Aware Multimodal Neural SLAM	Zhiwei Jia et.al.	2201.09862	link
2022-01-22	Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems	Xi Zheng et.al.	2201.09048	link
2022-01-17	SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System	Giseop Kim et.al.	2201.06423	null
2022-01-14	SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions	Ali Samadzadeh et.al.	2201.05386	link
2022-01-19	Multi-Hypothesis Scan Matching through Clustering	Giorgio Iavicoli et.al.	2201.03814	null
2022-01-11	Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM	Kevin J. Doherty et.al.	2201.03773	null
2022-01-10	High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM	Brian M. Hopkinson et.al.	2201.03364	link
2022-01-10	Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition	M. Usman Maqbool Bhutta et.al.	2201.03212	link
2022-01-04	Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds	Xueliang Wen et.al.	2201.00959	null
2021-12-29	Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic	Khen Elimelech et.al.	2112.14428	null
2021-12-19	M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots	Jie Yin et.al.	2112.13659	link
2021-12-27	UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping	Hyunjun Lim et.al.	2112.13515	link
2021-12-25	Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs	Yusheng Wang et.al.	2112.13224	null
2021-12-25	Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping	Peng Huang et.al.	2112.13222	null
2021-12-24	3D Point Cloud Reconstruction and SLAM as an Input	Ziyu Li et.al.	2112.12907	null
2021-12-22	NICE-SLAM: Neural Implicit Scalable Encoding for SLAM	Zihan Zhu et.al.	2112.12130	link
2021-12-18	Fast and Robust Registration of Partially Overlapping Point Clouds	Eduardo Arnold et.al.	2112.09922	link
2021-12-17	Symmetry-aware Neural Architecture for Embodied Visual Navigation	Shuang Liu et.al.	2112.09515	null
2021-12-27	Homography Decomposition Networks for Planar Object Tracking	Xinrui Zhan et.al.	2112.07909	link
2021-12-14	Autonomous Navigation System from Simultaneous Localization and Mapping	Micheal Caracciolo et.al.	2112.07723	link
2021-12-12	360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation	Bolivar Solarte et.al.	2112.06180	link
2021-12-11	Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization	Amay Saxena et.al.	2112.05921	null
2021-12-07	Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems	Gideon Billings et.al.	2112.03826	link
2021-12-05	Iterated Posterior Linearization PMB Filter for 5G SLAM	Yu Ge et.al.	2112.02575	null
2021-12-03	Fast Direct Stereo Visual SLAM	Jiawei Mo et.al.	2112.01890	link
2021-12-02	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment	Jie Ren et.al.	2112.01349	link
2021-12-01	Research on Event Accumulator Settings for Event-Based SLAM	Kun Xiao et.al.	2112.00427	link
2021-11-29	An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments	Assem Sadek et.al.	2111.14666	null
2021-11-29	Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report	Hartmut Surmann et.al.	2111.14542	null
2021-11-24	Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment	V. Ayala-Alfaro et.al.	2111.12690	null
2021-11-24	Autonomous bot with ML-based reactive navigation for indoor environment	Yash Srivastava et.al.	2111.12542	null
2021-11-22	A General Framework for Lifelong Localization and Mapping in Changing Environment	Min Zhao et.al.	2111.10946	link
2021-11-17	Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network	Xiaoming Zhao et.al.	2111.09006	null
2021-11-10	Comparing dominance of tennis' big three via multiple-output Bayesian quantile regression models	Bruno Santos et.al.	2111.05631	null
2021-11-10	TomoSLAM: factor graph optimization for rotation angle refinement in microtomography	Mark Griguletskii et.al.	2111.05562	null
2021-11-07	Hierarchical Segment-based Optimization for SLAM	Yuxin Tian et.al.	2111.04101	null
2021-11-07	Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM	Shing Yan Loo et.al.	2111.04096	null
2021-11-05	MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry	Joan P. Company-Corcoles et.al.	2111.03408	null
2021-10-31	Loop closure detection using local 3D deep descriptors	Youjie Zhou et.al.	2111.00440	link
2021-10-27	Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification	Mingsheng Yin et.al.	2110.14789	link
2021-10-27	Efficient Placard Discovery for Semantic Mapping During Frontier Exploration	David Balaban et.al.	2110.14742	null
2021-10-26	Robust Multi-view Registration of Point Sets with Laplacian Mixture Model	Jin Zhang et.al.	2110.13744	null
2021-10-25	WOLF: A modular estimation framework for robotics based on factor graphs	Joan Sola et.al.	2110.12919	null
2021-10-21	Real-Time Ground-Plane Refined LiDAR SLAM	Fan Yang et.al.	2110.11517	null
2021-10-21	SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words	Jonathan J. Y. Kim et.al.	2110.11491	null
2021-10-21	InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion	Zhenkun Zhu et.al.	2110.11040	null
2021-10-20	SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training	Ankur Bapna et.al.	2110.10329	null
2021-10-18	Enhancing exploration algorithms for navigation with visual SLAM	Kirill Muravyev et.al.	2110.09156	null
2021-10-18	Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment	Rui Tian et.al.	2110.08977	null
2021-10-16	Partial Hierarchical Pose Graph Optimization for SLAM	Alexander Korovko et.al.	2110.08639	null
2021-10-14	Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach	Shumon Koga et.al.	2110.07546	null
2021-10-13	Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity	Ran Liu et.al.	2110.06541	null
2021-10-12	Learning Efficient Multi-Agent Cooperative Visual Exploration	Chao Yu et.al.	2110.05734	null
2021-10-07	Self-Supervised Depth Completion for Active Stereo	Frederik Warburg et.al.	2110.03234	null
2021-10-06	InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes	Zhenkun Zhu et.al.	2110.02593	null
2021-10-03	AEROS: Adaptive RObust least-Squares for Graph-Based SLAM	Milad Ramezani et.al.	2110.02018	null
2021-10-04	Fast Uncertainty Quantification for Active Graph SLAM	Julio A. Placed et.al.	2110.01289	link
2021-10-04	Geometry-based Graph Pruning for Lifelong SLAM	Gerhard Kurz et.al.	2110.01286	null
2021-10-03	Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration	Marcus Greiff et.al.	2110.01099	null
2021-10-02	Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows	Qiangqiang Huang et.al.	2110.00876	link

(back to top)

SFM

Publish Date	Title	Authors	PDF	Code
2025-12-04	Deep infant brain segmentation from multi-contrast MRI	Malte Hoffmann et.al.	2512.05114	null
2025-12-04	QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory	Yu-Chao Hsu et.al.	2512.05049	null
2025-12-04	Geometric Data Science	Olga D Anosova et.al.	2512.05040	null
2025-12-04	Internal superfluid response and torque evolution in the giant glitch of PSR J1718-3718	Peng Liu et.al.	2512.04972	null
2025-12-04	Canonical Rough Path over Tempered Fractional Brownian Motion: Existence, Construction, and Applications	Atef Lechiheb et.al.	2512.04646	null
2025-12-04	Refaçade: Editing Object with Given Reference Texture	Youze Huang et.al.	2512.04534	null
2025-12-04	Development of a 15-Degree-of-Freedom Bionic Hand with Cable-Driven Transmission and Distributed Actuation	Haoqi Han et.al.	2512.04399	null
2025-12-03	Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications	Gasser Elazab et.al.	2512.04303	null
2025-12-03	Emergent Outlier View Rejection in Visual Geometry Grounded Transformers	Jisang Han et.al.	2512.04012	null
2025-12-03	DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment	Sheng-Hao Liao et.al.	2512.03981	null
2025-11-26	TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos	Seungjae Lee et.al.	2511.21690	null
2025-11-26	UAVLight: A Benchmark for Illumination-Robust 3D Reconstruction in Unmanned Aerial Vehicle (UAV) Scenes	Kang Du et.al.	2511.21565	null
2025-11-26	From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings	Jiajie Zhang et.al.	2511.21428	null
2025-11-26	DeepRFTv2: Kernel-level Learning for Image Deblurring	Xintian Mao et.al.	2511.21132	null
2025-11-25	Hund-projected Kanamori model: an effective description of Hund's metals near the Mott insulating regime	Johan Carlström et.al.	2511.20788	null
2025-11-25	From Observations to Simulations: A Neural-Network Approach to Intracluster Medium Kinematics	E. Gatuzz et.al.	2511.20755	null
2025-11-25	Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization	Tahira Kazimi et.al.	2511.20647	null
2025-11-25	Dance Style Classification using Laban-Inspired and Frequency-Domain Motion Features	Ben Hamscher et.al.	2511.20469	null
2025-11-25	AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend	Hengyi Wang et.al.	2511.20343	null
2025-11-25	Stochastic Dynamics of Skyrmions on a Racetrack: Impact of Equilibrium and Nonequilibrium Noise	Anton V. Hlushchenko et.al.	2511.20287	null
2025-11-24	Dynamic Multi-Species Bird Soundscape Generation with Acoustic Patterning and 3D Spatialization	Ellie L. Zhang et.al.	2511.19275	null
2025-11-24	A Deep-Learning-Based Framework for Focal Mechanism Determination and Its Application to the 2022 Luding Earthquake Sequence	Ziye Yu et.al.	2511.19185	null
2025-11-24	MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes	Kehua Chen et.al.	2511.19172	null
2025-11-24	The variability of blazars throughout the electromagnetic spectrum	Claudia M. Raiteri et.al.	2511.18975	null
2025-11-24	MagicWorld: Interactive Geometry-driven Video World Exploration	Guangyuan Li et.al.	2511.18886	null
2025-11-24	STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution	Junyang Chen et.al.	2511.18786	null
2025-11-24	On the role of fractional Brownian motion in models of chemotaxis and stochastic gradient ascent	Gustavo Cornejo-Olea et.al.	2511.18745	null
2025-11-23	C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction	Kuan Wei Huang et.al.	2511.18559	null
2025-11-23	Non-Symplectic Deformations of Geometric Quantisation	Kerr Maxwell et.al.	2511.18549	null
2025-11-23	Zero-Shot Video Deraining with Video Diffusion Models	Tuomas Varanka et.al.	2511.18537	null
2025-11-23	Expanding the Workspace of Electromagnetic Navigation Systems Using Dynamic Feedback for Single- and Multi-agent Control	Jasan Zughaibi et.al.	2511.18486	null
2025-11-23	Escape from end-pinching in Herschel-Bulkley ligaments	Shu Yang et.al.	2511.18388	null
2025-11-23	EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning	Yogesh Kulkarni et.al.	2511.18242	null
2025-11-22	MotionDuet: Dual-Conditioned 3D Human Motion Generation with Video-Regularized Text Learning	Yi-Yang Zhang et.al.	2511.18209	null
2025-11-22	A Unified Multi-Dynamics Framework for Perception-Oriented Modeling in Tendon-Driven Continuum Robots	Ibrahim Alsarraj et.al.	2511.18088	null
2025-11-22	Plan-X: Instruct Video Generation via Semantic Planning	Lun Huang et.al.	2511.17986	null
2025-11-22	Dynamic Slowdown and Spatial Correlations in Viscous Silica Melt: Perspectives from Dynamic Disorder	Shubham Kumar et.al.	2511.17887	null
2025-11-21	Lane-Frame Quantum Multimodal Driving Forecasts for the Trajectory of Autonomous Vehicles	Navneet Singh et.al.	2511.17675	null
2025-11-18	Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression	Siddiqua Namrah et.al.	2511.17612	null
2025-10-24	RadioMapMotion: A Dataset and Baseline for Proactive Spatio-Temporal Radio Environment Prediction	Honggang Jia et.al.	2511.17526	null
2025-11-21	TRAO Survey of the Nearby Filamentary Molecular Clouds, the Universal Nursery of Stars (TRAO-FUNS). IV. Filaments and Dense Cores in the W40 and Serpens South Regions of Aquila	Satyajeet Moharana et.al.	2511.16978	null
2025-11-21	One Walk is All You Need: Data-Efficient 3D RF Scene Reconstruction with Human Movements	Yiheng Bian et.al.	2511.16966	null
2025-11-20	TriDiff-4D: Fast 4D Generation through Diffusion-based Triplane Re-posing	Eddie Pokming Sheung et.al.	2511.16662	null
2025-11-20	Flow and Depth Assisted Video Prediction with Latent Transformer	Eliyas Suleyman et.al.	2511.16484	null
2025-11-20	Two Epochs of VLBI Observations of 8 KISSR Seyfert & LINER Galaxies: Suggestions of Fast and Filamentary Outflows	Preeti Kharb et.al.	2511.16159	null
2025-11-19	MambaIO: Global-Coordinate Inertial Odometry for Pedestrians via Multi-Scale Frequency-Decoupled Modeling	Shanshan Zhang et.al.	2511.15645	null
2025-11-19	Covariant Measures of Non-Markovianity in Curved Spacetime	Tushar Waghmare et.al.	2511.15365	null
2025-11-19	Generating Natural-Language Surgical Feedback: From Structured Representation to Domain-Grounded Evaluation	Firdavs Nasriddinov et.al.	2511.15159	null
2025-11-19	SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection	Chun-Jung Lin et.al.	2511.15153	null
2025-11-18	Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video	Yarin Bekor et.al.	2511.14848	null
2025-11-18	Blur-Robust Detection via Feature Restoration: An End-to-End Framework for Prior-Guided Infrared UAV Target Detection	Xiaolin Wang et.al.	2511.14371	null
2025-11-18	Hubble Space Telescope proper motions of Large Magellanic Cloud star clusters -- II. Kinematic structure of young and intermediate-age clusters	F. Niederhofer et.al.	2511.14351	null
2025-11-18	Vortex stability in pseudo-Hermitian theories	R. A. Battye et.al.	2511.14300	null
2025-11-18	Model-Based Clustering of Football Event Sequences: A Marked Spatio-Temporal Point Process Mixture Approach	Koffi Amezouwui et.al.	2511.14297	null
2025-11-18	Newborn jet in the symbiotic system R Aquarii	T. Liimets et.al.	2511.14243	null
2025-11-18	FreeMusco: Motion-Free Learning of Latent Control for Morphology-Adaptive Locomotion in Musculoskeletal Characters	Minkwan Kim et.al.	2511.14205	null
2025-11-18	AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models	Yuhua Jiang et.al.	2511.14148	null
2025-11-17	B2F: End-to-End Body-to-Face Motion Generation with Style Reference	Bokyung Jang et.al.	2511.13988	null
2025-11-17	Enabling Real-Time Volumetric Imaging in Interventional Radiology Suits via a Deep Learning Framework Robust to C-arm Tilt	Fawazilla Utomo et.al.	2511.13980	null
2025-11-17	Ultrafast electron diffractive imaging of the dissociation of pre-excited molecules	Yanwei Xiong et.al.	2511.13479	null
2025-11-17	An Automated Framework for Analyzing Structural Evolution in On-the-fly Non-adiabatic Molecular Dynamics Using Autoencoder and Multiple Molecular Descriptors	Hangxu Liu et.al.	2511.13364	null
2025-11-17	The Spontaneous Genesis of Solar Prominence Structures Driven by Supergranulation in Three-Dimensional Simulations	Huanxin Chen et.al.	2511.13252	null
2025-11-17	Infrared photometry and CaT spectroscopy of the most metal-poor in-situ globular cluster VVV-CL001	W. Haro Moya et.al.	2511.13161	null
2025-11-16	Kagome metals	Domenico Di Sante et.al.	2511.12731	null
2025-11-16	Examining Turbulence in Galactic Molecular Clouds - II: Continuity of Turbulence Cascading in a Portion of the Local Arm	Yuehui Ma et.al.	2511.12418	null
2025-11-16	Towards Rotation-only Imaging Geometry: Rotation Estimation	Xinrui Li et.al.	2511.12415	null
2025-11-14	Free3D: 3D Human Motion Emerges from Single-View 2D Supervision	Sheng Liu et.al.	2511.11368	null
2025-11-14	YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation	Pavel Rojtberg et.al.	2511.11344	null
2025-11-14	The Spatial Evolution of Star Clusters in NGC 628 with JWST	Anne S. M. Buckner et.al.	2511.11115	null
2025-11-14	Discovery of an X-ray bridge between the comma-shaped gas and the main cluster in MCXC J0157.4-0550	Chong Yang et.al.	2511.10968	null
2025-11-14	DEFT-LLM: Disentangled Expert Feature Tuning for Micro-Expression Recognition	Ren Zhang et.al.	2511.10948	null
2025-11-14	A High-Precision Dynamical Model of Callisto: Incorporating Rotation Effects within Multi-Layer Internal Structure Models	Kai Huang et.al.	2511.10929	null
2025-11-14	Collaborative Multi-Robot Non-Prehensile Manipulation via Flow-Matching Co-Generation	Yorai Shaoul et.al.	2511.10874	null
2025-11-13	A validated lumped-element model for bioinspired acoustic flow sensing toward the performance limit	Wei Sun et.al.	2511.10830	null
2025-11-13	From Attention to Frequency: Integration of Vision Transformer and FFT-ReLU for Enhanced Image Deblurring	Syed Mumtahin Mahmud et.al.	2511.10806	null
2025-11-13	Towards Attribution of Generators and Emotional Manipulation in Cross-Lingual Synthetic Speech using Geometric Learning	Girish et.al.	2511.10790	null
2025-11-13	The Quiescent Merging Nature of the Coma Cluster Revealed by ICM Velocity Structure	E. Gatuzz et.al.	2511.10740	null
2025-11-13	From Fold to Function: Dynamic Modeling and Simulation-Driven Design of Origami Mechanisms	Tianhui Han et.al.	2511.10580	null
2025-11-13	M3Scope a 3D multimode multiplane microscope for imaging nanoscale dynamics in soft matter	Steven Huysecom et.al.	2511.10174	null
2025-11-13	Physics-informed Machine Learning for Static Friction Modeling in Robotic Manipulators Based on Kolmogorov-Arnold Networks	Yizheng Wang et.al.	2511.10079	null
2025-11-13	Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level Constraints	Xiangyue Zhang et.al.	2511.10076	null
2025-11-13	PuffyBot: An Untethered Shape Morphing Robot for Multi-environment Locomotion	Shashwat Singh et.al.	2511.09885	null
2025-11-13	AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting	Aymen Mir et.al.	2511.09827	null
2025-11-12	DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation	Jerrin Bright et.al.	2511.09502	null
2025-11-12	SPIDER: Scalable Physics-Informed Dexterous Retargeting	Chaoyi Pan et.al.	2511.09484	null
2025-11-12	3D PIC simulation and theoretical modeling of RF Laser pulse in magnetized plasma for the generation of multidimensional relativistic Wakefields	A. A. Molavi Choobini et.al.	2511.09079	null
2025-11-12	Group-Theoretic Structure Governing Identifiability in Inverse Problems	Isshin Arai et.al.	2511.08995	null
2025-11-11	Resolving Thermospheric Vertical Wind Ambiguities and Energy Processes	Jeffrey P. Thayer et.al.	2511.08830	null
2025-11-11	Analytical Description of Baryonic Matter Fluctuations Using Jeans Filtering Functions in Second-Order Cosmological Perturbation Theory	Diego Fernando Fonseca et.al.	2511.08820	null
2025-11-11	3D MHD simulations of coronal loops heated via magnetic braiding I. Continuous driving	Gabriele Cozzo et.al.	2511.08726	null
2025-11-11	Coordinated Space- and Ground-based Monitoring of Accretion Bursts in a Protoplanetary Disk: The Orbital and Accretion Properties of DQ Tau	Hala Alqubelat et.al.	2511.08311	null
2025-11-11	Direction and speed selectivity properties for spatio-temporal receptive fields according to the generalized Gaussian derivative model for visual receptive fields	Tony Lindeberg et.al.	2511.08101	null
2025-11-17	Silicon-photonic optomechanical magnetometer	Fernando Gottardo et.al.	2511.07852	null
2025-11-11	Human Motion Synthesis in 3D Scenes via Unified Scene Semantic Occupancy	Gong Jingyu et.al.	2511.07819	null
2025-11-10	DIMO: Diverse 3D Motion Generation for Arbitrary Objects	Linzhan Mou et.al.	2511.07409	null
2025-11-10	Ultrafast Topological Transitions Driven by Permittivity Modulation in Non-Hermitian Multilayers	Giuseppina Simone et.al.	2511.06963	null
2025-11-10	Ambiguity-aware Truncated Flow Matching for Ambiguous Medical Image Segmentation	Fanding Li et.al.	2511.06857	null
2025-11-10	SDSS-ALMA Legacy Value Archival Gas Exploration (SALVAGE) -- I: global star formation is governed by central (not global) molecular gas	Scott Wilkinson et.al.	2511.06775	null
2025-11-08	Development and testing of novel soft sleeve actuators	Mohammed Abboodi et.al.	2511.06102	null
2025-11-08	Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration	Umar Rashid et.al.	2511.06087	null
2025-11-08	Equilibrium Portfolio Selection under Utility-Variance Analysis of Log Returns in Incomplete Markets	Yue Cao et.al.	2511.05861	null
2025-11-08	Supermassive Black Hole and Broad-line Region in NGC 5548: 2023 Reverberation Mapping Results	Wen-Zhe Xi et.al.	2511.05851	null
2025-11-07	A dual grid geometric electromagnetic particle in cell method	Katharina Kormann et.al.	2511.05032	null
2025-11-06	Kinematic and extinction analysis of a potential spiral arm beyond the Galactic bar	Simran Joharle et.al.	2511.04778	null
2025-11-06	Sub-Gyr variability around the SFMS and its contribution to the scatter	A. Camps-Fariña et.al.	2511.04745	null
2025-11-06	Dissecting coherent motions in extreme wall shear stress events within adverse pressure gradient turbulent boundary layers	Leandro J. O. Silva et.al.	2511.04620	null
2025-11-21	Disentangled Concepts Speak Louder Than Words: Explainable Video Action Recognition	Jongseo Lee et.al.	2511.03725	null
2025-11-05	Extreme-Mass-Ratio Inspirals Embedded in Dark Matter Halo I:Existence of Homoclinic Orbit and Near-Horizon Chaos	Surajit Das et.al.	2511.03657	null
2025-11-04	Comparative Investigations on Active and Passive Tails of Undulating Swimmers	Dev Pradeepkumar Nayak et.al.	2511.03057	null
2025-11-04	Distributions and evolution of the equatorial rotation velocities of 2937 BAF-type main-sequence stars from asteroseismology	Conny Aerts et.al.	2511.02909	null
2025-11-04	Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization	Shaohan Li et.al.	2511.02329	null
2025-11-04	Characterizing the astrometric quality of AGNs in Gaia-CRF3	Shilong Liao et.al.	2511.02204	null
2025-11-03	Fractional Diffusion Bridge Models	Gabriel Nobis et.al.	2511.01795	null
2025-11-03	Phason-driven temperature-dependent transport in moiré graphene	Alex Boschi et.al.	2511.01691	null
2025-11-03	Apsidal motion in massive binaries	Sophie Rosu et.al.	2511.01522	null
2025-11-12	Robust topological invariants of timelike circular orbits for spinning test particles in black hole spacetimes	Yong Song et.al.	2511.01447	null
2025-11-04	Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects	Jiawei Wang et.al.	2511.01294	null
2025-11-03	Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play	Jiatong Shi et.al.	2511.01261	null
2025-11-02	From Spray to Metric: The Geometric Construction of the Jacobi Metric	Zonghai Li et.al.	2511.01004	null
2025-11-02	The CatWISE2020 Quasar dipole: A Reassessment of the Cosmic Dipole Anomaly	Masroor Bashir et.al.	2511.00822	null
2025-11-02	Real-Time Learning of Predictive Dynamic Obstacle Models for Robotic Motion Planning	Stella Kombo et.al.	2511.00814	null
2025-11-01	Oitijjo-3D: Generative AI Framework for Rapid 3D Heritage Reconstruction from Street View Imagery	Momen Khandoker Ope et.al.	2511.00362	null
2025-11-17	Deep Chandra X-ray Observations of Abell 2029: the Merger History of a Relaxed, Strong Cool Core Cluster	Courtney B. Watson et.al.	2511.00250	null
2025-10-30	Comparing the magnetic Rayleigh-Taylor instability dynamics in two- and three-dimensions	Manohar Teja Kalluri et.al.	2510.27053	null
2025-10-30	HEIR: Learning Graph-Based Motion Hierarchies	Cheng Zheng et.al.	2510.26786	null
2025-10-30	Wrinkle-Induced Hexagonal Boron Nitride Nanochannels for Biomolecule Localization and Imaging	Xiliang Yang et.al.	2510.26370	null
2025-10-30	Ram pressure shaping HVC droplets -- FAST HI observations of HVC AC-III and theoretical interpretation	Xunchuan Liu et.al.	2510.26077	null
2025-10-29	Spherically Symmetric Quantum-Corrected Black Holes with String Clouds: A Multi-Observable Analysis	Faizuddin Ahmed et.al.	2510.25764	null
2025-10-29	Lost in Phonation: Voice Quality Variation as an Evaluation Dimension for Speech Foundation Models	Harm Lameris et.al.	2510.25577	null
2025-10-29	4-Doodle: Text to 3D Sketches that Move!	Hao Chen et.al.	2510.25319	null
2025-10-27	SFMS-ALR: Script-First Multilingual Speech Synthesis with Adaptive Locale Resolution	Dharma Teja Donepudi et.al.	2510.25178	null
2025-10-29	Magnetic Fields in Massive Star-forming Regions (MagMaR). VI. Magnetic Field Dragging in the Filamentary High-mass Star-forming Region G35.20--0.74N due to Gravity	Jihye Hwang et.al.	2510.25078	null
2025-10-28	The Binary Ballet: Mapping Local Expansion Around M81 & M82	Jenny Wagner et.al.	2510.24840	null
2025-10-29	Leveraging Scale Separation and Stochastic Closure for Data-Driven Prediction of Chaotic Dynamics	Ismaël Zighed et.al.	2510.24583	null
2025-10-28	Tracking the normal modes of an overpass highway bridge using Distributed Acoustic Sensing	E. Diego Mercerat et.al.	2510.24212	null
2025-10-28	High-energy droplet collisions in multi-interacting hollow cone sprays	Narendra Dev et.al.	2510.24207	null
2025-10-27	Adaptive Keyframe Selection for Scalable 3D Scene Reconstruction in Dynamic Environments	Raman Jha et.al.	2510.23928	null
2025-10-27	Non-Markovian quantum Mpemba effect in strongly correlated quantum dots	YuanDong Wang et.al.	2510.23445	null
2025-10-27	FlowCapX: Physics-Grounded Flow Capture with Long-Term Consistency	Ningxiao Tao et.al.	2510.23122	null
2025-10-27	EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction	Taoyu Wu et.al.	2510.23087	null
2025-10-27	Adapting Speech Foundation Models with Large Language Models for Unified Speech Recognition	Jing-Xuan Zhang et.al.	2510.22961	null
2025-10-26	MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control	Fatemeh Nazarieh et.al.	2510.22810	null
2025-10-26	Kinematics of Acceleration-Induced Excitations in Confined Quantum Fields	Hemansh Shah et.al.	2510.22797	null
2025-10-25	Learning 3D Anisotropic Noise Distributions Improves Molecular Force Field Modeling	Xixian Liu et.al.	2510.22123	null
2025-10-21	Vertex and front-tracking methods for the modeling of microstructure evolution at the solid state: a brief review	Marc Bernacki et.al.	2510.21818	null
2025-10-14	Beyond mechanochromism: Programmable multimodal actuation in cholesteric liquid crystal elastomer hollow fibers	Jiazhe Ma et.al.	2510.21765	null
2025-10-24	Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging	Ying Xue et.al.	2510.21654	null
2025-10-24	Magnetic Field Configuration of a Quiescent Prominence Revealed by Large-amplitude Longitudinal Oscillations in End-view Observations	Jun Dai et.al.	2510.21487	null
2025-10-23	Kinetics of Peierls dimerization transition: Machine learning force-field approach	Ho Jang et.al.	2510.20659	null
2025-10-23	RubbleSim: A Photorealistic Structural Collapse Simulator for Confined Space Mapping	Constantine Frost et.al.	2510.20529	null
2025-10-23	A simple model for PDFs and nPDFs	A. V. Kotikov et.al.	2510.20139	null
2025-10-22	Stochastic dynamics of quasiparticles in the hard rod gas	Seema Chahal et.al.	2510.19693	null
2025-10-22	Probing Accretion Disk Winds of Stratified Nature with Fe XXVI Doublet in Black Hole X-ray Binaries	Keigo Fukumura et.al.	2510.19539	null
2025-10-22	PRGCN: A Graph Memory Network for Cross-Sequence Pattern Reuse in 3D Human Pose Estimation	Zhuoyang Xie et.al.	2510.19475	null
2025-10-22	Advances in 4D Representation: Geometry, Motion, and Interaction	Mingrui Zhao et.al.	2510.19255	null
2025-10-21	The slope and scatter of the star forming main sequence at z~5 : reconciling observations with simulations	Claudia Di Cesare et.al.	2510.19044	null
2025-10-21	$\nabla$ -SDF: Learning Euclidean Signed Distance Functions Online with Gradient-Augmented Octree Interpolation and Neural Residual	Zhirui Dai et.al.	2510.18999	null
2025-10-21	Uniqueness of Angular Velocity Reconstruction in Parallel-Beam and Diffraction Tomography	Peter Elbau et.al.	2510.18829	null
2025-10-21	Nonthermal electron acceleration in turbulent post-flare coronal loops	Clarissa Mora et.al.	2510.18742	null
2025-10-21	Observational Tests of Regular Black Holes with Scalar Hair and their Stability	P. A. González et.al.	2510.18647	null
2025-10-21	Multiscale transitional flow in anisotropic nanoparticle suspensions revealed by time-resolved x-ray scatter microscopy	Kesavan Sekar et.al.	2510.18444	null
2025-10-21	MMRHP: A Miniature Mixed-Reality HIL Platform for Auditable Closed-Loop Evaluation	Mingxin Li et.al.	2510.18371	null
2025-10-21	The selection function of the Gaia DR3 open cluster census	Emily L. Hunt et.al.	2510.18343	null
2025-10-21	Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery	Xiang Zhang et.al.	2510.18256	null
2025-10-20	Geometric Field Theory for Elastohydrodynamics of Cosserat Rods	Mingjia Yan et.al.	2510.18097	null
2025-10-20	Bifurcations of planar balanced configurations for the $n$-body problem in $\mathbb{R}^4$	Katharina Kormanna et.al.	2510.17749	null
2025-10-20	Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS	Feng Zhou et.al.	2510.17479	null
2025-10-20	Segmenting infant brains across magnetic fields: Domain randomization and annotation curation in ultra-low field MRI	Vladyslav Zalevskyi et.al.	2510.17436	null
2025-10-21	Leveraging AV1 motion vectors for Fast and Dense Feature Matching	Julien Zouein et.al.	2510.17434	null
2025-10-21	DeepDetect: Learning All-in-One Dense Keypoints	Shaharyar Ahmed Khan Tareen et.al.	2510.17422	null
2025-10-20	Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models	Katie Luo et.al.	2510.17274	null
2025-10-20	Kinetically-induced bound states in a frustrated Rydberg tweezer array	Mu Qiao et.al.	2510.17183	null
2025-10-19	The Lorentz-Violating effects in charged particle systems	E. Maciel et.al.	2510.17055	null
2025-10-18	CryoDyna: Multiscale end-to-end modeling of cryo-EM macromolecule dynamics with physics-aware neural network	Chengwei Zhang et.al.	2510.16510	null
2025-10-18	HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars	Haocheng Tang et.al.	2510.16463	null
2025-10-18	LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching	Aidyn Ubingazhibov et.al.	2510.16438	null
2025-10-18	Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models	Chenrui Tie et.al.	2510.16344	null
2025-10-18	XRISM-Subaru views of Abell 754: an off-axis, near-line-of-sight merging cluster	Nobuhiro Okabe et.al.	2510.16291	null
2025-10-17	DGME-T: Directional Grid Motion Encoding for Transformer-Based Historical Camera Movement Classification	Tingyu Lin et.al.	2510.15725	null
2025-10-17	A single optically detectable tumbling spin in silicon	Félix Cache et.al.	2510.15590	null
2025-10-17	Airway Mucus Rheology: Physical Insights for Navigating through Health to Pathology and Clinical Applications	Zhiwei Liu et.al.	2510.15562	null
2025-10-17	ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents	Tingyu Lin et.al.	2510.15557	null
2025-10-17	MRASfM: Multi-Camera Reconstruction and Aggregation through Structure-from-Motion in Driving Scenes	Lingfeng Xuan et.al.	2510.15467	null
2025-10-17	Modeling and Dynamic Simulation of a Hybrid Wind-Wave System on a Hexagonal Semi-Submersible Platform	Saeid Bayat et.al.	2510.15285	null
2025-10-17	CuSfM: CUDA-Accelerated Structure-from-Motion	Jingrui Yu et.al.	2510.15271	null
2025-10-16	OmniMotion: Multimodal Motion Generation with Continuous Masked Autoregression	Zhe Li et.al.	2510.14954	null
2025-10-16	A Physics Prior-Guided Dual-Stream Attention Network for Motion Prediction of Elastic Bragg Breakwaters	Lianzi Jiang et.al.	2510.14250	null
2025-10-15	Is Gravity Truly Balanced? A Historical-Critical Journey Through the Equivalence Principle and the Genesis of Spacetime Geometry	Jaume de Haro et.al.	2510.13938	null
2025-10-15	Turbulent transport for wall shear stress fluctuations	Myoungkyu Lee et.al.	2510.13758	null
2025-10-15	Orbital dynamics and precession in magnetized Kerr spacetime	Karthik Iyer et.al.	2510.13569	null
2025-10-15	Learning Neural Parametric 3D Breast Shape Models for Metrical Surface Reconstruction From Monocular RGB Videos	Maximilian Weiherer et.al.	2510.13540	null
2025-10-15	InstantSfM: Fully Sparse and Parallel Structure-from-Motion	Jiankun Zhong et.al.	2510.13310	null
2025-10-15	Investigating Buoyant Plume Dynamics Induced by Localized Fire-Simulated Heating over Plant Canopies Using LES	Ajinkya Desai et.al.	2510.13196	null
2025-11-06	Dependency of the Bar Formation Timescale On The Halo Spin	Bin-Hui Chen et.al.	2510.13153	null
2025-10-15	Edit-Your-Interest: Efficient Video Editing via Feature Most-Similar Propagation	Yi Zuo et.al.	2510.13084	null
2025-10-14	Mapping the Perseus Galaxy Cluster with XRISM: Gas Kinematic Features and their Implications for Turbulence	Congyao Zhang et.al.	2510.12782	null
2025-10-14	PET Head Motion Estimation Using Supervised Deep Learning with Attention	Zhuotong Cai et.al.	2510.12758	null
2025-10-14	Widespread Hot Molecular Gas Heated by Shear-induced Turbulence in the Galactic Center	Juan Li et.al.	2510.12518	null
2025-10-14	M3D-skin: Multi-material 3D-printed Tactile Sensor with Hierarchical Infill Structures for Pressure Sensing	Shunnosuke Yoshimura et.al.	2510.12419	null
2025-10-14	Scene Coordinate Reconstruction Priors	Wenjing Bian et.al.	2510.12387	null
2025-10-14	Holographic Turbulence and the Fractal Dimension of the Turbulent Horizon	Jia Du et.al.	2510.12198	null
2025-10-14	VIDMP3: Video Editing by Representing Motion with Pose and Position Priors	Sandeep Mishra et.al.	2510.12069	null
2025-10-13	NaviGait: Navigating Dynamically Feasible Gait Libraries using Deep Reinforcement Learning	Neil C. Janwani et.al.	2510.11542	null
2025-10-13	Behavior of passive polymeric tracers of different topologies in a dilute bath of active Brownian particles	Ramanand Singh Yadav et.al.	2510.11337	null
2025-10-13	The chemodynamical memory of a major merger in a NIHAO-UHD Milky Way analogue I: A golden thread through time and space	Sven Buder et.al.	2510.11284	null
2025-10-13	High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation	Runyang Feng et.al.	2510.11017	null
2025-10-12	Align2Act: Instruction-Tuned Models for Human-Aligned Autonomous Driving	Kanishkha Jaisankar et.al.	2510.10503	null
2025-10-12	Mesh-Gait: A Unified Framework for Gait Recognition Through Multi-Modal Representation Learning from 2D Silhouettes	Zhao-Yang Wang et.al.	2510.10406	null
2025-10-11	sqrtVINS: Robust and Ultrafast Square-Root Filter-based 3D Motion Tracking	Yuxiang Peng et.al.	2510.10346	null
2025-10-11	Ordinal Scale Traffic Congestion Classification with Multi-Modal Vision-Language and Motion Analysis	Yu-Hsuan Lin et.al.	2510.10342	null
2025-10-11	Detection of Quadruple Structure Near the ASCC 32 Region via Machine Learning Methods	Mohammad Noormohammadi et.al.	2510.10296	null
2025-10-11	Are Video Models Emerging as Zero-Shot Learners and Reasoners in Medical Imaging?	Yuxiang Lai et.al.	2510.10254	null
2025-10-11	BurstDeflicker: A Benchmark Dataset for Flicker Removal in Dynamic Scenes	Lishen Qu et.al.	2510.09996	null
2025-10-11	A no-contact result for a plate-fluid interaction system in dimension three	Mario Bukal et.al.	2510.09992	null
2025-10-13	Guiding Energy-Efficient Locomotion through Impact Mitigation Rewards	Chenghao Wang et.al.	2510.09543	null
2025-10-10	Two-Stage Gaussian Splatting Optimization for Outdoor Scene Reconstruction	Deborah Pintani et.al.	2510.09489	null
2025-10-10	What is the contribution of gravitational infall on the mass assembly of star-forming clouds? A case study in a numerical simulation of the interstellar medium	Noé Brucy et.al.	2510.09480	null
2025-10-11	The Visual Iconicity Challenge: Evaluating Vision-Language Models on Sign Language Form-Meaning Mapping	Onur Keleş et.al.	2510.08482	null
2025-10-09	Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools	Zhenlong Yuan et.al.	2510.08480	null
2025-10-09	Scalar-tensor theories in the Lyra geometry: Invariance under local transformations of length units and the Jordan-Einstein frame conundrum	E. C. Valadão et.al.	2510.08433	null
2025-10-09	Beyond hospital reach: Autonomous lightweight ultrasound robot for liver sonography	Zihan Li et.al.	2510.08106	null
2025-10-09	Executable Analytic Concepts as the Missing Link Between VLM Insight and Precise Manipulation	Mingyang Sun et.al.	2510.07975	null
2025-10-08	XRISM/Resolve observations of Hercules X-1: vertical structure and kinematics of the disk wind	Peter Kosec et.al.	2510.07615	null
2025-10-08	Curve separation in supercritical half-space last passage percolation	Evgeni Dimitrov et.al.	2510.07508	null
2025-10-07	Learning from Limited Multi-Phase CT: Dual-Branch Prototype-Guided Framework for Early Recurrence Prediction in HCC	Hsin-Pei Yu et.al.	2510.07347	null
2025-10-08	Dispersion and the transport of exciton-polaritons in an optical conveyor belt	Xingran Xu et.al.	2510.07049	null
2025-10-08	The Star-forming Main Sequence and Bursty Star-formation Histories at $z>1.4$ in JADES and AURORA	Leonardo Clarke et.al.	2510.06681	null
2025-10-08	Classical Polymerization of the Bianchi I Model with Deformed Poisson Structure	Babak Vakili et.al.	2510.06628	null
2025-10-07	Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation	Qingxuan Wu et.al.	2510.06504	null
2025-10-07	The first proper motion measurement of the acceleration regions in the large-scale jets of SS 433 powering the W50 nebula	Naomi Tsuji et.al.	2510.06431	null
2025-10-07	Gravitational deflection of charged massive particle around charged galactic wormhole	Md Khalid Hossain et.al.	2510.06294	null
2025-10-07	Cross-Embodiment Dexterous Hand Articulation Generation via Morphology-Aware Learning	Heng Zhang et.al.	2510.06068	null
2025-10-07	Midway Network: Learning Representations for Recognition and Motion from Latent Dynamics	Christopher Hoang et.al.	2510.05558	null
2025-10-06	The Prevalence of Bursty Star Formation in Low-Mass Galaxies at z=1-7 from Hα-to-UV Diagnostics	Marissa N. Perry et.al.	2510.05388	null
2025-10-06	StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation	Mingyu Liu et.al.	2510.05057	null
2025-10-06	Thermal effects in fluid structure interactions	Sourav Mitra et.al.	2510.04801	null
2025-10-06	Equilibrium properties of strongly confined fluids	Ana M. Montero et.al.	2510.04546	null
2025-10-05	Physics-Inspired All-Pair Interaction Learning for 3D Dynamics Modeling	Kai Yang et.al.	2510.04233	null
2025-10-05	From Shadow to Light: Toward Safe and Efficient Policy Learning Across MPC, DeePC, RL, and LLM Agents	Amin Vahidi-Moghaddam et.al.	2510.04076	null
2025-10-04	Dissecting Larval Zebrafish Hunting using Deep Reinforcement Learning Trained RNN Agents	Raaghav Malik et.al.	2510.03699	null
2025-10-03	Bloch Oscillations and Landau-Zener Transitions in Flat-Band Lattices with Quadratic and Linear Band Touchings	Chenhaoyue Wang et.al.	2510.03530	null
2025-10-03	Selective disruption of reach-related saccade timing following a middle-cerebral artery stroke	Mahya Beheshti et.al.	2510.03076	null
2025-10-03	A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios	Ruining Yang et.al.	2510.02627	null
2025-10-23	DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing	Zihan Zhou et.al.	2510.02253	null
2025-10-02	Non-Gaussian Rotational Diffusion and Swing Motion of Dumbbell Probes in Two Dimensional Colloids	Jeongmin Kim et.al.	2510.01847	null
2025-10-02	Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale	Yongbo Chen et.al.	2510.01665	null
2025-10-01	Depinning of KPZ Interfaces in Fractional Brownian Landscapes	Neda Valizadeh et.al.	2510.01103	null
2025-10-01	Can World Models Benefit VLMs for World Dynamics?	Kevin Zhang et.al.	2510.00855	null
2025-09-30	Learning Human Reaching Optimality Principles from Minimal Observation Inverse Reinforcement Learning	Sarmad Mehrdad et.al.	2510.00329	null
2025-09-30	JADES: An Abundance of Ultra-Distant T- and Y-Dwarfs in Deep Extragalactic Data	Kevin N. Hainline et.al.	2510.00111	null
2025-10-03	The warm outer layer of a Little Red Dot as the source of [Fe II] and collisional Balmer lines with scattering wings	Alberto Torralba et.al.	2510.00103	null
2025-09-30	Seeing Space and Motion: Enhancing Latent Actions with Spatial and Dynamic Awareness for VLA	Zhejia Cai et.al.	2509.26251	null
2025-09-30	Droplets sliding on single and multiple vertical fibers	Matteo Leonard et.al.	2509.25898	null
2025-09-30	Hierarchical Diffusion Motion Planning with Task-Conditioned Uncertainty-Aware Priors	Amelie Minji Kim et.al.	2509.25685	null
2025-09-30	On the shape of pancakes: catastrophe theory and Gaussian statistics in 2D	Abineet Parichha et.al.	2509.25608	null
2025-10-06	CoTaP: Compliant Task Pipeline and Reinforcement Learning of Its Controller with Compliance Modulation	Zewen He et.al.	2509.25443	null
2025-09-29	Data-Augmented Resolvent Analysis of Wall-Bounded High-Pressure Transcritical Flow	M. Bernades et.al.	2509.25398	null
2025-09-29	Seeking Kinematic Association of Known FU Orionis Stars with Young Clusters in Cygnus	Tamojeet Roychowdhury et.al.	2509.25341	null
2025-10-08	VGGT-X: When VGGT Meets Dense Novel View Synthesis	Yang Liu et.al.	2509.25191	null
2025-09-29	Fast Feature Field ( $\text{F}^3$ ): A Predictive Representation of Events	Richeek Das et.al.	2509.25146	null
2025-09-29	Impact of Atomic Substitution on Core-Hole Relaxation Dynamics: A Study of Br $_2$ and IBr	Nivedita Bhat et.al.	2509.24915	null
2025-09-29	Understanding Cognitive States from Head & Hand Motion Data	Kaiang Wen et.al.	2509.24255	null
2025-09-28	BOSfM: A View Planning Framework for Optimal 3D Reconstruction of Agricultural Scenes	Athanasios Bacharis et.al.	2509.24126	null
2025-09-28	RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization	Dongki Jung et.al.	2509.23991	null
2025-09-28	CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting	Dragoş-Andrei Chileban et.al.	2509.23947	null
2025-09-28	Witnessing Magnetic Reconnection in Tangled Superpenumbral Fibrils Around a Sunspot	Hechao Chen et.al.	2509.23636	null
2025-09-27	Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos	Junyi Wu et.al.	2509.23492	null
2025-09-27	Geometry-Aware Losses for Structure-Preserving Text-to-Sign Language Generation	Zetian Wu et.al.	2509.23011	null
2025-09-26	Scallop Theorem for Swimming in Anisotropic Fluids	Mojtaba Rajabi et.al.	2509.22249	null
2025-09-26	Taming Flow-based I2V Models for Creative Video Editing	Xianghao Kong et.al.	2509.21917	null
2025-09-25	First results from ALPPS: a sub-Alfvénic streamer in SVS13A	P. C. Cortes et.al.	2509.21701	null
2025-09-25	Multireference equation-of-motion driven similarity renormalization group for X-ray photoelectron spectra	Shuhang Li et.al.	2509.21646	null
2025-09-25	Taxonomy-aware Dynamic Motion Generation on Hyperbolic Manifolds	Luis Augenstein et.al.	2509.21281	null
2025-09-24	Pattern Formation in Agent-Based and PDE Models for Evolutionary Games with Payoff-Driven Motion	Tianyong Yao et.al.	2509.20538	null
2025-09-24	Glassy dynamics in two-dimensional ring polymers: size versus stiffness polydispersity	Rahul Nayak et.al.	2509.20066	null
2025-09-24	Modelling and Analysis of Non-Contacting Mechanical Face Seals with Axial Disturbances and Misalignment	Ben S Ashby et.al.	2509.19993	null
2025-09-24	Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering	Jiangxue Yu et.al.	2509.19898	null
2025-09-23	Probing the Origin of X-ray Flares in the Low-Hard State of GRS 1915+105 Using AstroSat and NuSTAR	Shahzada Akhter et.al.	2509.19546	null
2025-10-30	Reaction/Diffusion Competition Drives Anomalous Relaxation of Vitrimers	Makayla R. Branham-Ferrari et.al.	2509.19496	null
2025-09-23	Internal dynamics and structure of Cepheus OB4. The asymmetric expansion of Berkeley 59	Bruno Wiesneth et.al.	2509.19175	null
2025-09-23	DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring	Pengteng Li et.al.	2509.18898	null
2025-09-23	Kinematics of the interstellar medium using Gaia: A catalogue of 102 YSO-MC associations within 3.5 kpc from the Sun with 3D velocities	Ji-Xuan Zhou et.al.	2509.18496	null
2025-09-22	Efficient Particle Acceleration in 2.5-Dimensional, Hybrid-Kinetic Simulations of Decaying, Supersonic, Plasma Turbulence	Keyan Gootkin et.al.	2509.18374	null
2025-09-22	Waves drive the rise and fall of 2D flows in rotating turbulence	Sébastien Gomé et.al.	2509.18323	null
2025-09-22	VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models	Geonung Kim et.al.	2509.17985	null
2025-09-22	Tensor-Based Self-Calibration of Cameras via the TrifocalCalib Method	Gregory Schroeder et.al.	2509.17620	null
2025-10-15	Energy Correlators Resolving Proton Spin	Jun Gao et.al.	2509.17596	null
2025-09-22	Learning Dexterous Manipulation with Quantized Hand State	Ying Feng et.al.	2509.17450	null
2025-09-21	Reference-aware SFM layers for intrusive intelligibility prediction	Hanlin Yu et.al.	2509.17270	null
2025-09-21	Beat on Gaze: Learning Stylized Generation of Gaze and Head Dynamics	Chengwei Shi et.al.	2509.17168	null
2025-11-19	Asymptotic Higher Spin Symmetries: Noether Realization & Algebraic Structure in Einstein-Yang-Mills Theory	Nicolas Cresto et.al.	2509.17137	null
2025-09-21	Insensitivity-induced potential non-uniqueness in system identification of Bouc-Wen models	Adrita Kundu et.al.	2509.17122	null
2025-09-21	Dynamics of the $N$ -body system in energy-momentum squared gravity: II. Existence of a Self-Acceleration	Elham Nazari et.al.	2509.17017	null
2025-09-21	VidCLearn: A Continual Learning Approach for Text-to-Video Generation	Luca Zanchetta et.al.	2509.16956	null
2025-09-27	HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos	Haoyang Weng et.al.	2509.16757	null
2025-09-19	On the application of refractive index matching to study the buoyancy-driven motion of spheres	Jibu Tom Jose et.al.	2509.16384	null
2025-09-19	Investigating Polyglot Speech Foundation Models for Learning Collective Emotion from Crowds	Orchid Chetia Phukan et.al.	2509.16329	null
2025-11-05	Modeling Elastic-Body Dynamics of Robotic Fish Using a Variational Framework	Zhiheng Chen et.al.	2509.16145	null
2025-10-09	Hierarchical Reinforcement Learning with Low-Level MPC for Multi-Agent Control	Max Studt et.al.	2509.15799	null
2025-09-19	Search for cosmic-ray induced gamma-ray emission from local galaxy clusters using Fermi-LAT data	Judit Pérez-Romero et.al.	2509.15720	null
2025-10-24	MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild	Deming Li et.al.	2509.15548	null
2025-10-21	SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models	Sen Wang et.al.	2509.15536	null
2025-09-18	Dynamical Analysis of the HD 169142 Planet-Forming Disk: Twelve Years of High-Contrast Polarimetry	Miles Lucas et.al.	2509.15323	null
2025-09-18	Static AdS Black Holes Surrounded by Strings and Quintessence-like Field within Rastall Gravity Framework	Allan. R. P. Moreira et.al.	2509.15274	null
2025-09-27	WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance	Chenxi Song et.al.	2509.15130	null
2025-09-17	Repulsive Trajectory Modification and Conflict Resolution for Efficient Multi-Manipulator Motion Planning	Junhwa Hong et.al.	2509.13882	null
2025-09-18	MapAnything: Universal Feed-Forward Metric 3D Reconstruction	Nikhil Keetha et.al.	2509.13414	null
2025-09-16	Optimal Annuitization with stochastic mortality: Piecewise Deterministic Mortality Force	Matteo Buttarazzi et.al.	2509.13091	null
2025-09-16	Spatiotemporal graph neural process for reconstruction, extrapolation, and classification of cardiac trajectories	Jaume Banus et.al.	2509.12953	null
2025-09-18	A-TDOM: Active TDOM via On-the-Fly 3DGS	Yiwei Xu et.al.	2509.12759	null
2025-10-21	Neural 3D Object Reconstruction with Small-Scale Unmanned Aerial Vehicles	Àlmos Veres-Vitàlyos et.al.	2509.12458	null
2025-09-15	DYNAMO: Dependency-Aware Deep Learning Framework for Articulated Assembly Motion Prediction	Mayank Patel et.al.	2509.12430	null
2025-11-20	End-to-End 4D Heart Mesh Recovery Across Full-Stack and Sparse Cardiac MRI	Yihong Chen et.al.	2509.12090	null
2025-11-18	Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting	Yi-Hsin Li et.al.	2509.11853	null
2025-09-15	WAFER: A new method to retrieve sun-induced fluorescence based on spectral wavelet decompositions	Veronika Oehl et.al.	2509.11829	null
2025-09-14	Understanding the effect of wall elasticity in turbulent channel flows	M. Koseki et.al.	2509.11142	null
2025-09-14	3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment	Nhut Le et.al.	2509.11097	null
2025-09-13	Space Astrometry with Gaia: Advances in Understanding our Galaxy	Michael Perryman et.al.	2509.10883	null
2025-11-04	Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation	Hao Zhang et.al.	2509.10687	null
2025-09-12	Nanosculpting lateral weak link junctions in superconducting Fe(Te,Se)/Bi2Te3 with focused Si++ ions and implications on vortex pinning	Debarghya Mallick et.al.	2509.10606	null
2025-09-17	DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training	Jianxin Shi et.al.	2509.10426	null
2025-09-12	*Breakdown of the critical state in the ferromagnetic superconductor EuFe $2$(As${1-x}$P$_x$)$_2$*	William Robert Fern et.al.	2509.10339	null
2025-09-12	A MeerKAT view of the parsec-scale jets in the black-hole X-ray binary GRS 1758-258	I. Mariani et.al.	2509.10275	null
2025-09-12	Robustness and Diagnostic Performance of Super-Resolution Fetal Brain MRI	Ema Masterl et.al.	2509.10257	null
2025-09-12	Cluster Ages to Reconstruct the Milky Way Assembly (CARMA) IV. Chrono-dynamics of seven old star clusters in the Large Magellanic Cloud and the peculiar origin of NGC 1841	F. Niederhofer et.al.	2509.10144	null
2025-09-11	Initial conditions for tidal synchronisation of a planet by its moon	Valeri V. Makarov et.al.	2509.09858	null
2025-09-09	Australian Supermarket Object Set (ASOS): A Benchmark Dataset of Physical Objects and 3D Models for Robotics and Computer Vision	Akansel Cosgun et.al.	2509.09720	null
2025-09-11	MOFU: Development of a MOrphing Fluffy Unit with Expansion and Contraction Capabilities and Evaluation of the Animacy of Its Movements	Taisei Mogi et.al.	2509.09613	null
2025-09-11	DualTrack: Sensorless 3D Ultrasound needs Local and Global Context	Paul F. R. Wilson et.al.	2509.09530	null
2025-09-11	BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging	Peng Zhou et.al.	2509.09484	null
2025-09-11	A Hybrid Hinge-Beam Continuum Robot with Passive Safety Capping for Real-Time Fatigue Awareness	Tongshun Chen et.al.	2509.09404	null
2025-09-11	Video Understanding by Design: How Datasets Shape Architectures and Insights	Lei Wang et.al.	2509.09151	null
2025-09-11	Exploration on the Two-stream Instability in the Polar Cusp Under Solar Storm Disturbances and its Potential Impacts on Spacecraft	Jikai Sun et.al.	2509.09126	null
2025-09-11	Propulsive transitions and scaling relations of a heaving flexible foil in a cylinder wake	Guojun Li et.al.	2509.09102	null
2025-10-18	Kinetostatics and Particle-Swarm Optimization of Vehicle-Mounted Underactuated Metamorphic Loading Manipulators	Nan Mao et.al.	2509.09093	null
2025-10-04	A comprehensive view of nuclear shapes, rotations and vibrations from fully quantum mechanical perspectives	Takaharu Otsuka et.al.	2509.08552	null
2025-09-10	The GECKOS survey: Jeans anisotropic models of edge-on discs uncover the impact of dust and kinematic structures	T. H. Rutherford et.al.	2509.08371	null
2025-08-26	Analog-based ensembles to characterize turbulent dynamics from observed data	Carlos Granero-Belinchon et.al.	2509.07992	null
2025-09-09	Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation	Shunlei Li et.al.	2509.07957	null
2025-09-09	Mode-coupling theory of the glass transition for a liquid in a periodic potential	Abolfazl Ahmadirahmat et.al.	2509.07697	null
2025-09-09	Beyond Motion Cues and Structural Sparsity: Revisiting Small Moving Target Detection	Guoyi Zhang et.al.	2509.07654	null
2025-09-10	VIM-GS: Visual-Inertial Monocular Gaussian Splatting via Object-level Guidance in Large Scenes	Shengkai Zhang et.al.	2509.06685	null
2025-09-08	From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans	Marilyn Keller et.al.	2509.06607	null
2025-09-08	Nonlinear planar Hall effect from superconducting vortex motion	Mio Hashimoto et.al.	2509.06313	null
2025-11-11	Limiting distribution of the chemical distance in high dimensional critical percolation	Shirshendu Chatterjee et.al.	2509.06236	null
2025-09-07	Micro-Expression Recognition via Fine-Grained Dynamic Perception	Zhiwen Shao et.al.	2509.06015	null
2025-09-07	Modeling Magnetoelastic Wave Interactions in Magnetic Films and Heterostructures: A finite-difference approach	Peter Flauger et.al.	2509.06007	null
2025-09-07	Skyrmion manipulation and logic gate functionality in transition metal multilayers	Tamali Mukherjee et.al.	2509.05951	null
2025-09-06	Depth Profiling of Oxygen Migration in Ta/HfO2 Stacks During Ionic Liquid Gating	Beatrice Bednarz et.al.	2509.05748	null
2025-09-05	Resolving Tangling in Multi-Conformer Refinement via Iterative Projections	Avinash Mandaiya et.al.	2509.05189	null
2025-09-04	Disentangling Multiple Gas Kinematic Drivers in the Perseus Galaxy Cluster	XRISM Collaboration et.al.	2509.04421	null
2025-09-07	Hyperuniformity and conservation laws in non-equilibrium systems	Raphaël Maire et.al.	2509.04242	null
2025-09-03	Exploiting correlations in multi-coincidence Coulomb explosion patterns for differentiating molecular structures using machine learning	Anbu Selvam Venkatachalam et.al.	2509.03776	null
2025-09-03	Beyond the Clouds: S3 as the most distant extended Milky Way stream, not of LMC origin	Ó. Jiménez-Arranz et.al.	2509.03424	null
2025-09-02	Voter Model stability with respect to conservative noises	Gideon Amir et.al.	2509.02717	null
2025-09-02	Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction	Xueyang Kang et.al.	2509.01873	null
2025-09-01	Optimal information injection and transfer mechanisms for active matter reservoir computing	Mario U. Gaimann et.al.	2509.01799	null
2025-09-01	An Accurate Comprehensive Approach to Substructure: IV. Dynamical Friction	Eduard Salvador-Solé et.al.	2509.01553	null
2025-08-31	Origin and control of pseudo-rotating spiral jets	Karol Wawrzak et.al.	2509.00763	null
2025-09-30	Intramolecular Singlet Fission Through a Coherently Coupled Excimer-like Intermediate	Sanjoy Patra et.al.	2508.21568	null
2025-08-28	Coherent motions to predict Lagrangian trajectories	Ali R Khojasteh et.al.	2508.21191	null
2025-08-28	First-Order Viscous Relativistic Hydrodynamics on the Two-Sphere	Lennox S. Keeble et.al.	2508.20998	null
2025-08-28	Scaling Fabric-Based Piezoresistive Sensor Arrays for Whole-Body Tactile Sensing	Curtis C. Johnson et.al.	2508.20959	null
2025-08-28	Language-Enhanced Mobile Manipulation for Efficient Object Search in Indoor Environments	Liding Zhang et.al.	2508.20899	null
2025-08-28	On W-algebras and ODE/IM correspondence	Matěj Kudrna et.al.	2508.20793	null
2025-08-28	AvatarBack: Back-Head Generation for Complete 3D Avatars from Front-View Images	Shiqi Xin et.al.	2508.20623	null
2025-08-26	PRISM: A Framework Harnessing Unsupervised Visual Representations and Textual Prompts for Explainable MACE Survival Prediction from Cardiac Cine MRI	Haoyang Su et.al.	2508.19325	null
2025-08-26	Thermoelectric evidence of the electronic structure changes from the charge-density-wave transition in FeGe	Kaila Jenkins et.al.	2508.19116	null
2025-08-26	WIde Separation Planets In Time (WISPIT): A Gap-clearing Planet in a Multi-ringed Disk around the Young Solar-type Star WISPIT 2	Richelle F. van Capelleveen et.al.	2508.19053	null
2025-08-27	Striking Similarities in Dynamics and Vibrations of 2D Quasicrystals and Supercooled Liquids	Edwin A. Bedolla-Montiel et.al.	2508.18856	null
2025-08-26	Locally tuned hydrodynamics of active polymer chains	Lisa Sappl et.al.	2508.18789	null
2025-08-26	Chemical control of polymorphism and ferroelectricity in PbTiO3 and SrTiO3 monolayers and bilayers	Shaowen Xu et.al.	2508.18777	null
2025-08-26	A New Evidence of Interplay Between Tetrahedral and Octahedral Symmetries and Symmetry Breaking: Exotic Rotational Bands in $^{152}$ Sm	S. Basak et.al.	2508.18686	null
2025-11-24	Warm Chat: Diffuse Emotion-aware Interactive Talking Head Avatar with Tree-Structured Guidance	Haijie Yang et.al.	2508.18337	null
2025-08-25	Cellular Flow Architecture Exposes the Hidden Mechanics of Biological Matter	Tianxiang Ma et.al.	2508.17974	null
2025-08-25	SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization	Junyuan Deng et.al.	2508.17972	null
2025-08-25	On the complexity of parametrized motion planning algorithms	Navnath Daundkar et.al.	2508.17629	null
2025-10-07	MoSA: Motion-Coherent Human Video Generation via Structure-Appearance Decoupling	Haoyu Wang et.al.	2508.17404	null
2025-08-24	Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery	Jiaqi Liu et.al.	2508.17380	null
2025-08-23	A fluxonium qubit-based hybrid electromechanical system	Roson Nongthombam et.al.	2508.17105	null
2025-08-27	A Black Hole Solution in Kalb-Ramond Gravity with Quintessence Field: From Geodesic Dynamics to Thermal Criticality	Ahmad Al-Badawi et.al.	2508.16693	null
2025-11-10	Stable black holes in lower dimensional $f(\mathbb{Q})$ non-metric gravity	G. G. L. Nashed et.al.	2508.16679	null
2025-08-07	Thermal convection in huddling emperor penguins	Dmitry Bratsun et.al.	2508.16586	null
2025-08-22	Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation	Chun-Peng Chang et.al.	2508.16512	null
2025-08-25	HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images	Anilkumar Swamy et.al.	2508.16465	null
2025-08-26	Prompting with Sign Parameters for Low-resource Sign Language Instruction Generation	Md Tariquzzaman et.al.	2508.16076	null
2025-08-22	NeuralMeshing: Complete Object Mesh Extraction from Casual Captures	Floris Erich et.al.	2508.16026	null
2025-08-21	WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception	Zhiheng Liu et.al.	2508.15720	null
2025-08-21	Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework	Zongqi He et.al.	2508.15457	null
2025-09-21	DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians	Cong Wang et.al.	2508.15376	null
2025-09-04	A Spectroscopic Hunt for Post-Red Supergiants in the Large Magellanic Cloud II: Turbulent Line Broadening in the Spectra of LMC Yellow Supergiants	Trevor Z. Dorn-Wallenstein et.al.	2508.14971	null
2025-08-22	The Alma catalogue of OB stars. III. A cross-match with Gaia DR3 and an extension based on new spectral classifications	M. Pantaleoni González et.al.	2508.14875	null
2025-08-20	Probing the farthest star clusters to the Small Magellanic Cloud	A. E. Piatti et.al.	2508.14701	null
2025-08-20	GeMS: Efficient Gaussian Splatting for Extreme Motion Blur	Gopi Raju Matta et.al.	2508.14682	null
2025-08-20	Identifying Monochromatic Signals in LISA and Taiji via Spectral Split: Gravitational Waves versus Ultralight Dark Matter	Yue-Hui Yao et.al.	2508.14655	null
2025-08-20	From Slices to Structures: Unsupervised 3D Reconstruction of Female Pelvic Anatomy from Freehand Transvaginal Ultrasound	Max Krähenmann et.al.	2508.14552	null
2025-08-20	Singularity of the axisymmetric stagnation-point-like solution within a cylinder of the 3D Euler incompressible fluid equations	Yinshen Xu et.al.	2508.14550	null
2025-08-20	Anisotropic Neutrino Emission from Spinning, Moving, and Charged Primordial Black Holes	Arnab Chaudhuri et.al.	2508.14510	null
2025-08-19	Gravitational Influence from Planets on the Measured Rates of Period Change of Pulsating White Dwarfs	Ling Xuan Yao et.al.	2508.14195	null
2025-08-20	Properties of the temporal transfer matrix in integrable Floquet circuits	Ilya Vilkoviskiy et.al.	2508.13883	null
2025-10-31	Smooth Flow Matching	Jianbin Tan et.al.	2508.13831	null
2025-08-18	Towards Routine Condensed Phase Simulations with Delta-Learned Coupled Cluster Accuracy: Application to Liquid Water	Niamh O'Neill et.al.	2508.13391	null
2025-08-18	Dynamic stall of a hydrofoil with tubercles in surface gravity waves	Guillaume Ricard et.al.	2508.13329	null
2025-08-18	MaskSem: Semantic-Guided Masking for Learning 3D Hybrid High-Order Motion Representation	Wei Wei et.al.	2508.12948	null
2025-10-20	Visual-Neural-Inspired Image Inpainting for Specific Objects-of-Interest Imaging	Yonghao Wu et.al.	2508.12808	null
2025-08-18	Discerning and quantifying high frequency activities in EEG under normal and epileptic conditions	Jyotiraj Nath et.al.	2508.12670	null
2025-08-17	HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization	Hyebin Ahn et.al.	2508.12292	null
2025-08-17	What do Speech Foundation Models Learn? Analysis and Applications	Ankita Pasad et.al.	2508.12255	null
2025-08-16	KP-INR: A Dual-Branch Implicit Neural Representation Model for Cardiac Cine MRI Reconstruction	Donghang Lyu et.al.	2508.12147	null
2025-08-16	Applied causality to infer protein dynamics and kinetics	Akashnathan Aranganathan et.al.	2508.12060	null
2025-09-15	WiseLVAM: A Novel Framework For Left Ventricle Automatic Measurements	Durgesh Kumar Singh et.al.	2508.12023	null
2025-08-19	Colloidal hydrodynamic interactions in viscoelastic fluids	Dae Yeon Kim et.al.	2508.11948	null
2025-08-16	Mapping feedback signatures in 3C 297: A quasar-host merger at Cosmic Noon	Chetna Duggal et.al.	2508.11926	null
2025-09-08	Deformation Driven Suction Cups: A Mechanics-Based Approach to Wearable Electronics	Seola Lee et.al.	2508.11838	null
2025-08-01	Multimodal Quantitative Measures for Multiparty Behaviour Evaluation	Ojas Shirekar et.al.	2508.10916	null
2025-08-14	Reduction of motion artifacts from photoplethysmography signals using learned convolutional sparse coding	Giulio Basso et.al.	2508.10805	null
2025-08-14	Snap-through time of arches is controlled by slenderness and imperfections	William Simpkins et.al.	2508.10802	null
2025-08-14	On the Derivation of Equations of Motion from Symmetries in Quantum-Mechanical Systems via Heisenberg's Uncertainty	Enrique Casanova et.al.	2508.10661	null
2025-08-14	EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba	Quang Nguyen et.al.	2508.10522	null
2025-08-13	Coulomb excitation of $^{124}$Te: Emerging collectivity and persisting seniority structure in the $6_1^+$ level	M. Reece et.al.	2508.09643	null
2025-08-12	A Galactic Interloper: A Study of the Cam OB1 Association's Clusters and its Visitor from the Perseus Arm	Joseph Mullen et.al.	2508.09393	null
2025-08-12	CLF-RL: Control Lyapunov Function Guided Reinforcement Learning	Kejun Li et.al.	2508.09354	null
2025-08-12	Quadrupolar gyration of a Brownian particle in a confining ring	Iman Abdoli et.al.	2508.08792	null
2025-08-11	Weak solutions and incompressible limit of a quasi-incompressible Navier--Stokes/Cahn--Hilliard model for viscous two-phase flows	Mingwen Fei et.al.	2508.08090	null
2025-08-11	Joint Transcription of Acoustic Guitar Strumming Directions and Chords	Sebastian Murgul et.al.	2508.07973	null
2025-08-12	Mem4D: Decoupling Static and Dynamic Memory for Dynamic Scene Reconstruction	Xudong Cai et.al.	2508.07908	null
2025-08-11	Tracking Any Point Methods for Markerless 3D Tissue Tracking in Endoscopic Stereo Images	Konrad Reuter et.al.	2508.07851	null
2025-08-11	Optimization of a Nonlinear Acoustics -- Structure Interaction Model	Barbara Kaltenbacher et.al.	2508.07728	null
2025-08-10	GS4Buildings: Prior-Guided Gaussian Splatting for 3D Building Reconstruction	Qilin Zhang et.al.	2508.07355	null
2025-11-17	Understanding Dynamic Scenes in Ego Centric 4D Point Clouds	Junsheng Huang et.al.	2508.07251	null
2025-08-27	From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving	Antonio Guillen-Perez et.al.	2508.07029	null
2025-08-09	Evaluating Fisheye-Compatible 3D Gaussian Splatting Methods on Real Images Beyond 180 Degree Field of View	Ulas Gunes et.al.	2508.06968	null
2025-08-08	Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video	Jixuan He et.al.	2508.06715	null
2025-08-08	Low temperature jet spectra of (DFE)2, DFE-He, DFE-He2 and DFE in the 2210-3105 cm-1 region (DFE = 1,1 difluoroethylene)	A. J. Barclay et.al.	2508.06629	null
2025-08-08	V: An Efficient Motion Planning Algorithm for Autonomous Vehicles*	Abdullah Zareh Andaryan et.al.	2508.06404	null
2025-08-08	Topological edge states and amplitude-dependent delocalization in quasiperiodic elliptically geared lattices	Shuaifeng Li et.al.	2508.06286	null
2025-08-07	CleanUpBench: Embodied Sweeping and Grasping Benchmark	Wenbo Li et.al.	2508.05543	null
2025-08-07	F2PASeg: Feature Fusion for Pituitary Anatomy Segmentation in Endoscopic Surgery	Lumin Chen et.al.	2508.05465	null
2025-08-07	Information-Theoretic Graph Fusion with Vision-Language-Action Model for Policy Reasoning and Dual Robotic Control	Shunlei Li et.al.	2508.05342	null
2025-10-08	Regular black hole's impact on the gravitational waveforms from periodic orbits	Mirzabek Alloqulov et.al.	2508.05245	null
2025-08-07	EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery	Bingyu Yang et.al.	2508.05205	null
2025-08-07	Refining Gaussian Splatting: A Volumetric Densification Approach	Mohamed Abdul Gafoor et.al.	2508.05187	null
2025-09-02	XRISM/Resolve View of Abell 2319: Turbulence, Sloshing, and ICM Dynamics	XRISM Collaboration et.al.	2508.05067	null
2025-11-04	Bursting at the seams: the star-forming main sequence and its scatter at z=3-9 using NIRCam photometry from JADES	C. Simmonds et.al.	2508.04410	null
2025-09-19	Variational mode decomposition analysis of the relationship between low-frequency shock-wave oscillations and buffet cells	Yuya Ohmichi et.al.	2508.04250	null
2025-08-06	PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction	Muhua Zhu et.al.	2508.04236	null
2025-08-06	SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition	Jiahui Li et.al.	2508.04224	null
2025-08-06	Probing globular clusters using modulated gravitational waves from binary black holes	Jie Wu et.al.	2508.04021	null
2025-10-21	Can Large Language Models Adequately Perform Symbolic Reasoning Over Time Series?	Zewen Liu et.al.	2508.03963	null
2025-09-26	Next Generation Equation-Free Multiscale Modelling of Crowd Dynamics via Machine Learning	Hector Vargas Alvarez et.al.	2508.03926	null
2025-08-05	High-Resolution Dynamic Full-Field Optical Coherence Microscopy: Illuminating Intracellular Activity in Deep Tissue	Erikas Tarvydas et.al.	2508.03657	null
2025-08-05	WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval	Junlong Ren et.al.	2508.03343	null
2025-08-04	A fluid--peridynamic structure model of deformation and damage of microchannels	Ziyu Wang et.al.	2508.02875	null
2025-08-04	Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering	Xu Wang et.al.	2508.02362	null
2025-08-04	Newtons First Law Is Not a Special Case of the Second Law	Indresh Yadav et.al.	2508.02246	null
2025-08-04	IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A	Chen Li et.al.	2508.01984	null
2025-08-03	CVD-SfM: A Cross-View Deep Front-end Structure-from-Motion System for Sparse Localization in Multi-Altitude Scenes	Yaxuan Li et.al.	2508.01936	null
2025-10-16	Orbital angular momentum of entangled photons as a probe for relativistic effects	Fazilah Nothlawala et.al.	2508.01716	null
2025-08-02	Rim destabilization and re-formation upon severance from its expanding sheet	M. Kharbedia et.al.	2508.01308	null
2025-10-16	UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation	Chaitanya Patel et.al.	2508.01126	null
2025-08-01	Counting topological interface modes using simplicial characteristic classes	N. Bohlsen et.al.	2508.01063	null
2025-08-01	3D Reconstruction via Incremental Structure From Motion	Muhammad Zeeshan et.al.	2508.01019	null
2025-08-01	GeoMoE: Divide-and-Conquer Motion Field Modeling with Mixture-of-Experts for Two-View Geometry	Jiajun Le et.al.	2508.00592	null
2025-08-01	TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps	Zehui Xu et.al.	2508.00303	null
2025-07-30	X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention	Xiaochen Zhao et.al.	2507.23143	null
2025-07-30	Eddy population based model for the wall-pressure spectrum at high Reynolds number	Jonathan M. O. Massey et.al.	2507.23098	null
2025-08-01	Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future	Guoping Xu et.al.	2507.22792	null
2025-08-14	A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks	Hang Su et.al.	2507.22733	null
2025-07-29	Probing Turbulence, Gravity, Supernovae, and Magnetic Field Effects with the 6D Kinematics of Young Stars in Milky Way Star-Forming Regions	Benjamin N. Velguth et.al.	2507.22107	null
2025-07-28	Projecting the New Body: How Body Image Evolves During Learning to Walk with a Wearable Robot	I-Chieh Lee et.al.	2507.21384	null
2025-07-28	FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling	Jingting Li et.al.	2507.20557	null
2025-07-27	Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars	Mattia Piccinini et.al.	2507.20427	null
2025-07-27	Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models	Bohong Chen et.al.	2507.20220	null
2025-07-27	Unveiling the Sagittarius Dwarf Spheroidal Galaxy Core with Gaia DR3	Ellie K. H. Toguchi-Tani et.al.	2507.20212	null
2025-07-27	PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks	Clinton Ansun Mo et.al.	2507.20170	null
2025-10-04	RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters	Xiaolin Liu et.al.	2507.20117	null
2025-07-26	Nonlinear causality of Israel-Stewart theory with diffusion	Ian Cordeiro et.al.	2507.20064	null
2025-07-26	TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking	Mengmeng Wang et.al.	2507.19908	null
2025-11-08	RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection	Xiaokai Bai et.al.	2507.19856	null
2025-07-25	The phase spiral's origin and evolution: indications from its varying properties across the Milky Way disk	Axel Widmark et.al.	2507.19579	null
2025-08-02	GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting	Baijun Ye et.al.	2507.19451	null
2025-11-10	A multi-dynamic low-rank deep image prior (ML-DIP) for 3D real-time cardiovascular MRI	Chong Chen et.al.	2507.19404	null
2025-07-25	NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography	Kirsten W. H. Maas et.al.	2507.19328	null
2025-07-31	MVG4D: Image Matrix-Based Multi-View and Motion Generation for 4D Content Creation from a Single Image	DongFu Yin et.al.	2507.18371	null
2025-07-23	Zero-Shot Dynamic Concept Personalization with Grid-Based LoRA	Rameen Abdal et.al.	2507.17963	null
2025-07-23	MCM: Mamba-based Cardiac Motion Tracking using Sequential Images in MRI	Jiahui Yin et.al.	2507.17678	null
2025-07-23	Constraints on Axion Dark Matter by Spin-Dependent Macroscopic Force	Dongyi Yang et.al.	2507.17148	null
2025-10-01	A Tutorial on MRI Reconstruction: From Modern Methods to Clinical Implications	Tolga Çukur et.al.	2507.16715	null
2025-07-22	Dyna3DGR: 4D Cardiac Motion Tracking with Dynamic 3D Gaussian Representation	Xueming Fu et.al.	2507.16608	null
2025-07-22	Sparse-View 3D Reconstruction: Recent Advances and Open Challenges	Tanveer Younis et.al.	2507.16406	null
2025-07-22	MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation	Yanchen Liu et.al.	2507.16310	null
2025-07-22	Universal Wavelet Units in 3D Retinal Layer Segmentation	An D. Le et.al.	2507.16119	null
2025-09-24	Interpretable Embeddings of Speech Enhance and Explain Brain Encoding Performance of Audio Models	Riki Shimizu et.al.	2507.16080	null
2025-07-21	Relationship between Structure and Dynamics of an Icosahedral Quasicrystal using Unsupervised Machine Learning	Edwin A. Bedolla-Montiel et.al.	2507.15731	null
2025-07-21	Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing	Boni Hu et.al.	2507.15683	null
2025-08-28	Edge-effects in the turbulent flow over flexible aquatic vegetation	Giulio Foggi Rota et.al.	2507.15477	null
2025-07-21	Low-Latency Event-Based Velocimetry for Quadrotor Control in a Narrow Pipe	Leonard Bauersfeld et.al.	2507.15444	null
2025-07-21	Few-Shot Object Detection via Spatial-Channel State Space Model	Zhimeng Xin et.al.	2507.15308	null
2025-10-11	TinyIO: Lightweight Reparameterized Inertial Odometry	Shanshan Zhang et.al.	2507.15293	null
2025-10-24	An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks	Xinyi Wu et.al.	2507.14798	null
2025-07-20	Flow Equivariant Recurrent Neural Networks	T. Anderson Keller et.al.	2507.14793	null
2025-07-19	The Serpent Eating Its Own Tail: Dust Destruction in the Apep Colliding-Wind Nebula	Ryan M. T. White et.al.	2507.14610	null
2025-07-19	BT-TL-DMPs: A Novel Robot TAMP Framework Combining Behavior Tree, Temporal Logic and Dynamical Movement Primitives	Zezhi Liu et.al.	2507.14582	null
2025-07-19	Motion Segmentation and Egomotion Estimation from Event-Based Normal Flow	Zhiyuan Hua et.al.	2507.14500	null
2025-07-18	DUSTrack: Semi-automated point tracking in ultrasound videos	Praneeth Namburi et.al.	2507.14368	null
2025-07-18	Efficient Variational Dynamics of Open Quantum Bosonic Systems via Automatic Differentiation	Jacopo Tosca et.al.	2507.14076	null
2025-07-29	DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation	Haoran Li et.al.	2507.13985	null
2025-07-18	Gaussian kernel-based motion measurement	Hongyi Liu et.al.	2507.13693	null
2025-10-20	Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation	Masahiro Ogawa et.al.	2507.13628	null
2025-07-16	Enhancing In-Domain and Out-Domain EmoFake Detection via Cooperative Multilingual Speech Foundation Models	Orchid Chetia Phukan et.al.	2507.12595	null
2025-07-16	BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images	Davide Di Nucci et.al.	2507.12095	null
2025-07-16	Spatial Frequency Modulation for Semantic Segmentation	Linwei Chen et.al.	2507.11893	null
2025-07-14	Supporting SENĆOTEN Language Documentation Efforts with Automatic Speech Recognition	Mengzhe Geng et.al.	2507.10827	null
2025-07-11	Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT	Wei Zhang et.al.	2507.08448	null
2025-07-04	MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion	Peilin Tao et.al.	2507.03306	null
2025-06-30	Towards Initialization-free Calibrated Bundle Adjustment	Carl Olsson et.al.	2506.23808	null
2025-06-30	AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention	Ziao Liu et.al.	2506.23611	null
2025-06-27	Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras	Petr Hruby et.al.	2506.22069	null
2025-06-24	ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes	Chenhao Zhang et.al.	2506.21629	null
2025-07-08	Wild refitting for black box prediction	Martin J. Wainwright et.al.	2506.21460	null
2025-06-24	Experimental Assessment of Neural 3D Reconstruction for Small UAV-based Applications	Genís Castillo Gómez-Raya et.al.	2506.19491	null
2025-06-23	ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs	Michal Nazarczuk et.al.	2506.18792	null
2025-06-23	Room temperature spin injection into commercial VCSELs at non-resonant wavelengths	Timur Almabetov et.al.	2506.18376	null
2025-06-11	OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary	Yui Sudo et.al.	2506.09448	null
2025-06-06	SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction	Yuchao Zheng et.al.	2506.05935	null
2025-06-05	On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images	Andreas Meuleman et.al.	2506.05558	null
2025-06-05	SupeRANSAC: One RANSAC to Rule Them All	Daniel Barath et.al.	2506.04803	link
2025-06-04	Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation	Tianyu Huang et.al.	2506.04225	null
2025-06-04	Accelerating SfM-based Pose Estimation with Dominating Set	Joji Joseph et.al.	2506.03667	null
2025-06-03	Nearby dwarf galaxies with extreme star formation rates: a window into dwarf-galaxy evolution in the early Universe	S. Kaviraj et.al.	2506.03265	null
2025-06-02	Fast and Robust Rotation Averaging with Anisotropic Coordinate Descent	Yaroslava Lochman et.al.	2506.01940	null
2025-06-03	Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC	Qingzheng Wang et.al.	2505.24200	null
2025-05-29	Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping	Justin Lazarow et.al.	2505.23756	null
2025-05-30	FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian	Sara Papi et.al.	2505.22759	link
2025-05-28	UAVPairs: A Challenging Benchmark for Match Pair Retrieval of Large-scale UAV Images	Junhuan Liu et.al.	2505.22098	null
2025-05-28	Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule	San Jiang et.al.	2505.22089	null
2025-05-30	Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations	Whenty Ariyanti et.al.	2505.21356	null
2025-05-27	Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting	Xiangyu Sun et.al.	2505.20729	null
2025-05-26	Robust fine-tuning of speech recognition models via model merging: application to disordered speech	Alexandre Ducorroy et.al.	2505.20477	null
2025-05-29	Sparse2DGS: Sparse-View Surface Reconstruction using 2D Gaussian Splatting with Dense Point Cloud	Natsuki Takama et.al.	2505.19854	null
2025-05-25	Improving Novel view synthesis of 360 $^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images	Guangan Chen et.al.	2505.19264	link
2025-05-24	Token-Level Logits Matter: A Closer Look at Speech Foundation Models for Ambiguous Emotion Recognition	Jule Valendo Halim et.al.	2505.18484	null
2025-05-22	Tracking the Flight: Exploring a Computational Framework for Analyzing Escape Responses in Plains Zebra (Equus quagga)	Isla Duporge et.al.	2505.16882	link
2025-05-21	A Taxonomy of Structure from Motion Methods	Federica Arrigoni et.al.	2505.15814	null
2025-05-18	Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis	Dong Yang et.al.	2505.12226	null
2025-05-15	Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis	Francisco Raverta Capua et.al.	2505.10751	link
2025-05-13	Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People	Haoshuai Zhou et.al.	2505.08215	null
2025-05-12	RDD: Robust Feature Detector and Descriptor using Deformable Transformer	Gonglin Chen et.al.	2505.08013	null
2025-05-12	Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild	Lintao Xiang et.al.	2505.07373	null
2025-05-11	Symmetry in Fundamental Parameters of Galaxies on the Star-forming Main Sequence	Zhicheng He et.al.	2505.06868	null
2025-05-10	TPK: Trustworthy Trajectory Prediction Integrating Prior Knowledge For Interpretability and Kinematic Feasibility	Marius Baden et.al.	2505.06743	null
2025-05-08	DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion	Qitao Zhao et.al.	2505.05473	null
2025-05-20	FastMap: Revisiting Dense and Scalable Structure from Motion	Jiahao Li et.al.	2505.04612	link
2025-05-15	Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera	Siming He et.al.	2505.03093	null
2025-05-03	AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting	Junhao Shi et.al.	2505.01799	null
2025-05-03	PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth	Bu Jin et.al.	2505.01729	null
2025-05-01	Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation?	Viktor Kocur et.al.	2505.00866	link
2025-04-29	Large-scale visual SLAM for in-the-wild videos	Shuo Sun et.al.	2504.20496	null
2025-04-29	Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views	Jiang Wu et.al.	2504.20378	link
2025-04-28	MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion	Zador Pataki et.al.	2504.20040	link
2025-04-24	Dynamic Camera Poses and Where to Find Them	Chris Rockwell et.al.	2504.17788	null
2025-04-24	EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy	Haodi Yao et.al.	2504.17280	null
2025-04-23	A Low-Cost Photogrammetry System for 3D Plant Modeling and Phenotyping	Joe Hrzich et.al.	2504.16840	null
2025-04-23	PRaDA: Projective Radial Distortion Averaging	Daniil Sinitsyn et.al.	2504.16499	null
2025-04-21	Traversing the Star-Forming Main Sequence with Molecular Gas Stacks of z~1.6 Cluster Galaxies	Alex Pigarelli et.al.	2504.15381	null
2025-04-21	Towards Understanding Camera Motions in Any Video	Zhiqiu Lin et.al.	2504.15376	null
2025-04-21	StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models	Yeona Hong et.al.	2504.14915	null
2025-04-17	Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering	Landon Dyken et.al.	2504.13339	null
2025-04-15	EDGS: Eliminating Densification for Efficient Convergence of 3DGS	Dmytro Kotovenko et.al.	2504.13204	null
2025-04-15	Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps	Panagiotis Agrafiotis et.al.	2504.11416	link
2025-04-12	A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds	Jizong Peng et.al.	2504.09129	null
2025-04-11	Stereophotoclinometry Revisited	Travis Driver et.al.	2504.08252	null
2025-04-08	Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring	José A. Pilartes-Congo et.al.	2504.06464	null
2025-04-07	Decoding the variability in the star-formation histories of z ~ 0.8 galaxies	Jenny T. Wan et.al.	2504.05281	null
2025-04-05	3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS	Zhisheng Huang et.al.	2504.04294	null
2025-04-04	An Algebraic Geometry Approach to Viewing Graph Solvability	Federica Arrigoni et.al.	2504.03637	null
2025-04-04	Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video	Jiaxin Guo et.al.	2504.03198	null
2025-04-03	Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation	Feng Gao et.al.	2504.02647	link
2025-04-09	FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking	Ulas Gunes et.al.	2504.01732	null
2025-03-31	LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors	Han Zhou et.al.	2504.00219	null
2025-03-30	AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos	Felix Wimbauer et.al.	2503.23282	link
2025-03-24	Ground Penetrating Radar-Assisted Multimodal Robot Odometry Using Subsurface Feature Matrix	Haifeng Li et.al.	2503.18301	null
2025-03-22	3D Modeling: Camera Movement Estimation and path Correction for SFM Model using the Combination of Modified A-SIFT and Stereo System	Usha Kumari et.al.	2503.17668	null
2025-03-25	ProtoGS: Efficient and High-Quality Rendering with 3D Gaussian Prototypes	Zhengqing Gao et.al.	2503.17486	null
2025-03-21	ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration	Johan Edstedt et.al.	2503.17093	link
2025-03-20	From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction	Ayberk Acar et.al.	2503.16263	null
2025-03-22	Euclid Quick Data Release (Q1). A first view of the star-forming main sequence in the Euclid Deep Fields	Euclid Collaboration et.al.	2503.15314	null
2025-03-18	Multi-view Reconstruction via SfM-guided Monocular Depth Estimation	Haoyu Guo et.al.	2503.14483	null
2025-03-18	A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios	Huy-Hoang Bui et.al.	2503.13982	link
2025-03-17	Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios	Iryna Repinetska et.al.	2503.13710	null
2025-03-17	Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization	Yiwei Xu et.al.	2503.13086	null
2025-03-15	SFMNet: Sparse Focal Modulation for 3D Object Detection	Oren Shrout et.al.	2503.12093	null
2025-03-11	A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds	Felix Rydell et.al.	2503.08142	null
2025-03-11	DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection	Johan Edstedt et.al.	2503.07347	link
2025-03-18	Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion	Mona Sheikh Zeinoddin et.al.	2503.07204	null
2025-03-10	VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation	Hanzhi Chen et.al.	2503.07135	null
2025-03-09	AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation	Yang Zou et.al.	2503.06660	null
2025-03-07	LiDAR-enhanced 3D Gaussian Splatting Mapping	Jian Shen et.al.	2503.05425	null
2025-03-06	PLMP -- Point-Line Minimal Problems for Projective SfM	Kim Kiehn et.al.	2503.04351	null
2025-03-03	MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Yohann Cabon et.al.	2503.01661	link
2025-03-03	ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization	Anas Abdelkarim et.al.	2503.01311	link
2025-03-05	A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping	Jialei He et.al.	2503.01202	null
2025-03-02	MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain	Rui Yi Yong et.al.	2503.00853	null
2025-03-02	PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery	BoCheng Li et.al.	2503.00848	null
2025-03-02	Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration	Jinjiang You et.al.	2503.00737	link
2025-02-28	The THESAN-ZOOM project: Burst, quench, repeat -- unveiling the evolution of high-redshift galaxies along the star-forming main sequence	William McClymont et.al.	2503.00106	null
2025-02-27	Best Foot Forward: Robust Foot Reconstruction in-the-wild	Kyle Fogarty et.al.	2502.20511	null
2025-02-26	SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images	Yangfan Xu et.al.	2502.18932	null
2025-03-04	Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model	Yaxuan Huang et.al.	2502.16779	null
2025-02-20	CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting	Qilin Zhang et.al.	2502.14684	link
2025-02-19	Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections	Seong Jong Yoo et.al.	2502.13986	null
2025-02-19	IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras	Dongki Jung et.al.	2502.12545	null
2025-02-12	Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors	Vishwanath Pratap Singh et.al.	2502.08587	null
2025-02-10	FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences	Oliver Boyne et.al.	2502.06367	link
2025-02-09	Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models	Jing-Xuan Zhang et.al.	2502.05766	link
2025-02-10	Building Rome with Convex Optimization	Haoyu Han et.al.	2502.04640	null
2025-02-04	SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification	Yifu Tao et.al.	2502.02657	null
2025-02-05	GP-GS: Gaussian Processes for Enhanced Gaussian Splatting	Zhihao Guo et.al.	2502.02283	link
2025-02-03	XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications	Shangjin Zhai et.al.	2502.01297	null
2025-01-29	Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment	Zixue Zeng et.al.	2501.17690	link
2025-01-28	Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction	Tim Flückiger et.al.	2501.16221	null
2025-01-25	Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos	Zhen-Hui Dong et.al.	2501.15096	null
2025-01-24	MATCHA:Towards Matching Anything	Fei Xue et.al.	2501.14945	null
2025-01-24	Light3R-SfM: Towards Feed-forward Structure-from-Motion	Sven Elflein et.al.	2501.14914	null
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277	null
2025-01-21	Theory of quantum-geometric charge and spin Josephson diode effects in strongly spin-polarized hybrid structures with noncoplanar spin textures	Niklas L. Schulz et.al.	2501.12232	null
2025-01-14	Selective Attention Merging for low resource tasks: A case study of Child ASR	Natarajan Balaji Shankar et.al.	2501.08468	link
2025-01-14	SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting	Yue Hu et.al.	2501.07015	null
2025-02-02	CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications	Xinyi Zheng et.al.	2501.06927	link
2025-01-11	Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis	Aditya Rauniyar et.al.	2501.06431	null
2025-01-09	Existence of dynamical fluctuation in AMPT generated data for Au+Au collisions at 10 AGeV	Somen Gope et.al.	2501.05175	null
2025-01-06	Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation	Yuezhang Lv et.al.	2501.02821	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-02	EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy	Ao Gao et.al.	2501.01003	null
2024-12-30	KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences	Keng-Wei Chang et.al.	2412.20767	null
2024-12-27	Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images	Xudong Cai et.al.	2412.19518	null
2024-12-25	Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition	Shujie Hu et.al.	2412.18832	null
2024-12-23	Reconstructing People, Places, and Cameras	Lea Müller et.al.	2412.17806	link
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103	null
2024-12-16	Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection	Beomseok Lee et.al.	2412.11978	null
2024-12-18	SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video	Jongmin Park et.al.	2412.09982	null
2024-12-12	CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework	Yushan Han et.al.	2412.08344	null
2024-12-10	Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling	Hui Deng et.al.	2412.07230	null
2024-12-08	Unveiling True Talent: The Soccer Factor Model for Skill Evaluation	Alexandre Andorra et.al.	2412.05911	null
2024-12-08	Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features	Yuanbo Xiangli et.al.	2412.05826	null
2024-12-06	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	null
2024-12-03	ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification	Pan Zhang et.al.	2412.02044	link
2024-12-02	SfM-Free 3D Gaussian Splatting via Hierarchical Training	Bo Ji et.al.	2412.01553	link
2024-12-02	MVImgNet2.0: A Larger-scale Dataset of Multi-view Images	Xiaoguang Han et.al.	2412.01430	null
2024-12-02	TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories	Mengran Li et.al.	2412.01122	null
2024-12-02	Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM	Alejandro Fontan et.al.	2412.01116	null
2024-11-27	RoMo: Robust Motion Segmentation Improves Structure from Motion	Lily Goli et.al.	2411.18650	null
2024-11-26	The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at $z \sim$ 0.3	Marcie Mun et.al.	2411.17882	null
2024-11-25	Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations	Peng Wei et.al.	2411.16150	null
2024-11-24	ZeroGS: Training 3D Gaussian Splatting from Unposed Images	Yu Chen et.al.	2411.15779	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592	link
2024-11-15	The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods	Yifu Tao et.al.	2411.10546	null
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879	null
2024-11-13	Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model	Yutao Shen et.al.	2411.08453	null
2024-11-08	From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS	Haoran Zhang et.al.	2411.05362	link
2024-10-29	A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching	Yi-Ting Huang et.al.	2410.22602	null
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213	null
2024-10-17	Stochastic Flow Matching for Resolving Small-Scale Physics	Stathi Fotiadis et.al.	2410.19814	null
2024-10-25	A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint	Changshi Mu et.al.	2410.19473	link
2024-10-30	Large Spatial Model: End-to-end Unposed Images to Semantic 3D	Zhiwen Fan et.al.	2410.18956	link
2024-10-23	CO-CAVITY project: Molecular gas and star formation in void galaxies	M. I. Rodríguez et.al.	2410.18078	null
2024-10-23	PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting	Yu Wang et.al.	2410.17505	null
2024-10-20	Neural Active Structure-from-Motion in Dark and Textureless Environment	Kazuto Ichimaru et.al.	2410.15378	null
2024-10-17	SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation	Shiao Xie et.al.	2410.13486	null
2024-10-16	Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks	Orchid Chetia Phukan et.al.	2410.12947	null
2024-10-16	Gravity-aligned Rotation Averaging with Circular Regression	Linfei Pan et.al.	2410.12763	link
2024-10-16	Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals	Orchid Chetia Phukan et.al.	2410.12645	null
2024-10-15	SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection	Yizhe Liu et.al.	2410.12080	link
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	link
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-10-09	Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models	Ange Lou et.al.	2410.07434	null
2024-10-09	Deep HI Mapping of M 106 Group with FAST	Yao Liu et.al.	2410.07038	null
2024-10-09	MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data	Mingu Kang et.al.	2410.06442	null
2024-10-08	Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation?	Charalambos Tzamos et.al.	2410.05984	link
2024-10-04	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Laura Fink et.al.	2410.03861	link
2024-10-01	MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages	Marco Gaido et.al.	2410.01036	link
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386	null
2024-09-29	Robust Incremental Structure-from-Motion with Hybrid Features	Shaohui Liu et.al.	2409.19811	null
2024-09-27	MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion	Bardienus Duisterhof et.al.	2409.19152	null
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673	null
2024-09-26	BlinkTrack: Feature Tracking over 100 FPS via Events and Images	Yichen Shen et.al.	2409.17981	null
2024-09-25	How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not	Francesco Verdini et.al.	2409.17044	null
2024-09-24	Frequency-based View Selection in Gaussian Splatting Reconstruction	Monica M. Q. Li et.al.	2409.16470	null
2024-10-07	Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion	Juan-Diego Florez et.al.	2409.16465	null
2024-09-24	Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research	Vandita Shukla et.al.	2409.15914	null
2024-09-23	Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments	Francisco Roza de Moraes et.al.	2409.15602	null
2024-09-23	Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking	Subham Agrawal et.al.	2409.14844	null
2024-09-21	Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models	Orchid Chetia Phukan et.al.	2409.14131	null
2024-09-17	GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module	Yichen Zhang et.al.	2409.11307	null
2024-09-13	Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints	Shan Chen et.al.	2409.08613	null
2024-09-09	KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction	Davide Di Nucci et.al.	2409.05407	null
2024-09-06	The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population	Ryan P. Keenan et.al.	2409.03963	null
2024-09-05	Active Galactic Nuclei in the Green Valley at z $\sim$ 0.7	Charity Woodrum et.al.	2409.03197	null
2024-09-04	Object Gaussian for Monocular 6D Pose Estimation from Sparse Views	Luqing Luo et.al.	2409.02581	null
2024-09-11	Geometry-aware Feature Matching for Large-Scale Structure from Motion	Gonglin Chen et.al.	2409.02310	null
2024-09-04	The study of strongly intensive observables for $π^{\pm,0}$ in $pp$ collisions at LHC energy in the framework of PYTHIA model	Tumpa Biswas et.al.	2409.00525	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373	null
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445	link
2024-08-21	Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations	Lintong Zhang et.al.	2408.11966	null
2024-08-20	TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks	Jinjie Mai et.al.	2408.10739	null
2024-08-16	Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS	Wei Sun et.al.	2408.08723	null
2024-08-15	CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning	Wei Zhu et.al.	2408.08134	link
2024-08-13	A Miniature Vision-Based Localization System for Indoor Blimps	Shicong Ma et.al.	2408.06648	null
2024-08-07	Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM	Yan Song Hu et.al.	2408.03825	null
2024-08-05	Context-aware Mamba-based Reinforcement Learning for social robot navigation	Syed Muhammad Mustafa et.al.	2408.02661	null
2024-08-04	Birational geometry of critical loci in Algebraic Vision	Marina Bertolini et.al.	2408.02067	null
2024-08-04	PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone	Xin Yang et.al.	2408.02053	null
2024-08-02	Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris	Kentaro Uno et.al.	2408.01035	null
2024-08-01	LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting	Zhenyu Bao et.al.	2408.00254	null
2024-07-29	Global Structure-from-Motion Revisited	Linfei Pan et.al.	2407.20219	link
2024-08-06	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166	null
2024-07-23	The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations	Hao Liu et.al.	2407.16452	null
2024-07-22	Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures	Ruizhe Wang et.al.	2407.15435	null
2024-07-16	NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models	Francesco Milano et.al.	2407.12207	link
2024-07-15	LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning	Zhuozhu Jian et.al.	2407.10782	null
2024-07-15	Towards Scale-Aware Full Surround Monodepth with Transformers	Yuchen Yang et.al.	2407.10406	null
2024-07-14	3DEgo: 3D Editing on the Go!	Umar Khalid et.al.	2407.10102	null
2024-07-10	Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization	Jinjie Mai et.al.	2407.08023	link
2024-07-10	Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods	Euclid Collaboration et.al.	2407.07940	null
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-09	Computer vision tasks for intelligent aerospace missions: An overview	Huilin Chen et.al.	2407.06513	null
2024-07-08	Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views	Jiawei Guo et.al.	2407.05666	null
2024-07-05	Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization	Shaohan Li et.al.	2407.04260	null
2024-07-15	SfM on-the-fly: Get better 3D from What You Capture	Zongqian Zhan et.al.	2407.03939	null
2024-07-03	Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction	Jiaxin Guo et.al.	2407.02918	link
2024-07-02	Indoor 3D Reconstruction with an Unknown Camera-Projector Pair	Zhaoshuai Qi et.al.	2407.01945	null
2024-06-27	SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas	John Lambert et.al.	2406.19390	link
2024-06-27	STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning	Yanan Zhang et.al.	2406.19362	null
2024-06-26	VDG: Vision-Only Dynamic Gaussian for Driving Simulation	Hao Li et.al.	2406.18198	null
2024-06-25	Consensus Learning with Deep Sets for Essential Matrix Estimation	Dror Moran et.al.	2406.17414	link
2024-06-24	Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction	Tong Qin et.al.	2406.16289	null
2024-06-21	The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization	Ivan Nikolić et.al.	2406.15237	link
2024-06-19	MVSBoost: An Efficient Point Cloud-based 3D Reconstruction	Umair Haroon et.al.	2406.13515	null
2024-06-17	MegaScenes: Scene-Level View Synthesis at Scale	Joseph Tung et.al.	2406.11819	link
2024-06-15	Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models	Ruchao Fan et.al.	2406.10507	link
2024-06-14	On the Evaluation of Speech Foundation Models for Spoken Language Understanding	Siddhant Arora et.al.	2406.10083	null
2024-06-12	Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement	Maxime Pietrantoni et.al.	2406.08463	null
2024-06-12	SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models	Chun Yin et.al.	2406.08445	null
2024-06-10	Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Xin Jin et.al.	2406.06216	link
2024-06-07	The Star-Forming Main Sequence in JADES and CEERS at $z>1.4$ : Investigating the Burstiness of Star Formation	Leonardo Clarke et.al.	2406.05178	null
2024-06-13	Gaussian Splatting with Localized Points Management	Haosen Yang et.al.	2406.04251	null
2024-06-05	L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration	Yibo Liu et.al.	2406.03298	link
2024-06-04	CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation	Dejia Xu et.al.	2406.02509	null
2024-05-29	Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy	Zijie Jiang et.al.	2405.18863	null
2024-05-29	3D Reconstruction with Fast Dipole Sums	Hanyu Chen et.al.	2405.16788	null
2024-05-26	MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups	Yusen Xie et.al.	2405.16599	null
2024-05-26	Categorical Flow Matching on Statistical Manifolds	Chaoran Cheng et.al.	2405.16441	link
2024-05-22	Exploring Galaxy Properties of eCALIFA with Contrastive Learning	G. Martínez-Solaeche et.al.	2405.13471	null
2024-05-23	Switched Flow Matching: Eliminating Singularities via Switching ODEs	Qunxi Zhu et.al.	2405.11605	null
2024-05-28	NeRO: Neural Road Surface Reconstruction	Ruibo Wang et.al.	2405.10554	link
2024-05-15	Three Dimensional Spatial Cognition: Bees and Bats	Robert Worden et.al.	2405.09413	null
2024-05-09	Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media	Zhizhen Zhang et.al.	2405.05760	null
2024-05-09	Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment	Simon Weber et.al.	2405.05079	link
2024-05-07	Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications	Markus Hillemann et.al.	2405.04345	null
2024-05-07	Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling	Jiawei Shi et.al.	2405.04309	null
2024-05-06	Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion	Yunfeng Li et.al.	2405.03177	link
2024-05-03	HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2	Miriam Jäger et.al.	2405.02005	null
2024-04-25	The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time	Marcie Mun et.al.	2404.16319	null
2024-04-22	Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer	Eric Brachmann et.al.	2404.14351	null
2024-04-22	RESFM: Robust Equivariant Multiview Structure from Motion	Fadi Khatib et.al.	2404.14280	null
2024-04-22	Does Gaussian Splatting need SFM Initialization?	Yalda Foroutan et.al.	2404.12547	null
2024-05-07	A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion	Feng Yu et.al.	2404.11590	link
2024-04-18	DeblurGS: Gaussian Splatting for Camera Motion Blur	Jeongtaek Oh et.al.	2404.11358	null
2024-05-21	LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives	Jiadi Cui et.al.	2404.09748	null
2024-04-12	MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance	Yuqun Wu et.al.	2404.08252	null
2024-04-11	Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation	Keonhee Han et.al.	2404.07933	null
2024-04-07	NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization	Peng Tu et.al.	2404.04875	null
2024-04-04	GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis	Emmanouil Nikolakakis et.al.	2404.03126	null
2024-03-29	InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds	Zhiwen Fan et.al.	2403.20309	link
2024-03-29	HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes	Zhuopeng Li et.al.	2403.20032	null
2024-03-26	NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation	Jiahao Chen et.al.	2403.17537	null
2024-03-25	INPC: Implicit Neural Point Clouds for Radiance Field Rendering	Florian Hahlbohm et.al.	2403.16862	null
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639	null
2024-03-14	Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting	Jaewoo Jung et.al.	2403.09413	link
2024-03-13	Refractive COLMAP: Refractive Structure-from-Motion Revisited	Mengkun She et.al.	2403.08640	null
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156	link
2024-03-11	SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection	Yifu Tao et.al.	2403.06877	null
2024-03-24	BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling	Cheng Peng et.al.	2403.04926	link
2024-02-22	GaussianPro: 3D Gaussian Splatting with Progressive Propagation	Kai Cheng et.al.	2402.14650	null
2024-02-25	A Robust Error-Resistant View Selection Method for 3D Reconstruction	Shaojie Zhang et.al.	2402.11431	null
2024-02-17	Dense Matchers for Dense Tracking	Tomáš Jelínek et.al.	2402.11287	null
2024-03-11	Local Feature Matching Using Deep Learning: A Survey	Shibiao Xu et.al.	2401.17592	link
2024-01-22	HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs	Zelin Gao et.al.	2401.11711	null
2024-01-19	SCENES: Subpixel Correspondence Estimation With Epipolar Supervision	Dominik A. Kloepfer et.al.	2401.10886	null
2024-01-15	3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data	Mathilde Letard et.al.	2401.09481	link
2024-01-17	3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey	Thiago Lopes Trugillo da Silveira et.al.	2401.09252	null
2024-01-17	ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization	Weiyao Wang et.al.	2401.08937	null
2024-01-16	Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions	Yi-Fan Zuo et.al.	2401.08043	link
2024-01-10	Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects	Tianhang Cheng et.al.	2401.05236	link
2024-01-07	A Classification of Critical Configurations for any Number of Projective Views	Martin Bråtelund et.al.	2401.03450	link
2023-12-24	Residual Learning for Image Point Descriptors	Rashik Shrestha et.al.	2312.15471	null
2023-12-16	Transformers in Unsupervised Structure-from-Motion	Hemang Chawla et.al.	2312.10529	link
2023-12-14	HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video	Xueying Wang et.al.	2312.08863	null
2023-12-14	CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning	Qingsong Yan et.al.	2312.08760	null
2023-12-11	Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach	Travis Driver et.al.	2312.06865	link
2023-12-11	Gaussian Splatting SLAM	Hidenobu Matsuki et.al.	2312.06741	null
2023-12-10	SuperPrimitive: Scene Reconstruction at a Primitive Level	Kirill Mazur et.al.	2312.05889	null
2023-12-07	Visual Geometry Grounded Deep Structure From Motion	Jianyuan Wang et.al.	2312.04563	null
2023-11-30	Distributed Global Structure-from-Motion with a Deep Front-End	Ayush Baid et.al.	2311.18801	link
2023-11-21	Robot Hand-Eye Calibration using Structure-from-Motion	Nicolas Andreff et.al.	2311.11808	null
2023-11-18	LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation	Sébastien Henry et.al.	2311.11171	null
2023-11-10	MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty	Rémi Marsal et.al.	2311.06137	link
2023-11-08	VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering	Linus Franke et.al.	2311.04634	link
2023-10-22	A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video	Jan Emily Mangulabnan et.al.	2310.14364	null
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605	null
2023-10-09	Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration	Chunge Bai et.al.	2310.05504	link
2023-10-08	LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization	Artem Nenashev et.al.	2310.05134	null
2023-11-29	Pose-Free Generalizable Rendering Transformer	Zhiwen Fan et.al.	2310.03704	link
2023-10-02	Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images	Georg Bökman et.al.	2310.01092	null
2023-10-01	Propagating Semantic Labels in Video Data	David Balaban et.al.	2310.00783	null
2023-09-22	Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning	Jonathan Sauder et.al.	2309.12804	null
2023-09-21	On-the-Fly SfM: What you capture is What you get	Zongqian Zhan et.al.	2309.11883	link
2023-09-19	Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water	Jayesh Tripathi et.al.	2309.10269	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927	link
2023-09-08	Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147	null
2023-09-01	SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation	Youhong Wang et.al.	2309.00526	null
2023-09-01	Dense Voxel 3D Reconstruction Using a Monocular Event Camera	Haodong Chen et.al.	2309.00385	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984	link
2023-08-26	Disjoint Pose and Shape for 3D Face Reconstruction	Raja Kumar et.al.	2308.13903	null
2023-08-30	CamP: Camera Preconditioning for Neural Radiance Fields	Keunhong Park et.al.	2308.10902	null
2023-08-18	Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling	Haorui Ji et.al.	2308.10705	null
2023-08-14	Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation	Tao Liu et.al.	2308.07231	link
2023-08-11	Efficient Large-scale AUV-based Visual Seafloor Mapping	Mengkun She et.al.	2308.06147	null
2023-08-04	EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems	Weihan Wang et.al.	2308.02670	null
2023-08-15	Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites	Jyotirmaya Shivottam et.al.	2308.01246	link
2023-08-02	Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network	Shenbagaraj Kannapiran et.al.	2308.01125	null
2023-07-27	PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking	Yang Zheng et.al.	2307.15055	link
2023-07-28	SACReg: Scene-Agnostic Coordinate Regression for Visual Localization	Jerome Revaud et.al.	2307.11702	null
2023-07-19	Lazy Visual Localization via Motion Averaging	Siyan Dong et.al.	2307.09981	null
2023-07-10	Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor	San Jiang et.al.	2307.04520	null
2023-07-07	RGB-D Mapping and Tracking in a Plenoxel Radiance Field	Andreas L. Teigen et.al.	2307.03404	link
2023-06-29	The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes	David Recasens et.al.	2306.16917	link
2023-06-27	Detector-Free Structure from Motion	Xingyi He et.al.	2306.15669	link
2023-06-28	PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment	Jianyuan Wang et.al.	2306.15667	null
2023-06-24	3D Reconstruction of Spherical Images based on Incremental Structure from Motion	San Jiang et.al.	2306.12770	link
2023-06-15	NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations	Varun Jampani et.al.	2306.09109	link
2023-06-15	Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization	Dror Aiger et.al.	2306.09012	link
2023-06-10	3D reconstruction using Structure for Motion	Kshitij Karnawat et.al.	2306.06360	link
2023-06-02	Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images	Marcela Mera-Trujillo et.al.	2306.01938	null
2023-05-31	FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow	Cameron Smith et.al.	2306.00180	null
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036	link
2023-05-09	Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization	Clémentin Boittiaux et.al.	2305.05301	link
2023-05-09	Rotation Synchronization via Deep Matrix Factorization	Gk Tejus et.al.	2305.05268	link
2023-04-20	A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion	Miriam Jäger et.al.	2304.10664	null
2023-04-14	Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments	Felix Ott et.al.	2304.07250	null
2023-04-12	Visual Localization using Imperfect 3D Models from the Internet	Vojtech Panek et.al.	2304.05947	link
2023-04-08	Photometric Correction for Infrared Sensors	Jincheng Zhang et.al.	2304.03930	null
2023-04-07	DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium	Antyanta Bangunharcana et.al.	2304.03560	link
2023-04-05	Semantic Validation in Structure from Motion	Joseph Rowell et.al.	2304.02420	link
2023-03-31	Learning Internal Representations of 3D Transformations from 2D Projected Inputs	Marissa Connor et.al.	2303.17776	null
2023-03-30	3D Line Mapping Revisited	Shaohui Liu et.al.	2303.17504	link
2023-03-27	TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering	Jaehoon Choi et.al.	2303.15060	null
2023-03-26	On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks	HyunJun Jung et.al.	2303.14840	link
2023-03-24	Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container	Jinguang Tong et.al.	2303.13805	link
2023-03-24	Progressively Optimized Local Radiance Fields for Robust View Synthesis	Andreas Meuleman et.al.	2303.13791	null
2023-03-15	RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters	Shuja Khalid et.al.	2303.08695	null
2023-03-09	Revisiting Rotation Averaging: Uncertainties and Robust Losses	Ganlin Zhang et.al.	2303.05195	link
2023-02-28	Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images	Zhongli Fan et.al.	2302.14239	link
2023-03-25	BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling	Sameera Ramasinghe et.al.	2302.13543	null
2023-02-21	EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images	Zhichao Ye et.al.	2302.10544	link
2023-02-18	Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering	Tatsuro Yamane et.al.	2302.09208	null
2023-02-12	Uncertainty-Driven Dense Two-View Structure from Motion	Weirong Chen et.al.	2302.00523	null
2023-01-28	AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion	Yu Chen et.al.	2301.12135	null
2023-01-20	A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles	Zhefan Xu et.al.	2301.08422	link
2023-03-21	Robust Dynamic Radiance Fields	Yu-Lun Liu et.al.	2301.02239	link
2022-12-24	Polarimetric Multi-View Inverse Rendering	Jinyu Zhao et.al.	2212.12721	null
2022-12-13	Accidental Turntables: Learning 3D Pose by Watching Objects Turn	Zezhou Cheng et.al.	2212.06300	null
2022-12-04	3D Object Aided Self-Supervised Monocular Depth Estimation	Songlin Wei et.al.	2212.01768	null
2022-12-02	High-Res Facial Appearance Capture from Polarized Smartphone Images	Dejan Azinović et.al.	2212.01160	null
2022-11-28	FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network	Xinjiang Wang et.al.	2211.15069	link
2022-11-24	JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models	Sepidehsadat Hosseini et.al.	2211.13785	null
2022-11-24	SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks	Sergio Izquierdo et.al.	2211.13551	link
2022-11-22	Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces	Yuxi Xiao et.al.	2211.12018	link
2022-11-21	Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques	David Ramirez et.al.	2211.11836	null
2022-11-14	Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion	René Haas et.al.	2211.07195	null
2022-10-13	Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach	Zhiang Chen et.al.	2210.07349	null
2022-10-11	DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion	Yuxi Xiao et.al.	2210.05517	null
2022-10-07	Leveraging Structure from Motion to Localize Inaccessible Bus Stops	Indu Panigrahi et.al.	2210.03646	link
2022-10-01	Structure-Aware NeRF without Posed Camera via Epipolar Constraint	Shu Chen et.al.	2210.00183	link
2022-10-05	FAST-LIO, Then Bayesian ICP, Then GTSFM	Jerred Chen et.al.	2210.00146	null
2022-09-20	BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction	Ahalya Ravendran et.al.	2209.09470	null
2022-09-19	A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion	Gerry Chen et.al.	2209.08690	null
2022-09-14	End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes	Qiao Chen et.al.	2209.06926	null
2022-09-07	Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021	Hartmut Surmann et.al.	2209.03084	null
2022-08-27	Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data	Thomas A. Ciarfuglia et.al.	2208.13001	null
2022-08-12	Handling Constrained Optimization in Factor Graphs for Autonomous Navigation	Barbara Bazzana et.al.	2208.06325	null
2022-08-04	Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training	Yao-Chih Lee et.al.	2208.02709	link
2022-07-31	One Object at a Time: Accurate and Robust Structure From Motion for Robots	Aravind Battaje et.al.	2208.00487	null
2022-07-23	Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks	Daniel Posada et.al.	2207.11413	null
2022-07-25	MeshLoc: Mesh-Based Visual Localization	Vojtech Panek et.al.	2207.10762	link
2022-07-19	ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild	Wang Zhao et.al.	2207.09137	link
2022-07-16	Organic Priors in Non-Rigid Structure from Motion	Suryansh Kumar et.al.	2207.06262	null
2022-07-06	A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models	Axel Garcia-Vega et.al.	2207.02396	null
2022-06-24	Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set	San Jiang et.al.	2206.11499	null
2022-06-13	TC-SfM: Robust Track-Community-Based Structure-from-Motion	Lei Wang et.al.	2206.05866	null
2022-06-10	EigenFairing: 3D Model Fairing using Image Coherence	Pragyana Mishra et.al.	2206.05309	null
2022-06-01	Semantic Room Wireframe Detection from a Single View	David Gillsjö et.al.	2206.00491	link
2022-05-31	Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction	Qiancheng Fu et.al.	2205.15848	null
2022-05-09	Is my Depth Ground-Truth Good Enough? HAMMER -- Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression	HyunJun Jung et.al.	2205.04565	null
2022-05-07	Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs	Pedro F. Proença et.al.	2205.03522	null
2022-05-06	EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms	Levi Burner et.al.	2205.03467	null
2022-04-20	Learned Monocular Depth Priors in Visual-Inertial Initialization	Yunwen Zhou et.al.	2204.09171	null
2022-04-10	Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective	Hui Deng et.al.	2204.04730	null
2022-04-08	Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems	Debao Huang et.al.	2204.04145	null
2022-04-07	SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation	Yi Wei et.al.	2204.03636	link
2022-04-06	Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion	Lukas Bommes et.al.	2204.02733	link
2022-04-05	Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows	Sheng Liu et.al.	2204.02509	link
2022-03-31	Fast, Accurate and Memory-Efficient Partial Permutation Synchronization	Shaohan Li et.al.	2203.16505	null
2022-03-28	Visual Odometry for RGB-D Cameras	Afonso Fontes et.al.	2203.15119	null
2022-03-28	Optimizing Elimination Templates by Greedy Parameter Search	Evgeniy Martyushev et.al.	2203.14901	link
2022-03-23	Event-Based Dense Reconstruction Pipeline	Kun Xiao et.al.	2203.12270	null
2022-03-21	DiffPoseNet: Direct Differentiable Camera Pose Estimation	Chethan M. Parameshwara et.al.	2203.11174	null
2022-03-02	Asynchronous Optimisation for Event-based Visual Odometry	Daqi Liu et.al.	2203.01037	null
2022-03-02	Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation	Yulun Tian et.al.	2203.00851	null
2022-02-18	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Ahmad Khaliq et.al.	2202.09146	link
2022-01-20	GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry	Yunhan Zhao et.al.	2201.08131	null
2022-01-13	Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching	Yunpeng Shi et.al.	2201.04797	link
2022-01-10	High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM	Brian M. Hopkinson et.al.	2201.03364	link
2022-01-06	De-rendering 3D Objects in the Wild	Felix Wimbauer et.al.	2201.02279	link
2021-12-29	On the Instability of Relative Pose Estimation and RANSAC's Role	Hongyi Fan et.al.	2112.14651	null
2021-12-16	Road-aware Monocular Structure from Motion and Homography Estimation	Wei Sui et.al.	2112.08635	null
2021-12-10	Critical configurations for three projective views	Martin Bråtelund et.al.	2112.05478	null
2021-12-09	Critical configurations for two projective views, a new approach	Martin Bråtelund et.al.	2112.05074	null
2021-12-06	Dense Depth Priors for Neural Radiance Fields from Sparse Input Views	Barbara Roessle et.al.	2112.03288	link
2021-12-10	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment	Jie Ren et.al.	2112.01349	link
2021-11-11	Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft	Pascal Schoppmann et.al.	2111.06271	null
2021-11-10	Damage Estimation and Localization from Sparse Aerial Imagery	Rene Garcia Franceschini et.al.	2111.03708	null
2021-11-03	Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems	Swarnabja Bhaumik et.al.	2111.02064	null
2021-10-14	Modeling dynamic target deformation in camera calibration	Annika Hagemann et.al.	2110.07322	null
2021-10-13	Hyperspectral 3D Mapping of Underwater Environments	Maxime Ferrera et.al.	2110.06571	null
2021-09-24	Automatic Map Update Using Dashcam Videos	Aziza Zhanabatyrova et.al.	2109.12131	null
2021-09-16	Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs	Gabriel Moreira et.al.	2109.08046	link
2021-09-06	Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications	Tejas Mane et.al.	2109.02740	null
2021-09-02	Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency	Beatrix-Emőke Fülöp-Balogh et.al.	2109.01018	null
2021-09-01	On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation	Eric Brachmann et.al.	2109.00524	link
2021-08-31	DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension	Roman Shapovalov et.al.	2109.00033	null
2021-08-29	Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration	Seyed-Mahdi Nasiri et.al.	2108.12876	null
2021-08-23	Burst Imaging for Light-Constrained Structure-From-Motion	Ahalya Ravendran et.al.	2108.09895	null

(back to top)

Visual Localization

Publish Date	Title	Authors	PDF	Code
2025-12-04	ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning	Shengyuan Ding et.al.	2512.05111	null
2025-12-04	Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark	Haobo Yuan et.al.	2512.05091	null
2025-12-04	Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding	Abhigyan Bhattacharya et.al.	2512.05039	null
2025-12-04	Revealing stimulus-dependent dynamics through statistical complexity	Edson V. de Paula et.al.	2512.05007	null
2025-12-04	Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis	Supriya Bordoloi et.al.	2512.04989	null
2025-12-04	LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging	Zhijian Shu et.al.	2512.04939	null
2025-12-04	Terahertz Fourier Ptychographic Imaging	Pitambar Mukherjee et.al.	2512.04783	null
2025-12-04	TEMPO-VINE: A Multi-Temporal Sensor Fusion Dataset for Localization and Mapping in Vineyards	Mauro Martini et.al.	2512.04772	null
2025-12-04	MemLoRA: Distilling Expert Adapters for On-Device Memory Systems	Massimo Bini et.al.	2512.04763	null
2025-12-04	Spectral micro-CT for quantitative analysis of calcification in fibrocartilage	Vittoria Mazzini et.al.	2512.04662	null
2025-11-26	Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models	Naifu Zhang et.al.	2511.21663	null
2025-11-26	Fast 3D Ultrasound Localization Microscopy via Projection-based Processing Framework	Jingke Zhang et.al.	2511.21647	null
2025-11-26	Qwen3-VL Technical Report	Shuai Bai et.al.	2511.21631	null
2025-11-26	Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy	Teng Hu et.al.	2511.21579	null
2025-11-26	FITRep: Attention-Guided Item Representation via MLLMs	Guoxiao Zhang et.al.	2511.21389	null
2025-11-26	Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning	Xin Gu et.al.	2511.21375	null
2025-11-26	HTTM: Head-wise Temporal Token Merging for Faster VGGT	Weitian Wang et.al.	2511.21317	null
2025-11-26	Low-dose Chemically Specific Bioimaging via Deep-UV Lensless Holographic Microscopy on a Standard Camera	Piotr Arcab et.al.	2511.21311	null
2025-11-26	Adaptive Lighting Control in Visible Light Systems: An Integrated Sensing, Communication, and Illumination Framework	Xinyan Xie et.al.	2511.21271	null
2025-11-26	Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition	Baoli Sun et.al.	2511.21202	null
2025-11-24	Wigner and Gabor phase-space analysis of propagators for evolution equations	Elena Cordero et.al.	2511.19400	null
2025-11-24	Real-Time Object Tracking with On-Device Deep Learning for Adaptive Beamforming in Dynamic Acoustic Environments	Jorge Ortigoso-Narro et.al.	2511.19396	null
2025-11-24	In-vivo imaging with a low-cost MRI scanner and cloud data processing in low-resource settings	Teresa Guallart-Naval et.al.	2511.19226	null
2025-11-24	Can Modern Vision Models Understand the Difference Between an Object and a Look-alike?	Itay Cohen et.al.	2511.19200	null
2025-11-24	From Pixels to Posts: Retrieval-Augmented Fashion Captioning and Hashtag Generation	Moazzam Umer Gondal et.al.	2511.19149	null
2025-11-24	Graph-based 3D Human Pose Estimation using WiFi Signals	Jichao Chen et.al.	2511.19105	null
2025-11-24	Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach	Fan Nie et.al.	2511.19080	null
2025-11-24	LAA3D: A Benchmark of Detecting and Tracking Low-Altitude Aircraft in 3D Space	Hai Wu et.al.	2511.19057	null
2025-11-24	Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors	Haihang Wu et.al.	2511.19031	null
2025-11-24	Dynamic Granularity Matters: Rethinking Vision Transformers Beyond Fixed Patch Splitting	Qiyang Yu et.al.	2511.19021	null
2025-11-24	AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization	Christos Koutlis et.al.	2511.18993	null
2025-11-24	Zero-shot segmentation of skin tumors in whole-slide images with vision-language foundation models	Santiago Moreno et.al.	2511.18978	null
2025-11-24	MagicWorld: Interactive Geometry-driven Video World Exploration	Guangyuan Li et.al.	2511.18886	null
2025-11-24	SP-VINS: A Hybrid Stereo Visual Inertial Navigation System based on Implicit Environmental Map	Xueyu Du et.al.	2511.18756	null
2025-11-24	Seeing What Matters: Visual Preference Policy Optimization for Visual Generation	Ziqi Ni et.al.	2511.18719	null
2025-11-24	CNN-Based Camera Pose Estimation and Localisation of Scan Images for Aircraft Visual Inspection	Xueyan Oh et.al.	2511.18702	null
2025-11-24	Stable Multi-Drone GNSS Tracking System for Marine Robots	Shuo Wen et.al.	2511.18694	null
2025-11-23	Shape-Adapting Gated Experts: Dynamic Expert Routing for Colonoscopic Lesion Segmentation	Gia Huy Thai et.al.	2511.18493	null
2025-11-23	Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span	Heeseung Yun et.al.	2511.18470	null
2025-11-23	LungX: A Hybrid EfficientNet-Vision Transformer Architecture with Multi-Scale Attention for Accurate Pneumonia Detection	Mansur Yerzhanuly et.al.	2511.18425	null
2025-11-23	4D-VGGT: A General Foundation Model with SpatioTemporal Awareness for Dynamic Scene Geometry Estimation	Haonan Wang et.al.	2511.18416	null
2025-11-23	NSTR: Neural Spectral Transport Representation for Space-Varying Frequency Fields	Plein Versace et.al.	2511.18384	null
2025-11-23	Learning Visually Interpretable Oscillator Networks for Soft Continuum Robots from Video	Henrik Krauss et.al.	2511.18322	null
2025-11-23	Table Comprehension in Building Codes using Vision Language Models and Domain-Specific Fine-Tuning	Mohammad Aqib et.al.	2511.18306	null
2025-11-23	AIA-UltraNeRF:Acoustic-Impedance-Aware Neural Radiance Field with Hash Encodings for Robotic Ultrasound Reconstruction and Localization	Shuai Zhang et.al.	2511.18293	null
2025-11-23	SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes	Jungho Lee et.al.	2511.18290	null
2025-11-22	AFT: Appearance-Based Feature Tracking for Markerless and Training-Free Shape Reconstruction of Soft Robots	Shangyuan Yuan et.al.	2511.18215	null
2025-11-22	ProHD: Projection-Based Hausdorff Distance Approximation	Jiuzhou Fu et.al.	2511.18207	null
2025-11-22	ARIAL: An Agentic Framework for Document VQA with Precise Answer Localization	Ahmad Mohammadshirazi et.al.	2511.18192	null
2025-11-22	Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models	Dachuan Zhao et.al.	2511.18123	null
2025-11-22	PromptMoE: Generalizable Zero-Shot Anomaly Detection via Visually-Guided Prompt Mixtures	Yuheng Shao et.al.	2511.18116	null
2025-11-22	Spotlight: Identifying and Localizing Video Generation Errors Using VLMs	Aditya Chinchure et.al.	2511.18102	null
2025-11-22	VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object Detection	Jianhang Yao et.al.	2511.18075	null
2025-11-22	HyM-UNet: Synergizing Local Texture and Global Context via Hybrid CNN-Mamba Architecture for Medical Image Segmentation	Haodong Chen et.al.	2511.17988	null
2025-11-22	Signal: Selective Interaction and Global-local Alignment for Multi-Modal Object Re-Identification	Yangyang Liu et.al.	2511.17965	null
2025-11-22	MambaTAD: When State-Space Models Meet Long-Range Temporal Action Detection	Hui Lu et.al.	2511.17929	null
2025-11-22	MGA-VQA: Secure and Interpretable Graph-Augmented Visual Question Answering with Memory-Guided Protection Against Unauthorized Knowledge Use	Ahmad Mohammadshirazi et.al.	2511.17881	null
2025-11-21	AEGIS: Preserving privacy of 3D Facial Avatars with Adversarial Perturbations	Dawid Wolkiewicz et.al.	2511.17747	null
2025-11-21	Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolved Transcriptomics	Wei Zhang et.al.	2511.17685	null
2025-11-18	Unified Low-Light Traffic Image Enhancement via Multi-Stage Illumination Recovery and Adaptive Noise Suppression	Siddiqua Namrah et.al.	2511.17612	null
2025-11-18	3D Ground Truth Reconstruction from Multi-Camera Annotations Using UKF	Linh Van Ma et.al.	2511.17609	null
2025-11-21	REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing	Binger Chen et.al.	2511.17442	null
2025-11-21	IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation	Yifan Li et.al.	2511.17384	null
2025-11-21	SVRecon: Sparse Voxel Rasterization for Surface Reconstruction	Seunghun Oh et.al.	2511.17364	null
2025-11-21	NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior	Dongbo Shi et.al.	2511.17322	null
2025-11-21	MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning	Wenrui Zhang et.al.	2511.17300	null
2025-11-21	Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation	Chuancheng Shi et.al.	2511.17282	null
2025-11-21	A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback	Bulat Khaertdinov et.al.	2511.17255	null
2025-11-21	Mixed Reality Scenic Live Streaming for Cultural Heritage: Visual Interactions in a Historic Landscape	Zeyu Huang et.al.	2511.17246	null
2025-11-21	SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors	Kunyi Li et.al.	2511.17207	null
2025-11-21	Navigating in the Dark: A Multimodal Framework and Dataset for Nighttime Traffic Sign Recognition	Aditya Mishra et.al.	2511.17183	null
2025-11-21	Reflection-Based Relative Localization for Cooperative UAV Teams Using Active Markers	Tim Lakemann et.al.	2511.17166	null
2025-11-21	Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation	Shuo Wang et.al.	2511.17097	null
2025-11-21	Spanning Tree Autoregressive Visual Generation	Sangkyu Lee et.al.	2511.17089	null
2025-11-24	ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion	Junming Liu et.al.	2511.17068	null
2025-11-21	Stable Offline Hand-Eye Calibration for any Robot with Just One Mark	Sicheng Xie et.al.	2511.17001	null
2025-11-21	VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions	Qianyi Shao et.al.	2511.16998	null
2025-11-21	DReX: Pure Vision Fusion of Self-Supervised and Convolutional Representations for Image Complexity Prediction	Jonathan Skaza et.al.	2511.16991	null
2025-11-21	The Finer the Better: Towards Granular-aware Open-set Domain Generalization	Yunyun Wang et.al.	2511.16979	null
2025-11-21	Single-Axis Ptychographic Coherent Diffractive Imaging for Spectroscopic and Wavefront Retrieval	Qijun You et.al.	2511.16950	null
2025-11-20	SAM 3: Segment Anything with Concepts	Nicolas Carion et.al.	2511.16719	null
2025-11-24	PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation	Ting Pan et.al.	2511.16712	null
2025-11-20	Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation	Ziyu Guo et.al.	2511.16671	null
2025-11-23	Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems	Elias Lumer et.al.	2511.16654	null
2025-11-20	SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction	Guolin Huang et.al.	2511.16635	null
2025-11-21	POMA-3D: The Point Map Way to 3D Scene Understanding	Ye Mao et.al.	2511.16567	null
2025-11-20	NutriScreener: Retrieval-Augmented Multi-Pose Graph Attention Network for Malnourishment Screening	Misaal Khan et.al.	2511.16566	null
2025-11-20	Contrastive vision-language learning with paraphrasing and negation	Kwun Ho Ngan et.al.	2511.16527	null
2025-11-20	BoxingVI: A Multi-Modal Benchmark for Boxing Action Recognition and Localization	Rahul Kumar et.al.	2511.16524	null
2025-11-20	YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras	Fan Yang et.al.	2511.16521	null
2025-11-20	TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models	Li Zhang et.al.	2511.16423	null
2025-11-20	CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering	Joni Vanherck et.al.	2511.16349	null
2025-11-20	Real-Time Inference for Distributed Multimodal Systems under Communication Delay Uncertainty	Victor Croisfelt et.al.	2511.16225	null
2025-11-20	Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments	Renxiang Xiao et.al.	2511.16091	null
2025-11-20	AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive Transformers	Boxun Xu et.al.	2511.16047	null
2025-11-19	EfficientSAM3: Progressive Hierarchical Distillation for Video Concept Segmentation from SAM1, 2, and 3	Chengxi Zeng et.al.	2511.15833	null
2025-11-19	IMACT-CXR - An Interactive Multi-Agent Conversational Tutoring System for Chest X-Ray Interpretation	Tuan-Anh Le et.al.	2511.15825	null
2025-11-19	Multidimensional scaling of two-mode three-way asymmetric dissimilarities: finding archetypal profiles and clustering	Aleix Alcacer et.al.	2511.15813	null
2025-11-19	GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization	Yikun Wang et.al.	2511.15705	null
2025-11-19	First Frame Is the Place to Go for Video Content Customization	Jingxi Chen et.al.	2511.15700	null
2025-11-19	Hierarchical Semantic Tree Anchoring for CLIP-Based Class-Incremental Learning	Tao Hu et.al.	2511.15633	null
2025-11-19	Multi-Text Guided Few-Shot Semantic Segmentation	Qiang Jiao et.al.	2511.15515	null
2025-11-19	SIGMMA: Hierarchical Graph-Based Multi-Scale Multi-modal Contrastive Alignment of Histopathology Image and Spatial Transcriptome	Dabin Jeong et.al.	2511.15464	null
2025-11-19	HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation	Linyin Luo et.al.	2511.15435	null
2025-11-19	The Empowerment of Science of Science by Large Language Models: New Tools and Methods	Guoqiang Liang et.al.	2511.15370	null
2025-11-19	C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models	Nayoung Oh et.al.	2511.15333	null
2025-11-19	Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval	Qing Wang et.al.	2511.15201	null
2025-11-19	Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation	Jin Wang et.al.	2511.15118	null
2025-11-19	BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer	Wenhan Yu et.al.	2511.15090	null
2025-11-18	FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding	Zhenshi Li et.al.	2511.14901	null
2025-11-18	Quantum Transport Spectroscopy of Pseudomagnetic Field in Graphene	Divya Sahani et.al.	2511.14888	null
2025-09-16	Image-Seeking Intent Prediction for Cross-Device Product Search	Mariya Hendriksen et.al.	2511.14764	null
2025-11-18	FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation	Yunfeng Wu et.al.	2511.14712	null
2025-11-18	Overcoming global sensitivity limitations: using active subspaces to explore discrepancies between global and local parameter sensitivities	Huiyan Zou et.al.	2511.14687	null
2025-11-18	A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases	Tao Yang et.al.	2511.14638	null
2025-11-18	Mind the Gaps: Measuring Visual Artifacts in Dimensionality Reduction	Jaume Ros et.al.	2511.14544	null
2025-11-18	D-PerceptCT: Deep Perceptual Enhancement for Low-Dose CT Images	Taifour Yousra Nabila et.al.	2511.14518	null
2025-11-18	Aerial Assistance System for Automated Firefighting during Turntable Ladder Operations	Jan Quenzel et.al.	2511.14504	null
2025-11-18	DIR-TIR: Dialog-Iterative Refinement for Text-to-Image Retrieval	Zongwei Zhen et.al.	2511.14449	null
2025-11-18	Agentic Video Intelligence: A Flexible Framework for Advanced Video Exploration and Understanding	Hong Gao et.al.	2511.14446	null
2025-11-19	Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving	Kangqiao Zhao et.al.	2511.14386	null
2025-11-18	O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language Model	Rishi Gupta et.al.	2511.14368	null
2025-11-23	Simultaneous Localization and 3D-Semi Dense Mapping for Micro Drones Using Monocular Camera and Inertial Sensors	Jeryes Danial et.al.	2511.14335	null
2025-11-18	Statistically controllable microstructure reconstruction framework for heterogeneous materials using sliced-Wasserstein metric and neural networks	Zhenchuan Ma et.al.	2511.14268	null
2025-11-18	EBind: a practical approach to space binding	Jim Broadbent et.al.	2511.14229	null
2025-11-18	LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation	Hao Jiang et.al.	2511.14221	null
2025-11-19	Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution	N Dinesh Reddy et.al.	2511.14210	null
2025-11-19	PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation	Xiangyu Li et.al.	2511.14185	null
2025-11-18	SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM	An Yu et.al.	2511.14143	null
2025-11-18	$A^2$GC: $A$symmetric $A$ ggregation with Geometric Constraints for Locally Aggregated Descriptors	Zhenyu Li et.al.	2511.14109	null
2025-11-18	SMGeo: Cross-View Object Geo-Localization with Grid-Level Mixture-of-Experts	Fan Zhang et.al.	2511.14093	null
2025-11-18	HiEAG: Evidence-Augmented Generation for Out-of-Context Misinformation Detection	Junjie Wu et.al.	2511.14027	null
2025-11-17	EchoAgent: Guideline-Centric Reasoning Agent for Echocardiography Measurement and Interpretation	Matin Daghyani et.al.	2511.13948	null
2025-11-17	Start Small, Think Big: Curriculum-based Relative Policy Optimization for Visual Grounding	Qingyang Yan et.al.	2511.13924	null
2025-11-17	GRLoc: Geometric Representation Regression for Visual Localization	Changyang Li et.al.	2511.13864	null
2025-11-17	Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification	Linhan Zhou et.al.	2511.13575	null
2025-11-17	Language-Guided Invariance Probing of Vision-Language Models	Jae Joong Lee et.al.	2511.13494	null
2025-11-17	Attention Grounded Enhancement for Visual Document Retrieval	Wanqing Cui et.al.	2511.13415	null
2025-11-17	Stray Light Correction for the Helioseismic and Magnetic Imager	A. A. Norton et.al.	2511.13348	null
2025-11-17	Uncovering and Mitigating Transient Blindness in Multimodal Model Editing	Xiaoqi Han et.al.	2511.13243	null
2025-11-17	GaRLILEO: Gravity-aligned Radar-Leg-Inertial Enhanced Odometry	Chiyun Noh et.al.	2511.13216	null
2025-11-17	Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework	Diego Ortego et.al.	2511.13189	null
2025-11-17	THIR: Topological Histopathological Image Retrieval	Zahra Tabatabaei et.al.	2511.13170	null
2025-11-17	SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration	Haodong Wang et.al.	2511.13168	null
2025-11-17	MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications	Gagan Raj Gupta et.al.	2511.13131	null
2025-11-17	Region-Point Joint Representation for Effective Trajectory Similarity Learning	Hao Long et.al.	2511.13125	null
2025-11-17	Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining	Zhaocheng Yu et.al.	2511.13113	null
2025-11-17	uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired Data	Dahyun Chung et.al.	2511.13036	null
2025-11-17	Angular Gradient Sign Method: Uncovering Vulnerabilities in Hyperbolic Networks	Minsoo Jo et.al.	2511.12985	null
2025-11-17	MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning	Yoonjae Seo et.al.	2511.12976	null
2025-11-16	Enhancing Neuro-Oncology Through Self-Assessing Deep Learning Models for Brain Tumor Unified Model for MRI Segmentation	Andrew Zhou et.al.	2511.12801	null
2025-11-16	Predicting upcoming visual features during eye movements yields scene representations aligned with human visual cortex	Sushrut Thorat et.al.	2511.12715	null
2025-11-16	FLClear: Visually Verifiable Multi-Client Watermarking for Federated Learning	Chen Gu et.al.	2511.12663	null
2025-11-16	D $^{2}$ -VPR: A Parameter-efficient Visual-foundation-model-based Visual Place Recognition Method via Knowledge Distillation and Deformable Aggregation	Zheyuan Zhang et.al.	2511.12528	null
2025-11-16	Visible Structure Retrieval for Lightweight Image-Based Relocalisation	Fereidoon Zangeneh et.al.	2511.12503	null
2025-11-16	CoTBox-TTT: Grounding Medical VQA with Visual Chain-of-Thought Boxes During Test-time Training	Jiahe Qian et.al.	2511.12446	null
2025-11-15	Calibrated Decomposition of Aleatoric and Epistemic Uncertainty in Deep Features for Inference-Time Adaptation	Divake Kumar et.al.	2511.12389	null
2025-11-15	SpaceVLM: Sub-Space Modeling of Negation in Vision-Language Models	Sepehr Kazemi Ranjbar et.al.	2511.12331	null
2025-11-15	A Disease-Aware Dual-Stage Framework for Chest X-ray Report Generation	Puzhen Wu et.al.	2511.12259	null
2025-11-21	Model Inversion Attack Against Deep Hashing	Dongdong Zhao et.al.	2511.12233	null
2025-11-15	FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention	Peng Zhang et.al.	2511.12215	null
2025-11-18	OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs	Feng Chen et.al.	2511.12201	null
2025-11-15	MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering	Seokwon Song et.al.	2511.12142	null
2025-11-15	Look As You Think: Unifying Reasoning and Visual Evidence Attribution for Verifiable Document RAG via Reinforcement Learning	Shuochen Liu et.al.	2511.12003	null
2025-11-21	Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models	Siyou Li et.al.	2511.11910	null
2025-11-14	TopoPerception: A Shortcut-Free Evaluation of Global Visual Perception in Large Vision-Language Models	Wenhao Zhou et.al.	2511.11831	null
2025-11-14	Lessons Learned from Developing a Privacy-Preserving Multimodal Wearable for Local Voice-and-Vision Inference	Yonatan Tussa et.al.	2511.11811	null
2025-11-12	Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement	Lian He et.al.	2511.11702	null
2025-11-12	Doubly Debiased Test-Time Prompt Tuning for Vision-Language Models	Fei Song et.al.	2511.11690	null
2025-11-10	A Deep Learning Model to Predicting Changes in Consumer Attributes for New Line-extended Products	Li Yinxing et.al.	2511.11646	null
2025-11-14	DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding	Dawei Zhu et.al.	2511.11552	null
2025-11-14	STEM EBIC as a Quantitative Probe of Semiconductor Devices	Sebastian Schneider et.al.	2511.11528	null
2025-11-14	Bridging Hidden States in Vision-Language Models	Benjamin Fein-Ashley et.al.	2511.11526	null
2025-11-14	Comprehension of Multilingual Expressions Referring to Target Objects in Visual Inputs	Francisco Nogueira et.al.	2511.11427	null
2025-11-14	Shrinking the Teacher: An Adaptive Teaching Paradigm for Asymmetric EEG-Vision Alignment	Lukun Wu et.al.	2511.11422	null
2025-11-14	Bidimensional measurements of photon statistics within a multimodal temporal framework	C. Hainaut et.al.	2511.11403	null
2025-11-18	GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes	Shumit A. Mitra et.al.	2511.11401	null
2025-11-14	StochEP: Stochastic Equilibrium Propagation for Spiking Convergent Recurrent Neural Networks	Jiaqi Lin et.al.	2511.11320	null
2025-11-21	DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding	Tanveer Hannan et.al.	2511.11313	null
2025-11-18	MOON Embedding: Multimodal Representation Learning for E-commerce Search Advertising	Chenghan Fu et.al.	2511.11305	null
2025-11-14	3D Stokes polarimetric imaging at nanoscales	Isael Herrera et.al.	2511.11222	null
2025-11-14	Positional Bias in Multimodal Embedding Models: Do They Favor the Beginning, the Middle, or the End?	Kebin Wu et.al.	2511.11216	null
2025-11-21	Draft and Refine with Visual Experts	Sungheon Jeong et.al.	2511.11005	null
2025-11-14	ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization	Anzhe Cheng et.al.	2511.10971	null
2025-11-13	From Attention to Frequency: Integration of Vision Transformer and FFT-ReLU for Enhanced Image Deblurring	Syed Mumtahin Mahmud et.al.	2511.10806	null
2025-11-13	Semantic Property Maps for Driving Applications	Marcus Greiff et.al.	2511.10798	null
2025-11-13	Fast Data Attribution for Text-to-Image Models	Sheng-Yu Wang et.al.	2511.10721	null
2025-11-18	CARScenes: Semantic VLM Dataset for Safe Autonomous Driving	Yuankai He et.al.	2511.10701	null
2025-11-12	DualVision ArthroNav: Investigating Opportunities to Enhance Localization and Reconstruction in Image-based Arthroscopy Navigation via External Cameras	Hongchao Shu et.al.	2511.10699	null
2025-11-12	$π$ -Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling	Dong Liu et.al.	2511.10696	null
2025-11-13	Mined Prompting and Metadata-Guided Generation for Wound Care Visual Question Answering	Bavana Durgapraveen et.al.	2511.10591	null
2025-11-13	SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation	Wei Li et.al.	2511.10518	null
2025-11-13	Domain Adaptation for Camera-Specific Image Characteristics using Shallow Discriminators	Maximiliane Gruber et.al.	2511.10424	null
2025-11-16	MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns	Jiarui Zhang et.al.	2511.10390	null
2025-11-17	Physics informed Transformer-VAE for biophysical parameter estimation: PROSAIL model inversion in Sentinel-2 imagery	Prince Mensah et.al.	2511.10387	null
2025-11-13	Rethinking Visual Information Processing in Multimodal LLMs	Dongwan Kim et.al.	2511.10301	null
2025-11-13	H3Former: Hypergraph-based Semantic-Aware Aggregation via Hyperbolic Hierarchical Contrastive Loss for Fine-Grained Visual Classification	Yongji Zhang et.al.	2511.10260	null
2025-11-20	TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding	Jinxuan Li et.al.	2511.10241	null
2025-11-13	Next-Frame Feature Prediction for Multimodal Deepfake Detection and Temporal Localization	Ashutosh Anshul et.al.	2511.10212	null
2025-11-13	Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA	Yiran Zhang et.al.	2511.10182	null
2025-11-13	GEA: Generation-Enhanced Alignment for Text-to-Image Person Retrieval	Hao Zou et.al.	2511.10154	null
2025-11-13	Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction	Mingda Jia et.al.	2511.10134	null
2025-11-13	GridPrune: From "Where to Look" to "What to Select" in Visual Token Pruning for MLLMs	Yuxiang Duan et.al.	2511.10081	null
2025-11-13	Radiology Workflow-Guided Hierarchical Reinforcement Fine-Tuning for Medical Report Generation	Bodong Du et.al.	2511.10065	null
2025-11-13	Trapped by Their Own Light: Deployable and Stealth Retroreflective Patch Attacks on Traffic Sign Recognition Systems	Go Tsuruoka et.al.	2511.10050	null
2025-11-13	AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models	Xinyi Wang et.al.	2511.10017	null
2025-11-13	Learning phase diversity for solving ill-posed inverse problems in imaging	Jasleen Birdi et.al.	2511.09952	null
2025-11-13	MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding	Ketong Chen et.al.	2511.09919	null
2025-11-12	From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance	Jeongho Min et.al.	2511.09820	null
2025-11-12	PALMS+: Modular Image-Based Floor Plan Localization Leveraging Depth Foundation Model	Yunqian Cheng et.al.	2511.09724	null
2025-11-12	SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control	Arman Zarei et.al.	2511.09715	null
2025-11-12	IFG: Internet-Scale Guidance for Functional Grasping Generation	Ray Muxin Liu et.al.	2511.09558	null
2025-11-12	Warped Disk Galaxies: Statistical Properties from DESI Legacy Imaging Surveys DR8	Yiheng Wang et.al.	2511.09518	null
2025-11-12	A general framework for adaptive nonparametric dimensionality reduction	Antonio Di Noia et.al.	2511.09486	null
2025-11-12	BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation	Hongchao Shu et.al.	2511.09443	null
2025-11-12	NeuroCLIP: Brain-Inspired Prompt Tuning for EEG-to-Image Multimodal Contrastive Learning	Jiyuan Wang et.al.	2511.09250	null
2025-11-12	SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields	Sangheon Yang et.al.	2511.09072	null
2025-11-12	ROI-based Deep Image Compression with Implicit Bit Allocation	Kai Hu et.al.	2511.08918	null
2025-11-12	Negative Entity Suppression for Zero-Shot Captioning with Synthetic Images	Zimao Lu et.al.	2511.08909	null
2025-11-13	LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis	Ibne Farabi Shihab et.al.	2511.08903	null
2025-11-11	SIFT-Graph: Benchmarking Multimodal Defense Against Image Adversarial Attacks With Robust Feature Graph	Jingjie He et.al.	2511.08810	null
2025-11-11	Decoupling Composition and Band Gap in $κ$-Ga$_2$O$_3$ Heterostructures via STEM-EELS	Annett Thøgersen et.al.	2511.08728	null
2025-11-11	Spatio-Temporal Cluster-Triggered Encoding for Spiking Neural Networks	Lingyun Ke et.al.	2511.08469	null
2025-11-11	Isolated massive star candidates in NGC 4242 with GULP	Pietro Facchini et.al.	2511.08447	null
2025-11-11	Text-based Aerial-Ground Person Retrieval	Xinyu Zhou et.al.	2511.08369	null
2025-11-11	VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion	Samet Hicsonmez et.al.	2511.08173	null
2025-11-11	Multi-Granularity Mutual Refinement Network for Zero-Shot Learning	Ning Wang et.al.	2511.08163	null
2025-11-11	Direction and speed selectivity properties for spatio-temporal receptive fields according to the generalized Gaussian derivative model for visual receptive fields	Tony Lindeberg et.al.	2511.08101	null
2025-11-11	Multi-modal Deepfake Detection and Localization with FPN-Transformer	Chende Zheng et.al.	2511.08031	null
2025-11-12	EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision	Yifei Cao et.al.	2511.08007	null
2025-11-11	Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition	Lintong Zhang et.al.	2511.07974	null
2025-11-11	Exploring the Underwater World Segmentation without Extra Training	Bingyu Li et.al.	2511.07923	null
2025-11-11	Visual Bridge: Universal Visual Perception Representations Generating	Yilin Gao et.al.	2511.07877	null
2025-11-11	MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection	Sunghun Yang et.al.	2511.07862	null
2025-11-11	Semantic-Consistent Bidirectional Contrastive Hashing for Noisy Multi-Label Cross-Modal Retrieval	Likang Peng et.al.	2511.07780	null
2025-11-14	Multistep Quasimetric Learning for Scalable Goal-conditioned Reinforcement Learning	Bill Chunyuan Zheng et.al.	2511.07730	null
2025-11-11	Operational machine learning for remote spectroscopic detection of CH $_{4}$ point sources	Vít Růžička et.al.	2511.07719	null
2025-11-19	Cross Modal Fine-Grained Alignment via Granularity-Aware and Region-Uncertain Modeling	Jiale Liu et.al.	2511.07710	null
2025-11-10	Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning	Michael Hoffmann et.al.	2511.07682	null
2025-11-10	CAVER: Curious Audiovisual Exploring Robot	Luca Macesanu et.al.	2511.07619	null
2025-11-08	Multivariate Variational Autoencoder	Mehmet Can Yavuz et.al.	2511.07472	null
2025-11-20	AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents	Ye Zheng et.al.	2511.07441	null
2025-11-10	TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research	Han Zhang et.al.	2511.07412	null
2025-11-10	YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting	Botao Ye et.al.	2511.07321	null
2025-11-10	VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models	Ying Cheng et.al.	2511.07299	null
2025-11-10	Direct imaging of magnetotransport at graphene-metal interfaces with a single-spin quantum sensor	C. Ding et.al.	2511.07181	null
2025-11-10	LeCoT: revisiting network architecture for two-view correspondence pruning	Luanyuan Dai et.al.	2511.07078	null
2025-11-10	Integration of Visual SLAM into Consumer-Grade Automotive Localization	Luis Diener et.al.	2511.06919	null
2025-11-10	Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding	Yuzhen Li et.al.	2511.06908	null
2025-11-10	NeuroBridge: Bio-Inspired Self-Supervised EEG-to-Image Decoding via Cognitive Priors and Bidirectional Semantic Alignment	Wenjiang Zhang et.al.	2511.06836	null
2025-11-10	Semi-distributed Cross-modal Air-Ground Relative Localization	Weining Lu et.al.	2511.06749	null
2025-11-10	AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer	Yulim So et.al.	2511.06687	null
2025-11-10	HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment	Ruijia Wu et.al.	2511.06653	null
2025-11-09	DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization	Tao Liu et.al.	2511.06422	null
2025-11-09	A generalization bound for exit wave reconstruction via deep unfolding	Moussa Atwi et.al.	2511.06413	null
2025-11-09	CINEMAE: Leveraging Frozen Masked Autoencoders for Cross-Generator AI Image Detection	Minsuk Jang et.al.	2511.06325	null
2025-11-09	ALIGN: A Vision-Language Framework for High-Accuracy Accident Location Inference through Geo-Spatial Neural Reasoning	MD Thamed Bin Zaman Chowdhury et.al.	2511.06316	null
2025-11-11	Robust Nearest Neighbour Retrieval Using Targeted Manifold Manipulation	B. Ghosh et.al.	2511.06261	null
2025-11-09	ExpReS-VLA: Specializing Vision-Language-Action Models Through Experience Replay and Retrieval	Shahram Najam Syed et.al.	2511.06202	null
2025-11-08	Real-Time Bundle Adjustment for Ultra-High-Resolution UAV Imagery Using Adaptive Patch-Based Feature Tracking	Selim Ahmet Iz et.al.	2511.06152	null
2025-11-11	When Object-Centric World Models Meet Policy Learning: From Pixels to Policies, and Where It Breaks	Stefano Ferraro et.al.	2511.06136	null
2025-11-08	Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration	Umar Rashid et.al.	2511.06087	null
2025-11-08	Visual Exploration of Feature Relationships in Sparse Autoencoders with Curated Concepts	Xinyuan Yan et.al.	2511.06048	null
2025-11-08	S2ML: Spatio-Spectral Mutual Learning for Depth Completion	Zihui Zhao et.al.	2511.06033	null
2025-11-08	Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era	Feng Lu et.al.	2511.06024	null
2025-11-08	Dissecting the Perseus-Pisces supercluster observed with CFHT-MegaCam: Investigating environmental effects on galaxy morphology	M. Mondelin et.al.	2511.05925	null
2025-11-08	Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning	Fei Yu et.al.	2511.05894	null
2025-11-08	HAPS Communication Networks: A Tutorial-cum-Survey on Integration with Optical Atmospheric Sensing	Ali Elkhazraji et.al.	2511.05877	null
2025-11-07	SARCH: Multimodal Search for Archaeological Archives	Nivedita Sinha et.al.	2511.05667	null
2025-11-05	Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps	Yoojin Oh et.al.	2511.05590	null
2025-11-07	Multi-modal Loop Closure Detection with Foundation Models in Severely Unstructured Environments	Laura Alejandra Encinar Gonzalez et.al.	2511.05404	null
2025-11-07	PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization	Zehui Feng et.al.	2511.05393	null
2025-11-07	Turning Adversaries into Allies: Reversing Typographic Attacks for Multimodal E-Commerce Product Retrieval	Janet Jenq et.al.	2511.05325	null
2025-11-07	On the possibility of using decayless kink oscillations of coronal loops to forecast powerful solar flares and coronal mass ejections	A. B. Nechaeva et.al.	2511.05175	null
2025-11-07	Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start	Fuyang Liu et.al.	2511.05095	null
2025-11-07	Dynamic Residual Encoding with Slide-Level Contrastive Learning for End-to-End Whole Slide Image Representation	Jing Jin et.al.	2511.05034	null
2025-11-07	DAFM: Dynamic Adaptive Fusion for Multi-Model Collaboration in Composed Image Retrieval	Yawei Cai et.al.	2511.05020	null
2025-11-07	Nuclear Ptychoscopy: A Ptychographic Framework for Nuclear Spectroscopy	Ziyang Yuan et.al.	2511.04924	null
2025-11-06	Learning to reason about rare diseases through retrieval-augmented agents	Ha Young Kim et.al.	2511.04720	null
2025-11-06	PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning	Yicheng Xiao et.al.	2511.04601	null
2025-11-06	Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA	Itbaan Safwan et.al.	2511.04384	null
2025-11-06	High-Resolution Forest Mapping from L-Band Interferometric SAR Time Series using Deep Learning over Northern Spain	Chiara Telli et.al.	2511.04362	null
2025-11-06	Probing the Probes: Methods and Metrics for Concept Alignment	Jacob Lysnæs-Larsen et.al.	2511.04312	null
2025-11-06	DINOv2 Driven Gait Representation Learning for Video-Based Visible-Infrared Person Re-identification	Yujie Yang et.al.	2511.04281	null
2025-11-07	On the Brittleness of CLIP Text Encoders	Allie Tran et.al.	2511.04247	null
2025-11-06	An Efficient Algorithm for Learning-Based Visual Localization	Jindi Zhong et.al.	2511.04232	null
2025-11-06	GraspView: Active Perception Scoring and Best-View Optimization for Robotic Grasping in Cluttered Environments	Shenglin Wang et.al.	2511.04199	null
2025-11-06	Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories	Olav Finne Praesteng Larsen et.al.	2511.04155	null
2025-11-06	Learning from Online Videos at Inference Time for Computer-Use Agents	Yujian Liu et.al.	2511.04137	null
2025-11-06	SpatialLock: Precise Spatial Control in Text-to-Image Synthesis	Biao Liu et.al.	2511.04112	null
2025-11-06	Caption Injection for Optimization in Generative Search Engine	Xiaolu Chen et.al.	2511.04080	null
2025-11-06	CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation	Yuwen Tao et.al.	2511.03992	null
2025-11-05	SILVI: Simple Interface for Labeling Video Interactions	Ozan Kanbertay et.al.	2511.03819	null
2025-11-05	Expert Evaluation of LLM World Models: A High- $T_c$ Superconductivity Case Study	Haoyu Guo et.al.	2511.03782	null
2025-11-05	The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents	Xingyao Wang et.al.	2511.03690	null
2025-11-10	Coherent Differential Imaging of high-contrast extended sources with VLT/SPHERE	Axel Potier et.al.	2511.03518	null
2025-11-05	Performance Evaluation of a Position-Sensitive SiPM-based Gamma Camera for Intraoperative Imaging	Aramis Raiola et.al.	2511.03493	null
2025-11-05	Lightwave Power Transfer-Enabled Underwater Optical ISAC Systems under Ship Attitude Variation	Kapila W. S. Palitharathna et.al.	2511.03366	null
2025-11-05	Accelerating Physical Property Reasoning for Augmented Visual Cognition	Hongbo Lan et.al.	2511.03126	null
2025-11-04	The Curved Spacetime of Transformer Architectures	Riccardo Di Sipio et.al.	2511.03060	null
2025-11-04	SLIP: Structural-aware Language-Image Pretraining for Vision-Language Alignment	Wenbo Lu et.al.	2511.03019	null
2025-11-04	Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data	Jessica Plassmann et.al.	2511.02541	null
2025-11-04	Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV Localization	Tao Liu et.al.	2511.02489	null
2025-11-04	LUMA-RAG: Lifelong Multimodal Agents with Provably Stable Streaming Alignment	Rohan Wandre et.al.	2511.02371	null
2025-11-04	Learning Spatial Awareness for Laparoscopic Surgery with AI Assisted Visual Feedback	Songyang Liu et.al.	2511.02233	null
2025-11-03	AlloyLens: A Visual Analytics Tool for High-throughput Alloy Screening and Inverse Design	Suyang Li et.al.	2511.02133	null
2025-11-10	Enhancing Multimodal Recommendations with Vision-Language Models and Information-Aware Fusion	Hai-Dang Kieu et.al.	2511.02113	null
2025-11-03	TurboMap: GPU-Accelerated Local Mapping for Visual SLAM	Parsa Hosseininejad et.al.	2511.02036	null
2025-11-03	Topological Expansion of Boehm's Brushes via Structured Light	Dmitry A. Pushin et.al.	2511.01841	null
2025-11-05	TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning	Ming Li et.al.	2511.01833	null
2025-11-03	3EED: Ground Everything Everywhere in 3D	Rong Li et.al.	2511.01755	null
2025-11-03	Progressive Translation of H&E to IHC with Enhanced Structural Fidelity	Yuhang Kang et.al.	2511.01698	null
2025-11-03	Vote-in-Context: Turning VLMs into Zero-Shot Rank Fusers	Mohamed Eltahir et.al.	2511.01617	null
2025-11-03	Wave-Particle (Continuous-Discrete) Dualistic Visual Tokenization for Unified Understanding and Generation	Yizhu Chen et.al.	2511.01593	null
2025-11-03	Floor Plan-Guided Visual Navigation Incorporating Depth and Directional Cues	Wei Huang et.al.	2511.01493	null
2025-11-03	UniSOT: A Unified Framework for Multi-Modality Single Object Tracking	Yinchao Ma et.al.	2511.01427	null
2025-11-03	Semantic BIM enrichment for firefighting assets: Fire-ART dataset and panoramic image-based 3D reconstruction	Ya Wen et.al.	2511.01399	null
2025-11-03	SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment	Xinyu Mao et.al.	2511.01390	null
2025-11-03	MIQ-SAM3D: From Single-Point Prompt to Multi-Instance Segmentation via Competitive Query Refinement	Jierui Qu et.al.	2511.01345	null
2025-11-03	Direct Mapping of Intrinsic Topology of Bound States in the Continuum via Nonlinear Emission	Shuzheng Chen et.al.	2511.01337	null
2025-11-03	MotionStream: Real-Time Video Generation with Interactive Motion Controls	Joonghyuk Shin et.al.	2511.01266	null
2025-11-03	A Saddle Point Remedy: Power of Variable Elimination in Non-convex Optimization	Min Gan et.al.	2511.01234	null
2025-11-02	Efficient Test-Time Retrieval Augmented Generation	Hailong Yin et.al.	2511.01059	null
2025-11-02	Integrating Visual and X-Ray Machine Learning Features in the Study of Paintings by Goya	Hassan Ugail et.al.	2511.01000	null
2025-11-02	Dynamic Multi-level Weighted Alignment Network for Zero-shot Sketch-based Image Retrieval	Hanwen Su et.al.	2511.00925	null
2025-11-02	GraphGeo: Multi-Agent Debate Framework for Visual Geo-localization with Heterogeneous Graph Neural Networks	Heng Zheng et.al.	2511.00908	null
2025-11-02	Enhancing Adversarial Transferability in Visual-Language Pre-training Models via Local Shuffle and Sample-based Attack	Xin Liu et.al.	2511.00831	null
2025-11-01	Applying Medical Imaging Tractography Techniques to Painterly Rendering of Images	Alberto Di Biase et.al.	2511.00702	null
2025-11-01	Multi-Mapcher: Loop Closure Detection-Free Heterogeneous LiDAR Multi-Session SLAM Leveraging Outlier-Robust Registration for Autonomous Vehicles	Hyungtae Lim et.al.	2511.00635	null
2025-11-05	Text-guided Fine-Grained Video Anomaly Detection	Jihao Gu et.al.	2511.00524	null
2025-11-01	OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback	Kai Luo et.al.	2511.00510	null
2025-11-09	VinDr-CXR-VQA: A Visual Question Answering Dataset for Explainable Chest X-Ray Analysis with Multi-Task Learning	Dang H. Nguyen et.al.	2511.00504	null
2025-11-01	FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts	Weihao Bo et.al.	2511.00480	null
2025-11-20	Weakly Supervised Pneumonia Localization from Chest X-Rays Using Deep Neural Network and Grad-CAM Explanations	Kiran Shahi et.al.	2511.00456	null
2025-11-01	ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training	Xin Yao et.al.	2511.00446	null
2025-11-01	Leveraging Hierarchical Image-Text Misalignment for Universal Fake Image Detection	Daichi Zhang et.al.	2511.00427	null
2025-11-01	VinciCoder: Unifying Multimodal Code Generation via Coarse-to-fine Visual Reinforcement Learning	Xuanle Zhao et.al.	2511.00391	null
2025-11-19	Spot The Ball: A Benchmark for Visual Social Inference	Neha Balamurugan et.al.	2511.00261	null
2025-10-31	Generative Modeling Enables Molecular Structure Retrieval from Coulomb Explosion Imaging	Xiang Li et.al.	2511.00179	null
2025-10-31	Integrating ConvNeXt and Vision Transformers for Enhancing Facial Age Estimation	Gaby Maroun et.al.	2511.00123	null
2025-11-03	Image Hashing via Cross-View Code Alignment in the Age of Foundation Models	Ilyass Moummad et.al.	2510.27584	null
2025-10-31	DP-FedPGN: Finding Global Flat Minima for Differentially Private Federated Learning via Penalizing Gradient Norm	Junkang Liu et.al.	2510.27504	null
2025-10-31	ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use	Mengjie Deng et.al.	2510.27363	null
2025-10-31	RzenEmbed: Towards Comprehensive Multimodal Retrieval	Weijian Jian et.al.	2510.27350	null
2025-11-24	FOCUS: Efficient Keyframe Selection for Long Video Understanding	Zirui Zhu et.al.	2510.27280	null
2025-10-31	Approximate Diverse $k$ -nearest Neighbor Search in Vector Database	Jiachen Zhao et.al.	2510.27243	null
2025-11-04	Dual-level Progressive Hardness-Aware Reweighting for Cross-View Geo-Localization	Guozheng Zheng et.al.	2510.27181	null
2025-10-31	M^3Detection: Multi-Frame Multi-Level Feature Fusion for Multi-Modal 3D Object Detection with Camera and 4D Imaging Radar	Xiaozhi Li et.al.	2510.27166	null
2025-10-31	AFM-Net: Advanced Fusing Hierarchical CNN Visual Priors with Global Sequence Modeling for Remote Sensing Image Scene Classification	Yuanhao Tang et.al.	2510.27155	null
2025-10-31	WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond	Zhicong Sun et.al.	2510.27133	null
2025-11-04	NaviTrace: Evaluating Embodied Navigation of Vision-Language Models	Tim Windecker et.al.	2510.26909	null
2025-10-30	Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench	Fenfen Lin et.al.	2510.26865	null
2025-11-03	Evaluating Perspectival Biases in Cross-Modal Retrieval	Teerapol Saengsukhiran et.al.	2510.26861	null
2025-10-29	Audio-Visual Speech Enhancement In Complex Scenarios With Separation And Dereverberation Joint Modeling	Jiarong Du et.al.	2510.26825	null
2025-10-30	Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark	Ziyu Guo et.al.	2510.26802	null
2025-10-30	Scaling Image Geo-Localization to Continent Level	Philipp Lindenberger et.al.	2510.26795	null
2025-11-03	ChartAB: A Benchmark for Chart Grounding & Dense Alignment	Aniruddh Bansal et.al.	2510.26781	null
2025-10-30	STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization	Marco Federici et.al.	2510.26771	null
2025-10-30	Fire Behavior Monitoring using MeteoSat Third Generation, FCI-FireDyn algorithm: Rate Of Spread and Burnt Area Dynamics for large fire event	Ronan Paugam et.al.	2510.26677	null
2025-10-30	Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection	Yuanting Fan et.al.	2510.26464	null
2025-10-30	CorVS: Person Identification via Video Trajectory-Sensor Correspondence in a Real-World Warehouse	Kazuma Kano et.al.	2510.26369	null
2025-10-30	Weak-Lensing Detection of Intercluster Filaments in Three Nearby Cluster Systems	Rahul Shinde et.al.	2510.26318	null
2025-10-30	Self-localization on a 3D map by fusing global and local features from a monocular camera	Satoshi Kikuch et.al.	2510.26170	null
2025-10-30	CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark	Jiaqi Wang et.al.	2510.26160	null
2025-10-30	Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM	Ali Caglayan et.al.	2510.26131	null
2025-10-30	Josephson effect with periodic order parameter	Klaus Ziegler et.al.	2510.26128	null
2025-10-30	OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research	Caoshuo Li et.al.	2510.26114	null
2025-10-29	RADRON: Cooperative Localization of Ionizing Radiation Sources by MAVs with Compton Cameras	Petr Stibinger et.al.	2510.26018	null
2025-10-29	DARTS: A Drone-Based AI-Powered Real-Time Traffic Incident Detection System	Bai Li et.al.	2510.26004	null
2025-10-31	Larger Hausdorff Dimension in Scanning Pattern Facilitates Mamba-Based Methods in Low-Light Image Enhancement	Xinhua Wang et.al.	2510.26001	null
2025-10-29	Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer	Roman Beliy et.al.	2510.25976	null
2025-10-26	Towards Piece-by-Piece Explanations for Chess Positions with SHAP	Francesco Spinnato et.al.	2510.25775	null
2025-10-29	Retrieval-Augmented Search for Large-Scale Map Collections with ColPali	Jamie Mahowald et.al.	2510.25718	null
2025-10-29	Instance-Level Composed Image Retrieval	Bill Psomas et.al.	2510.25387	null
2025-10-29	Prompt Estimation from Prototypes for Federated Prompt Tuning of Vision Transformers	M Yashwanth et.al.	2510.25372	null
2025-10-29	Development of a new phase-retrieval algorithm from a single-shot image for X-ray schlieren microscopy	Ryutaro Nishimura et.al.	2510.25264	null
2025-10-29	Spectral analysis of the stiffness matrix sequence in the approximated Stokes equation	Samuele Ferri et.al.	2510.25252	null
2025-10-29	Hybrid Vision Servoing with Depp Alignment and GRU-Based Occlusion Recovery	Jee Won Lee et.al.	2510.25233	null
2025-10-29	MMM-Fact: A Multimodal, Multi-Domain Fact-Checking Dataset with Multi-Level Retrieval Difficulty	Wenyan Xu et.al.	2510.25120	null
2025-10-29	Visual Diversity and Region-aware Prompt Learning for Zero-shot HOI Detection	Chanhyeong Yang et.al.	2510.25094	null
2025-10-28	Defect Mitigation for Robot Arm-based Additive Manufacturing Utilizing Intelligent Control and IOT	Matsive Ali et.al.	2510.24994	null
2025-10-28	DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts	Binbin Li et.al.	2510.24813	null
2025-10-28	Generative AI for Healthcare: Fundamentals, Challenges, and Perspectives	Gang Chen et.al.	2510.24551	null
2025-10-28	GeVI-SLAM: Gravity-Enhanced Stereo Visua Inertial SLAM for Underwater Robots	Yuan Shen et.al.	2510.24533	null
2025-10-28	Fast and accurate neural reflectance transformation imaging through knowledge distillation	Tinsae G. Dulecha et.al.	2510.24486	null
2025-10-28	Deeply-Conditioned Image Compression via Self-Generated Priors	Zhineng Zhao et.al.	2510.24437	null
2025-10-28	Half-Light Radius Measurements of Andromeda Dwarf Satellites from the Isaac Newton Telescope Survey Using Exponential, Plummer, and Sérsic Fits	Hedieh Abdollahi et.al.	2510.24377	null
2025-10-28	Decoupling What to Count and Where to See for Referring Expression Counting	Yuda Zou et.al.	2510.24374	null
2025-10-28	Sound Source Localization for Spatial Mapping of Surgical Actions in Dynamic Scenes	Jonas Hein et.al.	2510.24332	null
2025-10-28	CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation	Anshul Kaushal et.al.	2510.24202	null
2025-10-28	LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation	Haotian Zhou et.al.	2510.24118	null
2025-10-27	Explainable Detection of AI-Generated Images with Artifact Localization Using Faster-Than-Lies and Vision-Language Models for Edge Devices	Aryan Mathur et.al.	2510.23775	null
2025-10-27	EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT	Baoqi Pei et.al.	2510.23569	null
2025-10-27	MDReID: Modality-Decoupled Learning for Any-to-Any Multi-Modal Object Re-Identification	Yingying Feng et.al.	2510.23301	null
2025-10-27	Learning from Frustration: Torsor CNNs on Graphs	Daiyuan Li et.al.	2510.23288	null
2025-10-27	Moderating Role of Presence in EEG Responses to Visuo-haptic Prediction Error in Virtual Reality	Lukas Gehrke et.al.	2510.23262	null
2025-10-27	Accurate and Scalable Multimodal Pathology Retrieval via Attentive Vision-Language Alignment	Hongyi Wang et.al.	2510.23224	null
2025-10-27	The Sun as an X-ray star V.: A new method to retrieve coronal filling factors	Wilhelmina Maryann Joseph et.al.	2510.23161	null
2025-10-27	Reliable Robotic Task Execution in the Face of Anomalies	Bharath Santhanam et.al.	2510.23121	null
2025-10-27	Multi-Stage Field Extraction of Financial Documents with OCR and Compact Vision-Language Models	Yichao Jin et.al.	2510.23066	null
2025-10-26	Seeing the Unseen: Towards Zero-Shot Inspection for Wind Turbine Blades using Knowledge-Augmented Vision Language Models	Yang Zhang et.al.	2510.22868	null
2025-10-26	Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models	Lexiang Xiong et.al.	2510.22851	null
2025-10-26	Analytical Swarm Chemistry: Characterization and Analysis of Emergent Swarm Behaviors	Ricardo Vega et.al.	2510.22821	null
2025-10-26	VEHME: A Vision-Language Model For Evaluating Handwritten Mathematics Expressions	Thu Phuong Nguyen et.al.	2510.22798	null
2025-11-01	Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval	Binxiao Xu et.al.	2510.22765	null
2025-10-26	TWC-SLAM: Multi-Agent Cooperative SLAM with Text Semantics and WiFi Features Integration for Similar Indoor Environments	Chunyu Li et.al.	2510.22754	null
2025-10-30	Cross-view Localization and Synthesis -- Datasets, Challenges and Opportunities	Ningli Xu et.al.	2510.22736	null
2025-10-26	S-Chain: Structured Visual Chain-of-Thought For Medicine	Khai Le-Duc et.al.	2510.22728	null
2025-10-26	SpoofTrackBench: Interpretable AI for Spoof-Aware UAV Tracking and Benchmarking	Van Le et.al.	2510.22726	null
2025-10-26	LRW-Persian: Lip-reading in the Wild Dataset for Persian Language	Zahra Taghizadeh et.al.	2510.22716	null
2025-10-26	SARCLIP: A Vision Language Foundation Model for Semantic Understanding and Target Recognition in SAR Imagery	Qiwei Ma et.al.	2510.22665	null
2025-10-26	CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation	Md. Mehedi Hasan et.al.	2510.22609	null
2025-10-26	SWAN: Self-supervised Wavelet Neural Network for Hyperspectral Image Unmixing	Yassh Ramchandani et.al.	2510.22607	null
2025-10-26	RoGER-SLAM: A Robust Gaussian Splatting SLAM System for Noisy and Low-light Environment Resilience	Huilin Yin et.al.	2510.22600	null
2025-10-26	STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models	Mahiro Ukai et.al.	2510.22571	null
2025-10-26	Structure Aware Image Downscaling	G B Kevin Arjun et.al.	2510.22551	null
2025-10-26	Low-Light Image Enhancement Using Gamma Learning And Attention-Enabled Encoder-Decoder Networks	Bibhabasu Debnath et.al.	2510.22547	null
2025-10-26	Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing	Xiang Fei et.al.	2510.22529	null
2025-10-26	Open Multimodal Retrieval-Augmented Factual Image Generation	Yang Tian et.al.	2510.22521	null
2025-10-25	Moving Beyond Diffusion: Hierarchy-to-Hierarchy Autoregression for fMRI-to-Image Reconstruction	Xu Zhang et.al.	2510.22335	null
2025-10-25	From Slides to Chatbots: Enhancing Large Language Models with University Course Materials	Tu Anh Dinh et.al.	2510.22272	null
2025-10-25	Scaling Non-Parametric Sampling with Representation	Vincent Lu et.al.	2510.22196	null
2025-10-24	Earth Analogs in Reflected Light: Insights from Early Spectral Characterization in Unconstrained Orbits	Arnaud Salvador et.al.	2510.21973	null
2025-10-23	TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary Weights and Distilled Knowledge	Shu-Hao Zhang et.al.	2510.21879	null
2025-10-22	SCoPE VLM: Selective Context Processing for Efficient Document Navigation in Vision-Language Models	Gyubeum Lim et.al.	2510.21850	null
2025-10-24	Modest-Align: Data-Efficient Alignment for Vision-Language Models	Jiaxiang Liu et.al.	2510.21606	null
2025-10-23	GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs	Guanghao Zheng et.al.	2510.21501	null
2025-10-24	MUVR: A Multi-Modal Untrimmed Video Retrieval Benchmark with Multi-Level Visual Correspondence	Yue Feng et.al.	2510.21406	null
2025-10-24	Dynamic Semantic-Aware Correlation Modeling for UAV Tracking	Xinyu Zhou et.al.	2510.21351	null
2025-10-24	CT-CLIP: A Multi-modal Fusion Framework for Robust Apple Leaf Disease Recognition in Complex Environments	Lemin Liu et.al.	2510.21346	null
2025-10-24	FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning	Lu Zhang et.al.	2510.21311	null
2025-10-24	Underwater Visual-Inertial-Acoustic-Depth SLAM with DVL Preintegration for Degraded Environments	Shuoshuo Ding et.al.	2510.21215	null
2025-10-24	A visual big data system for the prediction of weather-related variables: Jordan-Spain case study	Shadi Aljawarneh et.al.	2510.21176	null
2025-10-24	MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning	Siyong Chen et.al.	2510.21093	null
2025-10-27	LayerComposer: Interactive Personalized T2I via Spatially-Aware Layered Canvas	Guocheng Gordon Qian et.al.	2510.20820	null
2025-10-23	Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation	Yuhan Liu et.al.	2510.20812	null
2025-10-23	Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence	Jiahao Meng et.al.	2510.20579	null
2025-10-23	Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired Navigation	Marziyeh Bamdad et.al.	2510.20549	null
2025-10-24	Robust Preference Alignment via Directional Neighborhood Consensus	Ruochen Mao et.al.	2510.20498	null
2025-10-23	Degradation-Aware Cooperative Multi-Modal GNSS-Denied Localization Leveraging LiDAR-Based Robot Detections	Václav Pritzl et.al.	2510.20480	null
2025-11-20	Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence	Kun Ouyang et.al.	2510.20470	null
2025-10-23	Mitigating Cross-modal Representation Bias for Multicultural Image-to-Recipe Retrieval	Qing Wang et.al.	2510.20393	null
2025-10-25	DB-FGA-Net: Dual Backbone Frequency Gated Attention Network for Multi-Class Brain Tumor Classification with Grad-CAM Interpretability	Saraf Anzum Shreya et.al.	2510.20299	null
2025-10-23	A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization	LinFeng Li et.al.	2510.20291	null
2025-10-23	Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures	Rahul Raja et.al.	2510.20193	null
2025-10-23	PathFormer: A Transformer with 3D Grid Constraints for Digital Twin Robot-Arm Trajectory Generation	Ahmed Alanazi et.al.	2510.20161	null
2025-10-27	"Learning Together": AI-Mediated Support for Parental Involvement in Everyday Learning	Yao Li et.al.	2510.20123	null
2025-10-24	BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models	Ziheng Zhang et.al.	2510.20095	null
2025-10-22	Exposing Blindspots: Cultural Bias Evaluation in Generative Image Models	Huichan Seo et.al.	2510.20042	null
2025-10-22	Automating Iconclass: LLMs and RAG for Large-Scale Classification of Religious Woodcuts	Drew B. Thomas et.al.	2510.19986	null
2025-10-22	Compressing Biology: Evaluating the Stable Diffusion VAE for Phenotypic Drug Discovery	Télio Cropsal et.al.	2510.19887	null
2025-10-22	Multilayer Perceptron Neural Network Model: A Novel Approach for LFP Contrast Sensitivity Tuning	Sahar Maleki et.al.	2510.19636	null
2025-10-22	XBench: A Comprehensive Benchmark for Visual-Language Explanations in Chest Radiography	Haozhe Luo et.al.	2510.19599	null
2025-10-22	Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation	Su Ho Han et.al.	2510.19592	null
2025-10-22	AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields	Woo Jae Kim et.al.	2510.19371	null
2025-10-22	Exploring Scale Shift in Crowd Localization under the Context of Domain Generalization	Juncheng Wang et.al.	2510.19330	null
2025-10-22	Step-Aware Residual-Guided Diffusion for EEG Spatial Super-Resolution	Hongjun Liu et.al.	2510.19166	null
2025-10-21	UniHPR: Unified Human Pose Representation via Singular Value Contrastive Learning	Zhongyu Jiang et.al.	2510.19078	null
2025-10-21	Macroscopic EEG Reveals Discriminative Low-Frequency Oscillations in Plan-to-Grasp Visuomotor Tasks	Anna Cetera et.al.	2510.19057	null
2025-10-21	Visually Comparing Graph Vertex Ordering Algorithms through Geometrical and Topological Approaches	Karelia Salinas et.al.	2510.19009	null
2025-10-21	Underwater Dense Mapping with the First Compact 3D Sonar	Chinmay Burgul et.al.	2510.18991	null
2025-10-18	Small Language Models Offer Significant Potential for Science Community	Jian Zhang et.al.	2510.18890	null
2025-10-21	FedDEAP: Adaptive Dual-Prompt Tuning for Multi-Domain Federated Learning	Yubin Zheng et.al.	2510.18837	null
2025-10-21	UltraGen: High-Resolution Video Generation with Hierarchical Attention	Teng Hu et.al.	2510.18775	null
2025-10-21	Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting	Taha Binhuraib et.al.	2510.18745	null
2025-10-21	SSD: Spatial-Semantic Head Decoupling for Efficient Autoregressive Image Generation	Siyong Jian et.al.	2510.18716	null
2025-10-21	Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents	Yiqi Lin et.al.	2510.18703	null
2025-10-21	CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder	Yongmin Lee et.al.	2510.18583	null
2025-11-12	Large deviations in the many-body localization transition: The case of the random-field XXZ chain	Greivin Alfaro Miranda et.al.	2510.18545	null
2025-10-21	RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation	Junwen Huang et.al.	2510.18521	null
2025-10-21	Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation	Wei-Chia Chang et.al.	2510.18502	null
2025-10-21	Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection	Ji Du et.al.	2510.18437	null
2025-10-21	ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization	Yuanhe Guo et.al.	2510.18433	null
2025-10-21	Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents	Guangfu Guo et.al.	2510.18424	null
2025-10-21	Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models	Lehan Wang et.al.	2510.18303	null
2025-10-22	Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs	Yanhong Li et.al.	2510.18279	null
2025-10-21	TreeFedDG: Alleviating Global Drift in Federated Domain Generalization for Medical Image Segmentation	Yucheng Song et.al.	2510.18268	null
2025-10-21	UWBench: A Comprehensive Vision-Language Benchmark for Underwater Understanding	Da Zhang et.al.	2510.18262	null
2025-10-21	DualHash: A Stochastic Primal-Dual Algorithm with Theoretical Guarantee for Deep Hashing	Luxuan Li et.al.	2510.18218	null
2025-10-20	AION-1: Omnimodal Foundation Model for Astronomical Sciences	Liam Parker et.al.	2510.17960	null
2025-10-13	Pre to Post-Treatment Glioblastoma MRI Prediction using a Latent Diffusion Model	Alexandre G. Leclercq et.al.	2510.17851	null
2025-09-30	Micromechanical characterisation of osteoarthritic subchondral bone by micropillar compression	Samuel McPhee et.al.	2510.17824	null
2025-10-20	SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference	Samir Khaki et.al.	2510.17777	null
2025-10-20	Seeing but Not Believing: Probing the Disconnect Between Visual Attention and Answer Correctness in VLMs	Zhining Liu et.al.	2510.17771	null
2025-10-20	Joint Multi-Condition Representation Modelling via Matrix Factorisation for Visual Place Recognition	Timur Ismagilov et.al.	2510.17739	null
2025-10-20	Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning	Min Cao et.al.	2510.17685	null
2025-10-20	MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning	Mir Nafis Sharear Shopnil et.al.	2510.17590	null
2025-10-20	BenCao: An Instruction-Tuned Large Language Model for Traditional Chinese Medicine	Jiacheng Xie et.al.	2510.17415	null
2025-10-20	Model Metamers Reveal Invariances in Graph Neural Networks	Wei Xu et.al.	2510.17378	null
2025-10-20	Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation	Chenghao Zhang et.al.	2510.17354	null
2025-10-21	LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding	ZhaoYang Han et.al.	2510.17305	null
2025-10-20	Performance Evaluation of an Integrated System for Visible Light Communication and Positioning Using an Event Camera	Ryota Soga et.al.	2510.17203	null
2025-10-20	Generation then Reconstruction: Accelerating Masked Autoregressive Models via Two-Stage Sampling	Feihong Yan et.al.	2510.17171	null
2025-10-22	OmniVIC: A Self-Improving Variable Impedance Controller with Vision-Language In-Context Learning for Safe Robotic Manipulation	Heng Zhang et.al.	2510.17150	null
2025-10-19	Person Re-Identification via Generalized Class Prototypes	Md Ahmed Al Muzaddid et.al.	2510.17043	null
2025-10-19	A Low-Complexity View Synthesis Distortion Estimation Method for 3D Video with Large Baseline Considerations	Chongyuan Bi et.al.	2510.17037	null
2025-10-19	SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models	Chih-Kai Yang et.al.	2510.16917	null
2025-10-19	ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification	Akhila Kambhatla et.al.	2510.16854	null
2025-11-24	ReefNet: A Large scale, Taxonomically Enriched Dataset and Benchmark for Hard Coral Classification	Yahia Battach et.al.	2510.16822	null
2025-10-19	An Efficient Framework for Whole-Page Reranking via Single-Modal Supervision	Zishuai Zhang et.al.	2510.16803	null
2025-10-19	Region in Context: Text-condition Image editing with Human-like semantic reasoning	Thuy Phuong Vu et.al.	2510.16772	null
2025-10-19	See or Say Graphs: Agent-Driven Scalable Graph Understanding with Vision-Language Models	Shuo Han et.al.	2510.16769	null
2025-10-19	Exact Nearest-Neighbor Search on Energy-Efficient FPGA Devices	Patrizio Dazzi et.al.	2510.16736	null
2025-10-27	UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid	Tianyang Dou et.al.	2510.16730	null
2025-10-18	Safire: Similarity Framework for Visualization Retrieval	Huyen N. Nguyen et.al.	2510.16662	null
2025-10-18	A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications	Melika Filvantorkaman et.al.	2510.16611	null
2025-10-18	Image Categorization and Search via a GAT Autoencoder and Representative Models	Duygu Sap et.al.	2510.16514	null
2025-10-18	RefAtomNet++: Advancing Referring Atomic Video Action Recognition using Semantic Retrieval based Multi-Trajectory Mamba	Kunyu Peng et.al.	2510.16444	null
2025-10-18	RL makes MLLMs see better than SFT	Junha Song et.al.	2510.16333	null
2025-10-17	Out-of-Equilibrium Dynamics in a U(1) Lattice Gauge Theory via Local Information Flows: Scattering and String Breaking	Claudia Artiaco et.al.	2510.16101	null
2025-10-14	Frequency domain laser ultrasound microscopy for nanometric layer thickness imaging with GHz elastic plate resonances	Martin Ryzy et.al.	2510.16000	null
2025-10-27	ESCA: Contextualizing Embodied Agents via Scene-Graph Generation	Jiani Huang et.al.	2510.15963	null
2025-10-17	Memory-SAM: Human-Prompt-Free Tongue Segmentation via Retrieval-to-Prompt	Joongwon Chae et.al.	2510.15849	null
2025-10-17	FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification	Zhen Sun et.al.	2510.15595	null
2025-10-17	MCA: Modality Composition Awareness for Robust Composed Multimodal Retrieval	Qiyu Wu et.al.	2510.15543	null
2025-10-17	DPTrack:Directional Kernel-Guided Prompt Learning for Robust Nighttime Aerial Tracking	Zhiqiang Zhu et.al.	2510.15449	null
2025-10-17	Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning	Xuchen Li et.al.	2510.15440	null
2025-10-17	Semantic4Safety: Causal Insights from Zero-shot Street View Imagery Segmentation for Urban Road Safety	Huan Chen et.al.	2510.15434	null
2025-11-07	Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs	Lee Qi Zun et.al.	2510.15418	null
2025-10-17	PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction	Ting-Yu Yen et.al.	2510.15386	null
2025-10-17	WebGen-V Bench: Structured Representation for Enhancing Visual Design in LLM-based Web Generation and Evaluation	Kuang-Da Wang et.al.	2510.15306	null
2025-10-17	Post-Processing Methods for Improving Accuracy in MRI Inpainting	Nishad Kulkarni et.al.	2510.15282	null
2025-10-17	CuSfM: CUDA-Accelerated Structure-from-Motion	Jingrui Yu et.al.	2510.15271	null
2025-11-02	Experience-Driven Exploration for Efficient API-Free AI Agents	Chenwei Tang et.al.	2510.15259	null
2025-10-17	LVI-Q: Robust LiDAR-Visual-Inertial-Kinematic Odometry for Quadruped Robots Using Tightly-Coupled and Efficient Alternating Optimization	Kevin Christiansen Marsim et.al.	2510.15220	null
2025-10-16	TGT: Text-Grounded Trajectories for Locally Controlled Video Generation	Guofeng Zhang et.al.	2510.15104	null
2025-10-16	Comprehensive language-image pre-training for 3D medical image understanding	Tassilo Wald et.al.	2510.15042	null
2025-10-16	NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks	Junliang Ye et.al.	2510.15019	null
2025-10-16	ChangingGrounding: 3D Visual Grounding in Changing Scenes	Miao Hu et.al.	2510.14965	null
2025-10-16	RainDiff: End-to-end Precipitation Nowcasting Via Token-wise Attention Diffusion	Thao Nguyen et.al.	2510.14962	null
2025-10-16	CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection	Hojun Choi et.al.	2510.14792	null
2025-10-16	Improving Cybercrime Detection and Digital Forensics Investigations with Artificial Intelligence	Silvia Lucia Sanna et.al.	2510.14638	null
2025-10-16	Multimodal RAG for Unstructured Data:Leveraging Modality-Aware Knowledge Graphs with Hybrid Retrieval	Rashmi R et.al.	2510.14592	null
2025-10-16	Talking Points: Describing and Localizing Pixels	Matan Rusanovsky et.al.	2510.14583	null
2025-10-16	Acquisition of interpretable domain information during brain MR image harmonization for content-based image retrieval	Keima Abe et.al.	2510.14535	null
2025-11-24	Structured Random Models for Phase Retrieval with Optical Diffusers	Zhiyuan Hu et.al.	2510.14490	null
2025-10-16	Spatial Preference Rewarding for MLLMs Spatial Understanding	Han Qiu et.al.	2510.14374	null
2025-10-14	K-frames: Scene-Driven Any-k Keyframe Selection for long video understanding	Yifeng Yao et.al.	2510.13891	null
2025-10-12	Multimodal Retrieval-Augmented Generation with Large Language Models for Medical VQA	A H M Rezaul Karim et.al.	2510.13856	null
2025-09-19	GQVis: A Dataset of Genomics Data Questions and Visualizations for Generative AI	Skylar Sargent Walters et.al.	2510.13816	null
2025-10-15	Adaptive Visual Conditioning for Semantic Consistency in Diffusion-Based Story Continuation	Seyed Mohammad Mousavi et.al.	2510.13787	null
2025-10-16	NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow Matching	Run Luo et.al.	2510.13721	null
2025-10-15	Jacobian-Based Interpretation of Nonlinear Neural Encoding Model	Xiaohui Gao et.al.	2510.13688	null
2025-11-11	AVAR-Net: A Lightweight Audio-Visual Anomaly Recognition Framework with a Benchmark Dataset	Amjid Ali et.al.	2510.13630	null
2025-10-15	Characterizing Lidar Point-Cloud Adversities Using a Vector Field Visualization	Daniel Choate et.al.	2510.13619	null
2025-10-15	Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU	Ruiqi Ye et.al.	2510.13546	null
2025-10-15	Through the Lens of Doubt: Robust and Efficient Uncertainty Estimation for Visual Place Recognition	Emily Miller et.al.	2510.13464	null
2025-10-15	Improving Visual Recommendation on E-commerce Platforms Using Vision-Language Models	Yuki Yada et.al.	2510.13359	null
2025-10-15	UniVector: Unified Vector Extraction via Instance-Geometry Interaction	Yinglong Yan et.al.	2510.13234	null
2025-10-15	OS-HGAdapter: Open Semantic Hypergraph Adapter for Large Language Models Assisted Entropy-Enhanced Image-Text Alignment	Rongjun Chen et.al.	2510.13131	null
2025-10-23	Epistemic-aware Vision-Language Foundation Model for Fetal Ultrasound Interpretation	Xiao He et.al.	2510.12953	null
2025-10-14	DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search	Kartik Narayan et.al.	2510.12801	null
2025-10-14	SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models	Weiyang Jin et.al.	2510.12784	null
2025-10-24	E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization	Wenpu Li et.al.	2510.12753	null
2025-10-14	A Text-Image Fusion Method with Data Augmentation Capabilities for Referring Medical Image Segmentation	Shurong Chai et.al.	2510.12482	null
2025-10-14	SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression	Biao Zhang et.al.	2510.12474	null
2025-10-14	SpineBench: Benchmarking Multimodal LLMs for Spinal Pathology Analysis	Chenghanyu Zhang et.al.	2510.12267	null
2025-10-14	Local Background Features Matter in Out-of-Distribution Detection	Jinlun Ye et.al.	2510.12259	null
2025-10-14	SDGraph: Multi-Level Sketch Representation Learning by Sparse-Dense Graph Architecture	Xi Cheng et.al.	2510.12192	null
2025-10-14	ImageSentinel: Protecting Visual Datasets from Unauthorized Retrieval-Augmented Image Generation	Ziyuan Luo et.al.	2510.12119	null
2025-10-13	Embedding the Teacher: Distilling vLLM Preferences for Scalable Image Retrieval	Eric He et.al.	2510.12014	null
2025-10-11	Benefits and Limitations of Using GenAI for Political Education and Municipal Elections	Raphael Fischer et.al.	2510.11749	null
2025-10-13	High-resolution Photo Enhancement in Real-time: A Laplacian Pyramid Network	Feng Zhang et.al.	2510.11613	null
2025-10-14	Massive Activations are the Key to Local Detail Synthesis in Diffusion Transformers	Chaofan Gan et.al.	2510.11538	null
2025-10-13	A Modular AIoT Framework for Low-Latency Real-Time Robotic Teleoperation in Smart Cities	Shih-Chieh Sun et.al.	2510.11421	null
2025-10-13	MMAP: A Multi-Magnification and Prototype-Aware Architecture for Predicting Spatial Gene Expression	Hai Dang Nguyen et.al.	2510.11344	null
2025-10-13	A Large-Language-Model Assisted Automated Scale Bar Detection and Extraction Framework for Scanning Electron Microscopic Images	Yuxuan Chen et.al.	2510.11260	null
2025-10-13	PhysHSI: Towards a Real-World Generalizable and Natural Humanoid-Scene Interaction System	Huayi Wang et.al.	2510.11072	null
2025-10-13	Impact of elastic inhomogeneity on collective dynamical properties investigated by field theoretical description in real space	Cunyuan Jiang et.al.	2510.10928	null
2025-10-13	SceneTextStylizer: A Training-Free Scene Text Style Transfer Framework with Diffusion Model	Honghui Yuan et.al.	2510.10910	null
2025-10-13	Spatial Correlation of Superconducting and Pseudogap Dynamics in a Bi-based Cuprate	T. Shimizu et.al.	2510.10906	null
2025-10-13	Where on Earth? A Vision-Language Benchmark for Probing Model Geolocation Skills Across Scales	Zhaofang Qian et.al.	2510.10880	null
2025-10-12	OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs	Caorui Li et.al.	2510.10689	null
2025-10-12	A Simple and Better Baseline for Visual Grounding	Jingchao Wang et.al.	2510.10587	null
2025-10-12	BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices	Euhid Aman et.al.	2510.10560	null
2025-10-12	Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs	Suyang Xi et.al.	2510.10426	null
2025-10-11	B2N3D: Progressive Learning from Binary to N-ary Relationships for 3D Object Grounding	Feng Xiao et.al.	2510.10194	null
2025-10-11	TCMA: Text-Conditioned Multi-granularity Alignment for Drone Cross-Modal Text-Video Retrieval	Zixu Zhao et.al.	2510.10180	null
2025-10-11	ViConEx-Med: Visual Concept Explainability via Multi-Concept Token Transformer for Medical Image Analysis	Cristiano Patrício et.al.	2510.10174	null
2025-10-11	Cooperative Pseudo Labeling for Unsupervised Federated Classification	Kuangpu Guo et.al.	2510.10100	null
2025-10-11	Think Twice to See More: Iterative Visual Reasoning in Medical VLMs	Kaitao Chen et.al.	2510.10052	null
2025-10-11	Complementary and Contrastive Learning for Audio-Visual Segmentation	Sitong Gong et.al.	2510.10051	null
2025-10-11	Scaling Traffic Insights with AI and Language Model-Powered Camera Systems for Data-Driven Transportation Decision Making	Fan Zuo et.al.	2510.09981	null
2025-10-14	J-RAS: Enhancing Medical Image Segmentation via Retrieval-Augmented Joint Training	Salma J. Ahmed et.al.	2510.09953	null
2025-10-15	Egocentric Visual Navigation through Hippocampal Sequences	Xiao-Xiong Lin et.al.	2510.09951	null
2025-10-10	The Geometry of Reasoning: Flowing Logics in Representation Space	Yufa Zhou et.al.	2510.09782	null
2025-10-10	VisRAG 2.0: Evidence-Guided Multi-Image Reasoning in Visual Retrieval-Augmented Generation	Yubo Sun et.al.	2510.09733	null
2025-10-07	Semantic-Cohesive Knowledge Distillation for Deep Cross-modal Hashing	Changchang Sun et.al.	2510.09664	null
2025-10-10	MRMR: A Realistic and Expert-Level Multidisciplinary Benchmark for Reasoning-Intensive Multimodal Retrieval	Siyue Zhang et.al.	2510.09510	null
2025-10-10	Diagonal Artifacts in Samsung Images: PRNU Challenges and Solutions	David Vázquez-Padín et.al.	2510.09509	null
2025-10-10	Dynamic Weight-based Temporal Aggregation for Low-light Video Enhancement	Ruirui Lin et.al.	2510.09450	null
2025-10-10	Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians	Jin-Chuan Shi et.al.	2510.09438	null
2025-10-10	Sub-Diffraction Chromatin Domains: Architecture, Regulation, and Functional Roles in Nuclear Organization	Vinayak Vinayak et.al.	2510.09375	null
2025-10-10	Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation	Wenyao Zhang et.al.	2510.09320	null
2025-10-10	Instance-Level Generation for Representation Learning	Yankun Wu et.al.	2510.09171	null
2025-10-10	Robust Visual Teach-and-Repeat Navigation with Flexible Topo-metric Graph Map Representation	Jikai Wang et.al.	2510.09089	null
2025-10-10	Visual Anomaly Detection for Reliable Robotic Implantation of Flexible Microelectrode Array	Yitong Chen et.al.	2510.09071	null
2025-10-10	HandEval: Taking the First Step Towards Hand Quality Evaluation in Generated Images	Zichuan Wang et.al.	2510.08978	null
2025-10-10	Hierarchical Scheduling for Multi-Vector Image Retrieval	Maoliang Li et.al.	2510.08976	null
2025-11-19	FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation	Samuel Hildebrand et.al.	2510.08945	null
2025-10-09	Identifying Video Game Debugging Bottlenecks: An Industry Perspective	Carlos Pinto Gomez et.al.	2510.08834	null
2025-10-09	Whole Body Model Predictive Control for Spin-Aware Quadrupedal Table Tennis	David Nguyen et.al.	2510.08754	null
2025-10-08	Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry	Thomas Fel et.al.	2510.08638	null
2025-10-11	MultiCOIN: Multi-Modal COntrollable Video INbetweening	Maham Tanveer et.al.	2510.08561	null
2025-10-09	X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering	Zhitong Huang et.al.	2510.08530	null
2025-10-09	Observation of electromagnons in a monolayer multiferroic	Mohammad Amini et.al.	2510.08253	null
2025-10-09	DarkHash: A Data-Free Backdoor Attack Against Deep Hashing	Ziqi Zhou et.al.	2510.08094	null
2025-10-09	CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning	Weihuang Lin et.al.	2510.08003	null
2025-10-09	MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding	Peiran Wu et.al.	2510.07915	null
2025-10-09	RePainter: Empowering E-commerce Object Removal via Spatial-matting Reinforcement Learning	Zipeng Guo et.al.	2510.07721	null
2025-10-09	Multimodal Safety Evaluation in Generative Agent Social Simulations	Alhim Vera et.al.	2510.07709	null
2025-10-09	Mutual Learning for Hashing: Unlocking Strong Hash Functions from Weak Supervision	Xiaoxu Ma et.al.	2510.07703	null
2025-10-16	Ctrl-VI: Controllable Video Synthesis via Variational Inference	Haoyi Duan et.al.	2510.07670	null
2025-10-08	SpecGuard: Spectral Projection-based Advanced Invisible Watermarking	Inzamamul Alam et.al.	2510.07302	null
2025-10-10	DPL: Depth-only Perceptive Humanoid Locomotion via Realistic Depth Synthesis and Cross-Attention Terrain Reconstruction	Jingkai Sun et.al.	2510.07152	null
2025-10-08	ELMUR: External Layer Memory with Update/Rewrite for Long-Horizon RL	Egor Cherepanov et.al.	2510.07151	null
2025-11-14	Concept Retrieval -- What and How?	Ori Nizan et.al.	2510.07058	null
2025-10-08	High-Performance Imaging in a Dilution Refrigerator	Timo Eikelmann et.al.	2510.07054	null
2025-10-08	Introspection in Learned Semantic Scene Graph Localisation	Manshika Charvi Bissessur et.al.	2510.07053	null
2025-10-08	IAR2: Improving Autoregressive Visual Generation with Semantic-Detail Associated Token Prediction	Ran Yi et.al.	2510.06928	null
2025-10-08	M3Retrieve: Benchmarking Multimodal Retrieval for Medicine	Arkadeep Acharya et.al.	2510.06888	null
2025-10-08	Versatile 3D reconstruction framework for hard X-ray grazing incidence imaging of nanostructures	Luke Besley et.al.	2510.06877	null
2025-10-08	Multi-hop Deep Joint Source-Channel Coding with Deep Hash Distillation for Semantically Aligned Image Retrieval	Didrik Bergström et.al.	2510.06868	null
2025-10-08	Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking	Mitchell Keren Taraday et.al.	2510.06820	null
2025-10-08	Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity	Islomjon Shukhratov et.al.	2510.06802	null
2025-10-08	DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining	Zhiliang Zhu et.al.	2510.06746	null
2025-10-08	ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory	Yunzhong Xiao et.al.	2510.06664	null
2025-11-15	Implicit-Knowledge Visual Question Answering with Structured Reasoning Traces	Zhihao Wen et.al.	2510.06638	null
2025-10-07	TDiff: Thermal Plug-And-Play Prior with Patch-Based Diffusion	Piyush Dashpute et.al.	2510.06460	null
2025-10-07	Vi-TacMan: Articulated Object Manipulation via Vision and Touch	Leiyao Cui et.al.	2510.06339	null
2025-10-05	A Mixed-Methods Analysis of Repression and Mobilization in Bangladesh's July Revolution Using Machine Learning and Statistical Modeling	Md. Saiful Bari Siddiqui et.al.	2510.06264	null
2025-10-09	A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants	Hans G. W. van Dam et.al.	2510.06223	null
2025-10-07	Human3R: Everyone Everywhere All at Once	Yue Chen et.al.	2510.06219	null
2025-10-07	DYMO-Hair: Generalizable Volumetric Dynamics Modeling for Robot Hair Manipulation	Chengyang Zhao et.al.	2510.06199	null
**2025-10-0

Name		Name	Last commit message	Last commit date
Latest commit History 2,313 Commits
.github		.github
assets		assets
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Updated on 2025.12.06

SLAM

SFM

Visual Localization

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

Vincentqyw/cv-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.12.06

SLAM

SFM

Visual Localization

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages