@Gi_T This is really interesting work! I have a somewhat related project where I’m training a neural network to perform navigation using a single camera, aiming to improve upon encoder/IMU-based odometry. The goal is to use low-cost hobby robot components and a single ESP32-based camera.

Deep reinforcement learning (DRL) for navigation often relies on LiDAR, which increases costs. Do you think it’s possible to extend DRL-based navigation using only a monocular camera? Monocular SLAM results seem a bit noisy. (GitHub - weixr18/Pi-SLAM: Implementing full visual SLAM on the Raspberry Pi 4B) . Seems do-able as these depth estimation models are getting really good. Latency might be the bottleneck. Im still a little new to DRL techniques in general.