My name is Alex, and alongside my work in the industry, I am pursuing a Ph.D. in Robotics/Computer Vision at a Brazilian university.
We are working on an autonomous vessel project, a USV (Unmanned Surface Vehicle). I am currently dedicated to implementing VSLAM (Visual Simultaneous Localization and Mapping) for the boat, which is currently a sailboat but could potentially be another type of vessel, such as a catamaran.
I need your help. I am looking for a scientific contribution that my work could address. If anyone is an expert in the field of VSLAM applied to USV or has experience in this area, could you please provide insights on where you think we can improve upon the current solutions available in ROS (Robot Operating System) and how this could be an innovation?
The idea is to create something new and contribute for ROS community.
VSLAM on the water? That sounds hard. I’m no expert on VSLAM, but doesn’t it rely on identifying visual features that can be recognized between frames? On the water, you wouldn’t expect many such features to exist, especially since the water itself keeps moving, too, no?
I agree. Perhaps they are assuming the vessel will be in clear or shallow water, and then perform VSLAM on the sea/lake bed below? Or they are navigating rivers/waterways, where there are constant nearby landmarks on the shore. But even still, this seems challenging (for the reasons you’ve mentioned, but also because of the up-and-down bobbing of the camera). It’s also not clear what advantages this offers over GPS+RTK: if you’re on the water, you’ve got a great line-of-sight to satellites basically all the time.
I think this is the backwards way to do it: rather than building a solution and then finding a problem to apply it to, perhaps you should be focusing on finding the problem and then solving that? For example, perhaps there’s a specific edge-scenario where this is applicable (like navigating around a dock). Alternatively, focusing on the problem of VSLAM in highly noisy areas or with hard-to-model camera movement would be useful beyond just USVs.
@Alex_Salgado Could you provide some additional detail on what your use-case is here? That might make it easier to see what you have in mind.
The idea is to use VSLAM for USVs in coastal waters, whether or not fixed landmarks on land are used as reference points.
I’ve talked to some sailor friends, and they mentioned that despite a vessel being equipped with various sensors, in port or dock areas, they often rely on visual navigation for the practicality of reaction time (as seen in the example of the iceberg with the Titanic).
So, I would like to create a system that replicates this behavior. They refer to it as “bearing problems navigation in restricted waters.”
As mentioned, some challenges I already foresee are:
A scarcity of reference landmarks.
Various and often compromised visibility conditions.
Typically unstable observation points.
The teleportation effect due to sensor signal loss.
We would also incorporate an IMU to enhance and adjust the trajectory.
Additionally, we plan to incorporate an Inertial Measurement Unit (IMU) to enhance and adjust the trajectory.
I apologize if my earlier expression was unclear, but the main idea is to consult the community to see if anyone has undertaken a similar endeavor and can identify an unresolved issue that would be valuable for someone to dedicate their efforts to and contribute to the community. This could serve as my starting point as a kind of initial survey.
I think to contribute something useful here you’d have to come up with something pretty novel, since traditional vslam won’t really perform well in the general maritime use case. Additionally, there’s already a ton of tried and true sensors in use for the same purpose, and navigation/localization of surface vessels is well studied and sort of solved (except for edge cases, which you should try to identify!).
These methods/sensors are generally quite robust, like traditional gnss with RTK, INS systems and DVLs (doppler velocity log). You could argue a use case for ASVs in gps-denied environments, but these are rare for your typical asv application. Maybe if you’re travelling through tunnels, or in defense.
I’ve seen a couple papers where they used monocular cameras for autonomous docking. This makes sense, since you have clear, static physical features. I would try to find a concrete scenario where VSLAM specifically would perform better than current methods.
Lastly, are you just implementing traditional vslam on a boat? or are you looking to design a new method (or tailor an existing one)? Maybe this calls for a neural network