# ov2slam

**Repository Path**: maxibooksiyi/ov2slam

## Basic Information

- **Project Name**: ov2slam
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: GPL-3.0
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2024-12-12
- **Last Updated**: 2024-12-12

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# OV²SLAM
## A Fully Online and Versatile Visual SLAM for Real-Time Applications

**Paper**: [[arXiv]](https://arxiv.org/pdf/2102.04060.pdf)

**Videos**: [[video #1]](https://www.youtube.com/watch?v=N4LFD4WKHyg), [[video #2]](https://www.youtube.com/watch?v=N5O0-0339fU), [[video #3]](https://www.youtube.com/watch?v=zNevDT12cKI), [[video #4]](https://www.youtube.com/watch?v=xhLZGDdb0FU), [[video #5]](https://www.youtube.com/watch?v=ITE1yYA5B78), [[video #6]](https://www.youtube.com/watch?v=9D66qpzBvi4)

**Authors:** Maxime Ferrera, Alexandre Eudes, Julien Moras, Martial Sanfourche, Guy Le Besnerais 
(maxime.ferrera@gmail.com / first.last@onera.fr).

---

**April 2024 update**: A ROS2 branch of OV2SLAM is now available [here](https://github.com/ov2slam/ov2slam/tree/ros2) thanks to [@MatPiech](https://github.com/MatPiech)

---

<img src="support_files/ov2slam_readme.gif" width = 512 height = 288 />

**OV²SLAM** is a fully real-time **Visual SLAM** algorithm for **Stereo** and **Monocular** cameras.  A complete SLAM pipeline
is implemented with a carefully designed multi-threaded architecture allowing to perform Tracking, Mapping, Bundle Adjustment and Loop Closing in real-time.
The Tracking is based on an undirect Lucas-Kanade optical-flow formulation and provides camera poses estimations at the camera's frame-rate.
The Mapping works at the keyframes' rate and ensures continuous localization by populating the sparse 3D map and minimize drift through a local map tracking step.
Bundle Adjustment is applied with an anchored inverse depth formulation, reducing the parametrization of 3D map points to 1 parameter instead of 3.
Loop Closing is performed through an **Online Bag of Words** method thanks to [iBoW-LCD](https://github.com/emiliofidalgo/ibow-lcd).  In opposition to classical
offline BoW methods, no pre-trained vocabulary tree is required.  Instead, the vocabulary tree is computed online from the descriptors extracted in the incoming video
stream, making it always suited to the currently explored environment.

## Related Paper:

If you use OV²SLAM in your work, please cite it as:


```
@article{fer2021ov2slam,
      title={{OV$^{2}$SLAM} : A Fully Online and Versatile Visual {SLAM} for Real-Time Applications},
      author={Ferrera, Maxime and Eudes, Alexandre and Moras, Julien and Sanfourche, Martial and {Le Besnerais}, Guy.},
      journal={IEEE Robotics and Automation Letters},
      year={2021}
     }
```

## License

OV²SLAM is released under the [GPLv3 license](https://www.gnu.org/licenses/gpl-3.0.txt). For a closed-source version of OV²SLAM for commercial purposes, please contact [ONERA](https://www.onera.fr/en/contact-us) (https://www.onera.fr/en/contact-us) or the authors. 

Copyright (C) 2020 [ONERA](https://www.onera.fr/en)

## 1. Prerequisites

The library has been tested with **Ubuntu 16.04 and 18.04**, **ROS Kinetic and Melodic** and **OpenCV 3**.  It should also work with **ROS Noetic and OpenCV 4** but this configuration has not been fully tested.

### 1.0 C++11 or Higher

OV²SLAM makes use of C++11 features and should thus be compiled with a C++11 or higher flag.

### 1.1 ROS

ROS is used for reading the video images through bag files and for visualization purpose in Rviz.

[ROS Installation](http://wiki.ros.org/ROS/Installation)

Make sure that the pcl_ros package is installed :

```
    sudo apt install ros-distro-pcl-ros
```

or even

```
    rosdep install ov2slam
```


### 1.2 Eigen3

[Eigen3](http://eigen.tuxfamily.org/index.php?title=Main_Page) is used throughout OV²SLAM.  It should work with version >= 3.3.0, lower versions have not been tested.


### 1.3 OpenCV

OpenCV 3 has been used for the development of OV²SLAM, OpenCV 4 might be supported as well but it has not been tested.
(Optional) The use of BRIEF descriptor requires that **opencv_contrib** was installed.  If it is not the case, ORB will be used instead without scale and rotation invariance properties (which should be the exact equivalent of BRIEF).

**WATCH OUT** By default the CMakeLists.txt file assumes that opencv_contrib is installed, __set the OPENCV_CONTRIB flag to OFF
in CMakeLists.txt if it is not the case__.

### 1.4 iBoW-LCD

A modified version of [iBoW-LCD](https://github.com/emiliofidalgo/ibow-lcd) is included in the Thirdparty folder.  It has been turned into a shared lib and
is not a catkin package anymore.  Same goes for [OBIndex2](https://github.com/emiliofidalgo/obindex2), the required dependency for iBoW-LCD.
Check the lcdetector.h and lcdetector.cc files to see the modifications w.r.t. to the original code.

### 1.5 Sophus

[Sophus](https://github.com/strasdat/Sophus) is used for _*SE(3), SO(3)*_ elements representation.  For convenience, a copy of Sophus has been included in the Thirdparty folder.

### 1.6 Ceres Solver

[Ceres](https://github.com/ceres-solver/ceres-solver) is used for optimization related operations such as PnP, Bundle Adjustment or PoseGraph Optimization.
For convenience, a copy of Ceres has been included in the Thirdparty folder.
Note that [Ceres dependencies](http://ceres-solver.org/installation.html) are still required.

### 1.6 (Optional) OpenGV

[OpenGV](https://github.com/laurentkneip/opengv) can be used for Multi-View-Geometry (MVG) operations.  The results reported in the paper were obtained using OpenGV.
For convenience, if OpenGV is not installed, MVG operations' alternatives are proposed with OpenCV functions.  
**Note** that the performances might be lower without OpenGV.


## 2. Installation

### 2.0 Clone

Clone the git repository in your catkin workspace:

```
    cd ~/catkin_ws/src/
    git clone https://github.com/ov2slam/ov2slam.git
```

### 2.1 Build Thirdparty libs

For convenience we provide a script to build the Thirdparty libs:

```
    cd ~/catkin_ws/src/ov2slam
    chmod +x build_thirdparty.sh
    ./build_thirdparty.sh
```

**WATCH OUT** By default, the script builds obindex2, ibow-lcd, sophus and ceres.  If you want to use your own version of Sophus or Ceres 
you can comment the related lines in the script.  Yet, about Ceres, as OV²SLAM is by default compiled with the "-march=native" flag, the 
Ceres lib linked to OV²SLAM must be compiled with this flag as well, which is not the default case (at least since Ceres 2.0).  The _*build_thirdparty.sh*_ script ensures that Ceres builds with the "-march=native" flag.

If you are not interested in the Loop Closing feature of OV²SLAM, you can also comment the lines related to obindex2 and ibow-lcd.

**(Optional)** Install OpenGV:

```
    cd your_path/
    git clone https://github.com/laurentkneip/opengv
    cd opengv
    mkdir build
    cd build/
    cmake ..
    sudo make -j4 install
```


### 2.2 Build OV²SLAM

Build OV²SLAM package with your favorite catkin tool:

```
    cd ~/catkin_ws/src/ov2slam
    catkin build --this
    source ~/catkin_ws/devel/setup.bash
```

OR

```
    cd ~/catkin_ws/
    catkin_make --pkg ov2slam
    source ~/catkin_ws/devel/setup.bash
```

## 3. Usage

Run OV²SLAM using:

```
    rosrun ov2slam ov2slam_node parameter_file.yaml
```

Visualize OV²SLAM outputs in Rviz by loading the provided configuration file: ov2slam_visualization.rviz. 

## 4. Miscellaneous

### Supported Cameras Model

Both the Pinhole Rad-tan and Fisheye camera's models are supported.  The models are OpenCV-based.
If you use [Kalibr](https://github.com/ethz-asl/kalibr) for camera calibration, the equivalencies are: 

- OpenCV "Pinhole" -> Kalibr "Pinhole Radtan" 
- OpenCV "Fisheye" -> Kalibr "Pinhole Equidistant"

### Extrinsic Calibration

The stereo extrinsic parameters in the parameter files are expected to represent the transformation from the camera frame to the body frame (**T_body_cam \ X_body = T_body_cam * X_cam**).
Therefore, if **T_body_camleft** is set as the Identity transformation, for the right camera we have: **T_body_camright** = **T_camleft_camright**.
In Kalibr, the inverse transformation is provided (i.e. **T_cam_body**).  Yet, Kalibr also provide the extrinsic transformation of each camera w.r.t. to the previous one with the field **T_cn_cnm1**.  This transformation can be directly used in OV²SLAM by setting **T_body_camleft** = **T_cn_cnm1** and **T_body_camright** = **I_4x4**.

### Parameters File Description

Three directories are proposed within the parameter_files folder: _*accurate*_, _*average*_ and _*fast*_.  They all store the parameter files to be used with KITTI, EuRoC and TartanAir.

* The _*accurate*_ folder provides the parameters as used in the paper for the full method (i.e. OV²SLAM w. LC).  

* The _*fast*_ folder provides the parameters as used in the paper for the Fast version of OV²SLAM.

* The _*average*_ folder is provided for convenience as an in-between mode.

<details>
  <summary> <strong>Parameters details:</strong> </summary>
    
    * debug: display debugging information or not
    * log_timings: log and display main functions timings or not

    * mono: set to 1 if you are in mono config
    * stereo: set to 1 if you are in stereo config

    * force_realtime: set to 1 if you want to enforce real-time processing (i.e. only process last received image, even if it leads to dropping not yet processed images)

    * slam_mode: must be set to 1

    * buse_loop_closer: set to 1 if you want to use LC

    * bdo_stereo_rect: set to 1 if you want to apply stereo rectification (and use epipolar lines for stereo matching)
    * alpha: to be set between 0 and 1, 0: rectified images contain only valid pixel / 1: rectified images contain all original pixels (see OpenCV doc for more details)

    * bdo_undist: set to 1 if you want to process undistorted images (the alpha parameter will be used in this case too)

    * finit_parallax: amount of parallax expected for creating new keyframes (should be set between 15. and 40.)

    * use_shi_tomasi: set to 1 to use OpenCV GFTT keypoints detector 
    * use_fast: set to 1 to use OpenCV FAST keypoints detector
    * use_brief: set to 1 to extract BRIEF descriptors from detected keypoints (must be set to 1 for apply local map matching, see below)
    * use_singlescale_detector: set to 1 to use our keypoints detector based on OpenCV cornerMinEigenVal function

    * nmaxdist: size of image cells for extracting keypoints (the bigger the less keypoints you will have)

    * nfast_th: FAST detector threshold (the lower the more sensitive the detector is)
    * dmaxquality: GFTT and cornerMinEigenVal detector threshold (the lower the more sensitive the detector is)

    * use_clahe: set to 1 to apply CLAHE on processed images
    * fclahe_val: strength of the CLAHE effect

    * do_klt: must be set to 1
    * klt_use_prior: if set to 1, keypoints which are observation of 3D Map Points will be initialized with a constant velocity motion model to get a prior before applying KLT tracking
    * btrack_keyframetoframe: if set to 1, KLT will be applied between previous keyframe and current frame instead of previous frame and current frame (setting it to 0 usually leads to better accuracy)
    * nklt_win_size: size of the pixels patch to be used in the KLT tracking
    * nklt_pyr_lvl: number of pyramid levels to be used with KLT in addition the full resolution image (i.e. if set to 1, two levels will be used: half-resolution, full-resolution)

    * nmax_iter: max number of iterations for KLT optimization
    * fmax_px_precision: maximum precision seeked with KLT (i.e. solution is not varying more than this parameter)

    * fmax_fbklt_dist: maximum allowed error in the backward KLT tracking
    * nklt_err: maximum allowed error between KLT tracks

    * bdo_track_localmap: set to 1 to use local map tracking using computed descriptors at each keyframe

    * fmax_desc_dist: distance ratio w.r.t. descriptor size for considering a good match (to be set between 0 and 1)
    * fmax_proj_pxdist: maximum distance in pixels between a map point projection and a keypoint to consider it as a matching candidate

    * doepipolar: set to 1 to apply 2D-2D epipolar based filtering
    * dop3p : set to 1 to use a P3P-RANSAC pose estimation
    * bdo_random: set to 1 to randomize RANSAC
    * nransac_iter: maximum number of RANSAC iterations allowed
    * fransac_err: maximum error in pixels for RANSAC

    * fmax_reproj_err: maximum reprojection error in pixels when triangulating new map points
    * buse_inv_depth: set to 1 to use an anchored inverse depth parametrization in BundleAdjustment, set to 0 to use XYZ parametrization

    * robust_mono_th: threshold to be used for the robust Huber cost function in BundleAdjustment

    * use_sparse_schur: set to 1 to use sparse schur (recommanded) (see Ceres doc)
    * use_dogleg: set to 1 to apply Dogleg optimization (see Ceres doc)
    * use_subspace_dogleg: set to 1 to apply subspace Dogleg optimization (see Ceres doc)
    * use_nonmonotic_step: set to 1 to allow nonmonotic steps in optimization (see Ceres doc)

    * apply_l2_after_robust: set to 1 to re-optimize without the Huber function after removal of detected outliers in BundleAdjustment

    * nmin_covscore: minimum covisibility score w.r.t. to current keyframe for adding a keyframe as a state to optimize in BundleAdjustment

    * fkf_filtering_ratio: ratio of co-observed 3D map points by 4 other keyframes to consider a keyframe as redundant and remove it from the map

    * do_full_ba: if set to 1, a final full BundleAdjustment will be applied once the sequence has been entirely processed
</details>


### Note on "-march=native"

If you experience issues when running OV²SLAM (segfault exceptions, ...), it might be related to the "-march=native" flag.
By default, OpenGV and OV²SLAM come with this flag enabled but Ceres does not.  Making sure that all of them are built with or 
without this flag might solve your problem.