ARES-Recruitment

SENSOR FUSION FOR DRIFT CORRECTION

There is a 9-DOF IMU which consisits of a 6-DOF( 3-axis accelerometer combined with a 3-axis gyroscope ) combined with a magnetometer to find and measure the location of the rover and the drift correction variables namely pitch, yaw and roll.

Kalman Filter

What is Kalman Filter? A kalman filter is an optimal estimation algorithm. One can use a kalman filter in any place where you have some uncertain information about some dynamic system and you have to make an educated guess about what will the system is going to do next. Kalman filters are ideal for systems which are continiously changing.The advantange of using kalman filters is that they are light on memory as they don't have to keep any history of the previous state, they are very fast and therefore well suited for realtime and embedded systems.

We have the current state of the system and we predict the future state of the system using maths. We take in account all the factors that can affect the system whether it be internal factors or external uncertainty.

We have one gaussian blob we got from our predictions and others from the sensors. We multiply them and the resultant is a new gaussian blob with it's own mean and covariance matrix. we find out the kalman factor/ kalman gain.

Kalman filter is used to model linear systems and for nonlinear systems we have Extended Kalman Filter.

Extended Kalman Filter

In real life most of the problems involve non linear functions.If we feed a Gaussian with a linear function then the output is also gaussian but in case of a non linear(angles,sine and cosine functions) function the output is a non gaussian distribution.

We use approximation to make a non linear funtion linear with the help of Taylor series. We use the first derivative of the linear function obtained by the Taylor series and feed it to the Gaussian function. Then we just do the same series of operations we do with a normal kalman filter.

So in our case we use 2 Extended kalman filters. One is used to find out the pitch,yawn and roll of the rover,the extended kalman filter is feeded with the output given by the 9-DOF IMU and is upated with the help of output of the stereo visual odometer from the localization thread. The output is feeded to the local_map_optimization unit of the localization thread.

The other filter is used to find out the velocity and cooridnates of the rover, it is feeded with the same output from the 9-DOF IMU in addition with the longitude and latitude given by the GPS module. The output is feeded to the global_map_representation unit of the localization thread.

STEREO SETUP

A stereo camera is a type of camera with two or more image sensors. This allows the camera to simulate human binocular vision and therefore gives it the ability to perceive depth.

Human binocular vision

The human binocular vision perceives depth by using Stereo disparity which refers to the difference in image location of an object seen by the left and right eyes, resulting from the eyes’ horizontal separation.

The brain uses this binocular disparity to extract depth information from the two-dimensional retinal images which are known as stereopsis.

The rover is generating a 3D image if the surrounding using the 2 cameras which is then feeded to the localization thread, Obstacle detection and motion planning thread and ball detection unit.

LOCALISATION THREAD

Stereo Visual odometry

What is Visual odometry?

VO is the process of estimating the camera’s relative motion by analyzing a sequence of camera images. VO is defined as the process of estimating the robot’s motion (translation and rotation with respect to a reference frame) by observing a sequence of images of its environment. VO is a particular case of a technique known as Structure From Motion (SFM) that tackles the problem of 3D reconstruction of both the structure of the environment and camera poses from sequentially ordered or unordered image sets.

The output from the cameras is feeded to stereo_visual_odometry unit and a number of operations are perfored to convert the 3D to 2D motion estimation.

1. Feature detection: The 2 images from the stereo camera are feeded to this OpenCV function which detects features from the images.The output from this function is then passed to TriangulatePoints().

2. Triangulate Points: This function triangulates the 3d position of 2d correspondences between the two images.

3. RANSAC(): RANSAC is abbreviation of RANdom SAmple Consensus, in computer vision, we use it as a method to calculate homography between two images.

The output from this is passed to the local_map_optimization function.

Local Map Optimization

The general idea of graph optimization is to express the SLAM problem as a graph structure.

G2o: g2o, short for General (Hyper) Graph Optimization, is a C++ framework for performing the optimization of nonlinear least squares problems that can be embedded as a graph or in a hyper-graph.

Two functions of the G2o library are used. Sparse Optimization adn SE3 Quat. These functions give us the optimized local coordinates and optimized 3D coordinates.

The Visual point cloud function is used to create a map of the points.

The local cooridnates are passed to the global map representation unit.

Global Map Representation

Here the local states are used to find the current states of the rover like the position, velocity and direction. The local and global states represent the rover position on the terrain as well as on the global map.

The current global states and dynamic constraints of the rover are passed to the Obstacle detection and motion planning thread.

OBSTACLE DETECTION AND MOTION PLANNING THREAD

Detect Obstacles

What is a Disparity map?

Disparity refers to the difference in location of an object in corresponding two (left and right) images as seen by the left and right eye which is created due to parallax (eyes’ horizontal separation). The brain uses this disparity to calculate depth information from the two dimensional images. The disparity of a pixel is equal to the shift value that leads to minimum sum-of-squared-differences for that pixel.

StereoSGBM

Stereo SGBM stands for semi block matching algorithm. StereoSGBM is used to get the disparity map.Then the disparity map is passed to the calc_v_disp() function.

V-disparity

Each row of the V-disparity image is a histogram of the various values of disparity that appeared on that row in the disparity map.

When done right, the disparities of the points on the ground plane will appear as a strong line in the V-disparity map.V disparity map is used to calculate the road area.

Hough Transformation

The Hough transform is a technique which can be used to isolate features of a particular shape within an image.

So we get the disparity map from StereoSGBM and the road map after hough transformation.Then we subtract the road map from the disparity map to get the Obstacle and then we project them to in the 3D point cloud.

Motion Planner

Get required variables

The values of all the required variables are passed from the previous steps. The speed and direction of the rover are passed from the Localization thread and the obstacle point cloud from the obstale detection function.

OMPL, the Open Motion Planning Library, consists of many state-of-the-art sampling-based motion planning algorithms.

We use OMPL library's function to find the shortest path avoiding all the obstacles considering rovers constraints. Then commands are generated for each of the motor.

These commands are passed to the motors and the rover works accordingly.

ROS NODE FOR MOTION CONTROL

ROS stands for Robot Operating system. So the commands generated from the obstacle detection and motion planning thread are passed to this unit where it gives instructions to the rover and according to the instructions the rover moves.

In our rover there is a seperate motor for each wheel. So seperate instructions are generated for each wheel.

BALL DETECTION

What are contours?

Contours can be explained simply as a curve joining all the continuous points (along the boundary), having same color or intensity. The contours are a useful tool for shape analysis and object detection and recognition.

So Contour_detection() is used to detect the objects in the image which are passes by the Stereo Setup.

Then we use color detetction for green color and we get the desired output.

Resources used:

Kalman and extended kalman filter:
Stereo Vision:
- https://www.e-consystems.com/blog/camera/what-is-a-stereo-vision-camera/
- http://www.cs.cmu.edu/~kaess/vslam_cvpr14/media/VSLAM-Tutorial-CVPR14-A12-StereoVO.pdf
Localisation thread:
G2o:
- https://fzheng.me/2016/03/15/g2o-demo/
- https://www.cct.lsu.edu/~kzhang/papers/g2o.pdf
Disparity Maps:
- https://jayrambhia.com/blog/disparity-mpas
V-disparity:
- https://pdfs.semanticscholar.org/1e1f/3aa8c1d2adeeda6b7c2ca08f2d4a63609bae.pdf
Hough Transformation:
- http://homepages.inf.ed.ac.uk/rbf/HIPR2/hough.htm
Contours:
- https://docs.opencv.org/3.3.1/d4/d73/tutorial_py_contours_begin.html

chaharnishant11 / ares-recruitment Goto Github PK