Normal Distribution Transform - A ROS-based Platform for Autonomous Vehicles:

Like the ICP algorithm, the Normal Distribution Transform (NDT) [4] computes a geometrical transformation between a given point cloud and an existing model, which is often a pregenerated map or a point cloud collection generated during SLAM like in fig.3.9.

The algorithm however introduces the idea of simplifying the model point cloud by transforming it into a set of normal distributions in 2D space.

First, the space is divided in a fixed size 2D grid. Given a point cloud, each point belongs to a cell in the grid depending on its (x, y) coordinates. Then, for every cell containing at least three points, a 2D normal distribution is computed.

The distribution is simply computed by fitting the 2D coordinates of the points within the cell:

mean: q = 1 n

∑

x_i covariance matrix: Σ = 1

∑

(xi− q) (xi− q)^T

At this point, we have a sum of normal distributions, which, being the normal distribution differentiable, implies we have a piecewise differentiable function.

Figure 3.9: The normal distribution transform allows to track the position of the vehicle by matching the point cloud coming from the sensor (colored points) against an existing point cloud map (grayscale) using the ndt_matching implementation from Autoware.

The map is generated using the same process described in section7.3

Given a new point cloud, it is possible to choose the optimal rotation and translation parameters in order to match the cloud with the model by using the standard Newton’s method (See appendix 1). More in detail, the parameters p = (ϕ, t_x, t_y)^T define a transform from a 2D point (x, y) in reference frame A to a point (x^′, y^′) in reference frame B:

T :

(x^′ y^′

)

(cos(ϕ) −sin(ϕ) sin(ϕ) cos(ϕ)

) (x y

)

(t_x t_y

)

Optimal values for the parameters p are sought that align a new scan to the model, summarized by a grid of normal transforms. Starting from an initial estimate p ← p0 = (ϕ₀, t_x,0, t_y,0) we calculate a score of the match between the model and the scan:

score (p) =^∑

exp

(− (x^′_i− q)^T Σ⁻¹_i (x^′_i− q) 2

)

In other words, the “goodness” of matching is computed as the sum of the probabilities yielded by the grid-of-Gaussians model, computed for each point of the new point cloud and summed. The value of p that maximize the score is

sought by using successive approximations. As said before, this is possible since the objective function is differentiable. The exact expressions of the Jacobian and Hessian matrix, used in the Newton’s method, can be found in the original paper.

6 ROS implementation: robot_localization

Localization is an essential and ubiquitous task in robotics, as most robots need to localize themselves or parts of them (such as a manipulator, even if the robot does not move around an environment). As such, ROS naturally provides many high-quality implementations of localization and SLAM algorithms, catering to a number of different use cases. Among these, some notable options are:

• amcl, [47] which is included in the standard ROS navigation stack [46], and provides a probabilistic 2D localization system based on the KLD-sampling Monte Carlo localization approach [15];

• robot_localization, [33] which is a generalized Extended Kalman Filter that supports a wide range of sensor inputs;

• Google Cartographer, [19] a real-time SLAM system by Google, mainly in-tended for LIDAR (point cloud) and inertial sensors input, both in 2D and 3D;

• ORB-SLAM [36], a feature-based visual SLAM system, which we discussed more in detail in section 4.3;

• LSD-SLAM [14], a direct (full image) visual SLAM system.

The first choice we made was whether to use a full SLAM system (see section3) or a more traditional filtering-based localization solution. We decided for the latter, mainly on the grounds that we easily get two different types of estimates out of the algorithm: (1) a local estimate, akin to the result of performing odometry, that is continuous and can be considered accurate in the immediate surroundings of the vehicle, but accumulates an error if measured against the car’s starting location; (2) a global estimate, which may be subject to discontinuities (i.e. the vehicles seems to “teleport” itself a few feet away every once in a while), but is generally accurate with regards to a global (geographic scale) map.

We then chose the implementation based on a few criteria, the most important being:

• Compatibility with many sensor types. Our experimental vehicle is equipped with many sensors, including cameras, LIDARs, inertial platforms, GPS, speed meters, some being part of the vehicle’s standard equipment, and

some we installed ourselves. In theory, localization can be aided by utilizing the data from all those sensors, so we were interested in implementations compatible with them.

• Implementation quality. ROS is not just made out of concrete components (such as software and collaboration infrastructure on the Internet), but also conventions. A node that does not adhere to ROS’ conventions poses many costs and challenges: it is harder to integrate with the rest of the system, harder to make tools for, may require modifications to its source code (and therefore integration in our build system). These costs are non-essential, and are ideally avoided by using software with high implementation standards.

• Documentation quality. Implementation quality would be largely useless without good documentation that guides the user through the configuration and tuning of the ROS nodes.

The project that best matches our requirements that we know of is robot_localization.

The software includes two complete, high-quality, well-documented implementa-tions of filtering-based localization; one is based on an Extended Kalman Filter (ROS node ekf_localization_node), the other on an Unscented Kalman Filter (ukf_localization_node). Both nodes have identical interfaces with regards to ROS parameters, topics and services, so they can be swapped with almost no ad-ditional work (other than adapting the part of the configuration that is specific on the type of filter).

Importantly, the package also includes tools to convert the estimate from the vehicle’s linear frame of reference to the WGS84 geographic frame of reference (which is significantly non-linear over a large enough scale) and vice versa. In particular, a node named navsat_transform_node allowed us to easily bring the geographical coordinates provided by GPS into a form compatible with the other inputs, and use it alongside other sensors.

We evaluated our results by visualizing the location estimate overlaid over a geographical map by using the Mapviz visualization tool [31]. Mapviz uses another method for making localization estimates compatible with geographical data, from the swri_transform_util package. Both are published as open source by the Southwest Research Institute.

robot_localization is very strictly compliant to REP-103 [42] and REP-105 [43], ROS’ standardized convention for units of measure and the communication of odometry through messages and frame of reference (in the sense of TF frames, see section 1.1), respectively. For the purpose of localization, the most important convention is that the TF tree has root in a frame named map, and that it contains the chain of transforms map → odom → base_link, where:

• map is the fixed frame, conventionally rigidly attached to “the world”. Any map of the global environment will be expressed in this frame.

• odom is also a world-fixed frame, but it differs from map in order to represent the accumulated odometry error. The discontinuities (“teleports”) of the car’s trajectory will also be visible in the map → odom vector.

• base_link is the vehicle-fixed frame. It represents the current location of the robot.

Somewhat non-intuitively, even though we have two localization estimates in-dependently computed (global and local) that relate map and odom to the same base_link frame, it is not possible for base_link to be represented in both frames of reference (i.e. the TF has to be a tree, and a node can not have two parents).

Instead, there is amap → odom transform, computed from the difference between the two estimates.

Another useful topic covered by the REP-105 specification is the convention for representing geographical data (such as GPS). navsat_transform_node requires GPS messages to follow this convention in order to translate them into the same form as other inputs.

Nel documento A ROS-based Platform for Autonomous Vehicles: (pagine 65-69)