Region-Growing Planar Segmentation for Robot Action Planning

The latest version is ...

[Relevant Paper]

This page is under construction...
The complete source code will be available soon. (It is partially available now.)

Description

Object detection, classification and manipulation are some of the capabilities required by autonomous robots. The main steps in object recognition and object classification are: segmentation, feature extraction, object representation and learning. To address the problem of learning object classification using multi-view range data, we used a relational approach (Farid et al., 2012a; Farid et al., 2012b; Farid et al., 2014a; Farid et al., 2014b). The first step of our object classification approach was to decompose a scene into shape primitives such as planes, followed by extracting a set of higher-level, relational features from the segmented regions. In this project, we compare our plane segmentation algorithm with state-of-the-art plane segmentation algorithms which are publicly available. We show that our segmentation outperforms and also produces better results especially for the robot action planning purposes.

Data formats

Range Image

A range image is similar to a 2D grey scale image. However, each pixel's value represents the distance from the sensor to the surface of an object in a scene from a particular viewpoint (Gächter, 2005; Gächter et al., 2006). This can be used to infer the shape of the object (Hegazy and Denzler, 2009).
The following pictures show a range image of a staircase with four steps, taken by a robot positioned in front of the staircase. In this grey scale image, the darker colour represents closer surfaces. For clarity, a colour-mapped version is also presented.

Point Cloud

A range image can be transformed into a set of points in 3D space, producing a 3D point cloud (Rabbani Shah, 2006). The following pictures show the corresponding point cloud for the recently shown range images. Because using just one view, such as the front view is ambiguous, another view of the same point cloud is also shown. The point cloud has been segmented into planes (using the plane segmentation algorithm developed in this project) that are identified by unique colours. Some points in this point cloud are removed because they are far away, such as the points in the yellow region of the colour-mapped version, previously shown.

Front View

Top View

After this conversion, an ordered (or structured) point cloud is available. Due to the 2D nature of the range image, the neighbourhood of each point in the ordered point cloud can be easily and quickly found. This fact has a great impact on the preprocessing speed. However, it should not be considered a limitation. Many algorithms create a neighbourhood structure when the point cloud is unordered. Therefore, the input is not be limited by range images. Any point cloud representing a partial view of an object can be used similarly.

TOP OF PAGE

Plane Segmentation Methods

Several methods claim to provide good plane segmentation such as two algorithms provided by PCL (Rusu et al., 2011) using the RANSAC algorithm. We use these two state-of-the-art plane segmentation methods especially because they are publicly available:

One of the PCL algorithms, setting the model type as SACMODEL_PLANE, uses 3D points belonging to the point cloud (URL) without considering normals or any additional constraints. We call it as SP.

The other algorithm, using the model type as the SACMODEL_NORMAL_PLANE, has an additional constraint similar to the method used in our research. We name it as SNP. It assumes the normal of each point must be parallel to the output plane normal within a maximum angular difference (URL). The use of SNP has been shown as a part of PCL's tutorial for cylinder model segmentation (URL).

Our Region growing plane segmentation algorithm using normal vectors. (URL).

TOP OF PAGE

Dataset

We captured data during RoboCup Rescue competitions, as well as from rescue laboratories and other indoor locations. In this paper, we use a subset (45 images) of such data which we used for learning classes such as box (12 images), stairs (15 images) and pitch/roll ramp in a maze(18 images).

Since it is difficult to comprehend the range image, the corresponding colour (RGB) image of the scene will be shown in the rest of the project. For each class, different multi-view data are chosen. For example, the following figures show one view of some of the examples in this research which previously were used for training box, stairs and pitch/roll ramp classes respectively.

Box class
Stairs class
Pitch/roll ramp class

Before applying PCL plane segmentation algorithms, we need to prepare our data. Since our range images have 640 × 480 pixels, we sub-sample them to 160 × 120, while we converting them to point clouds.

RGB version of the Dataset is available as [RGBs.tar.gz] (8 MB)

Point Cloud version of the Dataset (resolution: 160 × 120) is available as [PCDs.tar.gz] (3 MB)

TOP OF PAGE

Experimental Evaluation

SP

Visual segmentations using distance threshold as 0.005, 0.01, 0.03 and 0.05: [sp_pngs.tar.gz] (22 MB)

Example visual output:

Stairs

Analysis: [SP_analysis] (104 KB)

SNP

Visual segmentations using distance threshold as 0.01, 0.03 and 0.05: [snp_pngs.tar.gz] (3 MB)

Example visual output:

Stairs

Analysis: [SNP_analysis] (111 KB)

Our Region growing plane segmentation algorithm

Visual segmentations (regions and convex hulls):
- Regions (Planes) [Coloured Regions] (285 KB)
- Convex hulls [Coloured Regions with Convex Hull Boundaries] (337 KB)

Number of planes per image: [all_images_regions.txt] (1 KB)

Example visual output:

Stairs

Comparing SNP and Our method

Visual comparison tool (webpage): [comp.hml]

Visual segmentations using distance threshold as 0.05 (SNP): [snp_dt0_05_pngs_cut.tar.gz] (353 KB)

Visual segmentations (our method): [ours_pngs_cut.tar.gz] (285 KB)

Comparison Analysis: [comparison.ods] (49 KB)

TOP OF PAGE

Source Code

Shell Scripts

A set of scripts used for running plane segmentation methods:
- Shell script for running SP [runme_sac_planar.sh]
- Shell script for running SNP [runme_sac_normalp.sh]
- Shell script for running Our method [runme_rgs.sh]
C++ code

A set of Cmake codes implemented to use plane segmentation methods:
- Common source code needed [common.tar.gz]
- Apply SP on the input PCD [read_and_planar_segment.cpp]
- Apply SNP on the input PCD [read_and_normalplanar_segment.cpp]

TOP OF PAGE

References

Stefan Gächter. Results on range image segmentation for service robots. Technical report, Laboratoire de Systèmes Autonomes, Ecole Polytechnique Fédérale de Lausanne (EPFL), 2005. URL http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.90.7553 .

Stefan Gächter, Viet Nguyen, and Roland Siegwart. Results on range image segmentation for service robots. In Proceedings of IEEE International Conference on Computer Vision Systems (ICVS), pages 53–53, 2006. doi:10.1109/ICVS.2006.54 .

Doaa Hegazy and Joachim Denzler. Generic 3D object recognition from time-of-flight images using boosted combined shape features. In Alpesh Ranchordas and Helder Araújo, editors, Proceedings of the Fourth International Conference on Computer Vision, Theory and Applications (VISAPP), volume 2, pages 321–326. INSTICC Press, 2009.

Tahir Rabbani Shah. Automatic reconstruction of industrial installations using point clouds and images. PhD thesis, Delft University of Technology, University of Engineering and Technology, Lahore, 2006. URL http://www.ncgeo.nl/phocadownload/62Rabbani.pdf .

Reza Farid and Claude Sammut, Plane-based object categorisation using relational learning: implementation details and extension of experiments, Technical Report UNSW-CSE-TR-201416, School of Computer Science and Engineering, The University of New South Wales. (PDF on [CSE-Repository] or [local])

Reza Farid and Claude Sammut, Plane-based object categorisation using relational learning, Machine Learning Journal, 94(1):3-23, 2014, doi: 10.1007/s10994-013-5352-9 . ([Manuscript] Note: The final publication is available at link.springer.com)

Reza Farid and Claude Sammut, Plane-based object categorisation using relational learning. In Online Proceedings of the 22nd International Conference on Inductive Logic Programming (ILP2012), Dubrovnik, Croatia, 17-19 September 2012. ([PDF])

Reza Farid and Claude Sammut, A relational approach to plane-based object categorisation. Paper presented at Robotics Science and System (RSS) 2012 workshop on RGB-D: Advanced Reasoning with Depth Cameras, University of Sydney, 9-10 July 2012.(Paper[PDF ], Poster[PDF ])

R. B. Rusu and S. Cousins, “3D is here: Point Cloud Library (PCL),” in Proc. of ICRA2011, 2011, pp. 1–4.

TOP OF PAGE

Contact

Comments, suggestions or bugs can be sent to Reza Farid

This page is under construction...The complete source code will be available soon. (It is partially available now.)

SP

SNP

Our Region growing plane segmentation algorithm

Comparing SNP and Our method

This page is under construction...
The complete source code will be available soon. (It is partially available now.)