artificial vision in road vehicles

Post on 07-Jan-2016

81 Views

Category:

Documents

3 Downloads

Preview:

Click to see full reader

DESCRIPTION

Artificial Vision in Road Vehicles. By: MASSIMO BERTOZZI, ALBERTO BROGGI, MASSIMO CELLARIO, ALESSANDRA FASCIOLI, PAOLO LOMBARDI, AND MARCO PORTA. Presented by: Ali Agha April 6 th , 2009. Main Motivation. - PowerPoint PPT Presentation

TRANSCRIPT

Artificial Vision in Road Vehicles

By: MASSIMO BERTOZZI, ALBERTO BROGGI, MASSIMO CELLARIO, ALESSANDRA FASCIOLI, PAOLO LOMBARDI, AND MARCO PORTA

Presented by: Ali Agha

April 6th, 2009

Main Motivation

Help the driver in case of failure, for example, due to a lack of concentration or due to drowsiness.

Active Sensors in ITS Example: Laser-based sensors and millimeter-

wave radars Drawbacks

Low spatial resolution Slow scanning speed and Expensive Reflection problems maximum signal level must comply with some safety rules interference among sensors of the same type

Advantages: mm-wave radars are robust to rain and fog measure some quantities, such as movement, in a more

direct way require less performing computing resources, as they

acquire a considerably lower amount of data.

Passive Sensors in ITS

Example: Vision-based sensors Advantages:

Noninvasive Useful in specific applications (lane localization,

traffic signs recognition, Obstacle identification)

Drawbacks: Not robust in foggy, night, or direct sunshine

conditions.

Consideration in design a vision system for ITS applications ITS systems require faster processing than other

applications, since vehicle speed is bounded by the processing rate.

computing engines cannot be based on expensive processors.

no assumptions can be made on scene illumination or contrast and the process should be robust to environmental conditions, such as

sun, rain, or fog and their dynamic changes such as transitions between sun and shadow, or the entrance or exit from a tunnel.

robustness to vehicle’s movements and handling the drifts in the camera’s calibration

Road Following

Road Following

Lane Detection Obstacle Detection Pedestrian Detection

Lane Detection Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Two basic problem in Lane Detection are:

1)The presence of shadows

2)Occlusion, caused by other vehicles

Focus of attention

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Due to both physical and continuity constraints, the processing of the whole image can be replaced by the analysis of specific regions of interest only (the so-called focus of attention), in which the features of interest are more likely to be found.

Focus of attention

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape J. Goldbeck, et al. (1998)

R. Chapuis, et al. (2001)

employs a model both for the road and the vehicle’s dynamic to determine the road portionwhere it is most likely to find lane markings

Focus of attention

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape J. Goldbeck, et al. (1998)

R. Chapuis, et al. (2001)

Dynamically determination of WOI by means of statistical methods; According to the current state and previously detected WOIs.

Fixed Lane Width

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

The assumption of a fixed or smoothly varying lane width allows the enhancement of the search criterion, limiting the search to almost parallel lane markings.

Fixed Lane Width

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape K. Kim, et al. (1995)

D. Pomerleau, et al. (1996)

lane markings can be detected using both neural networks and simple vision algorithms:

two parallel stripes of the acquired image are selected and filtered using Gaussian masks and zero crossing to find vertical edges. The result is matched against a given model.

Fixed Lane Width

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape K. Kim, et al. (1995)

D. Pomerleau, et al. (1996)

based on the processingof the image portion

The determination of the curvature is carried out according to a number of possible curvature models

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

The reconstruction of road geometry can be simplified by assumptions on its shape.

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

lane markings are modeled as clothoids. In a clothoid thecurvature depends linearly on the curvilinear reference. This model has the advantage that the knowledge of two parameters only allows the full localization of lane markings and the computation of other parameters

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

dynamic programming optimization method is used to chose among center-line candidates representing the actual geometry of the road

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

uses a polynomial representation forlane markings. lane markings are modeled as parabolas and a simplified Hough transform is used to accomplish the fittingprocedure.

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

relies on a polynomial curve. It assumes a flat road with either continuous or dashed bright lane markings. The history of previously located lane markings is used to determine the region of interest

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

exploits a polynomial road modelization to calculate the impact distance from the vehicle to the nearest road side by considering the intersection between the straight line trajectory followed in case of adriver loss of control and the polynomial function describing the road side

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

proposed to use concentric circles to represent lane boundaries. circular shape models can in fact be better choices than polynomial approximations

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

adopt a more generic model for the road. uses a contour-based method. Actually, only straight or small curved roads without intersections are included in this model. The road model is used to follow contours formed by pixels that feature a significant gradient direction value.

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

uses an edge linking process based on Contour Chains and Causal Neighborhood Windows(areas of interest connected to edge elements). After an initial segmentation phase, the longest chains with slope angles close to 45 and 135 degrees are searched.

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

based on a linear lane model, where road markers are reconstructed as sequences of straight lines.

Road Shape

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

Lützeler, et al. & Franke, et al. (1998) & Goldbeck, et al. (1999) K. A. Redmill, et al. (2001) S. L. Michael, et al. (1997) K. A. Redmill, et al. (1997) F. Chausse, et al. (2000) J. Goldbeck, et al. (1999) R. Risack, et al. (1998) S. M.Wong, et al. (1999) X. Youchun, et al. (2000) A. Broggi, et al. (1995) & S. Denasi, et al. (1994)

A generic triangular road model is proposed

A priori knowledge on the road surface/slope

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

The knowledge of the specific camera calibration together with the assumption of an a priori knowledge on the road (i.e., a flat road without bumps) can be exploited to ease the localization of features and/or to simplify the mapping between image pixels and their correspondent world coordinates.

A priori knowledge on the road surface/slope

Road Following

PedestrianDetection

Obstacle Detection

Lane Detection

Focus ofattention

Fixed Lanewidth

Road ShapePrior Knowledgeon surface/shape

M. Bertozzi, et al. (1998)

The GOLD system exploit the assumption of a flat road in front of the vehicle. The lane markings detection is performed in a different image domain, representing a bird’s eye view of the road, which can be obtained thank to the flat road assumption.

Lane Detection (Summery)

Lane Detection (Summery)

Lane Detection (Summery)

Lane Detection (Summery)

Pedestrian Detection

Road Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

Segmentation with motionRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

Pros:Use temporal information and is reliable

Cons:Does not detect standing pedestrianNeeds sequence of a few framesJust for detecting moving objects not their

velocity

Segmentation with motionRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

R. Polana et al. (1995) S. McKenna et al. (1997) R. Cutler et al. (2000)

motion detection with optical flow: They analyzes the scene with a discrete cube, where they assign to each region its average optical flow.

Segmentation with motionRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

R. Polana et al. (1995) S. McKenna et al. (1997) R. Cutler et al. (2000)

uses a zero-crossing detection algorithm using the convolution of a spatio-temporal Gaussian

Segmentation with motionRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

R. Polana et al. (1995) S. McKenna et al. (1997) R. Cutler et al. (2000)

uses a subtraction between an image at time t and a version of the same image stabilized with respect to image at instant t-taw.

Segmentation with StereoRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

D. Beymer et al. (1999) L. Zhao et al. (2000)

In surveillance applications, stereo analysis is sometimes used as a cue to build a disparity map of the background for use with background subtraction.

Segmentation with StereoRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

D. Beymer et al. (1999) L. Zhao et al. (2000)

Range thresholding based on stereo analysis for segmentation

Focus of AttentionRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

A. Broggi et al. (2000) C. Curio et al. (2000)

salient regions in opportune feature maps are interpreted as candidates for pedestrians. In the GOLD system, vertical symmetries are associated with candidates for standing pedestrians

Focus of AttentionRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

A. Broggi et al. (2000) C. Curio et al. (2000)

The focus of attention is directed by a composition of 1) a map of the local image entropy, -2) a model-matching module with the shape of a representing human legs, and -3) finally inverse perspective mapping (binocular vision) for the short distance field.

Recognition of Human GaitRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

C. Wohler et al. (2000) R. Cutler et al. (2000) C. Curio et al. (2000)

the ATDNN performs a local spatio-temporal processing to detect the typical pattern of the movement

Recognition of Human GaitRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

C. Wohler et al. (2000) R. Cutler et al. (2000) C. Curio et al. (2000)

Periodicity of the human gait is often recognized with traditional methods like the Fourier transform

Recognition of Human GaitRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

C. Wohler et al. (2000) R. Cutler et al. (2000) C. Curio et al. (2000)

The periodic movement detected is correlated to an experimental curve derived from the statistical average of human gait periods

Recognition of Human ShapeRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

D. Beymar et al. (1999) & A. Broggi et al. (2000) H. Fujiyoshi et al. (2000) D. Gavrila et al. (2000) V. Philomin et al. (2000) C. Papageorgiou et al. (1999) & A. Mohan et al. (2001)

employ a model for the head and shoulders. This approach is very sensible to scale variation, so multiple models of different scales are needed.

Recognition of Human ShapeRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

D. Beymar et al. (1999) & A. Broggi et al. (2000) H. Fujiyoshi et al. (2000) D. Gavrila et al. (2000) V. Philomin et al. (2000) C. Papageorgiou et al. (1999) & A. Mohan et al. (2001)

uses a skeletonization procedure.calculate first the centroid of the area and then the distances from the centroid to each border points.

Recognition of Human ShapeRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

D. Beymar et al. (1999) & A. Broggi et al. (2000) H. Fujiyoshi et al. (2000) D. Gavrila et al. (2000) V. Philomin et al. (2000) C. Papageorgiou et al. (1999) & A. Mohan et al. (2001)

generic forms are tried first, and similar and more detailed shapes afterwards.

Recognition of Human ShapeRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

D. Beymar et al. (1999) & A. Broggi et al. (2000) H. Fujiyoshi et al. (2000) D. Gavrila et al. (2000) V. Philomin et al. (2000) C. Papageorgiou et al. (1999) & A. Mohan et al. (2001)

Recognition of Human ShapeRoad Following

Lane Detection

Obstacle Detection

Pedestrian Detection

Segmentation with motion

SegmentationWith stereo

Focus ofattention

Recognition ofHuman gait

Recognition ofHuman shape

D. Beymar et al. (1999) & A. Broggi et al. (2000) H. Fujiyoshi et al. (2000) D. Gavrila et al. (2000) V. Philomin et al. (2000) C. Papageorgiou et al. (1999) & A. Mohan et al. (2001)

Pedestrian Detection (Summery)

Pedestrian Detection (Summery)

Pedestrian Detection (Summery)

Pedestrian Detection (Summery)

Pedestrian Detection (Summery)

Thank you

Questions?

Ali

New developments in cameras

current cameras include new important features that permit the solution of some basic problems directly at sensor level. For example, image stabilization can be performed during acquisition, while the extension of camera dynamics allows one to avoid the processing required to adapt the acquisition parameters to specific light conditions. The resolution of the sensors has been drastically enhanced, and, in order to decrease the acquisition and transfer time, new technological solutions can be found in CMOS sensors, such as the possibility of dealing with pixels independently as in traditional memories. Another key advantage of CMOSbased sensors is that their integration on the processing chip seems to be straightforward.

Choosing appropriate camera Many different parameters must be evaluated for the design and choice of an image acquisition device. First of all, some parameters tightly coupled with the algorithms regard the choice of monocular versus binocular (stereo) vision and the sensors’ angle of view (some systems adopt a multicamera approach by using more than one camera with different viewing angles, e.g., fish eye or zoom). The resolution and the depth (number of bit/pixel) of the images have to be selected as well (this also includes the selection of color versus monochrome images).

Other parameters—intrinsic to the sensor—must be considered. Although the frame rate is generally fixed for CCDbased devices (25 or 30 Hz), the dynamics of the sensor is of basic importance: conventional cameras allow an intensity contrast of 500:1 within the same image frame, while most ITS applications require a 10 000:1 dynamic range for each frame and 100 000:1 for a short image sequence. Different approaches have been studied to meet this requirement, ranging from the use of CMOS-based cameras with a logarithmically compressed dynamic [6], [7] to the interpolation and superimposition regarding values of two subsequent images taken from the same camera [8].

Obstacle Detection (Vehicle Detection) M. Lutzeler et al. (1998) S. Kyo et al. (1999) S. Denasi et al. (2001)

W. Kruger et al. (1995) S. M. Smith et al. (1995) Z. Hu et al. (2000) F. Marmoiton et al. (1995)

Obstacle Detection (Generic Obstacle) – Single camera

U. Franke et al. (1998) M. Bertozzi et al. (1998) H. Hattori et al. (2000) M. Hariyama et al. (2000) Y. Ruicheck et al. (2000) D. Koller et al. (1995) S. Denasi et al. (1994) M. Leeuwen et al. (2000) C. Knoeppel et al. (2000)

Obstacle Detection (Generic Obstacle) – Multiple cameras

top related