real-time camera-only 3d object detection for autonomous

1
Bachelor’s/Semester/Master’s Thesis, Guided Research, Interdisciplinary Project (IDP) Real-Time Camera-Only 3D Object Detecon for Autonomous Driving Background As part of the research project Providena++ funded by the federal ministry of transport and digital infrastructure under the iniave “Digital Test Beds for Autonomous Driving”, a group of eminent industry partners and research instutes have come together to conduct research in the field of intelligent transportaon systems, and to come up with soluons and recommendaons for improving traffic safety, efficiency and comfort. Within the framework of this project, an exisng infrastructure for real me localizaon of traffic parcipants on the A9 Highway will be extended from the highway into an adjacent urban environment. A video about the Providena project is available on hps://youtu.be/4oCnQlGFuc4. Descripon A key challenge lies in the reliable and accurate detecon of road users (such as cars, trucks, buses) using the various sensors, among them standard frame-based cameras and event-based camera. Due to the complex mul-sensor system subject to real- life condions and applicaon-oriented challenges, many interesng research topics are available within this project. These include, but are not limited to: Research on temporal object detecon: Using video data to process consecuve frames in order to improve the consistency of detecon results, e.g. via LSTM Research on monocular 3D object detecon: How can we esmate the 3D characteriscs of objects in order to produce 3D bounding boxes using only a monocular camera? Research on 3D object detecon using event-based cameras: How can this new type of sensor be used to complement or replace frame-based cameras for 3D object detecon? Research on automac camera seng and algorithm adaptaon for opmal detecon performance: How can we dynamically and automacally adjust our system (e.g. camera sengs or the algorithms in use) to improve object detecon results in various situaons such as rain, fog, and night-me operaon. Research on combining classical with deep learning approaches : Can we improve the reliability of the detecon results by combining a deep learning approach with a classical, non-learning-based computer vision approach? Your ideas: If you have any other ideas for research in this area you are welcome to suggest your own topic. Your Tasks Familiarizaon with 3D object detecon algorithms and relevant topics via literature research Development of a soluon/approach for the specific problem Evaluaon of the concept using real-world data Requirements A strong interest in computer vision, deep learning, and (3D) object detecon algorithms High movaon and ability to work independently Good knowledge in at least one programming language: Python, C++ Experience with deep learning libraries (TensorFlow, Keras, PyTorch) is a plus Supervisor: Prof. Alois Knoll Contact: Walter Zimmer ([email protected]) Technical University of Munich Faculty of Informacs, Chair of Robocs, Arficial Intelligence and Real-me Systems

Upload: others

Post on 05-May-2022

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Real-Time Camera-Only 3D Object Detection for Autonomous

Bachelor’s/Semester/Master’s Thesis, Guided Research, Interdisciplinary Project (IDP)

Real-Time Camera-Only 3D Object Detection for Autonomous DrivingBackground

As part of the research project Providentia++ funded by the federal ministry of transport and digital infrastructure under theinitiative “Digital Test Beds for Autonomous Driving”, a group of eminent industry partners and research institutes have cometogether to conduct research in the field of intelligent transportation systems, and to come up with solutions andrecommendations for improving traffic safety, efficiency and comfort. Within the framework of this project, an existinginfrastructure for real time localization of traffic participants on the A9 Highway will be extended from the highway into anadjacent urban environment. A video about the Providentia project is available on https://youtu.be/4oCnQlGFuc4.

Description

A key challenge lies in the reliable and accurate detection of road users (such as cars, trucks, buses) using the various sensors,among them standard frame-based cameras and event-based camera. Due to the complex multi-sensor system subject to real-life conditions and application-oriented challenges, many interesting research topics are available within this project. Theseinclude, but are not limited to:

Research on temporal object detection: Using video data to process consecutive frames in order to improve theconsistency of detection results, e.g. via LSTM

Research on monocular 3D object detection: How can we estimate the 3D characteristics of objects in order to produce3D bounding boxes using only a monocular camera?

Research on 3D object detection using event-based cameras: How can this new type of sensor be used to complementor replace frame-based cameras for 3D object detection?

Research on automatic camera setting and algorithm adaptation for optimal detection performance: How can wedynamically and automatically adjust our system (e.g. camera settings or the algorithms in use) to improve objectdetection results in various situations such as rain, fog, and night-time operation.

Research on combining classical with deep learning approaches : Can we improve the reliability of the detectionresults by combining a deep learning approach with a classical, non-learning-based computer vision approach?

Your ideas: If you have any other ideas for research in this area you are welcome to suggest your own topic.

Your Tasks

Familiarization with 3D object detection algorithms and relevant topics via literature research

Development of a solution/approach for the specific problem

Evaluation of the concept using real-world data

Requirements

A strong interest in computer vision, deep learning, and (3D) object detection algorithms

High motivation and ability to work independently Good knowledge in at least one programming language: Python, C++ Experience with deep learning libraries (TensorFlow, Keras, PyTorch) is a plus

Supervisor: Prof. Alois KnollContact: Walter Zimmer ([email protected])

Technical University of MunichFaculty of Informatics, Chair of Robotics, Artificial Intelligence and Real-time Systems