![Page 1: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/1.jpg)
Challenges in Perception for Learning, Cognition and Control
Approaches
DariusBurschkaMachineVisionPerceptionGroup(MVP)TechnischeUniversitätMünchen,Germany
![Page 2: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/2.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Structure of a CNN for Robotics Applications
Rawsensorinformation(depthimagefromanRGB-Dsensor)istakenasinputtoconnectwiththerobotcontrolcommanddirectly
Idealcase:learnthecouplingbetweenperceptionandcontrolcommandsfromdemonstrations
![Page 3: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/3.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Curse of Complexity in Direct ProcessingIntheory,onecouldusealltheextractedfeatureswithaclassifiersuchasasoftmaxclassifier,butthiscanbecomputationallychallenging.Forinstanceimagesofsize96x96pixelswith400learnedfeaturesover8x8inputs.Eachconvolutionresultsinanoutputofsize(96−8+1)∗(96−8+1)=7921,andsincewehave400features,thisresultsinavectorof892∗400=3,168,400featuresperexample.Learningaclassifierwithinputshaving3+millionfeaturescanbeunwieldy,andcanalsobepronetoover-fitting.
![Page 4: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/4.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Deep-Learning for Perception in Robotics
Whatdoallthesetaskshaveincommon?
Source:L.Tai,“Deep-LearninginMobileRobotics…”
![Page 5: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/5.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Learning Approaches vs. Conventional Tools
Imagesource:Berkley
Grouping/Segmentation,Labeling,Identification
becausedifferencesinthepropertiesofpixelstoentireimageareconsideredduetothestructureofCNNs
Problem:notclear,whattheprocessingisbasedon!
?
Hand-designedfeatureshelptoestimatemetricdistancesbetweenthemusingcalibrationparameters
![Page 6: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/6.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Deep-Learning for Control in Robotics
Source:L.Tai,“Deep-LearninginMobileRobotics…”
![Page 7: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/7.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Can we avoid Metric Information in Representation of Environments?
![Page 8: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/8.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Fast Uncalibrated Monocular Estimation of Independent Motion Components
![Page 9: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/9.jpg)
Representation of the Environment in Collision Space
![Page 10: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/10.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Examples of non-metric Control (RoboMobil)
![Page 11: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/11.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Examples of non-metric Control (RoboMobil)
![Page 12: Challenges in Perception for Learning, Cognition and Control Approaches](https://reader031.vdocuments.net/reader031/viewer/2022030313/58ecc73d1a28abbc4b8b45b7/html5/thumbnails/12.jpg)
DariusBurschka–MachineVisionandPerceptionGroup(TUM)
Conclusions
• Imagelabelingandimageretrieval(indexing)representalargeapplicationfieldforperceptionandimageprocessing(Google,Apple,etc.),butroboticrequiresoftenametricmappingfromthesensorontocontrolvalues• Workonalternativesforcouplingbetweenthesensorandtheactuatorshouldbeoneoftheperceptiongoals• Currentlyuseddatarepresentationsneedtobere-definedforlearningapplications.