Kuri’s embedded GPU: if we want to process 5 frames a second, we can only use ~2.5 GMACs/frame
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Howard, Andrew G. et al. arXiv preprint 2017 (all authors from Google)
Framework Model mAP Billion Mult-Adds
Single Shot Multibox Detector
300x300
VGG 21.1% 34.9Inception V2 22.0% 3.8MobileNet 19.3% 1.2
pet & person detection
•Uses feature vectors from intermediate layer of object detection network
•Cosine distance of vectors -> difference score
Useful for:
Mapping loop closures
Global localization
“ConvNet features for Place Recognition,” Sunderhauf et al, IROS 2015
place recognition
face detection
Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks
Zhang, Kaipeng. et al. Signal Processing Letters, 2016