xiuwen liu department of computer science florida state university

104
Research Activities at Center for Applied Vision and Imaging Sciences and Florida State Vision Group Florida State University Xiuwen Liu Department of Computer Science Florida State University http://cavis.fsu.edu & http://fsvision.fsu.edu

Upload: tierra

Post on 06-Jan-2016

35 views

Category:

Documents


0 download

DESCRIPTION

Research Activities at Center for Applied Vision and Imaging Sciences and Florida State Vision Group Florida State University. Xiuwen Liu Department of Computer Science Florida State University http://cavis.fsu.edu & http://fsvision.fsu.edu. Research Statement. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Xiuwen Liu Department of Computer  Science Florida State University

Research Activities at Center for Applied Vision and Imaging Sciences and

Florida State Vision GroupFlorida State University

Xiuwen Liu

Department of Computer Science

Florida State University

http://cavis.fsu.edu & http://fsvision.fsu.edu

Page 2: Xiuwen Liu Department of Computer  Science Florida State University

Research Statement

My research goal is to create machines that can “see” with similar human performance

• This seems a trivial problem as each of us can do this without any effort

• Computer + Camera = “A See Machine” ?

Page 3: Xiuwen Liu Department of Computer  Science Florida State University

Visual Pathway

Page 4: Xiuwen Liu Department of Computer  Science Florida State University

Visual Illusion

Page 5: Xiuwen Liu Department of Computer  Science Florida State University

Outline

Motivations• Some applications of computer vision and pattern

recognition techniques

Some of the research projects

Related Courses

Contact information

Page 6: Xiuwen Liu Department of Computer  Science Florida State University

Computer Vision Applications

No hands across America• Sponsored by Delco Electronics, AssistWare

Technology, and Carnegie Mellon University

• Navlab 5 drove from Pittsburgh, PA to San Diego, CA, using the RALPH computer program.

• The trip was 2849 miles of which 2797 miles were driven automatically with no hands

– Which is 98.2%

Page 7: Xiuwen Liu Department of Computer  Science Florida State University

Computer Vision Applications – continued

Page 8: Xiuwen Liu Department of Computer  Science Florida State University

Computer Vision Applications – continued

Page 9: Xiuwen Liu Department of Computer  Science Florida State University
Page 10: Xiuwen Liu Department of Computer  Science Florida State University

Human-Computer Interactions

Page 11: Xiuwen Liu Department of Computer  Science Florida State University

Sign Language Recognition

Page 12: Xiuwen Liu Department of Computer  Science Florida State University

CyberKnife

Page 13: Xiuwen Liu Department of Computer  Science Florida State University

CyberKnife – Cont.

Page 14: Xiuwen Liu Department of Computer  Science Florida State University

Image-Guided Neurosurgery

Page 15: Xiuwen Liu Department of Computer  Science Florida State University

Intelligent Transportation Systems

http://dfwtraffic.dot.state.tx.us/dal-cam-nf.asp

Page 16: Xiuwen Liu Department of Computer  Science Florida State University

Computer Vision Applications – cont.

Military applications• Automated target recognition

Page 17: Xiuwen Liu Department of Computer  Science Florida State University

Computer Vision Applications – continued

Page 18: Xiuwen Liu Department of Computer  Science Florida State University

Biometrics – cont.

Iris code can achieve zero false acceptance

Page 19: Xiuwen Liu Department of Computer  Science Florida State University

Computer Vision in Sports

How was the yellow created?

Page 20: Xiuwen Liu Department of Computer  Science Florida State University

Generic Image Modeling

How can we characterize all these images perceptually?

Page 21: Xiuwen Liu Department of Computer  Science Florida State University

Spectral Histogram Representation

Spectral histogram

• Given a bank of filters F(), = 1, …, K, a spectral histogram is defined as the marginal distribution of filter responses

)I(*)(I )()( vFv

v

vIzδzH ))((|I|

1)( )()(

I

),,,( )(I

)2(I

)1(II

KHHHH

Page 22: Xiuwen Liu Department of Computer  Science Florida State University

Spectral Histogram Representation - continued

Choice of filters • Laplacian of Gaussian filters

• Gabor filters

• Gradient filters

• Intensity filter

LoG filter Gabor filter

Page 23: Xiuwen Liu Department of Computer  Science Florida State University

Spectral Histogram Representation - continued

Page 24: Xiuwen Liu Department of Computer  Science Florida State University

Texture Synthesis Examples - continued

An image with periodic structures

Observed image Synthesized image

Page 25: Xiuwen Liu Department of Computer  Science Florida State University

Object Synthesis Examples - continued

Page 26: Xiuwen Liu Department of Computer  Science Florida State University

Performance Comparison

Page 27: Xiuwen Liu Department of Computer  Science Florida State University

Face Detection Based On Spectral Representations

Face detection is to detect all instances of faces in a given image

Each image window is represented by its spectral histogram• A support vector machine is trained on training faces

• Then the trained support vector machine is used to classify each image window in an input image

More results at http://fsvision.fsu.edu/face-detection

Page 28: Xiuwen Liu Department of Computer  Science Florida State University

Face detection - continued

Page 29: Xiuwen Liu Department of Computer  Science Florida State University

Face detection - continued

Page 30: Xiuwen Liu Department of Computer  Science Florida State University
Page 31: Xiuwen Liu Department of Computer  Science Florida State University

Face detection - continued

Page 32: Xiuwen Liu Department of Computer  Science Florida State University

Rotation Invariant Face Detection

Page 33: Xiuwen Liu Department of Computer  Science Florida State University

Rotation Invariant Face Detection - continued

Page 34: Xiuwen Liu Department of Computer  Science Florida State University

Linear Representations

Linear representations are widely used in appearance-based object recognition and other applications

• Simple to implement and analyze

• Efficient to compute

• Effective for many applications

dT RIUUI ),(

Page 35: Xiuwen Liu Department of Computer  Science Florida State University

Standard Linear Representations

Principal Component Analysis• Designed to minimize the reconstruction error on the training set

• Obtained by calculating eigenvectors of the co-variance matrix

Fisher Discriminant Analysis• Designed to maximize the separation between means of each class

• Obtained by solving a generalized eigen problem

Independent Component Analysis• Designed to maximize the statistical independence among coefficients

along different directions

• Obtained by solving an optimization problem with some object function such as mutual information, negentropy, ....

Page 36: Xiuwen Liu Department of Computer  Science Florida State University

Standard Linear Representations - continued

Standard linear representations are sub optimal for recognition applications• Evidence in the literature

• A toy example– Standard representations give the worst recognition performance

Optimal component analysis

Page 37: Xiuwen Liu Department of Computer  Science Florida State University

Performance Measure - continued

Suppose there are C classes to be recognized• Each class has ktrain training images

• It has kcross cross validation images

• We used h(x) = 1/(1+exp(-2x)

Page 38: Xiuwen Liu Department of Computer  Science Florida State University

Performance Measure - continued

F(U) depends on the span of U but is invariant to change of basis• In other words, F(U)=F(UO) for any orthonormal matrix O

• The search space of F(U) is the set of all the subspaces, which is known as the Grassmann manifold

– It is not a flat vector space and gradient flow must take the underlying geometry of the manifold into account

Page 39: Xiuwen Liu Department of Computer  Science Florida State University

Deterministic Gradient Flow - continued

Gradient at [J] (first d columns of n x n identity matrix)

Page 40: Xiuwen Liu Department of Computer  Science Florida State University

Deterministic Gradient Flow - continued

Gradient at U: Compute Q such that QU=J

Deterministic gradient flow on Grassmann manifold

Page 41: Xiuwen Liu Department of Computer  Science Florida State University

Stochastic Gradient and Updating Rules

Stochastic gradient is obtained by adding a stochastic component

Discrete updating rules

Page 42: Xiuwen Liu Department of Computer  Science Florida State University

MCMC Simulated Annealing Optimization Algorithm

Let X(0) be any initial condition and t=01. Calculate the gradient matrix A(Xt)

2. Generate d(n-d) independent realizations of wij’s

3. Compute Y (Xt+1) according to the updating rules

4. Compute F(Y) and F(Xt) and set dF=F(Y)- F(Xt)

5. Set Xt+1 = Y with probability min{exp(dF/Dt),1}

6. Set Dt+1 = Dt / and set t=t+1

7. Go to step 1

Page 43: Xiuwen Liu Department of Computer  Science Florida State University

ORL Face Dataset

Page 44: Xiuwen Liu Department of Computer  Science Florida State University

Performance Comparison

Page 45: Xiuwen Liu Department of Computer  Science Florida State University

Performance Comparison – cont.

Page 46: Xiuwen Liu Department of Computer  Science Florida State University

Brain Curve Classification

Page 47: Xiuwen Liu Department of Computer  Science Florida State University

Brain Curve Classification – cont.

Page 48: Xiuwen Liu Department of Computer  Science Florida State University

Real-time Scene Interpretation

Object detection and recognition problem• Given a set of images, find regions in these images which

contain instances of relevant objects• Here the number of relevant objects is assumed to be large

– For example, the system should be able to handle 30,000 different kinds of objects, an estimate of the human brain’s capacity for basic level visual categorization [I. Biederman, Psychological Review, vol. 94, pp. 115-147, 1987]

Page 49: Xiuwen Liu Department of Computer  Science Florida State University

Global Monitoring Through High-resolution Satellite Images

Page 50: Xiuwen Liu Department of Computer  Science Florida State University

Problem Statement for Scene Interpretation

Object detection and recognition problem• Given a set of images, find regions in these images which

contain instances of relevant objects• Here the number of relevant objects is assumed to be large

– For example, the system should be able to handle 30,000 different kinds of objects, an estimate of the human’s capacity for basic level visual categorization [I. Biederman, Psychological Review, vol. 94, pp. 115-147, 1987]

Goal • Develop a system that can achieve real-time detection and

recognition for images of size 640 x 480 with high accuracy– Say, at a frame rate of 15 frames per second

Page 51: Xiuwen Liu Department of Computer  Science Florida State University

Existing Approaches

Fast methods but low accuracy• One can for example classify

one pixel at a time

• However, it is to identify airplanes with high accuracy due to high false positives and negatives

Page 52: Xiuwen Liu Department of Computer  Science Florida State University

Existing Approaches – cont.

Fast methods but low accuracy• One can for example classify

one pixel at a time• However, it is to identify

airplanes with high accuracy Methods with good

accuracy but slow• One can in theory use

deformable template matching to locate instances of airplanes

• It may need several hours to process one image

Page 53: Xiuwen Liu Department of Computer  Science Florida State University

Proposed Framework

Page 54: Xiuwen Liu Department of Computer  Science Florida State University

Specifications and Requirements

We want to detect and recognize at least 30,000 object classes in images• At four different scales

• Using exhaustive search of local windows, that is, we do not assume segmentation or other pre-processing

• If we assume objects are in some (e.g. 21 x 21) windows, this means that there will be many (18,432,000) local windows to be classified/processed

• We want to do this on a 3.6 Ghz Dell Precision workstation with an estimated performance of 28,665.4 MIPS

• This amounts to that we have about 1555 instructions to process a 21 x 21 local window

Page 55: Xiuwen Liu Department of Computer  Science Florida State University

Requirements – cont.

To achieve the specifications, we need two critical components• A classifier that can reduce the average classification time

effectively– Note that on average we have 1555 instructions; if we can process

90% of those windows using only 100 instructions per window, we can have on average 14,650 instructions for the remaining 10% local windows

• Features that can discriminate a large number of objects and can be computed using a few instructions

– Do such features exist?

Page 56: Xiuwen Liu Department of Computer  Science Florida State University

Topological Local Spectral Histograms

We introduce a new class of features, which we called TLSH features• It is defined relative to a chosen set of filters

• For a given filter, it is defined as a histogram of a local window of the filtered image

• One bin of the histogram is given by

Page 57: Xiuwen Liu Department of Computer  Science Florida State University

Topological Local Spectral Histogram Example

Convolution is implemented using FPGAs

Page 58: Xiuwen Liu Department of Computer  Science Florida State University

Local Spectral Histogram Features

Page 59: Xiuwen Liu Department of Computer  Science Florida State University

Field Programmable Gate Arrays

• Two primary methods for computation• Hard Wired Application Specific Integrated Circuit (ASIC)

• Software-programmed microprocessors

• New Approach• Programmable hardware

• Field Programmable Gate Arrays (FPGAs) represent a breakthrough in computing technology

– Especially for intrinsically parallel applications

Page 60: Xiuwen Liu Department of Computer  Science Florida State University

μP/ ASIC / FPGA Comparison Summary

μP ASIC FPGAProgrammable (flexible) Fixed Design Functionality (inflexible) Programmable (flexible)

Relatively Slow Serial Computation Very Fast, highly parallelized computation

Fast, Parallel Computation

Floating and Fixed Point Fixed Point / Floating Fixed Point / Floating

Relatively Inexpensive Design Cycle (Software)

Expensive Design Cycle (requires chip design)

Relatively Inexpensive Design Cycle

Limited Bandwidth Very High Bandwidth Near ASIC Bandwidth

Standard High Level Languages C/C++ or Assembly

Hardware Description Language for Design / Simulation

VHDL / Verilog

Hardware Description Language for Design / Simulation

VHDL / Verilog

Page 61: Xiuwen Liu Department of Computer  Science Florida State University

Hardware vs. Software

Sum = 0.0I = 0;While (I < L)

tmp = x(i) * h(i) Sum = Sum + tmp

I = I+1end

A typical software implementation takes 4*L instructions to compute one convolution

1

0

)()(L

kkk hnxny• Software

Implementation:

Page 62: Xiuwen Liu Department of Computer  Science Florida State University

Hardware vs. Software

A custom hardware implementation

Multiply/Accumulate

performed in parallel

Can be done in one clock cycle

Page 63: Xiuwen Liu Department of Computer  Science Florida State University

Convolution Timing Diagram

Convolution Start Signal Clock

All nine response

values finished

Every 7 ClockCycles: 9

new response

values

Page 64: Xiuwen Liu Department of Computer  Science Florida State University

Topological Local Spectral Histograms – cont.

Why TLSH features?• It provides a very rich set of over-complete features

– For example, suppose we have 22 filters, there will be 1,173,942 different TLSH features within a 21 x 21 region, considering different windows and different filters

– TLSH features are more effective than Haar features used by Viola and Jones [P. Viola and M. Jones, International Journal of Computer Vision, vol. 57, pp.

137-154, 2004]

Page 65: Xiuwen Liu Department of Computer  Science Florida State University

ORL Face Dataset

Page 66: Xiuwen Liu Department of Computer  Science Florida State University

Comparison Between Haar and TLSH Features

Page 67: Xiuwen Liu Department of Computer  Science Florida State University

COIL Dataset

Page 68: Xiuwen Liu Department of Computer  Science Florida State University

Comparison Between Haar and TLSH Features

Page 69: Xiuwen Liu Department of Computer  Science Florida State University

Texture Dataset

Page 70: Xiuwen Liu Department of Computer  Science Florida State University

Comparison Between Haar and TLSH Features

Page 71: Xiuwen Liu Department of Computer  Science Florida State University

Mixed Dataset

Page 72: Xiuwen Liu Department of Computer  Science Florida State University

Comparison Between Haar and TLSH Features

Page 73: Xiuwen Liu Department of Computer  Science Florida State University

Comparison Between Haar and TLSH Features

Page 74: Xiuwen Liu Department of Computer  Science Florida State University

Classifier

To achieve the specification, we also need a classifier that takes only a few instructions to make a decision on average• At the same time, we need to achieve high accuracy

We propose to use a look-up table tree classifier• I.e., a decision tree classifier where each node is

implemented by a look-up table

Page 75: Xiuwen Liu Department of Computer  Science Florida State University

Look-up Table Tree Classifier

Page 76: Xiuwen Liu Department of Computer  Science Florida State University

Look-up Table Tree Classifier

Page 77: Xiuwen Liu Department of Computer  Science Florida State University

An Example Path in a Decision Tree

Page 78: Xiuwen Liu Department of Computer  Science Florida State University

Constructing Look-up Table Decision Tree

Joint optimization of clustering, TLSH features, and optimal linear projections• We want to maximize the separations between marginal

distributions of different clusters

• We can do the optimization iteratively– We can do clustering first using current TLSH features and

projections to maximize the separations

– We can find optimal TLSH features given linear projections

– Then we can find optimal linear projections given updated TLSH features

Page 79: Xiuwen Liu Department of Computer  Science Florida State University

Performance Comparison

RCT – Rapid Classification Tree, implemented by Keith Haynes

Page 80: Xiuwen Liu Department of Computer  Science Florida State University

Detection and Recognition

Page 81: Xiuwen Liu Department of Computer  Science Florida State University

Detection and Recognition

Page 82: Xiuwen Liu Department of Computer  Science Florida State University

Shape Theory

We want to quantify the difference between two shapes in a principled way• We do this by constructing a shape space and then use the geodesic

distance of two shapes on the shape manifold as the metric

Page 83: Xiuwen Liu Department of Computer  Science Florida State University

Shape Clustering

Page 84: Xiuwen Liu Department of Computer  Science Florida State University

Shape Clustering

Page 85: Xiuwen Liu Department of Computer  Science Florida State University

Clustering Dendrogram

Page 86: Xiuwen Liu Department of Computer  Science Florida State University

Sulcal Curves

Sulcal curves are important for characterizing brain functions

Page 87: Xiuwen Liu Department of Computer  Science Florida State University

Sulcal Curves

Sulcal curves are important for characterizing brain functions

Page 88: Xiuwen Liu Department of Computer  Science Florida State University

Clustering of Sulcal Curves

Page 89: Xiuwen Liu Department of Computer  Science Florida State University
Page 90: Xiuwen Liu Department of Computer  Science Florida State University

Modeling Mathematical Abilities and Disabilities

As it is possible to acquire detailed surfaces of the human brain, one may ask how characteristics of the brain structure affect the mathematical abilities and disabilities• The U.S. Department of Education wants to know so that they can understand and

find solutions to the mathematical problems young children have

Corpus callosum examples of young children without mathematical disabilities (a) and with (b)

Page 91: Xiuwen Liu Department of Computer  Science Florida State University

SurfaVision – A Surface-based Vision System

One of the challenges is how to build a machine vision that is robust• This has been proven to be very difficult after several decades of

computer vision research We may now have a solution for applications in an indoor environment

Page 92: Xiuwen Liu Department of Computer  Science Florida State University

Multi-Camera Multi-Projector Scanning

Page 93: Xiuwen Liu Department of Computer  Science Florida State University

Surface Parametrization

Page 94: Xiuwen Liu Department of Computer  Science Florida State University

Geodesic Interpolation Between Surfaces

Page 95: Xiuwen Liu Department of Computer  Science Florida State University

Robust Visual Inference

With a common domain for surface representations, we can pose the visual inference in the Bayesian framework by building probability models

Page 96: Xiuwen Liu Department of Computer  Science Florida State University

Human-Robot Collaborative Interaction

The goal is to let robots be aware of the positions, poses, expressions, moods, and other factors of the humans so that robots can interact with humans collaborative

In collaboration with Prof. Emmanuel Collins at the College Engineering

Page 97: Xiuwen Liu Department of Computer  Science Florida State University

Automated 3D Phenotype Measurement

The central problem in biology is to understand the relationship between genotype and phenotype• With availability of genomes of humans and model organisms, the central

problem becomes how to measure phenotype at a large scale

Page 98: Xiuwen Liu Department of Computer  Science Florida State University

3D Urban Models

Page 99: Xiuwen Liu Department of Computer  Science Florida State University
Page 100: Xiuwen Liu Department of Computer  Science Florida State University

Courses

Most Relevant Courses • CAP 5638 Pattern Recognition• CAP 5415 Principles and Algorithms of Computer Vision • CAP 6417 Theoretical Foundations of Computer Vision• STA 5106 Computational Methods in Statistics I • STA 5107 Computational Methods in Statistics I I• Seminars and advanced studies

Related Courses• CAP 5615 Artificial Neural Networks• CAP 5600 Artificial Intelligence• CAP 5xxx Machine Learning

Page 101: Xiuwen Liu Department of Computer  Science Florida State University

Funding of the Group

National Science Foundation• DMS • CISE IIS• FRG• ACT• CCF

NGA – National Geo-spatial Intelligence Agency Army Research Office

• DURIP• Research grant

Companies• Next Century and others under negotiation

Page 102: Xiuwen Liu Department of Computer  Science Florida State University

Summary

CAVIS group and FSvision group offer interesting research topics/projects• Efficient represent for generic images• Real-time detection and recognition• Computational models for object recognition and image

classification• Medical image analysis• Motion/video sequence analysis and modeling

• They are challenging• They are interesting• They are exciting

Page 103: Xiuwen Liu Department of Computer  Science Florida State University

Contact Information

• Name Xiuwen Liu• Web sites http://cavis.fsu.edu

http://fsvision.fsu.edu

http://www.cs.fsu.edu/~liux• Email [email protected]• Offices LOV 166 and 118 North Woodward Ave.

• Phones 644-0050 and 645-2257

Page 104: Xiuwen Liu Department of Computer  Science Florida State University

Thank you!

Any questions?