extending propositional satisfiability to determine minimal fuzzy-rough reducts

16
Richard Jensen, Andrew Tuson and Qiang Shen Qiang Shen Aberystwyth University, UK Richard Jensen Aberystwyth University, UK Andrew Tuson City University, UK Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Upload: tacey

Post on 23-Feb-2016

47 views

Category:

Documents


0 download

DESCRIPTION

Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts. Outline. The importance of feature selection Rough set theory Fuzzy-rough feature selection (FRFS) FRFS-SAT Experimentation Conclusion. Feature selection. Why dimensionality reduction/feature selection? - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Qiang ShenAberystwyth University, UK

Richard JensenAberystwyth University, UK

Andrew TusonCity University, UK

Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough

Reducts

Page 2: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Outline• The importance of feature selection

• Rough set theory

• Fuzzy-rough feature selection (FRFS)

• FRFS-SAT

• Experimentation

• Conclusion

Page 3: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

• Why dimensionality reduction/feature selection?

• Growth of information - need to manage this effectively• Curse of dimensionality - a problem for machine learning

High dimensionaldata

DimensionalityReduction

Low dimensionaldata

Processing System

Intractable

Feature selection

Page 4: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Rough set theory

Rx is the set of all points that are indiscerniblewith point x in terms of feature subset B

UpperApproximation

Set A

LowerApproximation

Equivalence class Rx

Page 5: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Discernibility approach• Decision-relative discernibility matrix

• Compare objects• Examine attribute values• For attributes that differ:

• If decision values differ, include attributes in matrix• Else leave slot blank

• Construct discernibility function:

)}()( ),()(|{ jijiij xdxdxaxaCac

}|,|1:)({),...,( 1 ijijmC cUijcaaf

Page 6: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Example

• Remove duplicatesfC(a,b,c,d) = {a b c d} {a c d} ⋁ ⋁ ⋁ ⋀ ⋁ ⋁ ⋀

{b c} {d} {a b c} ⋁ ⋀ ⋀ ⋁ ⋁ ⋀{a b d} {b c d} {a d}⋁ ⋁ ⋀ ⋁ ⋁ ⋀ ⋁

• Remove supersetsfC(a,b,c,d) = {b c} {d}⋁ ⋀

Page 7: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Finding reducts• Usually too expensive to search exhaustively for

reducts with minimal cardinality

• Reducts found through:• Converting from CNF to DNF (expensive)• Hill-climbing search using clauses (non-optimal)• Other search methods - GAs etc (non-optimal)

• RSAR-SAT• Solve directly in SAT formulation.• DPLL approach is both fast and ensures optimal

reducts

Page 8: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Fuzzy discernibility matrices• Extension of crisp approach

• Previously, attributes had {0,1} membership to clauses• Now have membership in [0,1]• Allows real-coded data as well as nominal.

• Fuzzy DMs can be used to find fuzzy-rough reducts

Page 9: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Formulation• Fuzzy satisfiability

• In crisp SAT, a clause is fully satisfied if at least one variable in the clause has been set to true

• For the fuzzy case, clauses may be satisfied to a certain degree depending on which variables have been assigned the value true

Page 10: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Experimentation: setup• 9 benchmark datasets

• Features – 10 to 39• Objects – 120 to 690

• Methods used: • FRFS-SAT• Greedy hill-climbing: fuzzy dependency, fuzzy boundary region and

fuzzy discernibility.• Evolutionary algorithms: genetic algorithms (GA) and particle swarm

optimization (PSO) using fuzzy dependency

• 10x10-fold cross validation• FS performed on the training folds, test folds reduced using

discovered reducts

Page 11: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Experimentation: results

Page 12: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Conclusion• Extended propositional satisfiability to enable

search for fuzzy-rough reducts• New framework for fuzzy satisfiability• New DPLL algorithm• Fuzzy clause simplification

• Future work:• Non-chronological backtracking• Better heuristics• Unsupervised FS• Other extensions in propositional satisfiability

Page 13: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

• WEKA implementations of all fuzzy-rough feature selectors and classifiers can be downloaded from:

http://users.aber.ac.uk/rkj/book/weka.zip

Page 14: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Feature selection• Feature selection (FS) is a DR technique that

preserves data semantics (meaning of data)

• Subset generation: forwards, backwards, random…• Evaluation function: determines ‘goodness’ of subsets• Stopping criterion: decide when to stop subset search

Generation Evaluation

StoppingCriterion Validation

Feature set Subset

Subsetsuitability

Continue Stop

Page 15: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Algorithm

Page 16: Extending Propositional Satisfiability to Determine Minimal Fuzzy-Rough Reducts

Richard Jensen, Andrew Tuson and Qiang Shen

Example