tomer sagi and avigdor gal technion - israel institute of technology non-binary evaluation for...
TRANSCRIPT
Tomer Sagi and Avigdor GalTechnion - Israel Institute of Technology
Non-binary Evaluation for Schema Matching
Presentation @ ER 2012October 2012, Florence Italy
Presentation Outline
Background• Schema Matching
Schema Matching Evaluation• Current model: Set based Precision and Recall• Proposed Model: Similarity Spaces, a vector-space model• Non-Binary measures
Usage example:• Tuning schema matchers using Non-binary measure
2
3
BackgroundSchema Matching
Schema matching is the task of providing correspondences between concepts describing the meaning of data
Schema matching is recognized to be a basic operation required by data integration and web-query interface integration
BackgroundSchema Matching: Schemas
Schemas contain attributes
Each attributes may have a name, label, type, domain (allowed values), instances, etc.
Structural links and relationships are defined between attributes
4
First Name
Last Name
Gender
What are your favourite hobbies
Requested Password:
Password confirmation:
Yes
Register
Web-form Schema Small Business-Document Schema
With 5 concepts
BackgroundSchema Matching: First Line Matchers
First line matchers (a.k.a similarity measures) compare two schemas, generating correspondences between them
Each correspondence is assigned a confidence value over [0,1] The results is often represented as a similarity matrix:
5
0.32
0.64
0.84
0.350.62
BackgroundSchema Matching: Second Line Matchers
Second line matchers transform similarity matrices Filters transform a matrix by removing those values which do not
satisfy the constraint function. Examples: Threshold, MaxDelta.
6
Similarity Matrix
Transformed Similarity Matrix
BackgroundSchema Matching: Second Line Matchers
Second line matchers transform similarity matrices. Filters transform a matrix by removing those values which do not
satisfy the constraint function. Examples: Threshold, MaxDelta. Decision makers transform a matrix by changing the values of some
correspondence to 1 and the rest to 0.
7
Similarity Matrix
Binary Similarity Matrix
Schema matching systems employ various first and second line matchers; their results are composed, aggregated and combined.
Schema Matching SystemUncertain
Schema Matching System
BackgroundSchema Matching: Systems
8
String Matcher
Domain Matcher
Parent-Child Matcher
Instance Matcher
Aggregation
Filter
Decision maker
Schema Pair
Binary Similarity Matrix
Binary Similarity Matrix
1st Line Matchers 2nd Line Matchers
Presentation Outline
Background• Schema Matching
Schema Matching Evaluation• Current model: Set based Precision and Recall• Proposed Model: Similarity Spaces, a vector-space model• Non-Binary measures
Usage example:• Tuning schema matchers using Non-binary measure
9
Exact Match
Current evaluation model provides measures for evaluating a complete system using set-based measures
Major shortcoming: Evaluation of individual components (e.g. first line matchers) and non-binary results (uncertain schema matching systems) is undefined.
Schema Matching EvaluationCurrent Model
10
Schema Matching
System
String Matcher
Domain Matcher
Parent-Child Matcher
Instance Matcher
Aggregation
Filter
Decision maker
Schema Pair
Binary Similarity Matrix
True Positive (TP)
False Negative
(FN)
False Positive
(FP)
Precision = FPTP
TP
Recall =FNTP
TP
Schema Matching Evaluation
Begin with a similarity matrix:
Taking each entry as an element in a vector transforms this matrix to a similarity vector:
(0.84,0.29,0.34,0.32,1.00,0.33,0.32,0.33,0.35,0.30,0.30,0.64)
11
Similarity Spaces: A Vector Space Model
S1
S2
1 cardNum 2 city 3 arrivalDay 4 checkInTime
1 clientNum 0.84 0.32 0.32 0.30
2 city 0.29 1.00 0.33 0.30
3 checkInDay 0.34 0.33 0.35 0.64
Schema Matching Evaluation
We propose a Vector Space model for evaluation
Dimensions are possible correspondences between an attribute pair
Vectors are matching results
12
Similarity Spaces: A Vector Space Model
S1
S2
1 cardNum 2 city 3 arrivalDay 4 checkInTime
1 clientNum 1 0 0 0
2 city 0 1 0 0
3 checkInDay 0 0 0 1
S1
S2
1 cardNum 2 city 3 arrivalDay 4 checkInTime
1 clientNum 0.84 0.32 0.32 0.30
2 city 0.29 1.00 0.33 0.30
3 checkInDay 0.34 0.33 0.35 0.64
The Schema Matching Evaluation Problem
13
The Schema Matching Evaluation Problem
K
Informed?
0 1 2 >2
Yes?
Non-BinaryBinary
Non-BinaryBinary
No Non-BinaryBinary
Non-BinaryBinary
Non-BinaryBinary
Area in green marks where most research has focused to date. Areas in Yellow designate limited work done.
Schema Matching Evaluation
Over this vector space, evaluation functions are defined:
For example, the well known precision and recall functions are functions of two vectors ( v = outcome of a decision maker. ve = exact match ) :
Similarity Spaces: A Vector Space Model
14
5.02
1,1
1
1
2,1
1
1,1)1,0(
REPR
e
e
e
gg
vv
vv
vv
Schema Matching Evaluation
Accommodating non-binary evaluation is now trivial: allow v to be non-binary
Non-binary measures
15
49.02
99.0,99.0
1
99.0
2,99.0
99.0
1,1)35.0,64.0(
REPR
e
e
e
gg
vv
vv
vv
Implications:
Schema Matching SystemUncertain
Schema Matching System
Schema Matching EvaluationNon-binary measures - Implications
16
String Matcher
Domain Matcher
Parent-Child Matcher
Instance Matcher
Aggregation
Filter
Decision maker
Schema Pair
Binary Similarity Matrix
Binary Similarity Matrix
1st Line Matchers 2nd Line Matchers
Can evaluate individual 1st line matchers
Can evaluate Interim Results
Can evaluate Uncertain Results
Match DistanceSometimes you need a metric…
We define two complementary distance metrics:
17
Match DistanceBehavior vs. NBPrecision and NBRecall
Results of synthetic evaluation
18
Match DistanceBehavior vs. NBPrecision and NBRecall
Results of synthetic evaluation
19
Nonsense solution of increasing magnitudeNoisy solution with increasingly strict filter applied
Schema Matching EvaluationPredictors
Background Model Evaluation
EXACT MATCH
What if you don’t have an exact match?
In most applications, this is the case…
Schema Matching Evaluation
Predictors are a special class of schema matching evaluation methods, which do not use an exact match as part of the input.
Predictors can be classified into two subclasses:• Internalizers that refer to the internal structure of a vector as an
indication of match quality (e.g. max , stdev, average)• Idealizers assume the existence of an ideal vector and compare with it
Predictors
Background Model Evaluation
Schema Matching EvaluationIdealizers
Background Model Evaluation
(Ideal Vector)
Schema Matching Evaluation
Desired design properties:• Tunable: We should be able to tune predictors towards the desired
quality in a specific scenario.• Generalizable: Predictors should be based upon principles which are
applicable at several levels of granularity and can be specialized to some levels of granularity.
Desired empirical properties:• Correlated: Well correlated with the quality they are designed to
evaluate• Robust: Correlations are robust and statistically significant over varied
matching systems and datasets
23
Desired Properties of Predictors
Background Model Evaluation
Using PredictionTunable Prediction Models
24
Loosely correlated predictors can be composed into a model. The weights of its participating predictors can be tuned Construction by (multiple) step-wise regression.
Using PredictionTunable Prediction Models
25
Added bonus: Increased correlation
Schema Matching Evaluation
Consider the following example and how a matrix level vs. an attribute level predictor would fare in it
26
Why Granularity Matters
Exact Match Matcher Vectors
The Schema Matching Evaluation Problem
27
The Schema Matching Evaluation Problem
K
Informed?
0 1 2 >2
Yes?
Non-BinaryBinary
Non-BinaryBinary
No Non-BinaryBinary
Non-BinaryBinary
Non-BinaryBinary
Area in green marks where most research has focused to date. Areas in Yellow designate limited work done.
Presentation Outline
Background• Schema Matching
Schema Matching Evaluation• Current model: Set based Precision and Recall• Proposed Model: Similarity Spaces, a vector-space model• Non-Binary measures
Usage examples• Tuning schema matchers using Non-binary measure• Weighting ensembles using attribute-level prediction
28
Usage Examples
First-line matcher named Term has tunable parameter label score weight (α) defining the relative weight of the term’s label and name
Tuning can be done via machine learning methods or statistical methods
All tuning methods benefit from: • Smoothness: Gradual changes in α gradual changes in measure• Robustness: Observed behavior is robust w.r.t number of test cases
Tuning scenario
29
nameScore-1labelScore
Label
Name: leaveSlice
Usage ExampleSmoothness
30
To use binary precision, a decision maker is required. Introducing a decision maker causes random noise caused by arbitrary
thresholds
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
Precision Binary Measures
Threshold(0.2)Threshold(0.25)Stable Marriage
Label score weight
Prec
ision
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10
0.020.040.060.08
0.10.120.140.160.18
0.2
Non Binary Precision
Label score weight
NBP
reci
sion
Usage ExampleRobustness – effect of sample size
31
An outlier in schema pair no.2 causes Binary-precision (fig. (b)) to diverge greatly from the eventual observed behavior with 10 pairs.
Unperturbed by outliers, NBPrecision (fig. (a)) displays robust behavior
(pairs) (pairs)
Using PredictionDynamic Prediction Models - Results
32
Conclusions
Non-binary measures are a useful addition to the schema matching evaluation tool-kit.
Non-binary evaluation presents desired characteristics in tuning scenarios (smoothness and robustness)
Using the similarity vector space model we can generate additional measures, breaking from traditional measures (e.g. binary precision and recall) to measures more attuned to modern schema matching needs.
33
Thank You
34
Questions?