safeguarding and charging for information on the internet hector garcia-molina, steven p. ketchpel,...

13
Safeguarding and Charging Safeguarding and Charging for Information on the for Information on the Internet Internet Hector Garcia-Molina, Steven P. Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Ketchpel, Narayanan Shivakumar Stanford University Stanford University Presented by: Min Ren Presented by: Min Ren

Upload: violet-cameron

Post on 17-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

Safeguarding and Charging for Information Safeguarding and Charging for Information on the Interneton the Internet

Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Hector Garcia-Molina, Steven P. Ketchpel, Narayanan ShivakumarShivakumar

Stanford UniversityStanford University

Presented by: Min Ren Presented by: Min Ren

Page 2: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

OutlineOutline

Introduction

E-Commerce Components

Safeguarding Information

Conclusion

Page 3: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

IntroductionIntroduction With the rapid expansion of the Internet Population,

publishers are tempted to publish on the Internet to cut down on printing and distribution costs. They need an e-commerce infrastructure!

With the documents transmitted electronically, one concern is how to safeguard the information. This article addresses this problem by providing possible solutions.

Page 4: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

E-Commerce ComponentsE-Commerce Components Shopping Systems

These mechanisms enable a customer to search for a product and select goods for purchase.

Payment Systems

These mechanisms enable a customer to make the payment.

Delivery Systems

These mechanisms deliver goods to customers. Special safeguards are required for e-contents.

Page 5: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

Risks of e-content DeliveryRisks of e-content Delivery

when the information is transmitted electronically, provider runs the risk of the content being copied and made available to everyone through the World Wide Web.

Page 6: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

Possible SolutionsPossible Solutions• Copy-protection strategies (prevention)

1. Physical isolation (e.g. on CD-ROMs accessible only through special-purpose systems)

2 . Special-purpose hardware for authorization.

3. IBM’s Cryptolopes (content is encrypted; users are given a decryption key after their payment; users view the content through a “trusted viewer”).

Page 7: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

Possible SolutionsPossible Solutions

1. Watermarks (detect additional information in distributed copies)

2. Copy Detection (detect exact copies by comparison to originals)

• Watermarks and Copy Detection (detection):

Page 8: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

CDS (Copy Detection System)CDS (Copy Detection System) Registration Repository

a database to keep registered documents, which are checked with the query documents to find possible copies.

Crawler

a mechanism to crawl through the WWW and present a stream of documents and images to the CDS.

Similarity Matcher

a mechanism to compare the query object to objects in the database, if they are match, the owner is notified.

Page 9: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

SCAMSCAM SCAM (Stanford Copy Analysis Mechanism)

This is a “proof-of-concept” CDS system implemented by Stanford for text documents.

Example

Given a document that may have a copy at one of several databases B1,….,Bi. How can we find this copy?

Page 10: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

SCAMSCAM― Registration

The publisher registers the document at the Registration Server. “Chunking” is a method used to break documents into smaller

primitives. The document is chunked and the primitives are stored in the

database.― Crawling

How to choose a Bi for finer testing?

Firstly, to generate a detection query based on important words in the document, submit it to each database, choose the database with high number of documents; Secondly, generate an extraction query to extract the documents that may be copied.

Page 11: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

Implementation of CDSImplementation of CDS― Similarity Matching

Chunking The query object is chunked and compared with the document registered in the database; The same rules of chunking in registration step is applied here.

Filtering Expensive Tests “false negatives” (fail to detect documents that did overlap significantly). “false positives” (indicate two documents overlap when they do not). We can compose these tests to obtain better accuracy and performance.

Page 12: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

ConclusionConclusion

This paper addressed some problems in building a good commerce infrastructure.

The measures of how to efficiently safeguard illegal copies are discussed.

Page 13: Safeguarding and Charging for Information on the Internet Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar Stanford University Presented

Questions??Questions??

What is the relationship between watermarks and

CDS?