captcha and recaptcha seminar

25
CAPTCHA Vivek Maskara Vaibhav Goyal Khoshow

Upload: vivek-maskara

Post on 25-May-2015

480 views

Category:

Engineering


4 download

DESCRIPTION

Seminar presentation on captcha and re-captcha for Computer Science

TRANSCRIPT

Page 1: Captcha and Recaptcha Seminar

CAPTCHA

Vivek MaskaraVaibhav GoyalKhoshow

Page 2: Captcha and Recaptcha Seminar

AGENDADEFINITIONBACKGROUNDTYPESAPPLICATIONSCONSTRUCTING CAPTCHABREAKING CAPTCHAISSUES WITH CAPTCHACONCLUSION

Page 3: Captcha and Recaptcha Seminar

INTRODUCTIONCAPTCHA – Completely Automated Public Turing test to tell Computers & Humans Apart.

Invented at CMU by Luis von Ahn, Manuel Blum, et.al.

It is a program that is a challenge response to test to separate humans from computer programs.

Page 4: Captcha and Recaptcha Seminar

Generic CAPTCHAs distort letters & numbers -

Distorted characters are presented to the user.

User has to recognize the distorted letters.

If the guessed letters are correct, the user is inferred to be a human & allowed access.

Page 5: Captcha and Recaptcha Seminar

Contd…Humans can read the distorted & noisy text.

Current OCRs(Optical Character Recognition) cannot read them.

Page 6: Captcha and Recaptcha Seminar

BACKGROUNDWhy CAPTCHA was needed ?

Sabotage of Online Polls. Spam e-mails. Abusing free Online accounts. Tampering with rankings on recommendation systems (like Ebay, Amazon)

Page 7: Captcha and Recaptcha Seminar

What is TURING TEST ?Proposed by Alan Turing.To test a machine’s level of intelligence.Human judge asks questions to two

participants, one is a machine & the other human.

The judge doesn’t know which is which.After listening to the answer, if the judge

fails to recognize which one is the machine, then the machine passes the test.

Page 8: Captcha and Recaptcha Seminar

Contd…CAPTCHA employs a Reverse Turing Test.

Judge = CAPTCHA program, participant = user

If the user passes CAPTCHA, he is human otherwise it is a machine.

Page 9: Captcha and Recaptcha Seminar

Types of CAPTCHAs

Page 10: Captcha and Recaptcha Seminar

1. Text Based-simple, normal questions :-

What is the sum of three & thirty-five ?If today is Saturday, what is day after

tomorrow ?Which of mango, table & water is a

fruit ?Very effective, needs a large question

bank.Congnitively challenged users find it hard.

Page 11: Captcha and Recaptcha Seminar

2. Gimpy-Designed by Yahoo & CMU(Carnegie Mellon

University)Picks up 10 random words from dictionary &

distorts, fills with noise.User has to recognize at least 3 words.If the user is correct, then he is admitted.

Page 12: Captcha and Recaptcha Seminar

3. EZ-Gimpy-A modified version of Gimpy.Yahoo used this version in Messenger.Has only 1 random string of characters.Not a dictionary word, so not prone to

dictionary attack.Not a good implimentation , already

broken by OCRs(Optical Character Recognition).

Page 13: Captcha and Recaptcha Seminar

4. MSNs passport service CAPATCHAs-Provided for Microsoft’s MSN services.Use of 8 characters.Warping is used to distort.Very strong implementation, hasn’t been

broken.It is segmentation-resistant.

Page 14: Captcha and Recaptcha Seminar

5. Graphic based CAPTCHAs-1. BONGO-

After M.M.Bongard, pattern recognition expert.

User has to solve a pattern recognition problem.

Has to tell the distinct characteristic between two sets of figures.

Then tell to which set a given figure belongs to.

Page 15: Captcha and Recaptcha Seminar

Contd…2. PIX-

Uses a large database of labelled images.

It shows a set of images, user has to recognize the common feature among those.

Eg :- pick the common characteristic among the following 4 pictures = “aeroplane”.

Page 16: Captcha and Recaptcha Seminar

6. Audio CAPTCHAs-Consists of downloadable audio clip.User listens & enters the spoken word.Helps visually disabled users.Below is the Google’s audio enabled

CAPTCHA-

Page 17: Captcha and Recaptcha Seminar

7. Applications-Protect Online polls.Prevent web registration abuse,

protect passwords from brute-force attack.

Prevent comment spam & spam e-mails.

E-ticketing, prevent scalping.

Page 18: Captcha and Recaptcha Seminar

Contd…Verify digitized books : “RE-CAPTCHA”

Used in Google books project.Two words are shown, the program knows the

first word.If the user enter the first word correctly, it

assumes that the second unknown word will also be entered correctly.

Second word becomes “known”.

Page 19: Captcha and Recaptcha Seminar

Constructing CAPTCHAsThings to keep in mind :-

Don’t store CAPTCHA solution in web page’s metadata.

A CAPTCHA is no good if it doesn’t distort.

Need a large database of different CAPTCHA questions.

Avoid repetition of question.

Page 20: Captcha and Recaptcha Seminar

CAPTCHA logicGenerate the questionPersist the correct answerPresent the question to the userEvaluate the answer, if incorrect

start again- Generate a different CAPTCHA

If correct allow the access to the user

Page 21: Captcha and Recaptcha Seminar

Breaking CAPTCHAsCracking CAPTCHAs through

programs – Convert CAPTCHA into Grey scale.Detect patterns in the image corresponding to the characters

Greg Mori & Jitendra Malik have broken text CAPTCHAs

Ex:- Easy Gimpy

Page 22: Captcha and Recaptcha Seminar

Contd…Social engineering to break CAPTCHAs –

Spammer encounters a CAPTCHAThat CAPTCHA is copied to another siteHumans are baited, Ex:- free MP3s, free

wallpapers, etc.To get those MP3s or wallpapers, users are

told to solve the copied CAPTCHA.Then the solution is routed back to the

spammer.Solution – Fix a time-to-live period for a

question.

Page 23: Captcha and Recaptcha Seminar

Issues with CAPTCHAsUsability issue –

W3C mandates web to be accessible to all people.

Some CAPTCHAs are inaccessible to visually impaired, cognitively challenged people.

Compatibility issue –Java script may be needed to be activated

in browsers.Some may need Adobe Flash Plugin.

Page 24: Captcha and Recaptcha Seminar

SUMMARYCAPTCHAs are an effective way to counter bots & reduce spam.

They help advance AI knowledge.Some issues with current implementations represent challenges for future improvements.

Page 25: Captcha and Recaptcha Seminar

THANK YOU