review summarization system

11
Team – 22 M Manoj Kumar – Srinath Ravichandran - Dharmesh kakadia – Sandhya S (201107502) - (201107625) - (201107616) - (201107617) REVIEW SUMMARY SYSTEM

Upload: dharmesh-kakadia

Post on 07-Jul-2015

669 views

Category:

Technology


1 download

DESCRIPTION

This is my project of Cloud Computing Course

TRANSCRIPT

Page 1: Review Summarization System

Team – 22 M Manoj Kumar – Srinath Ravichandran - Dharmesh kakadia – Sandhya S (201107502) - (201107625) - (201107616) - (201107617)

REVIEW SUMMARY SYSTEM

Page 2: Review Summarization System

OVERVIEW

•  System to summarize reviews from various sources

•  Users can view and compare products based on features

•  Results exposed as RESTful web-service

•  Ability to cater to different products

Page 3: Review Summarization System

OVERALL WORK FLOW

Feature Extraction

Sentiment Analysis

Sentiment Classification

Page 4: Review Summarization System

DETAILED FLOW CHART

Reviews Parse and Tag Feature Extraction

Feature DB

Opinion DB

•  Once for a category of product

•  Nouns #frequency •  Adjectives

#frequency •  Classifier is

designed based on this data.

Page 5: Review Summarization System

Review •  Raw Review

Sentence Pruning •  Preprocess data

<features> •  List of valid features

Dependency relations

•  Using Stanford Parser

<Feature Opinions>

Semantic Analyzer

<Ratings>

NoSQL (mongo)

Feature DB

•  Each sentence is passed through NLP logic.

•  Features are extracted and rated according to the opinion of the setence.

Page 6: Review Summarization System

PARALLELIZING WITH HADOOP

Mapper

Reducer Reducer Reducer

mobile2

Summary Data Base

mobille1 mobille2 mobille3

(Tag the Review)

Page 7: Review Summarization System

DATABASE SCHEMA

Trained Data

•  Nouns # •  Modifiers #

Tagged Reviews

•  Features •  Ratings •  Review Text

Review Summary

•  Features •  Average

Rating

Product X

Page 8: Review Summarization System

RESTFUL WEB SERVICES •  System exposes results as restful web services.

Review System

Page 9: Review Summarization System

EXPERIENCES & LEARNING

•  NLP Dependency Relationships!!!

•  REST is BEST

•  SCHEMA defines EVERYTHING!!

Page 10: Review Summarization System

FUTURE WORK

•  Better feature Extraction.

•  Synonym match can be extended with Wordnet::Similarity.

•  Can be further optimized for blazing performance.

•  Preprocess user query.

Page 11: Review Summarization System

TOOLS USED

•  NLP

•  Stanford Parser

•  Wordnet (Synonyms)

•  Sentiwordnet

•  Hadoop 20.2

•  Mongo DB