promptcloud-big data crawl and extraction

9
Big Data made Small http://promptcloud.com © PromptCloud Technologies 2012, All rights reserved 1

Upload: promptcloud

Post on 01-Nov-2014

1.840 views

Category:

Technology


0 download

DESCRIPTION

Promptcloud does large-scale data crawl and extraction and is based on Data as a Service (DaaS) model. It aggregates data in the form of reviews, blogs, travel information, etc from the web and delivers this data in a structured format, on a per-client basis. It's the big data era, so PromptCloud aims to abstract the technicalities involved and make it really simple for its users.

TRANSCRIPT

Page 1: PromptCloud-Big Data Crawl and Extraction

Big Data made Small

http://promptcloud.com

© PromptCloud Technologies 2012, All rights reserved

1

Page 2: PromptCloud-Big Data Crawl and Extraction

Problem Identified

2

There’s a lot of data around in the form of reviews, blogs, social media, catalogs, etc. but there’s only 24 hours in a day - to aggregate all the “relevant” data, arrange it in a “format”, derive “insights” from it, and hmph! to realize that your focus was something else.

© PromptCloud Technologies 2012, All rights reserved

Aggregate??

Big Data, very big data,

very very big data..

Identify?? Analyze??

Page 3: PromptCloud-Big Data Crawl and Extraction

Our Answer

3

Crawl Web

• We do deep data crawling and reach where search engines don’t!

Extract Data

• We extract data in the desired format from as many sources as needed.

Normalize Data

• We de-dupe data and join extracts across pages.

We realize that Big Data = More Info = Bigger Opportunities, so we do the following for you.

© PromptCloud Technologies 2012, All rights reserved

Page 4: PromptCloud-Big Data Crawl and Extraction

Underlying Magic

4

Distributed Crawling - Hadoop

Pattern Recognition

-Parsing Agent

Extraction

Lucene

Cloud Computing

Cassandra/ HBase

© PromptCloud Technologies 2012, All rights reserved

Machine Learning

Page 5: PromptCloud-Big Data Crawl and Extraction

Business Model

5

Custom data from deep crawl & incremental crawls

XML

CSV

YAML

PromptClouder Happy Customer

© PromptCloud Technologies 2012, All rights reserved

Data as a Service (DaaS) Platform

Page 6: PromptCloud-Big Data Crawl and Extraction

Features & Functions

6

• Unlimited data in Terabyte/ Petabyte/ Exabyte (YOU ask for it!) that directly converts into business Unlimited Data

• Vertical content based on topicality, media type, or genre of content. Egs- Legal, Medical, Patent, Travel, and Automobile search engines

Vertical Search Engines

• Aggregated data from across social networks viz. Twitter, LinkedIn, Google+ , etc.

Social Media Content

• Collection/ analysis of reviews/ ratings on products and services providing direct insight into consumer preferences

Consumer Insights

• Real-time information about your competitors and BD opportunities (open tenders, project announcements, etc.)

Business Intelligence

© PromptCloud Technologies 2012, All rights reserved

Page 7: PromptCloud-Big Data Crawl and Extraction

Customers Speak

7 © PromptCloud Technologies 2012, All rights reserved

"They have a state-of-art data platform. It was definitely a

good decision to go for customized crawls than get our feet wet with just any other mass data crawler.“

-WisdomTap

“These guys at PromptCloud have done an excellent job. They have not only provided

exhaustive data but also have done the same within

stipulated SLAs. Their technology and

methodology is excellent and they get closely involved

with the business.“ - FunnelScope

Page 8: PromptCloud-Big Data Crawl and Extraction

Our Advantage

8

Price

•Flexible Pricing based on size and frequency of crawls

Performance

•Low ETA’s •Precision Extraction •Exhaustive data available as feed

Technology

•Highly Scalable •Access to real-time data

© PromptCloud Technologies 2012, All rights reserved

Making big data small to alleviate tech-aches

Page 9: PromptCloud-Big Data Crawl and Extraction

Ask Us for Free Demo

9

We can provide you with customized sample data from 2-3 sites of your choice.

Contact Us

Email: [email protected] Phone: +91-96 86 56 70 70

© PromptCloud Technologies 2012, All rights reserved