promptcloud-big data crawl and extraction
DESCRIPTION
Promptcloud does large-scale data crawl and extraction and is based on Data as a Service (DaaS) model. It aggregates data in the form of reviews, blogs, travel information, etc from the web and delivers this data in a structured format, on a per-client basis. It's the big data era, so PromptCloud aims to abstract the technicalities involved and make it really simple for its users.TRANSCRIPT
Big Data made Small
http://promptcloud.com
© PromptCloud Technologies 2012, All rights reserved
1
Problem Identified
2
There’s a lot of data around in the form of reviews, blogs, social media, catalogs, etc. but there’s only 24 hours in a day - to aggregate all the “relevant” data, arrange it in a “format”, derive “insights” from it, and hmph! to realize that your focus was something else.
© PromptCloud Technologies 2012, All rights reserved
Aggregate??
Big Data, very big data,
very very big data..
Identify?? Analyze??
Our Answer
3
Crawl Web
• We do deep data crawling and reach where search engines don’t!
Extract Data
• We extract data in the desired format from as many sources as needed.
Normalize Data
• We de-dupe data and join extracts across pages.
We realize that Big Data = More Info = Bigger Opportunities, so we do the following for you.
© PromptCloud Technologies 2012, All rights reserved
Underlying Magic
4
Distributed Crawling - Hadoop
Pattern Recognition
-Parsing Agent
Extraction
Lucene
Cloud Computing
Cassandra/ HBase
© PromptCloud Technologies 2012, All rights reserved
Machine Learning
Business Model
5
Custom data from deep crawl & incremental crawls
XML
CSV
YAML
PromptClouder Happy Customer
© PromptCloud Technologies 2012, All rights reserved
Data as a Service (DaaS) Platform
Features & Functions
6
• Unlimited data in Terabyte/ Petabyte/ Exabyte (YOU ask for it!) that directly converts into business Unlimited Data
• Vertical content based on topicality, media type, or genre of content. Egs- Legal, Medical, Patent, Travel, and Automobile search engines
Vertical Search Engines
• Aggregated data from across social networks viz. Twitter, LinkedIn, Google+ , etc.
Social Media Content
• Collection/ analysis of reviews/ ratings on products and services providing direct insight into consumer preferences
Consumer Insights
• Real-time information about your competitors and BD opportunities (open tenders, project announcements, etc.)
Business Intelligence
© PromptCloud Technologies 2012, All rights reserved
Customers Speak
7 © PromptCloud Technologies 2012, All rights reserved
"They have a state-of-art data platform. It was definitely a
good decision to go for customized crawls than get our feet wet with just any other mass data crawler.“
-WisdomTap
“These guys at PromptCloud have done an excellent job. They have not only provided
exhaustive data but also have done the same within
stipulated SLAs. Their technology and
methodology is excellent and they get closely involved
with the business.“ - FunnelScope
Our Advantage
8
Price
•Flexible Pricing based on size and frequency of crawls
Performance
•Low ETA’s •Precision Extraction •Exhaustive data available as feed
Technology
•Highly Scalable •Access to real-time data
© PromptCloud Technologies 2012, All rights reserved
Making big data small to alleviate tech-aches
Ask Us for Free Demo
9
We can provide you with customized sample data from 2-3 sites of your choice.
Contact Us
Email: [email protected] Phone: +91-96 86 56 70 70
© PromptCloud Technologies 2012, All rights reserved