categories of presented papers papers ranking results – s. brin and l. page. the page rank...

2
Categories of Presented Papers Papers Ranking Results S. Brin and L. Page. The Page Rank Citation Ranking: Bringing Order to the Web. Stanford InfoLab Technical Report. January, 1998 Su, Ja-Hwung, Bo-Wen Wang, and Vincent S. Tseng. Effective ranking and recommendation on Web page retrieval by integrating association mining and PageRank J. Kleinberg. Authoritative Sources in a Hyperlinked Environment. ACM-S IAM Symposium on Discrete Algorithms, 1998 Papers for Determining Pages that Match Detecting Near-Duplicates for Web Crawling, Manku et al, WWW Conf., May , 2007 Finding Near Duplicate Web Pages: A Large-Scale Evaluation of Algorithms Web Crawlers Mirtaheri, S.M. et al Dist-RIA Crawler: A Distributed Crawler for Rich Internet Applications S . Raghavan and H. Garcia-Molina. Crawling the Hidden Web. VLDB 2001 J. Madhavan, et al. Google's Deep Web Crawl. VLDB 2008 Madhavan et al, Harnessing the Deep Web: Present and Future, 2009 Auto-Completion Kraus and Bar-Yossef, Context-Sensitive Query Auto-Completion, Nov, 2010 Shokouhi, Milad, and Kira Radinsky. Time-sensitive query auto-completio

Upload: evangeline-hudson

Post on 20-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Categories of Presented Papers Papers Ranking Results – S. Brin and L. Page. The Page Rank Citation Ranking: Bringing Order to the Web. Stanford InfoLab

Categories of Presented Papers• Papers Ranking Results

– S. Brin and L. Page. The Page Rank Citation Ranking: Bringing Order to the Web. Stanford InfoLab Technical Report. January, 1998

– Su, Ja-Hwung, Bo-Wen Wang, and Vincent S. Tseng. Effective ranking and recommendation on Web page retrieval by integrating association mining and PageRank

– J. Kleinberg. Authoritative Sources in a Hyperlinked Environment. ACM-SIAM Symposium on Discrete Algorithms, 1998

• Papers for Determining Pages that Match– Detecting Near-Duplicates for Web Crawling, Manku et al, WWW Conf., May, 2007– Finding Near Duplicate Web Pages: A Large-Scale Evaluation of Algorithms

• Web Crawlers– Mirtaheri, S.M. et al Dist-RIA Crawler: A Distributed Crawler for Rich Internet Application

s

– S. Raghavan and H. Garcia-Molina. Crawling the Hidden Web. VLDB 2001– J. Madhavan, et al. Google's Deep Web Crawl. VLDB 2008– Madhavan et al, Harnessing the Deep Web: Present and Future, 2009

• Auto-Completion– Kraus and Bar-Yossef, Context-Sensitive Query Auto-Completion, Nov, 2010– Shokouhi, Milad, and Kira Radinsky. Time-sensitive query auto-completion

Page 2: Categories of Presented Papers Papers Ranking Results – S. Brin and L. Page. The Page Rank Citation Ranking: Bringing Order to the Web. Stanford InfoLab

Categories of Presented Papers• Google

– S. Brin and L. Page. The Anatomy of a Large-scale Hypertextual Web Search Engine. WWW 1998

– S. Ghemawat et al. The Google File System. SOSP 2003– J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusers. OSD

I 2004

– F. Chang, et al. Bigtable: a distributed storage system for structured data. OSDI 2006• Pay-per-Click Advertising

– An Investigation of Pay Per Click Search Engine Advertising: Modeling the PPC Paradigm to lower cost per action

– On the Security of pay-per-click and other Web Advertising Schemes– A Framework for the Optimizing of WWW Advertising – Online Ad Auctions, Varian, February, 2009

• Spelling Correction– Chen, Qing, Mu Li, and Ming Zhou. Improving Query Spelling Correction Using Web Sear

ch Results

– Spelling Correction for Search Engine Queries, Martins et al– Using the Web for Language Independent Spellchecking and Autocorrection, Whitelaw

et al