search engine working technology
DESCRIPTION
HTRANSCRIPT
![Page 1: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/1.jpg)
How does search engine work ?
![Page 2: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/2.jpg)
Easy to understand (?)
• Stores information of the web
• Classify information and rank them
• Bring us with the most relevant content related keywords
![Page 3: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/3.jpg)
Hard to do it (!)
• Billions of Web Pages
• Billions of Gigabytes data
• Every milisecond millions of new data
• Characteristic differences with languages ( localization )
• Requires huge amount of investment
• Respond query less them 1 second
![Page 4: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/4.jpg)
Google Data Centers
• 11 Data centers in the world
• Each data center consist of 10.000 Computers
• Last data center cost 600 Million U.S Dollar
![Page 5: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/5.jpg)
Google Data Centers
![Page 6: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/6.jpg)
Google Data Centers
![Page 7: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/7.jpg)
3 Main aspects of Search Engines
• Web crawling
• Indexing
• Searching
![Page 8: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/8.jpg)
Search Engine Scheme
![Page 9: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/9.jpg)
Web Crawling
• Walks Around the web
• Follows link from every site
• Retrieves information from web pages html
• Stores information in Data Centers
![Page 10: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/10.jpg)
Indexing
• Analyze the stored information by web crawler
• Classify them
• Rank them
![Page 11: Search Engine Working Technology](https://reader038.vdocuments.net/reader038/viewer/2022100602/558cc818d8b42a1d7c8b46d3/html5/thumbnails/11.jpg)
Searching
• Keyword query on search engine
• Fetch related links of the keywords
• List them according the relavance