Dedian  
-- 关注搜索引擎的开发
日历
<2024年12月>
24252627282930
1234567
891011121314
15161718192021
22232425262728
2930311234
统计
  • 随笔 - 82
  • 文章 - 2
  • 评论 - 228
  • 引用 - 0

导航

常用链接

留言簿(8)

随笔分类(45)

随笔档案(82)

文章档案(2)

Java Spaces

搜索

  •  

积分与排名

  • 积分 - 65026
  • 排名 - 816

最新评论

阅读排行榜

评论排行榜

 
It is interesting to explore some issues about how google works. Actually, the basical components for a searching engine is very simple: web crawler, indexer and searcher. I can use Lucene to do indexing and searching part, but I am still curious that how google do web crawling and collect those useful informations. Google uses Googlebot to do those things while need handle lots of statistcs information for web pages, how can he do that? well, i know it is hard to say before doing some researching...again, for a good search engine, there are still lots stuff need to consider, page rank is only one of them, also not easy one.  More difficult or more intellegent, a perfect search  engine should "understand exactly what you mean and give you back exactly what you want." which is said by Larry Page.

Well, can not say that Google is not a miracle in technolgy field... also exciting to read those google stories

reference:
http://www.googleguide.com/google_works.html
http://www.google.com/technology/
posted on 2006-04-18 14:18 Dedian 阅读(140) 评论(0)  编辑  收藏

只有注册用户登录后才能发表评论。


网站导航:
 
 
Copyright © Dedian Powered by: 博客园 模板提供:沪江博客