Found a very interesting website that captures billions of web pages and archive them.发现了一个非常有趣的网站,网页和存档他们捕捉十亿美元。
时间机器的网站
The Internet Archive Wayback Machine is a service that allows people to visit archived versions of Web sites.互联网档案Wayback Machine的是一种服务,让人们访问网站的存档版本。 Visitors to the Wayback Machine can type in a URL, select a date range, and then begin surfing on an archived version of the Web.对档案本可以在一个URL类型,用户选择日期范围,然后开始一个Web存档版本浏览。 Keyword searching is not currently supported.关键字搜索是目前不支持。 Imagine surfing circa 1999 and looking at all the Y2K hype, or revisiting an older version of your favorite Web site.想象一下,冲浪大约在1999年,面向所有的数位炒作,或重新对您最喜爱的网站的旧版本。 The Internet Archive Wayback Machine can make all of this possible.互联网档案Wayback Machine的可以使这一切成为可能。

I tried searching for www.raymond.cc at the internet archive wayback machine and I found…我试图寻找www.raymond.cc在互联网档案馆Wayback Machine的,我发现...

The empty page before this blog even began.在此之前博客的空页甚至开始。

Then I also found the first design I used for this blog.后来我还发现,最初的设计我为我的博客使用。

Brings back the memories…带回来的记忆...

Anyway, the Internet Archive Wayback Machine contains almost 2 petabytes of data and is currently growing at a rate of 20 terabytes per month.总之,互联网档案馆档案本载有近200千兆兆字节的数据,目前在每月20 TB的速度增长。 Terabytes is not very common and the Internet Archive Wayback Machine already has petabytes of data! TB的不是很普遍,互联网档案Wayback Machine的数据已经PB的!

If you have a web site, and you would like to ensure that it is saved for posterity in the Internet Archive, and you've searched wayback and found no results, here's what you can do to allow the Internet Archive Wayback Machine crawl your web site.如果你有一个网站,你想确保它在互联网档案馆后代保存,你搜索wayback,发现没有结果,这里你可以做什么让互联网档案馆Wayback Machine的抓取您的网页网站。

Method 1: Visit the 方法1:访问 Alexa's “Webmasters” page Alexa的“网站管理员”页 .

Method 2: If you have the Alexa tool bar installed, just visit a site. 方法2:如果你有安装Alexa的工具栏,只需访问一个网站。

Method 3: While visiting a site, use the ' show related links ' in Internet Explorer, which uses the Alexa service. 方法3:在访问一个网站,请使用' 显示在Internet Explorer中,它使用Alexa的服务相关链接 '。

Sites are usually crawled within 24 hours and no more than 48.网站检索通常在24小时内不超过48。 Right now there is a 6-12 month lag between the date a site is crawled and the date it appears in the Wayback Machine.现在有1日期间的网站6月12日一个月的滞后,检索和日期它的档案本机显示。

[ [ Visit Internet Archive Wayback Machine 访问互联网档案馆Wayback Machine的 ] ]

Technorati Tags: Technorati标记: , , ,