Found a very interesting website that captures billions of web pages and archive them.發現了一個非常有趣的網站,捕捉十億網頁和存檔的人員。
時間機器的網站
The Internet Archive Wayback Machine is a service that allows people to visit archived versions of Web sites.互聯網檔案Wayback Machine的是一種服務,使人們能夠訪問存檔的版本的網站。 Visitors to the Wayback Machine can type in a URL, select a date range, and then begin surfing on an archived version of the Web.參觀的檔案本可以輸入網址,選擇一個日期範圍,然後開始衝浪封存版本的網站。 Keyword searching is not currently supported.關鍵字搜索是目前不支持。 Imagine surfing circa 1999 and looking at all the Y2K hype, or revisiting an older version of your favorite Web site.想像一下,衝浪大約在1999年,面向所有的數位炒作,或再訪老的版本您喜愛的網站。 The Internet Archive Wayback Machine can make all of this possible.互聯網檔案Wayback Machine的可以使所有這些可能的。

I tried searching for www.raymond.cc at the internet archive wayback machine and I found…我試圖尋找www.raymond.cc在互聯網檔案館Wayback Machine的,我發現...

The empty page before this blog even began.空頁後,此博客甚至開始。

Then I also found the first design I used for this blog.後來我還發現,最初的設計,我用這個博客。

Brings back the memories…帶回來的記憶...

Anyway, the Internet Archive Wayback Machine contains almost 2 petabytes of data and is currently growing at a rate of 20 terabytes per month.總之,互聯網檔案館檔案本載有近2 PB的數據,目前增長速度的每月20萬億字節。 Terabytes is not very common and the Internet Archive Wayback Machine already has petabytes of data! TB的不是很普遍,互聯網檔案Wayback Machine的已經有PB的數據!

If you have a web site, and you would like to ensure that it is saved for posterity in the Internet Archive, and you've searched wayback and found no results, here's what you can do to allow the Internet Archive Wayback Machine crawl your web site.如果你有一個網站,你想確保它是為子孫後代保存在互聯網檔案館,你所搜索wayback,發現沒有結果,這裡你可以做什麼讓互聯網檔案館Wayback Machine的抓取您的網頁網站。

Method 1: Visit the 方法1:訪問 Alexa's “Webmasters” page Alexa的“網站管理員”頁 .

Method 2: If you have the Alexa tool bar installed, just visit a site. 方法2:如果你有安裝Alexa的工具欄,只需訪問一個網站。

Method 3: While visiting a site, use the ' show related links ' in Internet Explorer, which uses the Alexa service. 方法3:在訪問一個網站,請使用' 顯示相關鏈接 '在Internet Explorer中,它使用Alexa的服務。

Sites are usually crawled within 24 hours and no more than 48.網站檢索通常在24小時內不超過48。 Right now there is a 6-12 month lag between the date a site is crawled and the date it appears in the Wayback Machine.現在有一個滯後6-12個月之間的日期一網站抓取的日期出現的檔案本機。

[ [ Visit Internet Archive Wayback Machine 訪問互聯網檔案館Wayback Machine的 ] ]

Technorati Tags: Technorati標記: , , ,