<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>gHacks technology news &#187; website cache</title>
	<atom:link href="http://www.ghacks.net/tag/website-cache/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.ghacks.net</link>
	<description>A technology blog covering software, mobile phones, gadgets, security, the Internet and other relevant areas.</description>
	<lastBuildDate>Tue, 24 Nov 2009 23:31:44 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Restore Deleted Or Unavailable Websites</title>
		<link>http://www.ghacks.net/2009/06/14/restore-deleted-or-unavailable-websites/</link>
		<comments>http://www.ghacks.net/2009/06/14/restore-deleted-or-unavailable-websites/#comments</comments>
		<pubDate>Sun, 14 Jun 2009 17:47:15 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
				<category><![CDATA[Online Services]]></category>
		<category><![CDATA[The Web]]></category>
		<category><![CDATA[cache]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[internet cache]]></category>
		<category><![CDATA[mirror website]]></category>
		<category><![CDATA[restore website]]></category>
		<category><![CDATA[warrick]]></category>
		<category><![CDATA[website]]></category>
		<category><![CDATA[website cache]]></category>

		<guid isPermaLink="false">http://www.ghacks.net/?p=13558</guid>
		<description><![CDATA[Web users have quite a few possibilities to access websites that have been deleted or are temporarily unavailable for some time. Possibilities include using Google Cache, the Web Archive or other web caches that mirror websites. Web caches are great for accessing single pages of a website but not comfortable when multiple pages need to [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.ghacks.net/wp-content/uploads/2009/06/warrick_websites.jpg" alt="warrick websites" title="warrick websites" width="228" height="82" class="alignleft size-full wp-image-13559" />Web users have quite a few possibilities to access websites that have been deleted or are temporarily unavailable for some time. Possibilities include using Google Cache, the Web Archive or other web caches that mirror websites. Web caches are great for accessing single pages of a website but not comfortable when multiple pages need to be accessed. It can also happen that webmasters have lost their website in a server crash and need to restore the pages from Internet caches.</p>
<p>Warrick is a Perl script that tries to restore websites from various Internet sources including Archive.org and the three popular search engines Google, Yahoo and Bing. Installation is a bit more complex than running an executable but still doable even for inexperienced users. The computer program is provided as a version for the Windows and Linux / Unix operating system. Windows users need to install a Perl interpreter such as Active Perl before they can run the script from the command line.</p>
<p><span id="more-13558"></span><img src="http://www.ghacks.net/wp-content/uploads/2009/06/warrick-500x251.jpg" alt="warrick" title="warrick" width="500" height="251" class="alignnone size-medium wp-image-13560" /></p>
<p>The developers have <a href="http://warrick.cs.odu.edu/warrick-windows-install.html">created</a> a step by step guide for Windows users on how to install and use the script on the operating system. The <a href="http://warrick.cs.odu.edu/warrick.html">Warrick</a> website contains examples on how to use the script to restore single pages and entire web projects.</p>
<p>The command &#8220;warrick.pl -r -wr ia -c http://yourwebsite.com/&#8221; will reconstruct all pages of the website that are stored in at least one of the online sources used in the recovery process.</p>

	Tags: <a href="http://www.ghacks.net/tag/cache/" title="cache" rel="tag">cache</a>, <a href="http://www.ghacks.net/tag/google/" title="Google" rel="tag">Google</a>, <a href="http://www.ghacks.net/tag/internet-cache/" title="internet cache" rel="tag">internet cache</a>, <a href="http://www.ghacks.net/tag/mirror-website/" title="mirror website" rel="tag">mirror website</a>, <a href="http://www.ghacks.net/tag/restore-website/" title="restore website" rel="tag">restore website</a>, <a href="http://www.ghacks.net/tag/warrick/" title="warrick" rel="tag">warrick</a>, <a href="http://www.ghacks.net/tag/website/" title="website" rel="tag">website</a>, <a href="http://www.ghacks.net/tag/website-cache/" title="website cache" rel="tag">website cache</a><br />

	<h4>Related posts</h4>
	<ul class="st-related-posts">
	<li><a href="http://www.ghacks.net/2008/08/05/zoundry-raven-portable-blog-editor/" title="Zoundry Raven portable Blog Editor (August 5, 2008)">Zoundry Raven portable Blog Editor</a> (6)</li>
	<li><a href="http://www.ghacks.net/2009/11/20/youtube-videos-get-automatic-captions-1080p-videos-roll-out/" title="Youtube Videos Get Automatic Captions. 1080p Videos Roll-Out (November 20, 2009)">Youtube Videos Get Automatic Captions. 1080p Videos Roll-Out</a> (1)</li>
	<li><a href="http://www.ghacks.net/2009/08/21/youtube-insight-find-out-who-is-embedding-your-youtube-videos/" title="Youtube Insight: Find Out Who Is Embedding Your Youtube Videos (August 21, 2009)">Youtube Insight: Find Out Who Is Embedding Your Youtube Videos</a> (0)</li>
	<li><a href="http://www.ghacks.net/2005/10/14/yahoo-vs-google-vs-msn-search-commands-compared/" title="Yahoo vs. Google vs. Msn, Search commands compared (October 14, 2005)">Yahoo vs. Google vs. Msn, Search commands compared</a> (4)</li>
	<li><a href="http://www.ghacks.net/2009/10/05/xoopit-to-become-yahoo-mail-exclusive/" title="Xoopit To Become Yahoo Mail Exclusive (October 5, 2009)">Xoopit To Become Yahoo Mail Exclusive</a> (5)</li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://www.ghacks.net/2009/06/14/restore-deleted-or-unavailable-websites/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Create A Cached Website Copy</title>
		<link>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/</link>
		<comments>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/#comments</comments>
		<pubDate>Tue, 24 Feb 2009 17:40:53 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
				<category><![CDATA[The Web]]></category>
		<category><![CDATA[backup url]]></category>
		<category><![CDATA[cache website]]></category>
		<category><![CDATA[httrack]]></category>
		<category><![CDATA[website cache]]></category>
		<category><![CDATA[website copier]]></category>
		<category><![CDATA[website copy]]></category>
		<category><![CDATA[website download]]></category>
		<category><![CDATA[website downloader]]></category>

		<guid isPermaLink="false">http://www.ghacks.net/?p=10736</guid>
		<description><![CDATA[Many websites tend to be discontinued after a time. This can be extremely frustrating if that website did contain some valuable information that are not accessible in the same form anywhere on the Internet. Google Cache might be a solution but it usually caches one of the last states of a page which does not [...]]]></description>
			<content:encoded><![CDATA[<p>Many websites tend to be discontinued after a time. This can be extremely frustrating if that website did contain some valuable information that are not accessible in the same form anywhere on the Internet. Google Cache might be a solution but it usually caches one of the last states of a page which does not necessarily have to be the one containing the important information. There are various ways to preserve information on the Internet. It is possible to save the information on a per-page basis using the web browser&#8217;s Save As option, to use website downloaders like <a href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/ ">HTTrack</a> or online services like <a href="http://backupurl.com/create.php">BackupUrl</a>.</p>
<p>All methods have various advantages and disadvantages. Using the Save As function in web browsers is probably the fastest way to download a page to the computer. The structure makes it on the other hand uncomfortable to work with on larger projects. Website downloaders on the other hand deal perfectly with large websites, they do require some knowledge and configuration though before they even start to download the first byte.</p>
<p>The online service Backupurl offers another way to create a cached copy of a website. The user enters the url of a page that he wants to preserve in the form on the website. The service will then cache that url for the user and provide two addresses to cached versions of the page. The main advantage of the service is that the cached pages are not stored locally. This might be favorable in environments with strict data storage policies. The disadvantage is obvious as well. Only one page can be cached per run which means it becomes as impracticable and uncomfortable as using Save As if multiple pages need to be cached. There is also no guarantee that the service will be there when the information need to be retrieved. </p>
<p><span id="more-10736"></span><img src="http://www.ghacks.net/wp-content/uploads/2009/02/backup_url-500x314.jpg" alt="backup url" title="backup url" width="500" height="314" class="alignnone size-medium wp-image-10737" /></p>
<p>It would also be an interesting option to retrieve all pages that have been cached at once. The only way to keep track of all cached pages is to copy and paste all created urls into another document. Backup URL can be an interesting option under certain circumstance. Advanced users are better off with applications like HTTrack or similar applications.</p>

	Tags: <a href="http://www.ghacks.net/tag/backup-url/" title="backup url" rel="tag">backup url</a>, <a href="http://www.ghacks.net/tag/cache-website/" title="cache website" rel="tag">cache website</a>, <a href="http://www.ghacks.net/tag/httrack/" title="httrack" rel="tag">httrack</a>, <a href="http://www.ghacks.net/tag/website-cache/" title="website cache" rel="tag">website cache</a>, <a href="http://www.ghacks.net/tag/website-copier/" title="website copier" rel="tag">website copier</a>, <a href="http://www.ghacks.net/tag/website-copy/" title="website copy" rel="tag">website copy</a>, <a href="http://www.ghacks.net/tag/website-download/" title="website download" rel="tag">website download</a>, <a href="http://www.ghacks.net/tag/website-downloader/" title="website downloader" rel="tag">website downloader</a><br />

	<h4>Related posts</h4>
	<ul class="st-related-posts">
	<li><a href="http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/" title="Website Monitor And Downloader (March 24, 2009)">Website Monitor And Downloader</a> (2)</li>
	<li><a href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/" title="Rip Websites with HTTrack Website Copier (August 16, 2006)">Rip Websites with HTTrack Website Copier</a> (6)</li>
	<li><a href="http://www.ghacks.net/2005/11/24/website-downloader/" title="Website Downloader (November 24, 2005)">Website Downloader</a> (0)</li>
	<li><a href="http://www.ghacks.net/2009/06/14/restore-deleted-or-unavailable-websites/" title="Restore Deleted Or Unavailable Websites (June 14, 2009)">Restore Deleted Or Unavailable Websites</a> (4)</li>
	<li><a href="http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/" title="How to rip most websites (March 10, 2008)">How to rip most websites</a> (10)</li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
