<?xml version="1.0" encoding="UTF-8"?> <rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
> <channel><title>gHacks Technology News &#124; Latest Tech News, Software And Tutorials &#187; website cache</title> <atom:link href="http://www.ghacks.net/tag/website-cache/feed/" rel="self" type="application/rss+xml" /><link>http://www.ghacks.net</link> <description>A technology news blog covering software, mobile phones, gadgets, security, the Internet and other relevant areas.</description> <lastBuildDate>Fri, 10 Feb 2012 20:51:26 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <atom:link rel="hub" href="http://pubsubhubbub.appspot.com"/><atom:link rel="hub" href="http://superfeedr.com/hubbub"/> <item><title>Restore Deleted Or Unavailable Websites</title><link>http://www.ghacks.net/2009/06/14/restore-deleted-or-unavailable-websites/</link> <comments>http://www.ghacks.net/2009/06/14/restore-deleted-or-unavailable-websites/#comments</comments> <pubDate>Sun, 14 Jun 2009 17:47:15 +0000</pubDate> <dc:creator>Martin Brinkmann</dc:creator> <category><![CDATA[Online Services]]></category> <category><![CDATA[The Web]]></category> <category><![CDATA[cache]]></category> <category><![CDATA[Google]]></category> <category><![CDATA[internet cache]]></category> <category><![CDATA[mirror website]]></category> <category><![CDATA[restore website]]></category> <category><![CDATA[warrick]]></category> <category><![CDATA[website]]></category> <category><![CDATA[website cache]]></category> <guid
isPermaLink="false">http://www.ghacks.net/?p=13558</guid> <description><![CDATA[Web users have quite a few possibilities to access websites that have been deleted or are temporarily unavailable for some time. Possibilities include using Google Cache, the Web Archive or other web caches that mirror websites. Web caches are great for accessing single pages of a website but not comfortable when multiple pages need to [...]]]></description> <content:encoded><![CDATA[<p><img
src="http://www.ghacks.net/wp-content/uploads/2009/06/warrick_websites.jpg" alt="warrick websites" title="warrick websites" width="228" height="82" class="alignleft size-full wp-image-13559" />Web users have quite a few possibilities to access websites that have been deleted or are temporarily unavailable for some time. Possibilities include using Google Cache, the Web Archive or other web caches that mirror websites. Web caches are great for accessing single pages of a website but not comfortable when multiple pages need to be accessed. It can also happen that webmasters have lost their website in a server crash and need to restore the pages from Internet caches.</p><p>Warrick is a Perl script that tries to restore websites from various Internet sources including Archive.org and the three popular search engines Google, Yahoo and Bing. Installation is a bit more complex than running an executable but still doable even for inexperienced users. The computer program is provided as a version for the Windows and Linux / Unix operating system. Windows users need to install a Perl interpreter such as Active Perl before they can run the script from the command line.</p><p><span
id="more-13558"></span><img
src="http://www.ghacks.net/wp-content/uploads/2009/06/warrick-500x251.jpg" alt="warrick" title="warrick" width="500" height="251" class="alignnone size-medium wp-image-13560" /></p><p>The developers have <a
href="http://warrick.cs.odu.edu/warrick-windows-install.html">created</a> a step by step guide for Windows users on how to install and use the script on the operating system. The <a
href="http://warrick.cs.odu.edu/warrick.html">Warrick</a> website contains examples on how to use the script to restore single pages and entire web projects.</p><p>The command &#8220;warrick.pl -r -wr ia -c http://yourwebsite.com/&#8221; will reconstruct all pages of the website that are stored in at least one of the online sources used in the recovery process.</p> ]]></content:encoded> <wfw:commentRss>http://www.ghacks.net/2009/06/14/restore-deleted-or-unavailable-websites/feed/</wfw:commentRss> <slash:comments>5</slash:comments> </item> <item><title>Create A Cached Website Copy</title><link>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/</link> <comments>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/#comments</comments> <pubDate>Tue, 24 Feb 2009 17:40:53 +0000</pubDate> <dc:creator>Martin Brinkmann</dc:creator> <category><![CDATA[The Web]]></category> <category><![CDATA[backup url]]></category> <category><![CDATA[cache website]]></category> <category><![CDATA[httrack]]></category> <category><![CDATA[website cache]]></category> <category><![CDATA[website copier]]></category> <category><![CDATA[website copy]]></category> <category><![CDATA[website download]]></category> <category><![CDATA[website downloader]]></category> <guid
isPermaLink="false">http://www.ghacks.net/?p=10736</guid> <description><![CDATA[Many websites tend to be discontinued after a time. This can be extremely frustrating if that website did contain some valuable information that are not accessible in the same form anywhere on the Internet. Google Cache might be a solution but it usually caches one of the last states of a page which does not [...]]]></description> <content:encoded><![CDATA[<p>Many websites tend to be discontinued after a time. This can be extremely frustrating if that website did contain some valuable information that are not accessible in the same form anywhere on the Internet. Google Cache might be a solution but it usually caches one of the last states of a page which does not necessarily have to be the one containing the important information. There are various ways to preserve information on the Internet. It is possible to save the information on a per-page basis using the web browser&#8217;s Save As option, to use website downloaders like <a
href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/ ">HTTrack</a> or online services like <a
href="http://backupurl.com/">BackupUrl</a>.</p><p>All methods have various advantages and disadvantages. Using the Save As function in web browsers is probably the fastest way to download a page to the computer. The structure makes it on the other hand uncomfortable to work with on larger projects. Website downloaders on the other hand deal perfectly with large websites, they do require some knowledge and configuration though before they even start to download the first byte.</p><p>The online service Backupurl offers another way to create a cached copy of a website. The user enters the url of a page that he wants to preserve in the form on the website. The service will then cache that url for the user and provide two addresses to cached versions of the page. The main advantage of the service is that the cached pages are not stored locally. This might be favorable in environments with strict data storage policies. The disadvantage is obvious as well. Only one page can be cached per run which means it becomes as impracticable and uncomfortable as using Save As if multiple pages need to be cached. There is also no guarantee that the service will be there when the information need to be retrieved.</p><p><span
id="more-10736"></span><img
src="http://www.ghacks.net/wp-content/uploads/2009/02/backup_url-500x314.jpg" alt="backup url" title="backup url" width="500" height="314" class="alignnone size-medium wp-image-10737" /></p><p>It would also be an interesting option to retrieve all pages that have been cached at once. The only way to keep track of all cached pages is to copy and paste all created urls into another document. Backup URL can be an interesting option under certain circumstance. Advanced users are better off with applications like HTTrack or similar applications.</p> ]]></content:encoded> <wfw:commentRss>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/feed/</wfw:commentRss> <slash:comments>4</slash:comments> </item> </channel> </rss>
