<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>gHacks technology news &#187; httrack</title>
	<atom:link href="http://www.ghacks.net/tag/httrack/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.ghacks.net</link>
	<description>A technology blog covering software, mobile phones, gadgets, security, the Internet and other relevant areas.</description>
	<lastBuildDate>Tue, 24 Nov 2009 20:14:29 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Website Monitor And Downloader</title>
		<link>http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/</link>
		<comments>http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/#comments</comments>
		<pubDate>Tue, 24 Mar 2009 14:47:10 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
				<category><![CDATA[Browsing]]></category>
		<category><![CDATA[Internet Explorer]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[httrack]]></category>
		<category><![CDATA[internet-explorer]]></category>
		<category><![CDATA[monitor website]]></category>
		<category><![CDATA[rip websites]]></category>
		<category><![CDATA[website downloader]]></category>
		<category><![CDATA[website monitor]]></category>
		<category><![CDATA[website monitoring]]></category>
		<category><![CDATA[website snapshots]]></category>
		<category><![CDATA[wysigot]]></category>

		<guid isPermaLink="false">http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/</guid>
		<description><![CDATA[Wysigot is a browser that acts both as a website monitor and downloader. One of its main functions is the download of entire websites or selected pages. The process has been streamlined to make it as easy as possible. To start the download of a website or page one would simply enter the url of [...]]]></description>
			<content:encoded><![CDATA[<p>Wysigot is a browser that acts both as a website monitor and downloader. One of its main functions is the download of entire websites or selected pages. The process has been streamlined to make it as easy as possible. To start the download of a website or page one would simply enter the url of the site in the assistant that pops up after installation. Supported are http, ftp and file protocols. The next step involves selecting the update check frequency which can be set to automatic, periodical or manual. Wysigot will check the url for updated content and update the information automatically if new contents are found.</p>
<p>The last step in the configuration configures the capturing depths which can be set to first page, first page plus links or whole site. The same menu contains options to allow popups and to set the scanning to be careful which deactivates certain scripts and other potentially malicious contents. The download will start immediately after the last step. The program will display the download progress of all objects on the website.</p>
<p>Experienced users can define objects that should not be downloaded. Among them files like videos, cookies or scripts. Once the website or page has been downloaded it can be browsed in the program interface up to the level it was downloaded from the server.</p>
<p><span id="more-11443"></span><img src="http://www.ghacks.net/wp-content/uploads/2009/03/website_monitor-500x327.jpg" alt="website monitor" title="website monitor" width="500" height="327" class="alignnone size-medium wp-image-11442" /></p>
<p>The website downloader will display all pages that have been downloaded in the sidebar sorted by project name. A download will be initiated for every link pointing to a page that has not been downloaded before. The download speed depends on several factors including the connection speed of the computer system the application runs on. </p>
<p>Different view modes are available that differ from the default html view mode. It is possible to take a look at the contents, attached files (e.g. videos or images) or information (modification date, allowed objects, site and so on). Properties can be accessed for each downloaded website differently or combined for a project. They make it possible to set very specific rules for downloading contents including the number of page revisions to keep or the contents that should be downloaded.</p>
<p><a href="http://www.wysigot.com/">Wysigot</a> is not only a website downloader but also a website monitor. It can be set to monitor websites for changes and notify the user about those changes. Change verifications can be automatic, periodical or manual depending on the user&#8217;s choice. Alarms can be set to notify the user if a website has been updated.</p>
<p>The website monitor and downloader uses the rendering engine of Internet Explorer to display the website&#8217;s contents. It contains options to import Internet Explorer favorites which is convenient if several of those should be downloaded. </p>
<p>An alternative is the excellent <a href="http://www.ghacks.net/tag/httrack/">Httrack</a> which runs on Windows, Linux and OSX</p>

	Tags: <a href="http://www.ghacks.net/tag/httrack/" title="httrack" rel="tag">httrack</a>, <a href="http://www.ghacks.net/tag/internet-explorer/" title="internet-explorer" rel="tag">internet-explorer</a>, <a href="http://www.ghacks.net/tag/monitor-website/" title="monitor website" rel="tag">monitor website</a>, <a href="http://www.ghacks.net/tag/rip-websites/" title="rip websites" rel="tag">rip websites</a>, <a href="http://www.ghacks.net/tag/website-downloader/" title="website downloader" rel="tag">website downloader</a>, <a href="http://www.ghacks.net/tag/website-monitor/" title="website monitor" rel="tag">website monitor</a>, <a href="http://www.ghacks.net/tag/website-monitoring/" title="website monitoring" rel="tag">website monitoring</a>, <a href="http://www.ghacks.net/tag/website-snapshots/" title="website snapshots" rel="tag">website snapshots</a>, <a href="http://www.ghacks.net/tag/wysigot/" title="wysigot" rel="tag">wysigot</a><br />

	<h4>Related posts</h4>
	<ul class="st-related-posts">
	<li><a href="http://www.ghacks.net/2005/11/24/website-downloader/" title="Website Downloader (November 24, 2005)">Website Downloader</a> (0)</li>
	<li><a href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/" title="Rip Websites with HTTrack Website Copier (August 16, 2006)">Rip Websites with HTTrack Website Copier</a> (6)</li>
	<li><a href="http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/" title="Create A Cached Website Copy (February 24, 2009)">Create A Cached Website Copy</a> (4)</li>
	<li><a href="http://www.ghacks.net/2008/07/21/youtube-file-hack-tool-for-internet-explorer/" title="Youtube File Hack Tool For Internet Explorer (July 21, 2008)">Youtube File Hack Tool For Internet Explorer</a> (8)</li>
	<li><a href="http://www.ghacks.net/2008/06/27/you-better-stop-using-internet-explorer-for-now/" title="You better stop using Internet Explorer for now (June 27, 2008)">You better stop using Internet Explorer for now</a> (18)</li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Create A Cached Website Copy</title>
		<link>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/</link>
		<comments>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/#comments</comments>
		<pubDate>Tue, 24 Feb 2009 17:40:53 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
				<category><![CDATA[The Web]]></category>
		<category><![CDATA[backup url]]></category>
		<category><![CDATA[cache website]]></category>
		<category><![CDATA[httrack]]></category>
		<category><![CDATA[website cache]]></category>
		<category><![CDATA[website copier]]></category>
		<category><![CDATA[website copy]]></category>
		<category><![CDATA[website download]]></category>
		<category><![CDATA[website downloader]]></category>

		<guid isPermaLink="false">http://www.ghacks.net/?p=10736</guid>
		<description><![CDATA[Many websites tend to be discontinued after a time. This can be extremely frustrating if that website did contain some valuable information that are not accessible in the same form anywhere on the Internet. Google Cache might be a solution but it usually caches one of the last states of a page which does not [...]]]></description>
			<content:encoded><![CDATA[<p>Many websites tend to be discontinued after a time. This can be extremely frustrating if that website did contain some valuable information that are not accessible in the same form anywhere on the Internet. Google Cache might be a solution but it usually caches one of the last states of a page which does not necessarily have to be the one containing the important information. There are various ways to preserve information on the Internet. It is possible to save the information on a per-page basis using the web browser&#8217;s Save As option, to use website downloaders like <a href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/ ">HTTrack</a> or online services like <a href="http://backupurl.com/create.php">BackupUrl</a>.</p>
<p>All methods have various advantages and disadvantages. Using the Save As function in web browsers is probably the fastest way to download a page to the computer. The structure makes it on the other hand uncomfortable to work with on larger projects. Website downloaders on the other hand deal perfectly with large websites, they do require some knowledge and configuration though before they even start to download the first byte.</p>
<p>The online service Backupurl offers another way to create a cached copy of a website. The user enters the url of a page that he wants to preserve in the form on the website. The service will then cache that url for the user and provide two addresses to cached versions of the page. The main advantage of the service is that the cached pages are not stored locally. This might be favorable in environments with strict data storage policies. The disadvantage is obvious as well. Only one page can be cached per run which means it becomes as impracticable and uncomfortable as using Save As if multiple pages need to be cached. There is also no guarantee that the service will be there when the information need to be retrieved. </p>
<p><span id="more-10736"></span><img src="http://www.ghacks.net/wp-content/uploads/2009/02/backup_url-500x314.jpg" alt="backup url" title="backup url" width="500" height="314" class="alignnone size-medium wp-image-10737" /></p>
<p>It would also be an interesting option to retrieve all pages that have been cached at once. The only way to keep track of all cached pages is to copy and paste all created urls into another document. Backup URL can be an interesting option under certain circumstance. Advanced users are better off with applications like HTTrack or similar applications.</p>

	Tags: <a href="http://www.ghacks.net/tag/backup-url/" title="backup url" rel="tag">backup url</a>, <a href="http://www.ghacks.net/tag/cache-website/" title="cache website" rel="tag">cache website</a>, <a href="http://www.ghacks.net/tag/httrack/" title="httrack" rel="tag">httrack</a>, <a href="http://www.ghacks.net/tag/website-cache/" title="website cache" rel="tag">website cache</a>, <a href="http://www.ghacks.net/tag/website-copier/" title="website copier" rel="tag">website copier</a>, <a href="http://www.ghacks.net/tag/website-copy/" title="website copy" rel="tag">website copy</a>, <a href="http://www.ghacks.net/tag/website-download/" title="website download" rel="tag">website download</a>, <a href="http://www.ghacks.net/tag/website-downloader/" title="website downloader" rel="tag">website downloader</a><br />

	<h4>Related posts</h4>
	<ul class="st-related-posts">
	<li><a href="http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/" title="Website Monitor And Downloader (March 24, 2009)">Website Monitor And Downloader</a> (2)</li>
	<li><a href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/" title="Rip Websites with HTTrack Website Copier (August 16, 2006)">Rip Websites with HTTrack Website Copier</a> (6)</li>
	<li><a href="http://www.ghacks.net/2005/11/24/website-downloader/" title="Website Downloader (November 24, 2005)">Website Downloader</a> (0)</li>
	<li><a href="http://www.ghacks.net/2009/06/14/restore-deleted-or-unavailable-websites/" title="Restore Deleted Or Unavailable Websites (June 14, 2009)">Restore Deleted Or Unavailable Websites</a> (4)</li>
	<li><a href="http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/" title="How to rip most websites (March 10, 2008)">How to rip most websites</a> (10)</li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>How to rip most websites</title>
		<link>http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/</link>
		<comments>http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/#comments</comments>
		<pubDate>Mon, 10 Mar 2008 11:27:35 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
				<category><![CDATA[Operating Systems]]></category>
		<category><![CDATA[The Web]]></category>
		<category><![CDATA[Windows]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[copy websites]]></category>
		<category><![CDATA[httrack]]></category>
		<category><![CDATA[software rip websites]]></category>
		<category><![CDATA[websites]]></category>

		<guid isPermaLink="false">http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/</guid>
		<description><![CDATA[Ripping websites means to create a local copy of a website for offline browsing purposes. Creating a website mirror can actually be a good idea for several purposes. Even with all those caches that save website information many get lost when a website goes into Nirvana. It's also nice if you need information on a computer with no Internet access, or only temporary Internet access, say an HTML course for example.]]></description>
			<content:encoded><![CDATA[<p>Ripping websites means to create a local copy of a website for offline browsing purposes. Creating a website mirror can actually be a good idea for several purposes. Even with all those caches that save website information many get lost when a website goes into Nirvana. It&#8217;s also nice if you need information on a computer with no Internet access, or only temporary Internet access, say an HTML course for example.</p>
<p>One of the most efficient ways to rip websites is by using the program HTTrack which might look a little bit confusing at the beginning because of its many options. I would like to walk you through the process of ripping a website. Please note that this method is not working on all websites but on most.</p>
<p>To begin with you need to download and install the software <a href="http://www.httrack.com/">HTTrack</a> Website Copier. Start it once it has been installed, you will be greeted with a new project dialog. Each project creates the offline copy of one or more urls.</p>
<p><span id="more-3470"></span><img src='http://www.ghacks.net/wp-content/uploads/2008/03/httrack_rip_websites.jpg' alt='rip websites' /></p>
<p>The first screen manages the properties of the project. Just add a name &#8211; i prefer the name of the website that I want to rip &#8211; and a location on your hard drive where you want to save it. Make sure you have enough free disk space on that hard drive. Click Next to continue.</p>
<p>You add urls and the kind of action that you want HTTrack to perform. The standard action will download an exact copy of the website and make it available offline. The most important aspect here is the Set Options button which opens the configuration for the project.</p>
<p>It is very important to load the options and make some changes there. Click on the Browser ID tab and change the ID to Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0). Some websites check for the default ID of Httrack and deny access to it. This way makes it possible to prevent that from happening.</p>
<p>Access the Limits tab afterwards. Select the maximum mirroring and external depths. The first defines how many links will be scanned beginning from the homepage. If you set that to 2 for instance the homepage will be scanned, page 1 which was linked from the homepage will be scanned and page 11 will be scanned which was linked from page 1.</p>
<p>If you leave the first option blank all links will be scanned on that website. No external links are scanned by default which can be changed in that menu as well. I suggest to leave it at that because it would really bloat the project. Make sure you increase the maximum transfer rate in the same menu to the maximum as well to ensure faster downloads. </p>
<p>The Scan Rules tab is another important one. You can include and exclude files in here. If you do not want to download .exe files for instance you can use the string &#8220;-*.exe&#8221; without the &#8220;&#8221; in the form.</p>
<p><strong>Passworded Websites:</strong></p>
<p>Passworded websites are most of the times harder to come by. You need to supply HTTrack with the username and password for that website. The easiest way to do so is to add it to the url in the main menu. Instead of adding the url http://www.example.com/ you would add it this way: http://username:password@www.example.com/</p>
<p>That&#8217;s for websites with basic authentication which means popups that ask for a username and password. It&#8217;s more difficulty if the website uses form based logins. Your best option to rip those websites is to click on the Add Url button in the main menu and use the capture url feature. </p>
<p>This requires you to set a proxy in your favorite browser for a short time and login into the website that you want to rip so that HTTrack can check the way it is done and hopefully emulate this way when ripping the website.</p>

	Tags: <a href="http://www.ghacks.net/tag/copy-websites/" title="copy websites" rel="tag">copy websites</a>, <a href="http://www.ghacks.net/tag/httrack/" title="httrack" rel="tag">httrack</a>, <a href="http://www.ghacks.net/tag/software-rip-websites/" title="software rip websites" rel="tag">software rip websites</a>, <a href="http://www.ghacks.net/tag/websites/" title="websites" rel="tag">websites</a>, <a href="http://www.ghacks.net/tag/windows/" title="Windows" rel="tag">Windows</a><br />

	<h4>Related posts</h4>
	<ul class="st-related-posts">
	<li><a href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/" title="Rip Websites with HTTrack Website Copier (August 16, 2006)">Rip Websites with HTTrack Website Copier</a> (6)</li>
	<li><a href="http://www.ghacks.net/2006/10/21/zoom-it/" title="Zoom It (October 21, 2006)">Zoom It</a> (4)</li>
	<li><a href="http://www.ghacks.net/2008/06/08/zip-repair/" title="Zip Repair (June 8, 2008)">Zip Repair</a> (3)</li>
	<li><a href="http://www.ghacks.net/2008/07/15/zen-key-an-all-purpose-application-manager/" title="Zen Key An All Purpose Application Manager (July 15, 2008)">Zen Key An All Purpose Application Manager</a> (3)</li>
	<li><a href="http://www.ghacks.net/2008/05/13/youtube-batch-downloader/" title="Youtube Batch Downloader (May 13, 2008)">Youtube Batch Downloader</a> (13)</li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Rip Websites with HTTrack Website Copier</title>
		<link>http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/</link>
		<comments>http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/#comments</comments>
		<pubDate>Wed, 16 Aug 2006 19:15:16 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
				<category><![CDATA[Tools]]></category>
		<category><![CDATA[download websites]]></category>
		<category><![CDATA[httrack]]></category>
		<category><![CDATA[rip websites]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[website copier]]></category>
		<category><![CDATA[websites]]></category>

		<guid isPermaLink="false">http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/</guid>
		<description><![CDATA[Beta day here at Ghacks. First a way to get Windows Vista Beta 2 shipped right to your doorsteps by answering some easy questions in a quiz and now a new beta release of the great website copier HTTrack. The program aims towards the experienced user with all its options and settings but can be used by novices as well. Most settings are optional that should be used for websites that use a lot of scripting and dynamic pages.]]></description>
			<content:encoded><![CDATA[<p>Beta day here at Ghacks. First a way to get Windows Vista Beta 2 shipped right to your doorsteps by answering some easy questions in a quiz and now a new beta release of the great website copier <a target="_blank" title="copy websites rip" href="http://www.httrack.com/">HTTrack</a>. The program aims towards the experienced user with all its options and settings but can be used by novices as well. Most settings are optional that should be used for websites that use a lot of scripting and dynamic pages.</p>
<p>The question that some of you might be asking is why would someone want to rip a website ? You could rip websites for offline browsing purposes. Maybe you have a second computer without direct internet access but would like to view the website on this computer as well. A website that gives an overview over a programming language is a good example. Another reason would be to save the information that is on the website this way, maybe you fear that the website might be offline soon.</p>
<p><span id="more-723"></span>The HTTrack website offers a <a target="_blank" title="step by step guik" href="http://www.httrack.com/html/step.html">step by step guide</a> that can be used to understand the main features of the tool and rip your first website using the settings of the tutorial. HTTrack is available for Windows and Unix, Linux &#038; BSD.</p>

	Tags: <a href="http://www.ghacks.net/tag/download-websites/" title="download websites" rel="tag">download websites</a>, <a href="http://www.ghacks.net/tag/httrack/" title="httrack" rel="tag">httrack</a>, <a href="http://www.ghacks.net/tag/rip-websites/" title="rip websites" rel="tag">rip websites</a>, <a href="http://www.ghacks.net/tag/software/" title="software" rel="tag">software</a>, <a href="http://www.ghacks.net/tag/website-copier/" title="website copier" rel="tag">website copier</a>, <a href="http://www.ghacks.net/tag/websites/" title="websites" rel="tag">websites</a><br />

	<h4>Related posts</h4>
	<ul class="st-related-posts">
	<li><a href="http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/" title="Website Monitor And Downloader (March 24, 2009)">Website Monitor And Downloader</a> (2)</li>
	<li><a href="http://www.ghacks.net/2005/11/24/website-downloader/" title="Website Downloader (November 24, 2005)">Website Downloader</a> (0)</li>
	<li><a href="http://www.ghacks.net/2006/06/14/mirror-websites-on-your-hard-drive/" title="Mirror Websites on your Hard Drive (June 14, 2006)">Mirror Websites on your Hard Drive</a> (5)</li>
	<li><a href="http://www.ghacks.net/2005/12/27/how-to-save-websites-to-your-hard-drive/" title="How to save websites to your hard drive (December 27, 2005)">How to save websites to your hard drive</a> (4)</li>
	<li><a href="http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/" title="How to rip most websites (March 10, 2008)">How to rip most websites</a> (10)</li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>
