<?xml version="1.0" encoding="UTF-8"?> <rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
> <channel><title>gHacks Technology News &#124; Latest Tech News, Software And Tutorials &#187; httrack</title> <atom:link href="http://www.ghacks.net/tag/httrack/feed/" rel="self" type="application/rss+xml" /><link>http://www.ghacks.net</link> <description>A technology news blog covering software, mobile phones, gadgets, security, the Internet and other relevant areas.</description> <lastBuildDate>Fri, 10 Feb 2012 16:53:42 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <atom:link rel="hub" href="http://pubsubhubbub.appspot.com"/><atom:link rel="hub" href="http://superfeedr.com/hubbub"/> <item><title>Download Websites With FireMirror</title><link>http://www.ghacks.net/2010/03/31/download-websites-with-firemirror/</link> <comments>http://www.ghacks.net/2010/03/31/download-websites-with-firemirror/#comments</comments> <pubDate>Wed, 31 Mar 2010 15:17:25 +0000</pubDate> <dc:creator>Martin Brinkmann</dc:creator> <category><![CDATA[Browsing]]></category> <category><![CDATA[Firefox]]></category> <category><![CDATA[download websites]]></category> <category><![CDATA[firefox add-ons]]></category> <category><![CDATA[firemirror]]></category> <category><![CDATA[httrack]]></category> <category><![CDATA[mirror websites]]></category> <guid
isPermaLink="false">http://www.ghacks.net/?p=24078</guid> <description><![CDATA[Internet users sometimes come upon information on websites that they want to preserve for the future. They might be inclined to bookmark the page but that is only useful for as long as the website exists. Another option would be to print the information or save the page to the local computer system. The Firefox [...]]]></description> <content:encoded><![CDATA[<p>Internet users sometimes come upon information on websites that they want to preserve for the future. They might be inclined to bookmark the page but that is only useful for as long as the website exists. Another option would be to print the information or save the page to the local computer system.</p><p>The Firefox extension FireMirror uses a similar technique as the last suggestion to download websites to the local hard drive. It basically is a website mirroring software that can automatically download a website based on the user&#8217;s parameters. The default setting will for instance download the active page plus every page that is linked from that page. The depth, which is the parameter that defines the number of pages that the website downloader will download, can be configured in the program&#8217;s options. The maximum depth is 10 and the minimum 0.</p><p><span
id="more-24078"></span><img
src="http://www.ghacks.net/wp-content/uploads/2010/03/download_websites-500x408.jpg" alt="" title="download websites" width="500" height="408" class="alignnone size-medium wp-image-24079" /></p><p>Additional options include configuring the timeout duration, disabling link replacements, enabling reports or configuring filters to include urls with a specific string. The configuration can be saved as a profile to speed up future download runs.</p><p>A basic browser is provided in another tab that can be used to browse the website that is being downloaded but it is usually a better idea to load the downloaded pages instead from the hard drive (the browser will retrieve pages that have not been downloaded).</p><p>The local pages use relative paths so that the downloaded websites and pages can be moved around without breaking the navigation.</p><p>The extension is provided in an early version. There is for instance no stop or cancel button which means the process can only be stopped by clicking on the x button on the extension window. The report is also not working at this time.</p><p>FireMirror could become an interesting alternative to desktop software like <a
href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/">HTTrack</a>. The add-on is compatible with Firefox 3.6+ and can be downloaded from the Mozilla website.</p><p>Update: The Fire Mirror extension is no longer available. We suggest to use HTTrack which is linked above to download entire websites.</p> ]]></content:encoded> <wfw:commentRss>http://www.ghacks.net/2010/03/31/download-websites-with-firemirror/feed/</wfw:commentRss> <slash:comments>10</slash:comments> </item> <item><title>Website Monitor And Downloader</title><link>http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/</link> <comments>http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/#comments</comments> <pubDate>Tue, 24 Mar 2009 14:47:10 +0000</pubDate> <dc:creator>Martin Brinkmann</dc:creator> <category><![CDATA[Browsing]]></category> <category><![CDATA[Internet Explorer]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[httrack]]></category> <category><![CDATA[internet-explorer]]></category> <category><![CDATA[monitor websites]]></category> <category><![CDATA[rip websites]]></category> <category><![CDATA[website monitor]]></category> <category><![CDATA[website monitoring]]></category> <guid
isPermaLink="false">http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/</guid> <description><![CDATA[Wysigot is a browser that acts both as a website monitor and downloader. One of its main functions is the download of entire websites or selected pages. The process has been streamlined to make it as easy as possible. To start the download of a website or page one would simply enter the url of [...]]]></description> <content:encoded><![CDATA[<p>Wysigot is a browser that acts both as a website monitor and downloader. One of its main functions is the download of entire websites or selected pages. The process has been streamlined to make it as easy as possible. To start the download of a website or page one would simply enter the url of the site in the assistant that pops up after installation. Supported are http, ftp and file protocols. The next step involves selecting the update check frequency which can be set to automatic, periodical or manual. Wysigot will check the url for updated content and update the information automatically if new contents are found.</p><p>The last step in the configuration configures the capturing depths which can be set to first page, first page plus links or whole site. The same menu contains options to allow popups and to set the scanning to be careful which deactivates certain scripts and other potentially malicious contents. The download will start immediately after the last step. The program will display the download progress of all objects on the website.</p><p>Experienced users can define objects that should not be downloaded. Among them files like videos, cookies or scripts. Once the website or page has been downloaded it can be browsed in the program interface up to the level it was downloaded from the server.</p><p><span
id="more-11443"></span><img
src="http://www.ghacks.net/wp-content/uploads/2009/03/website_monitor-500x327.jpg" alt="website monitor" title="website monitor" width="500" height="327" class="alignnone size-medium wp-image-11442" /></p><p>The website downloader will display all pages that have been downloaded in the sidebar sorted by project name. A download will be initiated for every link pointing to a page that has not been downloaded before. The download speed depends on several factors including the connection speed of the computer system the application runs on.</p><p>Different view modes are available that differ from the default html view mode. It is possible to take a look at the contents, attached files (e.g. videos or images) or information (modification date, allowed objects, site and so on). Properties can be accessed for each downloaded website differently or combined for a project. They make it possible to set very specific rules for downloading contents including the number of page revisions to keep or the contents that should be downloaded.</p><p><a
href="http://www.wysigot.com/">Wysigot</a> is not only a website downloader but also a website monitor. It can be set to monitor websites for changes and notify the user about those changes. Change verifications can be automatic, periodical or manual depending on the user&#8217;s choice. Alarms can be set to notify the user if a website has been updated.</p><p>The website monitor and downloader uses the rendering engine of Internet Explorer to display the website&#8217;s contents. It contains options to import Internet Explorer favorites which is convenient if several of those should be downloaded.</p><p>An alternative is the excellent <a
href="http://www.ghacks.net/tag/httrack/">Httrack</a> which runs on Windows, Linux and OSX</p> ]]></content:encoded> <wfw:commentRss>http://www.ghacks.net/2009/03/24/website-monitor-and-downloader/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>Create A Cached Website Copy</title><link>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/</link> <comments>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/#comments</comments> <pubDate>Tue, 24 Feb 2009 17:40:53 +0000</pubDate> <dc:creator>Martin Brinkmann</dc:creator> <category><![CDATA[The Web]]></category> <category><![CDATA[backup url]]></category> <category><![CDATA[cache website]]></category> <category><![CDATA[httrack]]></category> <category><![CDATA[website cache]]></category> <category><![CDATA[website copier]]></category> <category><![CDATA[website copy]]></category> <category><![CDATA[website download]]></category> <category><![CDATA[website downloader]]></category> <guid
isPermaLink="false">http://www.ghacks.net/?p=10736</guid> <description><![CDATA[Many websites tend to be discontinued after a time. This can be extremely frustrating if that website did contain some valuable information that are not accessible in the same form anywhere on the Internet. Google Cache might be a solution but it usually caches one of the last states of a page which does not [...]]]></description> <content:encoded><![CDATA[<p>Many websites tend to be discontinued after a time. This can be extremely frustrating if that website did contain some valuable information that are not accessible in the same form anywhere on the Internet. Google Cache might be a solution but it usually caches one of the last states of a page which does not necessarily have to be the one containing the important information. There are various ways to preserve information on the Internet. It is possible to save the information on a per-page basis using the web browser&#8217;s Save As option, to use website downloaders like <a
href="http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/ ">HTTrack</a> or online services like <a
href="http://backupurl.com/">BackupUrl</a>.</p><p>All methods have various advantages and disadvantages. Using the Save As function in web browsers is probably the fastest way to download a page to the computer. The structure makes it on the other hand uncomfortable to work with on larger projects. Website downloaders on the other hand deal perfectly with large websites, they do require some knowledge and configuration though before they even start to download the first byte.</p><p>The online service Backupurl offers another way to create a cached copy of a website. The user enters the url of a page that he wants to preserve in the form on the website. The service will then cache that url for the user and provide two addresses to cached versions of the page. The main advantage of the service is that the cached pages are not stored locally. This might be favorable in environments with strict data storage policies. The disadvantage is obvious as well. Only one page can be cached per run which means it becomes as impracticable and uncomfortable as using Save As if multiple pages need to be cached. There is also no guarantee that the service will be there when the information need to be retrieved.</p><p><span
id="more-10736"></span><img
src="http://www.ghacks.net/wp-content/uploads/2009/02/backup_url-500x314.jpg" alt="backup url" title="backup url" width="500" height="314" class="alignnone size-medium wp-image-10737" /></p><p>It would also be an interesting option to retrieve all pages that have been cached at once. The only way to keep track of all cached pages is to copy and paste all created urls into another document. Backup URL can be an interesting option under certain circumstance. Advanced users are better off with applications like HTTrack or similar applications.</p> ]]></content:encoded> <wfw:commentRss>http://www.ghacks.net/2009/02/24/create-a-cached-website-copy/feed/</wfw:commentRss> <slash:comments>4</slash:comments> </item> <item><title>How to rip most websites</title><link>http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/</link> <comments>http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/#comments</comments> <pubDate>Mon, 10 Mar 2008 11:27:35 +0000</pubDate> <dc:creator>Martin Brinkmann</dc:creator> <category><![CDATA[Operating Systems]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[The Web]]></category> <category><![CDATA[Windows]]></category> <category><![CDATA[copy websites]]></category> <category><![CDATA[httrack]]></category> <category><![CDATA[software rip websites]]></category> <category><![CDATA[websites]]></category> <guid
isPermaLink="false">http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/</guid> <description><![CDATA[Ripping websites means to create a local copy of a website for offline browsing purposes. Creating a website mirror can actually be a good idea for several purposes. Even with all those caches that save website information many get lost when a website goes into Nirvana. It's also nice if you need information on a computer with no Internet access, or only temporary Internet access, say an HTML course for example.]]></description> <content:encoded><![CDATA[<p>Ripping websites means to create a local copy of a website for offline browsing purposes. Creating a website mirror can actually be a good idea for several purposes. Even with all those caches that save website information many get lost when a website goes into Nirvana. It&#8217;s also nice if you need information on a computer with no Internet access, or only temporary Internet access, say an HTML course for example.</p><p>One of the most efficient ways to rip websites is by using the program HTTrack which might look a little bit confusing at the beginning because of its many options. I would like to walk you through the process of ripping a website. Please note that this method is not working on all websites but on most.</p><p>To begin with you need to download and install the software <a
href="http://www.httrack.com/">HTTrack</a> Website Copier. Start it once it has been installed, you will be greeted with a new project dialog. Each project creates the offline copy of one or more urls.</p><p><span
id="more-3470"></span><img
src='http://www.ghacks.net/wp-content/uploads/2008/03/httrack_rip_websites.jpg' alt='rip websites' /></p><p>The first screen manages the properties of the project. Just add a name &#8211; i prefer the name of the website that I want to rip &#8211; and a location on your hard drive where you want to save it. Make sure you have enough free disk space on that hard drive. Click Next to continue.</p><p>You add urls and the kind of action that you want HTTrack to perform. The standard action will download an exact copy of the website and make it available offline. The most important aspect here is the Set Options button which opens the configuration for the project.</p><p>It is very important to load the options and make some changes there. Click on the Browser ID tab and change the ID to Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0). Some websites check for the default ID of Httrack and deny access to it. This way makes it possible to prevent that from happening.</p><p>Access the Limits tab afterwards. Select the maximum mirroring and external depths. The first defines how many links will be scanned beginning from the homepage. If you set that to 2 for instance the homepage will be scanned, page 1 which was linked from the homepage will be scanned and page 11 will be scanned which was linked from page 1.</p><p>If you leave the first option blank all links will be scanned on that website. No external links are scanned by default which can be changed in that menu as well. I suggest to leave it at that because it would really bloat the project. Make sure you increase the maximum transfer rate in the same menu to the maximum as well to ensure faster downloads.</p><p>The Scan Rules tab is another important one. You can include and exclude files in here. If you do not want to download .exe files for instance you can use the string &#8220;-*.exe&#8221; without the &#8220;&#8221; in the form.</p><p><strong>Passworded Websites:</strong></p><p>Passworded websites are most of the times harder to come by. You need to supply HTTrack with the username and password for that website. The easiest way to do so is to add it to the url in the main menu. Instead of adding the url http://www.example.com/ you would add it this way: http://username:password@www.example.com/</p><p>That&#8217;s for websites with basic authentication which means popups that ask for a username and password. It&#8217;s more difficulty if the website uses form based logins. Your best option to rip those websites is to click on the Add Url button in the main menu and use the capture url feature.</p><p>This requires you to set a proxy in your favorite browser for a short time and login into the website that you want to rip so that HTTrack can check the way it is done and hopefully emulate this way when ripping the website.</p> ]]></content:encoded> <wfw:commentRss>http://www.ghacks.net/2008/03/10/how-to-rip-most-websites/feed/</wfw:commentRss> <slash:comments>15</slash:comments> </item> <item><title>Rip Websites with HTTrack Website Copier</title><link>http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/</link> <comments>http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/#comments</comments> <pubDate>Wed, 16 Aug 2006 19:15:16 +0000</pubDate> <dc:creator>Martin Brinkmann</dc:creator> <category><![CDATA[Tools]]></category> <category><![CDATA[download websites]]></category> <category><![CDATA[httrack]]></category> <category><![CDATA[rip websites]]></category> <category><![CDATA[Software]]></category> <category><![CDATA[website copier]]></category> <category><![CDATA[websites]]></category> <guid
isPermaLink="false">http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/</guid> <description><![CDATA[Beta day here at Ghacks. First a way to get Windows Vista Beta 2 shipped right to your doorsteps by answering some easy questions in a quiz and now a new beta release of the great website copier HTTrack. The program aims towards the experienced user with all its options and settings but can be used by novices as well. Most settings are optional that should be used for websites that use a lot of scripting and dynamic pages.]]></description> <content:encoded><![CDATA[<p>Beta day here at Ghacks. First a way to get Windows Vista Beta 2 shipped right to your doorsteps by answering some easy questions in a quiz and now a new beta release of the great website copier <a
target="_blank" title="copy websites rip" href="http://www.httrack.com/">HTTrack</a>. The program aims towards the experienced user with all its options and settings but can be used by novices as well. Most settings are optional that should be used for websites that use a lot of scripting and dynamic pages.</p><p>The question that some of you might be asking is why would someone want to rip a website ? You could rip websites for offline browsing purposes. Maybe you have a second computer without direct internet access but would like to view the website on this computer as well. A website that gives an overview over a programming language is a good example. Another reason would be to save the information that is on the website this way, maybe you fear that the website might be offline soon.</p><p><span
id="more-723"></span>The HTTrack website offers a <a
target="_blank" title="step by step guik" href="http://www.httrack.com/html/step.html">step by step guide</a> that can be used to understand the main features of the tool and rip your first website using the settings of the tutorial. HTTrack is available for Windows and Unix, Linux &#038; BSD.</p> ]]></content:encoded> <wfw:commentRss>http://www.ghacks.net/2006/08/16/rip-websites-with-httrack-website-copier/feed/</wfw:commentRss> <slash:comments>8</slash:comments> </item> <item><title>Website Downloader</title><link>http://www.ghacks.net/2005/11/24/website-downloader/</link> <comments>http://www.ghacks.net/2005/11/24/website-downloader/#comments</comments> <pubDate>Thu, 24 Nov 2005 11:11:16 +0000</pubDate> <dc:creator>Martin Brinkmann</dc:creator> <category><![CDATA[Tools]]></category> <category><![CDATA[download websites]]></category> <category><![CDATA[httrack]]></category> <category><![CDATA[rip websites]]></category> <category><![CDATA[website downloader]]></category> <guid
isPermaLink="false">http://www.ghacks.net/?p=172</guid> <description><![CDATA[WinHTTrack is an easy-to-use offline browser utility. It allows you to download a World Wide website from the Internet to a local directory, building recursively all directories, getting html, images, and other files from the server to your computer. ]]></description> <content:encoded><![CDATA[<p>WinHTTrack is an easy-to-use offline browser utility. It allows you to download a World Wide website from the Internet to a local directory, building recursively all directories, getting html, images, and other files from the server to your computer.</p><p>WinHTTrack arranges the original site&#8217;s relative link-structure. Simply open a page of the &#8216;mirrored&#8217; website in your browser, and you can browse the site from link to link, as if you were viewing it online. WinHTTrack can also update an existing mirrored site, and resume interrupted downloads. WinHTTrack is fully configurable, and has an integrated help system. NOTES: WinHTTrack is the Windows release of HTTrack.</p><p><img
src="http://freeware.deny.de/screenshots/httrack/snap6.jpg" alt="website download leech rip save page web site" /></p><h3>HTTrack 3.44-1 Offline Browser Utility</h3><p>HTTrack is a free offline browser utility that allows you to download www. Sites directly from the internet and arrange them in a local directory.  This recursively builds all directories, HTML, images, and various other files from the server to your computer.  HTTrack works with the original site and its relative link-structure.</p><p>Offline viewing is similar to collecting a library of websites, text and images.  You are able to browse these sites as if you were online when you are actually offline.  If you anticipate that you will be offline for any given reason and still need to access online information, this is a great way to do it.  Of course, you can only view what you download to the local directory.</p><p>The result is the ability to browse any site from any link to any other link as long as they are downloaded to the local directory.  All you have to do is open the mirrored page of the website in your browser as though you were viewing online.  Offline viewing can be advantageous when you have multiple sites that you need to cross-reference while offline.  Now HTTrack will update existing mirrored sites and any interrupted downloads will be resumed.  It can be fully configured to needed specifications and includes an integrated help system to make it easy to use for beginners.</p><p>The Windows 200/XP/Vista/7 release of HTTrack is called WinHTTrack.  The Linux/Unix/BSD release version is WebHTTrack.  Most Unix versions are available, including Ubuntu, of course.  This is convenient for users utilizing virtualization and running multiple operating systems.  You can download as many versions of HTTrack as needed to fit VHDs or multiple boot systems.  All versions are available on the download page, including versions for both 32-bit and 64-bit systems.  Download from the following link:</p><p><a
href="http://www.httrack.com/page/2/">http://www.httrack.com/page/2/</a></p><p>You will immediately notice that a version is available for virtually any operating system.  Be sure to choose the right one or it simply will not work.  This is only mentioned for those running more than one operating system on a single computer as mentioned above.  It is an easy mistake to make; installing the wrong download to the wrong operating system.</p><p><img
src="http://www.ghacks.net/wp-content/uploads/2005/11/httrack-600x307.png" alt="httrack" title="httrack" width="600" height="307" class="alignnone size-medium wp-image-51365" /></p><p>Another link includes all Documentation for HTTrack.  Anything that you need to know about the latest version of HTTrack is here.  Simply click on the desired header for specifics.  The information is too extensive to detail here, but you can find all of the specifics here:<br
/> http://www.httrack.com/html/index.html</p><p>For the truly technical users out there, you can get all of the specifics regarding the release changes from a link on the same page links above.  For your convenience, the link to the release changes in the current edition is included here: http://www.httrack.com/history.txt</p><p>The utility is pretty much the same as before, with many minor changes that create improvements that may be significant to some and unapparent to other users.  Regardless, this is a useful utility which any user can utilize as an offline browsing utility.</p> ]]></content:encoded> <wfw:commentRss>http://www.ghacks.net/2005/11/24/website-downloader/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> </channel> </rss>
