There are reasons why you would want to view a web site off line. Say, for example, you know you are on the go and do not always have access to a network connection, yet you want to be up to date on the latest news. Or you are a developer working on a site and need to be able to make changes, or check a web site for bugs or broken links. Or maybe you are trying to develop a new site and you want to loosely base your new site (all the while crediting the original site of course) on an already existing site.
You can come up with plenty of reasons for this action and fortunately there are plenty of tools to enable this. One of those tools is WebHTTrack. WebHTTrack is the Linux version and WinHTTrack is the Windows version, so not only can you read your sites off line, you can read them in either platform. In this article I will show you how to do just that - only on the Linux platform.
Installation is quite simple. Let's take a look at how to do this from the command line for both Ubuntu and Fedora. The Ubuntu steps look like this:
The Feodra installation is very similar:
You're ready to start downloading sites. When WebHTTrack is installed, you can start it by clicking Applications > Internet > Web HTTrack Website Copier.
Name: Give the project a name (or select from pre-existing projects).
Category: Give the project a category (or select from pre-existing categories).
Base path: Select where you want the project saved (by default it is in ~/websites).
Action: Here you select from a number of options, including Download web site(s), Download web sites + questions, get individual files, Download all sites in pages, Test links in pages. You can also choose Continue interrupted download or Update existing download.
Web Adresses: Enter the URL you want to download.
In this same screen you can also set Preferences and Mirror options. There are plenty of options to select from (such as Build, Scan Rules, Spider, Log/Index/Cache, Flow Control, and more).
This final screen gives you a last warning to make any adjustments and allows you to save your settings only (for later download). Or you can simply click Start to begin the download process.
Once you start downloading you will see a progress screen that will indicate what has been downloaded. Depending on the size and depth of your site, this process can take quite some time. Once the download is finished you can then browse your downloaded site by opening up your browser and navigating to the download directory of that site (it will be a sub-directory within ~/website).
No matter the reason for needing a downloaded web site, it's good to know there are tools that can handle this task. WebHTTrack is one of the easier and more reliable of these tools I have found. And since it's cross platform, you won't miss a beat switching back and forth between Linux and Windows.
Advertising revenue is falling fast across the Internet, and independently-run sites like Ghacks are hit hardest by it. The advertising model in its current form is coming to an end, and we have to find other ways to continue operating this site.
We are committed to keeping our content free and independent, which means no paywalls, no sponsored posts, no annoying ad formats (video ads) or subscription fees.
If you like our content, and would like to help, please consider making a contribution:
Ghacks is a technology news blog that was founded in 2005 by Martin Brinkmann. It has since then become one of the most popular tech news sites on the Internet with five authors and regular contributions from freelance writers.