Duplicate content is one of the things that webmasters need to avoid on their own domains as web search engines like Google or Bing can penalize websites for duplicate content. Webmasters have less options when dealing with content that someone else copied from their websites. Content scraping is a lucrative business, primarily due to the fact that it is possible to automate the whole process with scripts and that setup usually does not take longer than a few minutes.
This too can have ill effects for the original content creator, for instance when content scrapers rank above the webmaster's original content or reputation is at stake.
Search engines ask webmasters to file DMCA requests to take down copied contents, but this requires extensive research. While it is possible to use the search engines for that, for instance by searching for article titles or the first paragraph, it is often better to use specialized tools for the job.
Desktop Plagiarism Checker offers to search on Google, Bing, Yahoo, Google Scholar or Google Books for text queries or documents. A free account is required before the program can be used.
Just paste text into one of the two tabs in the program and select one of the available search engines to search for duplicate content on the Internet. It is alternatively possible to load a text file into the program (supported are txt, htm, doc, pdf, rtf or odt) to search for the file's contents instead.
A click on check duplicate content runs the search query and displays the results in a new program window. The number of hits, the query and domains found to contain the query are listed here in a table. Each sentence is analyzed and displayed separately in the results window.
The query itself links to the results page of the selected search engine. Only the root domain name is displayed in the results window. This is a problem as deep links are needed when filing DMCA requests. Results can be exported as pdf documents for safekeeping or further processing.
Checking online for duplicate content is only one of the ways the program can be used. Teachers can use it to check if students have copied parts or all of their essays and work from other sources.
The program lacks several features that would improve it further. This includes a full url listing of sites where the entered query has been found on, the ability to file DMCA requests directly from the program interface, and options to email the webmaster or the hosters abuse department are just a few that come to mind right away.
Desktop Plagiarism Checker can be downloaded from the developer website. The very same plagiarism checks can be run directly on the website. Guests see only limited results, registered users all details.
Advertising revenue is falling fast across the Internet, and independently-run sites like Ghacks are hit hardest by it. The advertising model in its current form is coming to an end, and we have to find other ways to continue operating this site.
We are committed to keeping our content free and independent, which means no paywalls, no sponsored posts, no annoying ad formats (video ads) or subscription fees.
If you like our content, and would like to help, please consider making a contribution:
Ghacks is a technology news blog that was founded in 2005 by Martin Brinkmann. It has since then become one of the most popular tech news sites on the Internet with five authors and regular contributions from freelance writers.