ghacks Technology News

Unbreak Copied Text From PDF Documents

Users who want to copy and paste text out of pdf documents might have noticed that the text in the destination document will have line breaks just like the original pdf document had. This is usually something that is not wanted and while it is not a big problem to remove the line breaks manually when short paragraphs have been pasted it becomes a bigger problem for longer texts.

Auto Unbreak is a small 22 Kilobyte tool that has only one purpose. It takes text from pdf documents and removes the line breaks of those texts before it provides the user with an option to copy the newly formatted text to the clipboard again.

Auto Unbreak is a portable application that can be executed from any location of a computer system. It ships with two files that define merge and exception rules which might come in handy for users who deal with specifically formatted text.

unbreak pdf

The rule files can be edited in every text editor. The homepage of the developers have been suspended, please download the tool from this link. It is temporarily hosted here at Ghacks until the developers announce their new website.

Enjoyed the article?: Then sign-up for our free newsletter or RSS feed to kick off your day with the latest technology news and tips, or share the article with your friends and contacts on Facebook or Twitter.

Related Articles:

Find and Replace text across multiple documents
PDF OCR Turns PDF Documents Into Text
Convert Text To HTML Documents
How To Extract Images Or Text From PDF Documents
Topicmarks Summarizes Text Documents For Faster Learning



About the Author:Martin Brinkmann is a journalist from Germany who founded Ghacks Technology News Back in 2005. He is passionate about all things tech and knows the Internet and computers like the back of his hand. You can follow Martin on Facebook or Twitter.

Author: , Thursday September 11, 2008 -
Tags:, , , ,


Responses so far:

  1. Josh says:

    Thanks Martin, great find this is major pain for me =)

  2. I use clipboard extender ClipCache Pro, which allows for a multitude of reformating options for any copied text whatever its source as well as handling html and graphics. Very useful little tool, although it’s not free.

  3. Jim says:

    Symantec flags it as as “Construction.Kit” virus. I’ll have to try AVG and Trend to see if it’s a false alert.

  4. Medved says:

    I use http://texthandler.com to remove line breaks online. Copy text from PDF, select options “Every paragraph began by capital ” and click the “execute” button.

  5. Ruth says:

    I have used AutoUnbreak for many years but now suddenly the Clear button has disappeared. This is a real pain, I have to close down the program to clear text. And copied text can only be pasted into word if the program is still open.
    Another issue is how it doesn’t remove the last line break from a slab of text. I could live with that if only I could get the Clear button back again. Do you have a solution?

Leave a Reply   Follow Ghacks   Subscribe To Comment Rss

Subscribe without commenting

© 2005-2012 Ghacks.net. All Rights Reserved. Privacy Policy - About Us