Are you helpful enough? ... to the helpless ..this one made me hopeless! Grrrrr
By zills66
@zills66 (1419)
Saudi Arabia
February 6, 2011 7:58am CST
how do i fix break lines and big space gaps between words in word 2007?
i am copying and pasting texts from scanned PDF file to word 2007. some with underlines and some are just plain texts.
now, i am dying to death editing, backspacing lines to fill the empty big big spaces before and after the other lines! lol.. i've tried changing every settings i have known but it did not work out!
1 response
@owlwings (43897)
• Cambridge, England
6 Feb 11
The reason that you are getting these spaces is that each LINE is formatted as a paragraph when you have copied and pasted it into Word 2007. Turn ON 'Show formatting' (the button with the paragraph mark on the 'Home' menu) and you will see all the invisible formatting.
You can do a 'Search and Replace' to replace all paragraph marks or newline marks with a space but it is better to do it in two stages (depending on the document). I find it best to replace all occurrences of two paragraph marks together (^p^p) with a combination that doesn't occur elsewhere (I often use '$$'), then replace all SINGLE paragraph/newline marks ('^p' or '^l') with spaces and then replace all '$$' with paragraph marks ('^p').
Copies from a PDF can often be more complicated than that, too, because you very often have to remove headers and footers which come in as part of the text. Go look at some 'Help' on advanced Search and Replace. There are some quite powerful options which allow you to specify 'any number' or 'any character' and can speed up the removal of repeating text which you don't need.
@owlwings (43897)
• Cambridge, England
7 Feb 11
Ah! If they are scanned hard copies, then they are not actually text at all. They are bitmapped images and copying and pasting into Word will simply insert them as images (which cannot be edited as text).
In order to convert them to editable text, you will need to use Optical Character Recognition (OCR) software to 'read' the bitmaps and convert any recognisable characters to text.


