A file is converted from pdf to word, so how to detected whether it is typed....

India
September 2, 2009 9:43pm CST
It is a long story.... But i'll keep it short.... this question is to challenge all the tech freaks.. my friends say it is possible to detect a word file whether the contents of it is typed from the key board by looking into the original pdf file or it(word file) is a converted version of the original pdf... so i tried this to check whether i can do it or not... 1st i scanned a page from my physics book and saved it into a jpeg format(my scanner doesn't have option to directly scan into a pdf file)....then i converted that file into a pdf file... then i converted that file into a word file..saved in .doc format,I gave this file to my friend.. and he was able to detect that it was a converted version of the pdf format... so i'm wondering how he was able to do it... is their any software to detect this or does the word create any code(correspondingly whether it is typed or converted 4m software)so that we can recognize from these codes.... so plz, help me how to do this... i'm very much interested to know how he was able to detect it....
5 responses
@lovedude (4447)
• India
9 Sep 09
Well sagar (I guess your name sagar only )it's just wild guess anyone can do. specially if he is from computer field. he can easily do that.. the file you convert from pdf to word must have some either symbols or special characters which will be detected as image in your word file where in case of typed word file it work as symbol not as image. :-) You just try once.. may be manier characters there in that physics word which can't be typed directly in word too. so at that point too you can easily detect it's scanned document or converted from pdf.. Good Luck.
@etavasi (749)
• Malaysia
3 Sep 09
Hello my friends, long time ago when i lazy to type i just scan the book. I think you do the the long way to get file for Microsoft Word. I don't know what type your scanner but i use my cannon mp 160 to scan and print. When i buy this, it also follow the disc installer for driver and software. Then, i just install and use this software called Scansoft Omnipage from the disc that provided. Then i will be able to scan and direct save to Microsoft word. It also can save to many other filetype like image, and more. It is so easy to use. Maybe you should try download Scansoft Omnipage, for me it detect very well only those word that not very clear u need to do some work. Thanks.
• India
13 Sep 09
He detected it based on the header and footer. Because whenever you type something the the header and footer is managed accurately. this might be the reason how he identified it. And also there is change in the quality of the content that is the alphabets are not that dark as you normally see them when typed.
@Boyetski (986)
• Philippines
3 Sep 09
He said omni page. Depending on the quality of your scanned document. Ominipage which has the OCR it think it's optical character recognition will definitely detect the words from the pictures. But you dont need to convert it to PDF. You can directly scan and recognize characters from JPEG format. I have experience with this software. And it's not too accurate. It really depends on the quality of the scanned document's
• India
30 Jul 10
Hi, I also want to know this, as i also have a same problem facing from one my friend. Please let me know if there are any software that detects the converted files.