Converting text in jpeg format to Word or rtf? Téma indítója: Peter Beauclerk DTCM, CPT
|
I have received several pages for translation in jpeg format- does anyone know how I can convert them into an editable text format? | | | Mulyadi Subali Indonézia Local time: 10:24 ProZ.com-tag angol - indonéz + ...
assuming it's not hand-written, you can use any ocr program. many recommend abbyy fine reader. | | | Brandis (X) Local time: 05:24 angol - német + ... few steps and copy editing | Nov 24, 2007 |
Hi1 Generally the client´´s never mention this or do not know themselves. Steps are 1 .jpeg 2 .pdf 2 .pdf 2 ocr into word and copy edit comparing with the original .jpeg. and translate . I normally convert the target file into a .jpeg and send back. But this is separate process and not related to translation at all. There are quite a few data converting comapnies that do a professional job. Brandis | | | Samuel Murray Hollandia Local time: 05:24 Tag (2006 óta) angol - afrikaans + ... Print and type | Nov 24, 2007 |
Peter Beauclerk MS LAc wrote: I have received several pages for translation in jpeg format- does anyone know how I can convert them into an editable text format? Print them out and let someone type them in. | |
|
|
a2ztranslate Új-Zéland Local time: 15:24 angol - japán + ... depends if text has been outlined first | Nov 24, 2007 |
which it probably has before creating jpeg; try OCR but be real careful on the double check as OCR will often miss content, especially if there are graphics within graphics. for smaller files it is often just quicker to re-type the master. | | | optical character recognition | Nov 24, 2007 |
Peter Beauclerk MS LAc wrote: I have received several pages for translation in jpeg format- does anyone know how I can convert them into an editable text format? Use an OCR program. I highly commend Abbyy FinerReader, even in the free Sprint version. Better, still, Professional. | | | Peter Linton (X) Local time: 04:24 svéd - angol + ... Nuance OmniPage | Nov 24, 2007 |
I use an OCR product called Nuance OmniPage (curently version 15 or 16). Very effective and converts JPEG directly to DOC. In fact converts a wide range of graphics files to text, including PDF. Similar to ABBYY FineReader. A rcent computer magazine survey found them both very good. An essential tool these days - gives you a competitive edge. You may also be able to charge separately for the process, by emphasing the benefits of handing everything electronically. I have certainly re... See more I use an OCR product called Nuance OmniPage (curently version 15 or 16). Very effective and converts JPEG directly to DOC. In fact converts a wide range of graphics files to text, including PDF. Similar to ABBYY FineReader. A rcent computer magazine survey found them both very good. An essential tool these days - gives you a competitive edge. You may also be able to charge separately for the process, by emphasing the benefits of handing everything electronically. I have certainly recouped the cost of the software. But as others have mentioned, the text needs to be carefully checked. ▲ Collapse | | |
Recent versions of OCR programs work very good. I have IRIS, which is bundled to some HP scanners, and it is OK: Recently I needed to OCR some tiffs, and did not have my computer, so I downloaded this: topocr it is shareware, so the usage is limited (number of use, not functions). Small package, so quick download. Very easy to use - no need to read manual. It did the job. | |
|
|
Haiyang Ai (X) Egyesült Államok Local time: 22:24 angol - kínai + ... Manually type them out | Nov 24, 2007 |
Samuel Murray wrote: Print them out and let someone type them in. That's probably the easiest way to do it. If you type it yourself, can you write that in the invoice? Regards, Haiyang | | |
Peter Linton wrote: I use an OCR product called Nuance OmniPage (curently version 15 or 16). It's currently at version 16 (came out a few months ago). Many people think that OCR is tricky because the software will try recognizing text in images or even the images themselves (with, as a result, the usual useless garbles we OCR users are familiar with). I just wanted to say that OmniPage lets you manually process documents, which in turn will tell the software which part of a page should be processed in what way. You can "zone" images so they are left out of the recognition process (you can do this with any type of content, so if you want to exclude footers, for example, it is easy to do). This ensures that images that are not meant to be recognized are simply carried forward into the recognition results without any further recognition, which in turn eliminates the above-mentioned garbles. This is handy with documents that contain lots of images, like for example software manuals, but even more so when it comes to PDFs in image format (scanned or even handwritten documents). OmniPage also recognizes handwriting, although it cannot be perfect in that field, just like any other software. Also, for images, especially when you need to recognize text within images, OmniPage lets you process the images (sharpness, contrast, etc.) to make the initial image prepared for recognition easier to handle and optimize recognition performance (I usually make all images grayscale and increase the contrast). There's even a built-in text editor which will let you correct OCR results before OmniPage exports them into the final format (for our purposes, this would be Word .doc, usually). However, the learning curve of OmniPage is steep. It takes some time before getting a hang of simple functioning, and then some to learn the tricks that will let you get the most out of the software. But I definitely think that once you get past Nuance's lousy costumer service and the learning period, OmniPage truly is an OCR software that is at least one step ahead of the bunch. It offers unique features not found in other software (advanced zoning is one of them - you can precisely determine which parts of a page will be recognized and how they will be processed, and zoning typically doesn't take long) and it is really great at what it was initially invented for, aside from handy unique features, which is to recognize text. The investment is sizable (on the website, these babies go for around USD 500) but if you are willing to tame the software, the benefits can far outweigh the investment.
[Edited at 2007-11-24 15:45] | | | Samuel Murray Hollandia Local time: 05:24 Tag (2006 óta) angol - afrikaans + ... Typing included in invoice | Nov 24, 2007 |
Haiyang Ai wrote: If you type it yourself, can you write that in the invoice? If you want to... but you must tell the client beforehand that you'll charge extra for the typing. And some clients might try to save that money by self-OCR'ing it and the result will be rubbish. So my advice is to quote higher for such texts that are not editable, and then pay the typist from your fees. If the client wants you to sign an NDA that excludes using a typist, refuse the job or get the typist to sign the NDA too. You can easily pay a typist 10% of your own translation rate per word (just an example). | | | Thanks for all responses- good solution found | Nov 27, 2007 |
Abby Scan to Word Wizard worked very nicely & was easy to use- when set for jpeg it rendered the text plus graphics faithfully & at $40 it won't bust my budget. cheers, Peter | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Converting text in jpeg format to Word or rtf? Anycount & Translation Office 3000 | Translation Office 3000
Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.
More info » |
| Trados Studio 2022 Freelance | The leading translation software used by over 270,000 translators.
Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop
and cloud solution, empowering you to work in the most efficient and cost-effective way.
More info » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |