Tag soup - accents Téma indítója: Thomas Carey
|
Hi all, I have recently been experiencing problems with tags in source when it comes to apostrophes and accents (source text in French). I use MemoQ 2013 R2 (6.8.56). At first, I thought it was due to the document being a converted .pdf, but I've been told it is actually an original word file. Now, nearly every time I receive a document from this client, I have these tag problems. The original file is a .doc file (97-2003). The main text font is Arial but t... See more Hi all, I have recently been experiencing problems with tags in source when it comes to apostrophes and accents (source text in French). I use MemoQ 2013 R2 (6.8.56). At first, I thought it was due to the document being a converted .pdf, but I've been told it is actually an original word file. Now, nearly every time I receive a document from this client, I have these tag problems. The original file is a .doc file (97-2003). The main text font is Arial but the accents are in Arial Unicode MS (and the style is slightly different). I tried saving the doc as docx, this sometimes works or helps but not perfectly. I also selected all the text and changed all font to normal Arial, this removed tags but messed up the text by missing or repeating parts of sentences (I even ended up with what appeared to be some kind of Asian language!?!). Has anyone else been experiencing such problems recently? Any suggestions? Thanks, Tom ▲ Collapse | | | esperantisto Local time: 21:40 Tag (2006 óta) angol - orosz + ... A WEBOLDALAT LOKALIZÁLÓ FORDÍTÓ OCRd document? | Apr 23, 2014 |
Is your document really in MS Word? Your description makes me think that it's a result of OCRing by ABBYY FineReader or a similar program. In such a case, it's not really DOC, but RTF. Try the following: 1. Re-save to make sure it's really DOC. Or use Apache OpenOffice / LibreOffice and save to ODF. 2. Make sure that the entire text is in the same (desired) language. 3. After verifying the language, apply uniform font attributes such as fontface, height, color to the text. | | | Thomas Carey Local time: 20:40 francia - angol + ... TÉMAINDÍTÓ
Hi, thanks, esperantisto Yes, pretty sure it was Word. Anyway, problem solved by selecting everything again, changing all font to normal arial and saving, and then saving once more in .docx. I don't know why that didn't work the first time I tried... thanks again, Tom | | | LEXpert Egyesült Államok Local time: 13:40 Tag (2008 óta) horvát - angol + ... Similar problem, same solution | Apr 23, 2014 |
Thomas Carey wrote: Hi, thanks, esperantisto Yes, pretty sure it was Word. Anyway, problem solved by selecting everything again, changing all font to normal arial and saving, and then saving once more in .docx. I don't know why that didn't work the first time I tried... thanks again, Tom Glad that worked for you! I just saw your post, and I've had similar issues in MemoQ with umlauted characters in OCR'd German files - the often appear with a tag pair around every such letter. Indeed, selecting all and changing the font to normal Arial usually resolves the problem. For good measure I also change the language to EN, but I'm not sure if that step makes any difference. | |
|
|
As correctly surmised by esperantisto... | Apr 29, 2014 |
Thomas Carey wrote: At first, I thought it was due to the document being a converted .pdf, but I've been told it is actually an original word file. Now, nearly every time I receive a document from this client, I have these tag problems. Tom ... the document is almost certainly a converted PDF if Arial Unicode MS is used for accents. PDFs really get in the way of agencies applying "Trados discounts" so some of them aren't too keen to pass on this information to translators in case they baulk at the idea ("no discounts for PDFs"). They are desperate to convert them and as a result there will usually be a whole host of other formatting and layout problems to be fixed before you can start the translation. | | | Thomas Carey Local time: 20:40 francia - angol + ... TÉMAINDÍTÓ Definitely Word | Apr 29, 2014 |
David Turner wrote: ... the document is almost certainly a converted PDF if Arial Unicode MS is used for accents. Hi, Thanks for the info. Not this time. The sender confirmed the document was created in word. | | | Sofia Costa Portugália Local time: 19:40 angol - portugál + ... Umlauted letters between tags in MemoQ | May 29, 2015 |
Thank you all. Your comments helped me a lot. | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Tag soup - accents Anycount & Translation Office 3000 | Translation Office 3000
Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.
More info » |
| Protemos translation business management system | Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!
The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.
More info » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |