Defining your own segmentation rules for Chinese source for CAT tools ?
Téma indítója: PatentTrans
PatentTrans
PatentTrans
Egyesült Államok
Local time: 17:15
kínai - angol
Oct 31, 2013

Anyone tried defining your own segmentation rules for CAT software, Chinese being the source? I'm using punctuation marks to break up paragraphs and it's not bad. For one of my documents (patent) it showed about 1/3 - 1/2 of the segments as being unique. Is it possible to optimize this further? Of course if the segments are too short then I'll run into readability issues. Chinese grammar is kind of chaotic and I'm having a tough time finding a reliable pattern.

 
Lawrence Lam
Lawrence Lam  Identity Verified
Kína
Local time: 06:15
angol - kínai
+ ...
be careful doing this Nov 2, 2013

You can define some new splitting rules, according to the 1/3 - 1/2 unique content in your patent document.

But changing the type of segmentation considerably changes the way a CAT tool works and, among other things, may also influence the alignment of translations, pre-translations, etc. You should avoid repeatedly changing the type of segmentation for a document format, because this will otherwise have a negative impact on the quality of the translation memory.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Defining your own segmentation rules for Chinese source for CAT tools ?






CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »