Replace multiple CP-1252 characters with CP-1251 characters in one go
Thread poster: Stepan Konev
Stepan Konev
Stepan Konev  Identity Verified
Russian Federation
Local time: 06:57
English to Russian
Nov 8, 2021

Hi folks,
I have a very old TM from Trados 2007. I converted it into an sdltm file. However many segments have got wrongly coded somehow.
111
Also, I have a "mapping" table like this:
———————
1252
à|á|â|ã|ä|å|¸|æ|ç|è|é|ê|ë|ì|í|î|ï|ð|ñ|ò|ó|ô|õ|ö|÷|ø|ù|ú|û|ü|ý|þ|ÿ|À|Á|Â|Ã|Ä|Å|¨|Æ|Ç|È|É|Ê|Ë|Ì|Í|Î|Ï|Ð|Ñ|Ò|Ó|Ô|Õ|Ö|×|Ø|Ù|Ú|Û|Ü|Ý|Þ|ß
———————
1251
а|б|в|г|д|е|ё|ж|з|и|й|к|л|м|н|о|п|р|с|т|у|ф|х|ц|ч|ш|щ|ъ|ы|ь|э|ю|я|А|Б|В|Г|Д|Е|Ё|Ж|З|И|Й|К|Л|М|Н|О|П|Р|С|Т|У|Ф|Х|Ц|Ч|Ш|Щ|Ъ|Ы|Ь|Э|Ю|Я
———————
Is there any way to quickly replace CP-1252 chars with CP-1251 either by AHK script, or regex, or SDL AppStore app, or any other tool/method?
Thank you.


 
Hans Lenting
Hans Lenting
Netherlands
Member (2006)
German to Dutch
CafeTran Nov 8, 2021

Open the TMX in CafeTran, use the multi replace feature.

 
Stepan Konev
Stepan Konev  Identity Verified
Russian Federation
Local time: 06:57
English to Russian
TOPIC STARTER
Can't find that Nov 8, 2021

German Dutch Engineering Translation wrote:
Open the TMX in CafeTran, use the multi replace feature.
Thank you for the tip. However I failed to find it in UI or Google. Could you please give further instructions where should look for it? Tab/button/section/command?


 
Hans Lenting
Hans Lenting
Netherlands
Member (2006)
German to Dutch
Well-hidden Nov 8, 2021

Screen Shot 2021-11-08 at 17.35.53

Screen Shot 2021-11-08 at 17.44.42


à=а
á=б
â=в
ã=г
ä=д
å=е
¸=ё
æ=ж
ç=з
è=и
é=й
ê=к
ë=л
ì=м
í=н
î=о
ï=п
ð=р
ñ=с
ò=т
ó=у
ô=ф
õ=х
ö=ц
÷=ч
ø=ш
ù=щ
ú=ъ
û=ы
ü=ь
ý=э
þ=ю
ÿ=я
À=А
Á=Б
Â=В
Ã=Г
Ä=Д
Å=Е
¨=Ё
Æ=Ж
Ç=З
È=И
É=Й
Ê=К
Ë=Л
Ì=М
Í=Н
Î=О
Ï=П
Ð=Р
Ñ=С
Ò=Т
Ó=У
Ô=Ф
Õ=Х
Ö=Ц
×=Ч
Ø=Ш
Ù=Щ
Ú=Ъ
Û=Ы
Ü=Ь
Ý=Э
Þ=Ю
ß=Я


 
Stepan Konev
Stepan Konev  Identity Verified
Russian Federation
Local time: 06:57
English to Russian
TOPIC STARTER
Thank you Nov 8, 2021

Can't open it so far. This is what I do:
1. Drag and drop TMX into CafeTran
2. Select 'Edit translation memory' (?)
3. Go to the QA-Word lists-Find and replace in target segments
4. Browse to... err... what? TMX again? Ok, browse the same tmx as in step 1.
After step 4, CT just freezes (with 'OpenJDK Platform binary' process running).
My tmx is 188396KB. Is it ok?


 
Hans Lenting
Hans Lenting
Netherlands
Member (2006)
German to Dutch
@Stepan Nov 8, 2021

Stepan Konev wrote:


4. Browse to... err... what?


Save the list with the equal signs as an UTF-8 text file.

à=а
á=б
â=в
ã=г
ä=д
å=е
¸=ё
æ=ж
ç=з


Use this as replacement list.


 
Stepan Konev
Stepan Konev  Identity Verified
Russian Federation
Local time: 06:57
English to Russian
TOPIC STARTER
Great! Nov 8, 2021

German Dutch Engineering Translation wrote:
Save the list with the equal signs as an UTF-8 text file.
Wow! That worked. Thank you!


 
Stepan Konev
Stepan Konev  Identity Verified
Russian Federation
Local time: 06:57
English to Russian
TOPIC STARTER
Update Nov 10, 2021

Found another solution: the Batch Edit feature. Similar to CafeTran, you can add a list of replacements and execute them all at once.
2021-11-10_211518


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Replace multiple CP-1252 characters with CP-1251 characters in one go







CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »