Unicode and ANSI troubles
Thread poster: elm0505
elm0505
elm0505
Spain
Local time: 21:17
French to Spanish
+ ...
Aug 24, 2011

Hello everyone
I am translating texts from Spanish into French and Portuguese. Some of the source texts contain special characters from other languages, such as Turkish and Hungarian. Following recommendations (and an alert message when saving the texts) I chose Unicode to keep the special characters. The problem comes up when I open the project in OmegaT and I find the original text is plagued with strange symbols (and the text looks different, with the characters strangely spaced) This
... See more
Hello everyone
I am translating texts from Spanish into French and Portuguese. Some of the source texts contain special characters from other languages, such as Turkish and Hungarian. Following recommendations (and an alert message when saving the texts) I chose Unicode to keep the special characters. The problem comes up when I open the project in OmegaT and I find the original text is plagued with strange symbols (and the text looks different, with the characters strangely spaced) This doesn't happens if I save the text in ANSI codification although some special characters and accents disappear.

Does anyone know how to solve this?
Collapse


 
Dragomir Kovacevic
Dragomir Kovacevic  Identity Verified
Italy
Local time: 21:17
Italian to Serbian
+ ...
Omegat + UTF-8 exclusively Aug 24, 2011

Omegat + UTF-8 encoding exclusively, in order to guarantee universal visibility of all characters.

You probably used utf-16. In Windows it is simply named as "Unicode".

elm0505 wrote:

Hello everyone
I am translating texts from Spanish into French and Portuguese. Some of the source texts contain special characters from other languages, such as Turkish and Hungarian. Following recommendations (and an alert message when saving the texts) I chose Unicode to keep the special characters. The problem comes up when I open the project in OmegaT and I find the original text is plagued with strange symbols (and the text looks different, with the characters strangely spaced) This doesn't happens if I save the text in ANSI codification although some special characters and accents disappear.

Does anyone know how to solve this?


 
Didier Briel
Didier Briel  Identity Verified
France
Local time: 21:17
English to French
+ ...
What formats are your source texts? Aug 24, 2011

elm0505 wrote:
I am translating texts from Spanish into French and Portuguese. Some of the source texts contain special characters from other languages, such as Turkish and Hungarian. Following recommendations (and an alert message when saving the texts) I chose Unicode to keep the special characters. The problem comes up when I open the project in OmegaT and I find the original text is plagued with strange symbols (and the text looks different, with the characters strangely spaced) This doesn't happens if I save the text in ANSI codification although some special characters and accents disappear.

I assume you use text files.

By default, OmegaT reads .txt files as system encoding, which means ANSI under Windows.

If your files are UTF-16 (Unicode), you must configure OmegaT so that the extension you use (e.g., .utf16) corresponds to the encoding. This is done in Options > File Filters > Text Files > Edit..., and is documented (including the concept of encoding) in the Chapter "Working with plain text" of the documentation.

Didier


 
elm0505
elm0505
Spain
Local time: 21:17
French to Spanish
+ ...
TOPIC STARTER
Solved Aug 29, 2011

You all were right, I resorted to name the files manually under the extension utf8 before saving and it works now, thank you all. Once again Windows has been messing around!

 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Unicode and ANSI troubles






Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »