This site uses cookies.
Some of these cookies are essential to the operation of the site,
while others help to improve your experience by providing insights into how the site is being used.
For more information, please see the ProZ.com privacy policy.
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Petr Oliver Czech Republic Local time: 07:20 English to Czech + ...
Notes on using ExtPhr32 and AntConc as terminology extraction tools
Nov 25, 2011
I tried using ExtPhr32 (recommended earlier ni this discussion) to extract terminology. It worked well, I could easily reduce the outcoming list of terms by defining my "stoplist" and "break set" TXT files. However, this tool has one major drawback for me - it uses the ISO 8859-1 character set where some important Czech characters are missing (ž, Ž, ť, Ť) - the first two exist also in Finnish, if I am not mistaken. So if I want to use the term list, I need to add these charactes manually. Tr... See more
I tried using ExtPhr32 (recommended earlier ni this discussion) to extract terminology. It worked well, I could easily reduce the outcoming list of terms by defining my "stoplist" and "break set" TXT files. However, this tool has one major drawback for me - it uses the ISO 8859-1 character set where some important Czech characters are missing (ž, Ž, ť, Ť) - the first two exist also in Finnish, if I am not mistaken. So if I want to use the term list, I need to add these charactes manually. Trying to specify them in the "additional letter set" didn't help either. Then I found there is a newer version ("ExtPhr33"), but the installation wizard froze twice in a row, so I gave up.
As to AntConc, it is slightly more complicated to learn to use and it doesn't display correctly if you are using different DPI (mine is 144 and I am still willing to keep it). ▲ Collapse
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Dan Slavoiu Local time: 08:20 English to Romanian + ...
Craven's - it doesn't work on Win7
Jul 25, 2012
David Turner wrote:
Eleni Makantani wrote:
Are there any free solutions on the Internet?
... is an excellent little freeware utility which "Extracts every word and every phrase up to a certain length that occurs at least a minimum number of times in a source text and that does not start or end with a stopword"
A English stopword list file is provided but you can find stopword lists for other languages on the Web.
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!
The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.