I've added initial versions of all the files needed for the AFF version of Catalan in MedSLT. Naming conventions are the usual ones, and I was able to build all the AFF Cat resources by doing
make role_marked
in the Cat/scripts directory. There should now be config files for all 5 x 5 = 25 pairs of languages in {Ara, Cat, Eng, Fre, Jap} - this involved adding a few new pairs. I only tested Interlingua to Catalan and Catalan to Interlingua. We get currently translations for about 75% of the sentences in IntCat, and about 20% in CatInt. Hopefully it will be easy to improve these figures.
Over to Pierrette and Bruna to debug the rules. Note that I have macrotised the Cat lexicon to make the AFF version work. It should be mostly OK, but there were a few cases (in particular, WH+ PPs) where I wasn't quite sure how to do the macrotisation - people who actually know Catalan should review the entries.
Subscribe to:
Post Comments (Atom)
Blog Archive
-
▼
2008
(32)
-
▼
June
(11)
- McNemar at the word level?
- Faster parsing in Regulus using Nuance
- "Paraphrase corpora" for estimating semantic error...
- Better ways to estimate semantic error rate
- Interlingua corpora for multiple domains
- AFF version of Catalan
- Regulus 2.9.0 released
- Problems with SICStus 4.0.3 resolved
- Problems with SICStus 4.0.3
- Interlingua corpora
- Running multiple copies of the GUI
-
▼
June
(11)
No comments:
Post a Comment