The Twenty-One Project University of Twente

Twente word alignment software


Download

Current development is taking place at the Computer Science Department of the University of Minho, Portugal, under the name Natura Alignment Tools (NATools). Many thanks to Alberto Simões and José João Almeida for making their implementations available. The latest version 0.5.5 is from May 2007: NATools-0.5.5.tar.gz

Go to the NATools home page or to NATools at SourceForge.

You can download the original version (1998) of the twente word alignment software here: twente_word_align_09b.tar.gz


Contact

Djoerd Hiemstra
University of Twente
PO Box 217, 7500 AE Enschede
The Netherlands
E-mail: hiemstra@cs.utwente.nl


Publications

Djoerd Hiemstra, Franciska de Jong and Wessel Kraaij, ``A Domain Specific Lexicon Acquisition Tool for Cross-Language Information Retrieval'', In: Proceedings of the RIAO'97 Conference on Computer-Assisted Information Searching on Internet, pages 255-270, 1997 [abstract] [postcript]

Djoerd Hiemstra, ``Multilingual domain modeling in Twenty-One: automatic creation of a bi-directional translation lexicon from a parallel corpus'', In: Peter-Arno Coppen, Hans van Halteren and Lisanne Teunissen (eds.) Proceedings of the eightth CLIN meeting, pages 41-58, 1998 [abstract] [postcript]


Related publications

Dan Melamed, ``Empirical Methods for MT Lexicon Development'', in L. Gerber and D. Farwell (eds.), Machine Translation and the Information Soup, Springer-Verlag, 1998
ftp://ftp.cis.upenn.edu/pub/melamed/papers/amta98.ps.gz

Eric Gaussier, David Hull and Salah Ait-Mokhtar ``Term Alignment in Use: Machine-Aided Human Translation'', Xerox Research Centre Technical Report, 1999
http://kb.rxrc.xerox.com/publis/mltt/gaussier-ptp-99.ps

Kyo Kageura, Keita Tsuji, and Akiko N. Aizawa ``Automatic Thesaurus Generation through Multiple Filtering'', The 18th International Conference on Computational Linguistics (COLING-2000), 2000
http://acl.ldc.upenn.edu/C/C00/C00-1058.pdf

Dan Tufis, ``A cheap and fast way to build useful translation lexicons'', The 19th International Conference on Computational Linguistics (COLING-2002), 2002
http://acl.ldc.upenn.edu/coling2002/proceedings/data/area-03/co-005.pdf

Alberto Simões and José João Almeida, ''NATools -- A Statistical Word Aligner Workbench'', Sociedade Española para el Procesamiento del Lenguaje Natural, 2003
http://alfarrabio.di.uminho.pt/~albie/publications/sepln2003.pdf

Alberto Simões, ``Parallel corpora word alignment and applications'', Master's Thesis, Universidade do Minho, 2004
http://alfarrabio.di.uminho.pt/~albie/publications/msc.pdf


More information


Last modified July 2007 by Djoerd Hiemstra