1 apertium-tagger-training-tools
3 This software is GPL. It is provided as is, WITHOUT any guarantee, in
4 the hope that it will be useful.
6 (c) 2004-2006 Felipe Sánchez Martínez
7 (c) 2006 Universitat d'Alacant
8 ---------------------------------------------------------------------
10 This package provides a set of tools to train and test the
11 hidden-Markov-model-based part-of-speech tagger module of the apertium
12 machine translation engine (http://www.apertium.org).
14 The main program provided with this package apertium-tagger-tl-trainer
15 uses information from the target language to train the part-of-speech
16 tagger used in apertium in an unsupervised way, i. e. without human
17 intervention. This method has been already described (see below) and
18 is known to give better results than the standard Baum-Welch algorithm
19 used to train hidden-Markov-model-based part-of-speech taggers when no
20 tagged text is available.
22 Comments on this software are VERY welcome, do not hesitate to write
23 me for comments, feedback or just to tell me that you found this
26 If you use this software for research, or for any other particular
27 purpose, I would be very grateful if you could cite one or both of the
28 following research papers describing this software:
30 * Felipe Sánchez-Martínez, Juan Antonio Pérez-Ortiz, Mikel
31 L. Forcada. Exploring the use of target-language information to
32 train the part-of-speech tagger of machine translation systems. In
33 Lecture Notes in Computer Science 3230 (Advances in Natural Language
34 Processing, Proceedings of EsTAL - España for Natural Language
35 Processing), p. 137-148, October 20-22, 2004, Alicante, Spain.
37 @incollection{sanchezmartinez04,
38 author = {S{\'{a}}nchez-Mart{\'{\i}}nez, Felipe
39 and P{\'{e}}rez-Ortiz, Juan Antonio
40 and Forcada, Mikel L.},
41 title = {Exploring the use of target-language information to train the
42 part-of-speech tagger of machine translation systems},
43 booktitle = {Advances in Natural Language Processing, Proceedings of
44 4th International Conference {EsTAL}},
45 publisher = {Springer-Verlag},
49 series = {Lecture Notes in Computer Science},
53 location = {Alicante, Spain},
54 copyright = {Springer-Verlag},
55 isbn = {3-540-23498-5},
60 * Felipe Sánchez-Martínez, Juan Antonio Pérez-Ortiz, Mikel L. Forcada.
61 Speeding up target-language driven part-of-speech tagger training for
62 machine translation. In Lecture Notes in Computer Science 4293 (Advances in
63 Artificial Intelligence, Proceedings of MICAI 2006, 5th Mexican
64 International Conference on Artificial Intelligence) , p. 844-854,
65 November 13-17, 2006, Apizaco, Mexico.
67 @incollection{sanchezmartinez06,
68 author = {S{\'{a}}nchez-Mart{\'{\i}}nez, Felipe
69 and P{\'{e}}rez-Ortiz, Juan Antonio
70 and Forcada, Mikel L.},
71 title = {Speeding up target-language driven part-of-speech tagger
72 training for machine translation},
73 booktitle = {{MICAI} 2006: {A}dvances in {A}rtificial {I}ntelligence,
74 Proceedings of the 5th {M}exican {I}nternational {C}onference
75 on {A}rtificial {I}ntelligence},
77 publisher = {Springer-Verlag},
80 series = {Lecture Notes in Computer Science},
82 location = {Apizaco, Mexico},
83 copyright = {Springer-Verlag},
84 isbn = {3-540-37334-9},
89 ---------------------------------------------------------------------
91 Felipe Sánchez Martínez
93 Departament de Llenguatges y Sistemes Informàtics
95 E-03071 Alacant (Spain)
98 http://www.dlsi.ua.es/~fsanchez
101 ---------------------------------------------------------------------
102 To install apertium-tagger-training-tools you need to have installed
103 the following packages:
105 libxml2, at least version 2.6.17
106 lttoolbox, at least version 3.0.0
107 apertium, at least version 3.0.0
109 Once you have all the required software installed, install
110 apertium-tagger-training-tools in the common way:
112 $ ./configure && make
121 ---------------------------------------------------------------------
122 Software funded by the Spanish Ministry of Science and Technology
123 through project TIC2003-08681-C02-01, and by the Spanish Ministry of
124 Education and Science and the European Social Found through research
128 ---------------------------------------------------------------------
129 Please visit http://www.dlsi.ua.es/~fsanchez/publications.html
132 ---------------------------------------------------------------------
133 Have a look at the 'example' directory