apertium-deshtml - This application is part of ( apertium
This tool is part of the apertium open-source machine translation toolbox:
[ -h ] [ -i ] [ -n ] [ <input file> [ <output
file> ] ]
is an HTML format processor. Data should be passed
through this processor before being piped to lt-proc. The program takes input
in the form of an HTML document and produces output suitable for processing
with lt-proc. HTML tags and other format information are enclosed in brackets
so that lt-proc treats them as whitespace between words.
- -h, --help
- Display this help. -i Makes the addition of trailing
sentence terminator (".") unconditional, often leading to
duplicates. -n Suppresses the addition of a trailing sentence
- You could write the following to show how the word
"gener" is analysed:
apertium-destxt(1), apertium-desrtf(1), lt-proc(1),
- echo "<b>gener</b>" |
apertium-deshtml | lt-proc ca-es.automorf.bin
Lots of...lurking in the dark and waiting for you!
Copyright (c) 2005, 2006 Universitat d'Alacant / Universidad de Alicante. This
is free software. You may redistribute copies of it under the terms of the GNU
General Public License <http://www.gnu.org/licenses/gpl.html>.