harusame.pl - Multilingual Web page management tool
perl harusame.pl --lang LANGCODE < input.html > output.html
perl harusame.pl --help
The harusame.pl
script extracts a version of the HTML document
written in the specified natural language, from a source HTML document
that contains paragraphs in multiple natural languages.
The document management of a multilingual Web site where there are multiple versions of a (conceptually same) document is somewhat difficult in general. If the author of an HTML document wants to edit a part of the document, then he or she has to ensure not to forget updating translations at the same time, otherwise documents in different language versions also differ in their content versions.
Using the harusame.pl
, one can generate versions of an HTML
document in different language from one source HTML document that
contains paragraphs written in all of those languages, such that
authors no longer have to manage different content versions and
different language versions in separate files.
The source document must be provided to the script using the standard input. It must be encoded in UTF-8.
The script outputs the generated document encoded in UTF-8 to the standard output.
Following command-line options are available to this script:
--help
Show the help message and exit.
--lang LANGCODE
(REQUIRED)
The language of the version to generate. This option must be
specified. The value must be a value that is valid for HTML
lang=""
attribute.
Readme http://suika.fam.cx/www/harusame/readme. How to mark up the source HTML document is described in this document.
Wakaba <w@suika.fam.cx>.
Copyright 2008 Wakaba <w@suika.fam.cx>.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.