harusame
$ harusame --lang LANGCODE < input.html > output.html
$ harusame --help
The harusame
script extracts a version of the HTML document written in the specified natural language, from a source HTML document that contains paragraphs in multiple natural languages.
The document management of a multilingual Web site where there are multiple versions of a (conceptually same) document is somewhat difficult in general. If the author of an HTML document wants to edit a part of the document, then he or she has to ensure not to forget updating translations at the same time, otherwise documents in different language versions also differ in their content versions.
Using the harusame
, one can generate versions of an HTML document in different language from one source HTML document that contains paragraphs written in all of those languages, such that authors no longer have to manage different content versions and different language versions in separate files.
The source document must be provided to the script using the standard input. It must be encoded in UTF-8.
The script outputs the generated document encoded in UTF-8 to the standard output.
Following command-line options are available to this script:
--help
Show the help message and exit.
--lang LANGCODE
(REQUIRED)The language of the version to generate. This option must be specified. The value must be a value that is valid for HTML lang=""
attribute.
Readme https://suika.suikawiki.org/www/harusame/readme. How to mark up the source HTML document is described in this document.
Wakaba <wakaba@suikawiki.org>.
Copyright 2008, 2010 Wakaba <wakaba@suikawiki.org>.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.