/[suikacvs]/markup/html/whatpm/readme.en.html
Suika

Diff of /markup/html/whatpm/readme.en.html

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.9 by wakaba, Sun Sep 9 05:35:49 2007 UTC revision 1.22 by wakaba, Sat Jul 19 14:47:23 2008 UTC
# Line 13  Technologies (beta)</title> Line 13  Technologies (beta)</title>
13  <div class="section" id="introduction">  <div class="section" id="introduction">
14  <h2>Introduction</h2>  <h2>Introduction</h2>
15    
16  <p><dfn>Whatpm</dfn> is a <em>work-in-progress</em> set of Perl modules for  <p><dfn>Whatpm</dfn> is a <em>work-in-progress</em> set of <m>P</m>erl
17  Web hypertext application technologies.  It is part  <m>m</m>odules for <m>W</m>eb <m>h</m>ypertext <m>a</m>pplication
18    <m>t</m>echnologies.  It is part
19  of the <a href="http://suika.fam.cx/www/2006/manakai/" rel=up>manakai</a>  of the <a href="http://suika.fam.cx/www/2006/manakai/" rel=up>manakai</a>
20  project.</p>  project.</p>
21    
22  <dl>  <dl>
23  <dt>Modules</dt>  <dt>Modules</dt>
24  <dd><dl>  <dd><dl>
25    <dt><a href="Whatpm/CacheManifest.html"><code>Whatpm::CacheManifest</code></a></dt>
26      <dd>An
27      <a href="http://www.whatwg.org/specs/web-apps/current-work/#manifests">HTML5
28      cache manifest</a> parser.</dd>
29    <dt id=whatpm-charset-universalchardet><a href="Whatpm/Charset/UniversalCharDet.html"><code>Whatpm::Charset::UniversalCharDet</code></a></dt>
30      <dd>A Perl interface to universalchardet character encoding detection
31      library.</dd>
32  <dt><a href="Whatpm/ContentChecker.html"><code>Whatpm::ContentChecker</code></a></dt>  <dt><a href="Whatpm/ContentChecker.html"><code>Whatpm::ContentChecker</code></a></dt>
33    <dd>A DOM5 HTML (in-memory representation of a document) conformance    <dd>A DOM5 HTML (in-memory representation of a document) conformance
34    checker with a partial support for Atom 1.0.</dd>    checker with a partial support for Atom 1.0.  (See also
35      <a href="#demo-html-parser">demo</a>.)</dd>
36  <dt><a href="Whatpm/ContentType.html"><code>Whatpm::ContentType</code></a></dt>  <dt><a href="Whatpm/ContentType.html"><code>Whatpm::ContentType</code></a></dt>
37    <dd>An implementation of HTML5 Content Type sniffing algorithm.</dd>    <dd>An implementation of HTML5 Content Type sniffing algorithm.</dd>
38    <dt><a href="Whatpm/CSS/SelectorsParser.html"><code>Whatpm::CSS::SelectorsParser</code></a></dt>
39      <dd>A <a href="http://www.w3.org/TR/css3-selectors/#grouping">group of
40      selectors</a> parser.  (See also <a href="#demo-css-parser">demo</a>.)</dd>
41    <dt><a href="Whatpm/CSS/SelectorsSerializer.html"><code>Whatpm::CSS::SelectorsSerializer</code></a></dt>
42      <dd>A <a href="http://www.w3.org/TR/css3-selectors/#grouping">group of
43      selectors</a> serializer.  (See also <a href="#spec-ssft">specification</a>
44      and <a href="#demo-css-parser">demo</a>.)</dd>
45  <dt><a href="Whatpm/CSS/Tokenizer.html"><code>Whatpm::CSS::Tokenizer</code></a></dt>  <dt><a href="Whatpm/CSS/Tokenizer.html"><code>Whatpm::CSS::Tokenizer</code></a></dt>
46    <dd>A CSS tokenizer.</dd>    <dd>A CSS tokenizer.  (See also <a href="#demo-css-parser">demo</a>.)</dd>
47  <dt><a href="Whatpm/HTML.html"><code>Whatpm::HTML</code></a></dt>  <dt id=module-whatpm-html><a href="Whatpm/HTML.html"><code>Whatpm::HTML</code></a></dt>
48    <dd>An implementation of HTML5 parsing algorithm, fragment    <dd>An implementation of HTML5 document and fragment
49    parsing, and fragment serialization algorithms.  It can be used    parsing algorithms.  It can be used
50    to convert a string into DOM, or <i lang="">vice versa</i>.</dd>    to convert an arbitrary string into a
51      <abbr title="Document Object Model">DOM</abbr>.  (See also
52      <a href="#demo-html-parser">demo</a>.)</dd>
53    <dt id=module-whatpm-html-serializer><a href="Whatpm/HTML/Serializer.html"><code>Whatpm::HTML::Serializer</code></a></dt>
54      <dd>An implementation of HTML5 fragment serialization algorithm.
55      (See also <a href="#demo-html-parser">demo</a>.)</dd>
56  <dt><a href="Whatpm/HTMLTable.html"><code>Whatpm::HTMLTable</code></a></dt>  <dt><a href="Whatpm/HTMLTable.html"><code>Whatpm::HTMLTable</code></a></dt>
57    <dd>An implementation of the HTML5 table algorithm.  It can be    <dd>An implementation of the HTML5 table algorithm.  It can be
58    used to extract a table structure from a DOM <code>table</code>    used to extract a table structure from a DOM <code>table</code>
59    element node.</dd>    element node.  (See also <a href="#demo-html-table">demo</a>.)</dd>
60  <dt><a href="Whatpm/IMTChecker.html"><code>Whatpm::IMTChecker</code></a></dt>  <dt><a href="Whatpm/IMTChecker.html"><code>Whatpm::IMTChecker</code></a></dt>
61    <dd>An Internet Media Type (<abbr>aka</abbr> MIME type) label    <dd>An Internet Media Type (<abbr>aka</abbr> MIME type) label
62    conformance checker.</dd>    conformance checker.</dd>
63  <dt><a href="Whatpm/URIChecker.html"><code>Whatpm::URIChecker</code></a></dt>  <dt><a href="Whatpm/URIChecker.html"><code>Whatpm::URIChecker</code></a></dt>
64    <dd>An IRI reference conformance checker.</dd>    <dd>An IRI reference conformance checker.</dd>
65    
66    <dt><a href="Whatpm/WebIDL.html"><code>Whatpm::WebIDL</code></a></dt>
67      <dd>A WebIDL fragment parser.  It parses an IDL fragment, whether conforming
68      or not, and constructs a DOM-like object model for further processing.
69      Non-conforming (or broken) IDL fragment-like string will be parsed using
70      CSS-like error-tolerant parsing rules, e.g. ignoring anything until next
71      <code>;</code> character.
72    
73  <dt><a href="Whatpm/XMLSerializer.html"><code>Whatpm::XMLSerializer</code></a></dt>  <dt><a href="Whatpm/XMLSerializer.html"><code>Whatpm::XMLSerializer</code></a></dt>
74    <dd>A simple XML serializer.</dd>    <dd>A simple XML serializer.</dd>
75    </dl>    </dl>
# Line 50  project.</p> Line 79  project.</p>
79    
80    <p>Note also that some modules have no documentation for now.</p>    <p>Note also that some modules have no documentation for now.</p>
81    </dd>    </dd>
82  <dt>Documentation</dt>  <dt id=spec>Specifications</dt>
83      <dd><dl>
84        <dt id=spec-ssft><a href="http://suika.fam.cx/www/markup/selectors/ssft/ssft"><abbr title="Selectors Serialization Format for Testing">SSFT</abbr>
85        Specification</a></dt>
86          <dd>The specification for the serialization format used for
87          testing Selectors-related modules.</dd>
88        <dt id=spec-manakai-selectors"><a href="http://suika.fam.cx/gate/2005/sw/manakai/Selectors%20Extensions">manakai's
89        Selectors Extensions</a></dt>
90          <dd>The specification for <code>:-manakai-<var>*</var></code>
91          pseudo-classes implemented by Selectors-related modules.</dd>
92      </dl></dd>
93    <dt>Documentations</dt>
94    <dd><dl>    <dd><dl>
95  <dt><a href="http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types">List of error types</a></dt>  <dt><a href="http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types">List of error types</a></dt>
96    <dd>Description of errors to be notified to callback functions by Whatpm    <dd>Description of errors to be notified to callback functions by Whatpm
97    modules.</dd>    modules.</dd>
98    </dd></dl></dd>      <dt><a href="Whatpm/CSS/selectors-object">Selectors object</a></dt>
99          <dd>Description of data structure for Selectors, as implemented by
100          <a href="Whatpm/CSS/SelectorsParser.html"><code>Whatpm::CSS::SelectorsParser</code></a>
101          (as output), and
102          <a href="Whatpm/CSS/SelectorsSerializer.html"><code>Whatpm::CSS::SelectorsSerializer</code></a>
103          (as input)<!--, and
104          <a href="http://suika.fam.cx/www/manakai-core/lib/Message/DOM/SelectorsAPI.html"><code>Message::DOM::SelectorsAPI</code></a>-->.</dd>
105        <dt id=doc-user-data-names><a href="http://suika.fam.cx/gate/2005/sw/manakai/Predefined%20User%20Data%20Names">List of predefined user data names</a></dt>
106          <dd>List of user data names defined by Whatpm modules.</dd>
107        </dl>
108      </dd>
109  </dl>  </dl>
110  </div>  </div>
111    
# Line 63  project.</p> Line 113  project.</p>
113  <h2>Demo</h2>  <h2>Demo</h2>
114    
115  <ul>  <ul>
116  <li><a href="http://suika.fam.cx/gate/2007/html/parser-interface">HTML5 parser  <li id=demo-html-parser-nanodom><a href="http://suika.fam.cx/gate/2007/html/parser-interface">HTML5 parser
117  and checker demo</a></li>  and checker demo</a>
118  <li><a href="http://suika.fam.cx/gate/2007/html/parser-manakai-interface">HTML5  (<a href="http://suika.fam.cx/gate/cvs/*checkout*/webroot/gate/2007/html/parser.cgi">source</a>,
119  parser and checker demo, with manakai's DOM implementation</a></li>  with <a href="Whatpm/NanoDOM.html">a lightweight non-conforming
120  <li><a href="http://suika.fam.cx/gate/2007/html/table-interface">HTML5 table  DOM implementation</a>)</li>
121  structure visualization demo</a></li>  <li id=demo-html-parser-manakai><a href="http://suika.fam.cx/gate/2007/html/parser-manakai-interface">HTML5
122  <li><a href="http://suika.fam.cx/gate/2007/css/parser-interface">CSS tokenizer  parser and checker demo, with manakai's DOM implementation</a>
123  demo</a></li>  (<a href="http://suika.fam.cx/gate/cvs/*checkout*/webroot/gate/2007/html/parser-manakai.cgi">source</a>)</li>
124    <li id=demo-html-table><a href="http://suika.fam.cx/gate/2007/html/table-interface">HTML5 table
125    structure visualization demo</a>
126    (<a href="http://suika.fam.cx/gate/cvs/*checkout*/webroot/gate/2007/html/table.cgi">source</a>)</li>
127    <li id=demo-css-parser><a href="http://suika.fam.cx/gate/2007/css/parser-interface">CSS tokenizer
128    demo</a>
129    (<a href="http://suika.fam.cx/gate/cvs/*checkout*/webroot/gate/2007/css/parser.cgi">source</a>)</li>
130  </ul>  </ul>
131  </div>  </div>
132    
# Line 78  demo</a></li> Line 134  demo</a></li>
134  <h2>Dependency</h2>  <h2>Dependency</h2>
135    
136  <dl>  <dl>
137  <dt>Perl 5.8 or later</dt>  <dt id=dependency-perl>Perl 5.8 or later</dt>
138    <dd>It is recommended to use newer release of Perl 5.8 or later.</dd>    <dd>It is recommended to use newer stable release of Perl 5.8 (or
139      later).</dd>
140      <dd id=dependency-encode>Some modules require <code>Encode</code>
141      modules, which are part of standard Perl distribution.</dd>
142    <dt id=dependency-manakai-core>Modules from
143    <a href="http://suika.fam.cx/www/2006/manakai/">manakai-core</a></dt>
144      <dd>
145        <dl>
146    <dt id=dependency-error><a href="http://search.cpan.org/author/SHLOMIF/Error-0.17009/lib/Error.pm"><code>Error</code></a></dt>
147      <dd>Module <code>Whatpm::HTML</code> requires <code>Error</code>,
148      which is bundled in
149      <a href="http://suika.fam.cx/www/2006/manakai/">manakai-core</a>.</dd>
150  <dt><code>Message::IMT::InternetMediaType</code></dt>  <dt><code>Message::IMT::InternetMediaType</code></dt>
151    <dd><code>Whatpm::IMTChecker</code> depends on    <dd>Module <code>Whatpm::IMTChecker</code> depends on
152    <code>Message::IMT::InternetMediaType</code>, which is part of    <code>Message::IMT::InternetMediaType</code>, which is part of
153    <a href="http://suika.fam.cx/www/2006/manakai/">manakai</a>.</dd>    <a href="http://suika.fam.cx/www/2006/manakai/">manakai-core</a>.</dd>
154  <dt><code>Message::URI::URIReference</code></dt>  <dt><code>Message::URI::URIReference</code></dt>
155    <dd><code>Whatpm::URIChecker</code> depends on    <dd>Modules <code>Whatpm::URIChecker</code> and
156    <code>Message::URI::URIReference</code>, which is part of    <code>Whatpm::CacheManifest</code> depend on
157    <a href="http://suika.fam.cx/www/2006/manakai/">manakai</a>.</dd>    <a href="http://suika.fam.cx/www/manakai-core/lib/Message/URI/URIReference.html"><code>Message::URI::URIReference</code></a>,
158      which is part of
159      <a href="http://suika.fam.cx/www/2006/manakai/">manakai-core</a>.</dd>
160      <dt><code>Message::Charset::Info</code></dt>
161        <dd>Module <code>Whatpm::ContentChecker</code> depends on
162        <a href="http://suika.fam.cx/www/manakai-core/lib/Message/Charset/Info.html"><code>Message::Charset::Info</code></a>,
163        which is part of
164        <a href="http://suika.fam.cx/www/2006/manakai/">manakai-core</a>.</dd>
165    <dt><code>Message::DOM::DOMImplementation</code> and related modules</dt>
166      <dd><em>Testing</em> for module <code>Whatpm::ContentChecker</code>
167      depends on <code>Message::DOM::DOMImplementation</code> and related modules
168      in <a href="http://suika.fam.cx/www/2006/manakai/">manakai-core</a>.
169      They are not required in practice.</dd>
170        </dl>
171      </dd>
172  <dt><a href="http://suika.fam.cx/www/manakai-charlib/readme">manakai  <dt><a href="http://suika.fam.cx/www/manakai-charlib/readme">manakai
173  charlib</a></dt>  charlib</a></dt>
174    <dd><code>Whatpm::Charset::DeocdeHandle</code> depends on    <dd>Module <code>Whatpm::Charset::DecodeHandle</code> depends on
175    modules in <a href="http://suika.fam.cx/www/manakai-charlib/readme">manakai    modules in <a href="http://suika.fam.cx/www/manakai-charlib/readme">manakai
176    charlib</a> for <em>decoding Japanese character encodings</em>.    charlib</a> for decoding of <em>Japanese character encodings</em>.
177    See the documentation for    See the documentation for
178    <a href="http://suika.fam.cx/www/manakai-charlib/readme">manakai    <a href="http://suika.fam.cx/www/manakai-charlib/readme">manakai
179    charlib</a> for more information.</dd>    charlib</a> for more information.</dd>
180  <dt><code>Message::DOM::DOMImplementation</code> and related modules</dt>  <dt><a href="http://www.python.org/">Python</a>, Perl
181    <dd><em>Testing</em> for <code>Whatpm::ContentChecker</code>  <a href="http://search.cpan.org/~neilw/Inline-Python-0.22/"><code>Inline::Python</code></a>
182    depends on <code>Message::DOM::DOMImplementation</code> and related modules  module, and <a href="http://chardet.feedparser.org/">Universal Encoding
183    in <a href="http://suika.fam.cx/www/2006/manakai/">manakai</a>.  Detector</a></dt>
184    They are not required in practice.</dd>    <dd>For the module <code>Whatpm::Charset::UniversalCharDet</code> being
185      meaningful, these softwares are requires on the system.  See the
186      <a href="Whatpm/Charset/UniversalCharDet.html#dependency">documentation</a>
187      for more information.</dd>
188  <dt><a href="http://search.cpan.org/~makamaka/JSON-1.14/"><code>JSON</code></a></dt>  <dt><a href="http://search.cpan.org/~makamaka/JSON-1.14/"><code>JSON</code></a></dt>
189    <dd><em>Testing</em> for <code>Whatpm::HTML</code>    <dd><em>Testing</em> for modules <code>Whatpm::HTML</code> and
190      <code>Whatpm::CSS::Tokenizer</code>
191    depends on <a href="http://search.cpan.org/~makamaka/JSON-1.14/"><code>JSON</code> and related modules</a>.    depends on <a href="http://search.cpan.org/~makamaka/JSON-1.14/"><code>JSON</code> and related modules</a>.
192    They are not required in practice.</dd>    They are not required in practice.</dd>
193  </dl>  </dl>
# Line 126  repository</a>.</p> Line 211  repository</a>.</p>
211      <a href="t/tokenizer-result">HTML tokenization</a>,      <a href="t/tokenizer-result">HTML tokenization</a>,
212      <a href="t/tree-construction-result">HTML tree construction</a>,      <a href="t/tree-construction-result">HTML tree construction</a>,
213      <a href="t/content-checker-result"><code>Whatpm::ContentChecker</code></a>).</li>      <a href="t/content-checker-result"><code>Whatpm::ContentChecker</code></a>).</li>
214      <li>Merge with the <a href="http://suika.fam.cx/www/2006/manakai/">manakai-core</a>
215          code tree.
216    <li>Charset detection.</li>    <li>Charset detection.</li>
217    <li>Validation for <code>meta</code>.</li>    <li>Validation for <code>meta</code>.</li>
218    <li>Validation for media queries, IRIs (against URI schemes), language tags,    <li>Validation for media queries, IRIs (against URI schemes), language tags,
219      and so on.</li>      and so on.</li>
220    <li>Documentations are missing for some features.</li>    <li>Documentations are missing for some features.</li>
221    <li><q>Whatpm</q> is a code name in fact.  Please let me know    <li>XML parser<!-- with application cache selection algorithm hook-->.</li>
     if you have a better name.</li>  
222    <li>In addition, each module has its own TO DO items.    <li>In addition, each module has its own TO DO items.
223      (Search for <q>## TODO</q> and <q>## ISSUE</q> in each module.)</li>      (Search for <q>## TODO</q> and <q>## ISSUE</q> in each module.)</li>
224  </ul>  </ul>
# Line 142  repository</a>.</p> Line 228  repository</a>.</p>
228  <h2>Acknowledgments</h2>  <h2>Acknowledgments</h2>
229    
230  <p>Thanks to the <a href="http://code.google.com/p/html5lib/">html5lib</a>  <p>Thanks to the <a href="http://code.google.com/p/html5lib/">html5lib</a>
231  team for <a href="http://html5lib.googlecode.com/svn/trunk/testdata/">HTML5  team for their
232    <a href="http://html5lib.googlecode.com/svn/trunk/testdata/">HTML5
233  parser test data</a>.</p>  parser test data</a>.</p>
234  </div>  </div>
235    
# Line 155  parser test data</a>.</p> Line 242  parser test data</a>.</p>
242  <div class="section" id="license">  <div class="section" id="license">
243  <h2>License</h2>  <h2>License</h2>
244    
245  <p>Copyright 2007 Wakaba  <p>Copyright 2007$B!>(B2008 Wakaba
246  <code class="mail">&lt;<a href="mailto:w@suika.fam.cx"  <code class="mail">&lt;<a href="mailto:w@suika.fam.cx"
247      rel="author">w@suika.fam.cx</a>></code>.</p>      rel="author">w@suika.fam.cx</a>></code>.</p>
248    

Legend:
Removed from v.1.9  
changed lines
  Added in v.1.22

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24