Whatpm (beta)

Introduction

Whatpm is a work-in-progress set of Perl modules for Web hypertext application technologies. It is part of the manakai project.

Modules
Whatpm::ContentChecker
A DOM5 HTML (in-memory representation of a document) conformance checker with a partial support for Atom 1.0. (See also demo.)
Whatpm::ContentType
An implementation of HTML5 Content Type sniffing algorithm.
Whatpm::CSS::SelectorsParser
A group of selectors parser. (See also demo.)
Whatpm::CSS::SelectorsSerializer
A group of selectors serializer. (See also specification and demo.)
Whatpm::CSS::Tokenizer
A CSS tokenizer. (See also demo.)
Whatpm::HTML
An implementation of HTML5 parsing algorithm, fragment parsing, and fragment serialization algorithms. It can be used to convert a string into DOM, or vice versa. (See also demo.)
Whatpm::HTMLTable
An implementation of the HTML5 table algorithm. It can be used to extract a table structure from a DOM table element node. (See also demo.)
Whatpm::IMTChecker
An Internet Media Type (aka MIME type) label conformance checker.
Whatpm::URIChecker
An IRI reference conformance checker.
Whatpm::XMLSerializer
A simple XML serializer.

Note that all of these modules are work in progress and have a number of unresolved problems.

Note also that some modules have no documentation for now.

Specification
SSFT Specification
The specification for the serialization format used for testing Selectors-related modules.
manakai's Selectors Extensions
The specification for :-manakai-* pseudo-classes implemented by Selectors-related modules.
Documentations
List of error types
Description of errors to be notified to callback functions by Whatpm modules.
Selectors object
Description of data structure of Selectors object as used by Whatpm::CSS::SelectorsParser (as output), and Whatpm::CSS::SelectorsSerializer (as input).

Demo

Dependency

Perl 5.8 or later
It is recommended to use newer release of Perl 5.8 or later.
Message::IMT::InternetMediaType
Whatpm::IMTChecker depends on Message::IMT::InternetMediaType, which is part of manakai.
Message::URI::URIReference
Whatpm::URIChecker depends on Message::URI::URIReference, which is part of manakai.
manakai charlib
Whatpm::Charset::DeocdeHandle depends on modules in manakai charlib for decoding Japanese character encodings. See the documentation for manakai charlib for more information.
Message::DOM::DOMImplementation and related modules
Testing for Whatpm::ContentChecker depends on Message::DOM::DOMImplementation and related modules in manakai. They are not required in practice.
JSON
Testing for Whatpm::HTML and Whatpm::CSS::Tokenizer depends on JSON and related modules. They are not required in practice.

Distribution

The development version of Whatpm may be found in the CVS repository.

TO DO

Acknowledgments

Thanks to the html5lib team for HTML5 parser test data.

Author

.

License

Copyright 2007 Wakaba <>.

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.