2008-03-07 Wakaba
* HTML.pm.src: Simplified "generate implied end tag" (HTML5
revision 1320).
2008-03-07 Wakaba
* HTML.pm.src (_tree_construction_main): Merge rules for "h1"
and "div" (HTML5 revision 1318). Add comments to where
|form| pointer association codes should be inserted (HTML5 revision
1319).
2008-03-06 Wakaba
* HTML.pm.src: treatement refined (HTML5 revision 1314).
2008-03-05 Wakaba
* HTML.pm.src: Since the case t268 should never be reached (no
other token type, there are), it is replaced by a |die| statement.
2008-03-05 Wakaba
* HTML.pm.src: Typo fixed.
2008-03-04 Wakaba
* HTML.pm.src (_tree_construction_initial): Some limited quirks
doctypes were not uppercased for comparison.
2008-03-03 Wakaba
* HTML.pm.src (tree construction and set_inner_html): Checkpoints
are added.
2008-03-03 Wakaba
* HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
are set. Cases that are unlikely reached are noted as so.
2008-03-03 Wakaba
* HTML.pm.src: Checkpoints for debugging are added.
* mkhtmlparser.pl: Support for |!!!cp| syntax.
2008-03-03 Wakaba
* HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
for simplicity.
2008-03-03 Wakaba
* HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
revision 1307).
2008-03-03 Wakaba
* HTML.pm.src (_get_next_token): Where the /incorrect/ flag
is set are changed (HTML5 revision 1305).
2008-03-02 Wakaba
* HTML.pm.src: Raise a parse error for || (HTML5 revision
1292). Entities are not parsed in comment-like part in RCDATA
elements (HTML5 revision 1294). Allow bare & at the end
of attribute value literals (HTML5 revision 1296). More
quirks mode doctypes (HTML5 revision 1302). Requires spaces
between attributes and ban attribute names or unquoted
attribute values containing single or double quotes (HTML5
revision 1303).
2008-03-02 Wakaba
* ContentChecker.pm: Typo fixed. Don't raise "character encoding"
and related errors unless it is an HTML document (though the spec
is unclear on whether it is applied to XHTML document).
* HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
are added.
2008-03-02 Wakaba
* HTML.pm.src: s/local_name/manakai_local_name/g.
2008-03-01 Wakaba
* _NamedEntityList.pm: Updated (HTML5 revision 1286).
* HTML.pm.src: |charset| in |content| attribute is
case-insensitive (HTML5 revision 1270).
2008-02-26 Wakaba
* ContentChecker.pm: New status constants are added.
($ElementDefault): |status| added.
(check_element): Err for non-standard or deprecated elements.
(_attr_status_info): For non-standard or deprecated attributes.
2008-02-24 Wakaba
* ContentChecker.pm (_attr_status_info): New internal method.
2008-02-24 Wakaba
* ContentChecker.pm (check_element): Element standardized
status information is now dispatched.
2008-02-24 Wakaba
* ContentChecker.pm (check_element): Fix |del|-and-significant
problem by adding some more arguments.
2008-02-24 Wakaba
* ContentChecker.pm (check_element): Use context of
container-for-the-purpose-of-content-model element (not transparent
element) for |check_child_element| calling and significant
text flag marking. This reintroduces ||-and-significant
problem again.
2008-02-24 Wakaba
* ContentChecker.pm (check_element): Make semi-transparent
elements ignored for the purpose of phase changes in
content model checking.
2008-02-23 Wakaba
* ContentChecker.pm (check_element): In-element state
was not properly managed for transparent cases.
2008-02-23 Wakaba
* ContentChecker.pm (check_element): Support for |video|
and |audio| as semi-transparent elements.
2008-02-23 Wakaba
* ContentChecker.pm ($HTMLSemiTransparentElements): New.
(check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
Support for |html:object| as a semi-transparent element.
2008-02-23 Wakaba
* ContentChecker.pm (check_element): The way to traverse
the tree is entirely revised to make it easier to track
the state of ancestors/descendants. As a result of this
revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
support for content model checking for HTML elements |figure|,
|object|, |video|, and |audio| and checking for XML elements (and
some XMLNS checkings) are dropped for now. They will be
reimplemented in due cource.
2008-02-17 Wakaba
* HTML.pm.src: |>| in public or system literal closes the DOCTYPE
token (HTML5 revision 1225).
2008-02-17 Wakaba
* ContentChecker.pm ({unsupported_level}): New value.
* HTML.pm.src: Save whether |meta| |content| attribute
contains character references or not.
2008-02-17 Wakaba
* ContentChecker.pm (_get_children): (Incompleted) attempt to
imlement significant content checking for contents
with |del| elements.
2008-02-17 Wakaba
* ContenteChecker.pm ($HTMLTransparentElements): More
elements are added.
(_get_children): HTML |object| elements are now semi-transparent.
* NanoDOM.pm (manakai_html, manakai_head): New methods.
2008-02-16 Wakaba
* CacheManifest.pm: HTML5 revision 1211 implemented.
* CacheManifest.pod: Updated.
2008-02-10 Wakaba
* ContentChecker.pm (check_document, check_element): Support
for second argument ($onsubdoc).
(_get_css_parser): Removed (now it is part of WDCC).
2008-02-09 Wakaba
* ContentChecker.pm (_get_css_parser): New.
2007-11-25 Wakaba
* ContentChecker.pm ($AnyChecker): Old way to add child elements
for checking had been used.
2007-11-25 Wakaba
* ContentChecker.pm (check_element): New todo item type |descendant|.
2007-11-23 Wakaba
* IMTChecker.pm: Revised to raise errors and warnings as (poorly)
specced in RFC 2046 and RFC 4288.
(application/atom+xml): Definition added.
2007-11-23 Wakaba
* URIChecker.pm: Make RFC 3986 should-level errors
warnings (rather than SHOULD-level errors).
2007-11-23 Wakaba
* NanoDOM.pm (get_user_data, set_user_data): New methods.
* HTML.pm.src: A flag for character references in attribute
values are added. Set |manakai_has_reference| user data
to |charset| attribute.
2007-11-23 Wakaba
* NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
attributes.
* ContentChecker.pm (check_document): Warn if charset requirements
cannot be tested.
2007-11-19 Wakaba
* HTML.pm.src (parse_byte_string): Detect charset
by universalchardet if charset parameter is not specified.
* Makefile (Charset-all, Charset-clean): New rules.
2007-11-18 Wakaba
* ContentChecker.pm (check_document): Check the existence
of character encoding declaration and selection of encoding
for HTML document.
2007-11-18 Wakaba
* ContentType.pm (get_sniffed_type): Return also the official
type in list context.
2007-11-18 Wakaba
* ContentType.pm: Sniffing with leading white space
ignoring (HTML5 revisions 1013 and 1016).
2007-11-18 Wakaba
* ContentType.pm: HTML5 revision 1013 changes, except for leading
white spaces, are implemented.
2007-11-11 Wakaba
* HTML.pm.src (parse_char_string): Set |inner_encoding|
attribute if possible.
2007-11-11 Wakaba
* HTML.pm.src (parse_byte_string): New method.
(parse_char_string): New alias for |parse_string|.
(main phase): Invoking "change the encoding" algorithm if desired.
* HTML.pod: Updated.
2007-11-11 Wakaba
* HTML.pod (get_inner_html): Removed.
* Makefile (HTML-all, HTML-clean): New.
2007-11-11 Wakaba
* HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
2007-11-08 Wakaba
* mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
2007-11-04 Wakaba
* Makefile: |CacheManifest.html| is added.
* CacheManifest.pod: New file.
2007-11-04 Wakaba
* CacheManifest.pm: New module.
2007-11-04 Wakaba
* HTML.pm.src: Support for application cache selection algorithm
callback.
2007-11-04 Wakaba
* mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
2007-10-17 Wakaba
* Makefile (clean): New rule.
* NanoDOM.pm (public_id, system_id): New attributes.
2007-10-17 Wakaba
* Makefile (CSS-all, CSS-clean, clean): New rules.
2007-10-14 Wakaba
* ContentChecker.pm (check_document): Support for
new |is_xml_root| flag.
(check_element): Support for new |pluses| state.
(_add_pluses): New method.
(_remove_minuses): Support for new |minus| item.
2007-09-24 Wakaba
* ContentChecker.pm: Raise specific error for invalid
root element.
2007-09-24 Wakaba
* ContentChecker.pm: Set level values for later uses.
2007-09-09 Wakaba
* ContentChecker.pm: Support for language tag validation.
2007-09-09 Wakaba
* LangTag.pm (check_rfc3066_language_tag): New method.
2007-09-09 Wakaba
* LangTag.pm: New module.
2007-09-04 Wakaba
* HTML.pm.src: Some error types were wrong.
2007-08-17 Wakaba
* CSS/: New directory.
2007-08-17 Wakaba
* ContentChecker.pm (_check_get_children): Support
for |noscript| in |head|.
2007-08-12 Wakaba
* URI/: New directory.
2007-08-11 Wakaba
* HTML.pm.src: Tokenizer's states are now represented in
number.
2007-08-11 Wakaba
* HTML.pm.src: |or|s for insertion modes are replaced
by |&|s.
2007-08-11 Wakaba
* HTML.pm.src: Token types are now represented in number.
2007-08-11 Wakaba
* ContentType.pm (SEE ALSO): Updated.
* HTML.pm.src: Insertion modes are now represented in number.
2007-08-11 Wakaba
* ContentType.pm: Sniffing for bitmap images (HTML5 revision
999) is implemented.
2007-08-08 Wakaba
* ContentType.pm: Sniffing for |