/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory | Revision Log | View Patch Patch

-revision 1.276 by wakaba,
Sun Aug 17 05:09:12 2008 UTC
+revision 1.341 by wakaba,
Sat Oct  4 08:58:02 2008 UTC
 Line 1
+-10-04  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Support for new definition of |param| and |source|
+         start tag parsing (HTML5 revision 1731).
+-10-04  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
+-10-04  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
+         revisions 1731 and 1831).
+-10-04  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
+         revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
+         revision 1731).
+-10-04  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
+         revision 1731).  End tags of |option| and |optgroup| elements are
+         now optional (HTML5 revision 1731).
+-10-04  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: New "special" elements added to the list (HTML5
+         revision 1778).  "strile" -> "strike".
+-10-02  Wakaba  <wakaba@suika.fam.cx>
+         * ContentType.pm (get_sniffed_type): Support for the "better"
+         content sniffing (HTML5 revision 1927).  In a case the official
+         type was not returned when the method is invoked in the list
+         context.
+-09-22  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Character references for non-space C0 characters,
+         including U+000B VT, DEL character, noncharacter code points, are
+         now converted to the U+FFFD character (cf. HTML5 revision 2138).
+-09-21  Wakaba  <wakaba@suika.fam.cx>
+         * ContentChecker.pm: |form=""| check support added.
+-09-21  Wakaba  <wakaba@suika.fam.cx>
+         * ContentChecker.pm: |contextmenu| validness is now checked using
+         |id| and |id_type| properties, and |menu| property is removed.
+-09-21  Wakaba  <wakaba@suika.fam.cx>
+         * ContentChecker.pm: Prepare for |form| |name| attribute's
+         duplication checking.
+-09-21  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src (parse_byte_stream): Support (or non-support) for
+         unsupported charset="" parameter value (HTML5 revision 2131).
+-09-20  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Reminding places where U+000B is allowed as a space
+         character is fixed (cf. HTML5 revision 1738).
+         * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
+         space characters (HTML5 revision 1738).
+-09-20  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: The "anything else" case for the "after after body"
+         insertion mode was not updated to swtich to the "in body"
+         insertion mode.  U+000B is no longer a space character for the
+         purpose of tree construction phase (HTML5 revision 1738).
+-09-20  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: U+000B is no longer a space character (HTML5
+         revision 1738).
+-09-20  Wakaba  <wakaba@suika.fam.cx>
+         * ContentType.pm: 0x0B is no longer a space character (HTML5
+         revision 1738).
+         * HTML.pm.src: U+000B is no longer a space character for the
+         algorithm for extracting an encoding from a Content-Type (HTML5
+         revision 1738).
+-09-20  Wakaba  <wakaba@suika.fam.cx>
+         * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
+-09-18  Wakaba  <wakaba@suika.fam.cx>
+         * LangTag.pm: Add checks for remaining requirements from RFC 4646.
+         * mklangreg.pl: Sort 'Prefix' values by their length, to ease
+         matching.
+-09-18  Wakaba  <wakaba@suika.fam.cx>
+         * LangTag.pm: Warn for private use language subtags.  Error level
+         typos fixed.  Support for Suppress-Script field.
+         * mklangreg.pl: Support for dumping of nested structure.
+-09-18  Wakaba  <wakaba@suika.fam.cx>
+         * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
+         recommended case as per RFC 4646.
+-09-18  Wakaba  <wakaba@suika.fam.cx>
+         * LangTag.pm (check_rfc4646_langtag): New method.
+-09-18  Wakaba  <wakaba@suika.fam.cx>
+         * mklangreg.pl: New script.
+         * Makefile: Updated for creation of the module for language subtag
+         registry.
+-09-16  Wakaba  <wakaba@suika.fam.cx>
+         * Makefile: WebIDL.html added.
+         * WebIDL.pod: New documentation.
+-09-16  Wakaba  <wakaba@suika.fam.cx>
+         * WebIDL.pm: Checker's error types are redefined.
+-09-16  Wakaba  <wakaba@suika.fam.cx>
+         * WebIDL.pm: Parser's error types are redefined.  Some forward
+         compatible parsing bugs are fixed.  Some unreachable codes are
+         commented out.
+-09-16  Wakaba  <wakaba@suika.fam.cx>
+         * WebIDL.pm: Support for the reminding extended attributes are
+         added.  It does not satisfy the definition that a forward
+         interface declaration has an extended attribute.  It seems that
+         unless explicitly allowed multiple extended attributes with the
+         same name is not allowed, though it is not explicitly mentioned in
+         the spec.
+-09-16  Wakaba  <wakaba@suika.fam.cx>
+         * WebIDL.pm: Unescapes extended attribute names and extended
+         attribute identifiers.  Preserve whether an extended attribute has
+         an argument list of not.  Support for extended attributes:
+         Constructor, ExceptionConsts, IndexGetter, IndexSetter,
+         NameGetter, NameSetter, and Null.
+         (has_argument_list): New attribute.
+         (idl_text): Stringifies argument lists, if any, even if it is
+         empty.
+-09-15  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
+         |{s_kwd}| in DATA_STATE as default.
+-09-15  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
+         by |{s_kwd}| in DATA_STATE.
+-09-15  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Shorten keys.
+-09-15  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Remove checking for control character, surrogate
+         pair, or noncharacter code points and non-Unicode code
+         points (they should be handled by Whatpm::Charset::UnicodeChecker).
+         (parse_char_stream): Support for the |$get_wrapper| argument and
+         character stream error handlers.
+-09-15  Wakaba  <wakaba@suika.fam.cx>
+         * ContentChecker.pm: Don't call |loda_ns_module|
+         for null-namespace elements/attributes.
+         * HTML.pm.src: Fact out $disallowed_control_chars
+         as a hash.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
+         and |{next_char}| initializations are moved to initialization
+         method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
+         with |parse_char_stream|.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src (parse_char_stream): Make |set_next_char|
+         invoke |manakai_read_until|, not only |read|, where
+         possible, to decrease the number of |read| method calls.
+         * mkhtmlparser.pl: Related changes to the aforementioned
+         modification.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
+         would report character error from now.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: White-space-leaded non-white-space character
+         tokens in "before head insertion mode" was not
+         correctly handled.
+         (set_inner_html): Reimplemented using CharString decodehandle
+         class.  Support for $get_wrapper argument.  Support
+         for |{read_until}| feature.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Make a "bare ero" error for unknown
+         entities point the "&" character.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: It turns out that U+FFFD don't have to
+         be added to the list of excluded characters.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src ($char_onerror): Have character decoder's |line|
+         and |column| a higher priority than the one set by the
+         tokenizer's input handler.
+         ($self->{read_until}): Exclude U+FFFD (but this might
+         not be necessary, since now we do line/column fixup in
+         the character decode handle).
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Use |{read_until}| where possible.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Change |{getc_until}| to |{read_until}|
+         and |manakai_getc_until| to |manakai_read_until| to
+         reduce the number of string copies.
+-09-14  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src (parse_char_string): Use newly created
+         |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
+         standard feature to |open| a string as a filehandle,
+         since Perl's string filehandle seems not supporting |ungetc|
+         method correctly.
+         (parse_char_stream): Define |{getc_until}| method.
+         (DATA_STATE): Experimental support for |getc_until| feature.
+-09-13  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Check points added to newly added branches.
+-09-13  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Remove |{char}|, which is no longer used.
+         Remove |{entity_in_attr}| and |{last_attribute_value_state}|
+         and replaced by |{prev_state}|.
+         * mkhtmlparser.pl: Remove |{char}| feature.
+         Remove |!!!back-next-input-character;| macro.
+-09-13  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
+         entity related tokenizer states in favor of new states
+         implementing the consume character reference algorithm.
+-09-13  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: "Consume a character reference" algorithm is
+         now implemented as a tokenizer's state, rather than
+         a method, with minimum changes (more changes will
+         be made, in due course).  "Bogus comment state"'s inner
+         loop gets removed.
+-09-13  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
+         into their own tokenizer states.
+-09-13  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
+         is split into three states.
+-09-13  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
+         itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
+         no longer does the tokenizer have to push back next input
+         characters in those states.
+-09-13  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
+         into four states so that no longer does the tokenizer have to push
+         back next input characters in that state.
+-09-11  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
+         which can be used to insert some wrapper between the character
+         stream handle and the tokenizer.  (It is currently not supported
+         for |set_inner_html| for |Element|s).
+-09-10  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Ignore punctuations in charset names.
+-09-10  Wakaba  <wakaba@suika.fam.cx>
+         * ContentChecker.pm: Support for charset-layer error levels.
+         * HTML.pm.src: Don't specify |text| argument for the
+         |chardecode:fallback| error, since it is not the encoding
+         being used alternatively.
+-09-06  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
+-08-31  Wakaba  <wakaba@suika.fam.cx>
+         * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
+-08-31  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: Bug fix and sync with the spec with regard
+         to after after frameset insertion mode processing (HTML5
+         revision 1909).  Note that the implementation was wrong
+         per the old spec before the r1909 changes.
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * HTMLTable.pm: scope=auto algorithm fix synced with the
+         spec (HTML5 revision 2093).
+         ($process_row): Algorithm step numbers synced with the
+         spec (HTML5 revision 2092).
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
+         revision 2094).
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * HTML.pm.src: '"' and "'" at the end of attribute
+         name (after another attribute) now raise parse error (HTML5
+         revision 2123).  Empty unquoted attribute values are no
+         longer allowed (HTML5 revision 2122).
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
+         revision 2130).
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * ContentChecker.pm: |xml:lang| attribute value must be same
+         as |lang| attribute value for HTML elements (HTML5 revision 2062
+         and so on).
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * ContentChecker.pm: Error level definition for |xml_id_error|
+         was missing.
+         * URIChecker.pm: The end of the URL should be marked as the
+         error location for an empty path error.  The position
+         between the userinfo and the port components should be
+         marked as the error location for an empty host error.
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * URIChecker.pm: Set parameters representing where in the
+         value the error occurs for errors.  Report unknown
+         address format error in warning level, since address
+         formats are rarely added.  Path segments starting with "/.."
+         were misinterpreted as a dot-segment.
+-08-30  Wakaba  <wakaba@suika.fam.cx>
+         * URIChecker.pm (check_iri_reference): Requires
+         |Message::DOM::DOMImplementation|.
+-08-29  Wakaba  <wakaba@suika.fam.cx>
+         * IMTChecker.pm: Updated for the new error reporting architecture.
+         * ContentChecker.pm: Error levels for IMTs are added.
 -08-17  Wakaba  <wakaba@suika.fam.cx>
          * H2H.pm (_shift_token): Support for unquoted HTML attribute

 Legend:



Removed from v.1.276
 


changed lines


 
Added in v.1.341
 Legend:



Removed from v.1.276
 


changed lines


 
Added in v.1.341
-Removed from v.1.276
+Added in v.1.341

admin@suikawiki.org	ViewVC Help
Powered by ViewVC 1.1.24