/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.154 by wakaba, Mon Mar 3 09:17:09 2008 UTC revision 1.237 by wakaba, Sun May 18 03:46:26 2008 UTC
# Line 1  Line 1 
1    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (parse_byte_string): Fix the column number reported
4            by encoding layer error reporter.
5    
6    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src (parse_byte_string): Use streaming decoder
9            rather than converting the whole byte string and then parsing.
10            Propagate errors in character encoding layer.
11            (get_next_token): Precise error reporting for |bare stago| error.
12    
13    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
14    
15            * HTML.pm.src (parse_char_stream): New method.
16            (parse_char_string): This method is now defined as an invocation
17            of the |parse_char_stream| method.
18    
19    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src (parse_byte_string): Report various status
22            of the sniffing as info-level errors.  Support for new
23            decoding framework in parser resestting.
24            (new): Various default error levels were not set.
25    
26    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
29            algorithm, except for the actual sniffing, is implemented
30            with new framework with Message::Charset::Info.
31    
32    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
33    
34            * CacheManifest.pm (_parse): Drop fragment identifiers from
35            URIs in fallback section (HTML5 revision 1596).
36    
37    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
38    
39            * Makefile (Entities.html): URI has changed.
40    
41    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
42    
43            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
44            1553).
45    
46    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
47    
48            * ContentChecker.pm: Noted that those returned in |table| are
49            no longer table elements, but table objects returned
50            by Whatpm::HTMLTable.
51    
52            * HTMLTable.pm (form_table): Return table element node
53            as |$table->{element}|.
54            (assign_header): Support for the |headers=""| attribute.
55    
56    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
57    
58            * HTMLTable.pm (assign_header): New function; first version
59            with no support for headers="".
60            (form_table): Include table width and height to the returned
61            table object for covenience.  Indexing in column assignement was wrong.
62            Set whether a data cell is empty or not for convenience.
63    
64    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
65    
66            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
67    
68    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTMLTable.pm: How table model errors are detected is
71            changed (HTML5 revision 1387).
72    
73    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTMLTable.pm: The algorithm now moves |tfoot| elements
76            to the end of the table (HTML5 revision 1380).
77    
78    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
81            of 1-based (HTML5 revision 1376).
82    
83    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
84    
85            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
86            and "Content-Encoding" no longer prevent sniffing (HTML5
87            revision 1288).
88    
89    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
90    
91            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
92            1282).
93    
94    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
95    
96            * ContentChecker.pm: Support for global attributes.
97            Status of XML specs are added.
98    
99    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
100    
101            * ContentChecker.pm (check_element): Support for |template=""|
102            and |ref=""| attribute (referent element type checking).
103    
104    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
105    
106            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
107            1500) is implemented (except for IDNA part and URI-scheme-specific
108            knowledge).  Line number counting was wrong for LF-only
109            documents.
110    
111    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
112    
113            * HTML.pm.src: Raise an parse error for any disallowed
114            character (HTML5 revision 1263).
115    
116    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
117    
118            * mkentitylist.pl: Support for new HTML5 entity table format (the
119            definition for |AElig;| was missing).
120    
121    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
122    
123            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
124            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
125    
126    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src: List of element names that close foreign content
129            insertion mode is added (HTML5 revisions 1412 and 1418).
130    
131    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTML.pm.src: Support for |mglyph| and |malignmark|
134            elements (HTML5 revision 1410).
135    
136    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
137    
138            * HTML.pm.src: Support for new long MathML entities (HTML5
139            revision 1406).
140    
141    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
142    
143            * HTML.pm.src: CDATA section support for MathML and SVG
144            elements (HTML5 revision 1404 and 1420).
145    
146    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
149            elements (HTML5 revision 1404).  Unused !!!macro definitions
150            are removed.
151    
152    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
155            are raised is changed (HTML5 revision 1404).
156    
157    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
158    
159            * HTML.pm.src: Category sets that are no longer used are removed.
160    
161    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: The ->[1] property of stack entries are now
164            replaced by constants representing element category.
165    
166    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
169            for error reporting.  (This is a preparation for using constant
170            value for ->[1].)
171    
172    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
173    
174            * RDFXML.pm: Typo fixed.
175    
176    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
177    
178            * ContentChecker.pm: |fact_level| is now treated
179            as same as |must_level|, i.e. level = |m|.
180            (check_element): Make list of URIs in the DOM.
181    
182    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
183    
184            * RDFXML.pm: Language accessor implemented.
185            Local (null-namespace) attribute support.
186    
187    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
188    
189            * RDFXML.pm: Factored out ID checking code.
190    
191    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
192    
193            * RDFXML.pm: TODO items noted.  Validation of ID
194            and URI attributes is implemented.  Warn if unknown
195            value is used in rdf:parseType="" attribute.
196    
197            * URIChecker.pm (check_rdf_uri_reference): New function.
198    
199    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
200    
201            * RDFXML.pm: bnodeid implemented.  Relative references
202            are now resolved.
203    
204    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
205    
206            * ContentChecker.pm: RDF reification implemented.
207    
208            * RDFXML.pm: undef vs false bug fixed.
209            Reification implemented.
210    
211    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
212    
213            * RDFXML.pm: s/id/ID/ for attribute name.
214            The |node| arguments are added for |ontriple| calls.
215            Too many "attribute not allowed" errors were raised.
216    
217            * ContentChecker.pm: Initial experimental support for rdf:RDF
218            element.
219    
220    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
221    
222            * RDFXML.pm: New module.
223    
224    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
225    
226            * HTML.pm.src (set_inner_html): Line/column number
227            code was old one yet.
228    
229    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
232            errors.  Line/column markings for DOCTYPE, comment, and
233            character tokens are reintroduced; otherwise,
234            error location for "not HTML5" error and errors
235            for implied elements are not attached.
236    
237    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: Set line/column numbers to attributes.
240    
241            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
242            (value): Setter implemented.
243    
244            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
245    
246    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: Unused line/column markings are removed.
249    
250    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
253            flag, which is no longer used.
254    
255    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: Set line/column information to element nodes.
258    
259            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
260            and !!!insert-element-t): Set line/column information to
261            element nodes.
262    
263    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
264    
265            * HTML.pm.src (_get_next_token): The first "<" character
266            in "<?", "<>", or "</>" should be the error point.
267    
268    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Some more fixes on error position reporting.
271    
272    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
273    
274            * HTML.pm.src: Token-level precious error reporting.
275    
276    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
277    
278            * HTML.pm.src: Preparation for more precious error point
279            reporting.
280    
281    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: Error type revised.
284    
285    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
286    
287            * HTML.pm.src: Similar codes are merged together, again.
288    
289    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src: Similar codes are merged together.
292    
293    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
294    
295            * mkhtmlparser.pl: Set "level" parameter to parse errors.
296    
297            * HTML.pm.src: Code refined.
298    
299    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src: |</body>| treatement has been changed (HTML5
302            revision 1348).  Note that I really don't know this makes
303            any difference in the black-box behavior of the parser.
304    
305    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
306    
307            * HTML.pm.src: New end-of-file token implementation (HTML5
308            revision 1348).
309    
310    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
311    
312            * HTML.pm.src: |applet| support (HTML5 revision 1347).
313    
314    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
315    
316            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
317    
318    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src: Support for |<input>| in the "in select" insertion
321            mode and support for the "in select in table" insertion mode (HTML5
322            revision 1342).
323    
324    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src: No foster parenting for <script> and <script>
327            in non-tainted <table>s (HTML5 revision 1336).
328    
329    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
330    
331            * HTML.pm.src: Ignore white space characters between <html>
332            and <head> (HTML5 revision 1332).
333    
334    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
335    
336            * HTML.pm.src: Treat <input type=hidden> as if it were a
337            white space (HTML5 revision 1331).
338    
339    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
340    
341            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
342            element (HTML5 revision 1330).
343    
344    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
345    
346            * HTML.pm.src: <title> is always appended to the current
347            element (HTML5 revision 1328).
348    
349    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
350    
351            * HTML.pm.src: White space in tainted tables are moved
352            into foster parents (HTML5 revision 1326).
353    
354    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
357            revision 1321).
358    
359    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
360    
361            * HTML.pm.src: |</h/n/>| case code rearranged to align with
362            the spec (HTML5 revision 1320).  Note that we finally complete
363            all of HTML5 revision 1320 changes.
364    
365    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
366    
367            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
368            tags (HTML5 revision 1320).
369    
370    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
371    
372            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
373    
374    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
375    
376            * HTML.pm.src: A "generate implied end tags" code (t409.1)
377            could not be reached so that it is now removed (HTML5 revision
378            1320).
379    
380    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
381    
382            * HTML.pm.src: Code for the case of |</div>| and so on are revised
383            to align with new spec text (HTML5 revision 1320).
384    
385    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
386    
387            * HTML.pm.src: Remove strange |if| condition; however, it should
388            have had no harm in theory.
389    
390    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
391    
392            * HTML.pm.src (_tree_construction_main): '</p>' in body
393            case is split from other end tags for the preparation
394            of implementing HTML revision 1320.
395    
396    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
397    
398            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
399            revision 1320).
400    
401    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
402    
403            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
404            and "div" (HTML5 revision 1318).  Add comments to where
405            |form| pointer association codes should be inserted (HTML5 revision
406            1319).
407    
408    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
409    
410            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
411    
412    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
413    
414            * HTML.pm.src: Since the case t268 should never be reached (no
415            other token type, there are), it is replaced by a |die| statement.
416    
417    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
418    
419            * HTML.pm.src: Typo fixed.
420    
421    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
422    
423            * HTML.pm.src (_tree_construction_initial): Some limited quirks
424            doctypes were not uppercased for comparison.
425    
426    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
427    
428            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
429            are added.
430    
431    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
432    
433            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
434            are set.  Cases that are unlikely reached are noted as so.
435    
436    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
437    
438            * HTML.pm.src: Checkpoints for debugging are added.
439    
440            * mkhtmlparser.pl: Support for |!!!cp| syntax.
441    
442  2008-03-03  Wakaba  <wakaba@suika.fam.cx>  2008-03-03  Wakaba  <wakaba@suika.fam.cx>
443    
444          * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g          * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g

Legend:
Removed from v.1.154  
changed lines
  Added in v.1.237

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24