[Date Prev] [Date Next] [Thread Prev] [Thread Next] Indexes: Main | Date | Thread | Author

[ba-ohs-talk] NekoHTML scanner-balancer


http://www.apache.org/~andyc/nekohtml/doc/index.html
Java, Apache    (01)

"NekoHTML is a simple HTML scanner and tag balancer that enables 
application programmers to parse HTML documents and access the information 
using standard XML interfaces. The parser can scan HTML files and "fix up" 
many common mistakes that human (and computer) authors make in writing HTML 
documents. NekoHTML adds missing parent elements; automatically closes 
elements with optional end tags; and can handle mismatched inline element 
tags.
NekoHTML is written using the Xerces Native Interface (XNI) that is the 
foundation of the Xerces2 implementation. This enables you to use the 
NekoHTML parser with existing XNI tools without modification or rewriting 
code. "    (02)

This is likely a required widget for any HyperScope project.    (03)