applications/text

nekohtml - HTML scanner and tag balancer

Website: http://www.apache.org/~andyc/neko/doc/html/
License: Apache License
Vendor: Fedora Project
Description:
NekoHTML is a simple HTML scanner and tag balancer that enables
application programmers to parse HTML documents and access the
information using standard XML interfaces. The parser can scan HTML
files and "fix up" many common mistakes that human (and computer)
authors make in writing HTML documents.  NekoHTML adds missing parent
elements; automatically closes elements with optional end tags; and
can handle mismatched inline element tags.
NekoHTML is written using the Xerces Native Interface (XNI) that is
the foundation of the Xerces2 implementation. This enables you to use
the NekoHTML parser with existing XNI tools without modification or
rewriting code.

Packages

nekohtml-0.9.5-4jpp.1.fc7.src [395 KiB] Changelog by Jeff Johnston (2007-02-12):
- Update to address Fedora review comments.

Listing created by Repoview-0.6.2-1.fc9