| Name: | tagsoup |
|---|---|
| Version: | 1.2.1 |
| Release: | 8.el6 |
| Architecture: | noarch |
| Group: | Text Processing/Markup/XML |
| Size: | 143062 |
| License: | ASL 2.0 and (GPLv2+ or AFL) |
| RPM: | tagsoup-1.2.1-8.el6.noarch.rpm |
| Source RPM: | tagsoup-1.2.1-8.el6.src.rpm |
| Build Date: | Tue Oct 14 2014 |
| Build Host: | ca-buildj3.us.oracle.com |
| Vendor: | Oracle America |
| URL: | http://home.ccil.org/~cowan/XML/tagsoup/ |
| Summary: | A SAX-compliant HTML parser written in Java |
| Description: | TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. |