Parsing and extracting information from (possibly malformed) HTML/XML documents
TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping. Users should start from the "Text.HTML.TagSoup" module.
Release | Stable | Testing |
---|---|---|
Fedora Rawhide | 0.14.8-9.fc35 | - |
Fedora 35 | 0.14.8-9.fc35 | - |
Fedora 34 | 0.14.8-7.fc34 | - |
EPEL 7 | 0.12.8-4.el7 | - |
You can contact the maintainers of this package via email at
ghc-tagsoup dash maintainers at fedoraproject dot org
.