Parsing and extracting information from (possibly malformed) HTML/XML documents

Edit Package ghc-tagsoup
http://hackage.haskell.org/package/tagsoup

TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Refresh
Refresh
Source Files
Filename Size Changed
_service 0000000075 75 Bytes
ghc-tagsoup.changes 0000000960 960 Bytes
ghc-tagsoup.spec 0000002740 2.68 KB
tagsoup-0.12.8.tar.gz 0000030647 29.9 KB
Revision 1 (latest revision is 22)
Stephan Kulow's avatar Stephan Kulow (coolo) accepted request 227584 from Peter Trommler's avatar Peter Trommler (ptrommler) (revision 1)
New package for pandoc.
See: http://lists.opensuse.org/archive/opensuse-factory/2014-03/msg00332.html
Comments 0
openSUSE Build Service is sponsored by