Parsing and extracting information from (possibly malformed) HTML/XML documents
http://hackage.haskell.org/package/tagsoup
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.
Users should start from the Text.HTML.TagSoup module.
- Developed at devel:languages:haskell
- Sources inherited from project openSUSE:Factory
-
2
derived packages
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout devel:ARM:Factory:Contrib:ILP32/ghc-tagsoup && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
_service | 0000000075 75 Bytes | |
ghc-tagsoup.changes | 0000000960 960 Bytes | |
ghc-tagsoup.spec | 0000002740 2.68 KB | |
tagsoup-0.12.8.tar.gz | 0000030647 29.9 KB |
Revision 1 (latest revision is 22)
Stephan Kulow (coolo)
accepted
request 227584
from
Peter Trommler (ptrommler)
(revision 1)
New package for pandoc. See: http://lists.opensuse.org/archive/opensuse-factory/2014-03/msg00332.html
Comments 0