Search Criteria
Package Details: justext 1.1-1
Git Clone URL: | https://aur.archlinux.org/justext.git (read-only) |
---|---|
Package Base: | justext |
Description: | jusText removes boilerplate content (such as navigation links, headers, and footers) from HTML pages. Designed to preserve text with full sentences, it is suited for creating linguistic resources like Web corpora. |
Upstream URL: | https://code.google.com/p/justext/ |
Licenses: | |
Submitter: | unhammer |
Maintainer: | unhammer |
Last Packager: | unhammer |
Votes: | 3 |
Popularity: | 0.000000 |
First Submitted: | 2011-08-06 14:35 |
Last Updated: | 2015-07-14 08:24 |
Dependencies (1)
- python2>=2.2.4 (pypy19, stackless-python2, placeholder)
not sure about the Category, anyone got a better suggestion?