Search Criteria
Package Details: ucto 0.35-1
Package Actions
| Git Clone URL: | https://aur.archlinux.org/ucto.git (read-only, click to copy) |
|---|---|
| Package Base: | ucto |
| Description: | An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. |
| Upstream URL: | https://languagemachines.github.io/ucto |
| Keywords: | nlp tokenization tokenizer |
| Licenses: | GPL3 |
| Submitter: | proycon |
| Maintainer: | proycon |
| Last Packager: | proycon |
| Votes: | 1 |
| Popularity: | 0.000000 |
| First Submitted: | 2014-11-28 18:58 (UTC) |
| Last Updated: | 2024-12-16 14:41 (UTC) |
Dependencies (8)
- icu (icu-gitAUR)
- libfoliaAUR
- libxml2 (libxml2-gitAUR)
- ticcutilsAUR
- uctodataAUR
- autoconf (autoconf-gitAUR) (make)
- autoconf-archive (autoconf-archive-gitAUR) (make)
- libtool (libtool-gitAUR) (make)