Search Criteria
Package Details: ucto 0.28.1-1
Package Actions
Git Clone URL: | https://aur.archlinux.org/ucto.git (read-only, click to copy) |
---|---|
Package Base: | ucto |
Description: | An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. |
Upstream URL: | https://languagemachines.github.io/ucto |
Keywords: | nlp tokenization tokenizer |
Licenses: | GPL3 |
Submitter: | proycon |
Maintainer: | proycon |
Last Packager: | proycon |
Votes: | 1 |
Popularity: | 0.000000 |
First Submitted: | 2014-11-28 18:58 (UTC) |
Last Updated: | 2023-02-22 11:47 (UTC) |
Dependencies (8)
- icu (icu-git-static, icu-git)
- libfolia (libfolia-git)
- libxml2 (libxml2-git)
- ticcutils (ticcutils-git)
- uctodata (uctodata-git)
- autoconf (autoconf-git) (make)
- autoconf-archive (autoconf-archive-git) (make)
- libtool (libtool-git) (make)