Package Details: uctodata 0.8-1

Git Clone URL: https://aur.archlinux.org/uctodata.git (read-only)
Package Base: uctodata
Description: An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. This package contains the necessary data.
Upstream URL: https://languagemachines.github.io/ucto
Licenses: GPL3
Submitter: proycon
Maintainer: proycon
Last Packager: proycon
Votes: 0
Popularity: 0.000000
First Submitted: 2016-07-11 16:57
Last Updated: 2018-12-07 17:37