Package Details: uctodata 0.11-1

Git Clone URL: https://aur.archlinux.org/uctodata.git (read-only, click to copy)
Package Base: uctodata
Description: An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. This package contains the necessary data.
Upstream URL: https://languagemachines.github.io/ucto
Licenses: GPL3
Submitter: proycon
Maintainer: proycon
Last Packager: proycon
Votes: 0
Popularity: 0.000000
First Submitted: 2016-07-11 16:57 (UTC)
Last Updated: 2024-05-17 09:16 (UTC)