Package Details: ucto 0.28.1-1

Git Clone URL: (read-only, click to copy)
Package Base: ucto
Description: An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline.
Upstream URL:
Keywords: nlp tokenization tokenizer
Licenses: GPL3
Submitter: proycon
Maintainer: proycon
Last Packager: proycon
Votes: 1
Popularity: 0.000000
First Submitted: 2014-11-28 18:58 (UTC)
Last Updated: 2023-02-22 11:47 (UTC)