2 packages found. Page 1 of 1.

Name Version Votes Popularity? Description Maintainer Last Updated
uctodata 0.10.1-1 0 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. This package contains the necessary data. proycon 2024-04-16 08:22 (UTC)
ucto 0.32.1-1 1 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. proycon 2024-03-21 11:50 (UTC)

2 packages found. Page 1 of 1.