2 packages found. Page 1 of 1.

Name Version Votes Popularity? Description Maintainer Last Updated
uctodata 0.11-1 0 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. This package contains the necessary data. proycon 2024-05-17 09:16 (UTC)
ucto 0.33-1 1 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. proycon 2024-05-17 09:16 (UTC)

2 packages found. Page 1 of 1.