1 package found. Page 1 of 1.

Name Version Votes Popularity? Description Maintainer Last Updated
ucto 0.32.1-1 1 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. proycon 2024-03-21 11:50 (UTC)

1 package found. Page 1 of 1.