36 packages found. Page 1 of 1.

Name Version Votes Popularity? Description Maintainer Last Updated
foliautils 0.20-1 0 0.00 Tools for working with the FoLiA format, based on libfolia. *NOT* the same as Python package FoLiA-tools! proycon 2023-03-13 21:47 (UTC)
ucto 0.28.1-1 1 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. proycon 2023-02-22 11:47 (UTC)
frog 0.27.1-1 1 0.00 Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It includes a tokenizer, part-of-speech tagger, lemmatizer, morphological analyser, named entity recognition, shallow parser and dependency parser. proycon 2023-02-22 11:47 (UTC)
libfolia 2.14-1 2 0.00 C++ library for FoLiA (Format for Linguistic Annotation) proycon 2023-02-21 08:59 (UTC)
ticcutils 0.32-1 4 0.00 Common library with functions for tools developed at Tilburg Centre for Cognition and Communication (Tilburg University) and Centre for Language and Speech Technology (Radboud University Nijmegen) proycon 2023-02-21 08:59 (UTC)
timbl 6.8.1-1 3 0.00 Tilburg Memory-Based Learner, implementations of k-nearest neighbour classification proycon 2023-01-04 16:59 (UTC)
mbtserver 0.16-1 2 0.00 Memory-based tagger-generator and tagger server. proycon 2023-01-03 11:37 (UTC)
timblserver 1.16-1 2 0.00 Tilburg Memory Based Learner Server. proycon 2023-01-03 11:36 (UTC)
mbt 3.9-1 2 0.00 Memory-based tagger-generator and tagger in one. proycon 2023-01-03 11:35 (UTC)
frogdata 0.21-1 1 0.00 Data for Frog. Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. proycon 2022-07-22 10:10 (UTC)
uctodata 0.9.1-1 0 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. This package contains the necessary data. proycon 2022-07-22 09:43 (UTC)
python-frog-git 1-2 1 0.00 Python binding for Frog, a NLP suite for Dutch containing a part-of-speech tagger, lemmatizer, morphological analyser, named entity recognition, shallow parser and dependency parser proycon 2022-06-12 18:22 (UTC)
mbt-git 1-3 1 0.00 Memory-based tagger-generator and tagger in one. proycon 2022-06-12 18:21 (UTC)
toad-git 1-3 0 0.00 Toad: Trainer Of All Data, the Frog training collection proycon 2022-06-12 18:20 (UTC)
foliautils-git 1-2 0 0.00 Tools for working with the FoLiA format, based on libfolia. *NOT* the same as Python package FoLiA-tools! proycon 2022-06-12 18:14 (UTC)
toad 0.7-1 0 0.00 Toad: Trainer Of All Data, the Frog training collection proycon 2021-12-19 23:24 (UTC)
python-folia-git 479-3 1 0.00 Command line tools for dealing with the FoLiA format (Format for Linguistic Annotation). proycon 2021-04-02 13:34 (UTC)
python-pynlpl-git 1324-3 2 0.00 Python Natural Language Processing Library (pronounce as: pineapple). Contains various modules useful for common, and less common, NLP tasks. Includes full FoLiA library. proycon 2021-04-02 13:33 (UTC)
ticcltools 0.7.1-1 0 0.00 Tools for TICCL: A spelling normalisation engine proycon 2020-09-15 19:02 (UTC)
vocage-git 1.0.0.42.g86f9c6f-1 0 0.00 A minimalistic terminal-based vocabulary learner or flashcard tool, using a spaced repetition algorithm. proycon 2020-09-13 12:31 (UTC)
mbtserver-git 1-1 0 0.00 Memory Based Tagger Server proycon 2017-04-06 12:17 (UTC)
ucto-git 1-3 2 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. proycon 2016-07-11 17:16 (UTC)
uctodata-git 1-1 0 0.00 An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. These are the data files. proycon 2016-07-11 17:14 (UTC)
ticcltools-git 16-1 0 0.00 Tools for TICCL: A spelling normalisation engine proycon 2016-06-02 10:09 (UTC)
ticcutils-git 201-1 1 0.00 Common library with functions for tools developed at Tilburg Centre for Cognition and Communication (Tilburg University) and Center of Language and Speech Technology (Radboud University Nijmegen) proycon 2016-02-10 16:58 (UTC)
frog-git 1-4 1 0.00 Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It includes a tokenizer, part-of-speech tagger, lemmatizer, morphological analyser, named entity recognition, shallow parser and dependency parser. proycon 2016-02-02 16:03 (UTC)
wopr-git 1-2 1 0.00 Memory Based Word Predictor/Language Model proycon 2015-11-23 22:04 (UTC)
libfolia-git 1-2 1 0.00 C++ library for FoLiA (Format for Linguistic Annotation) proycon 2015-11-23 22:02 (UTC)
frogdata-git 1-2 1 0.00 Data for Frog. Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. proycon 2015-11-23 22:01 (UTC)
timblserver-git 1-3 1 0.00 Tilburg Memory Based Learner Server. proycon 2015-11-23 21:59 (UTC)
timbl-git 1-4 1 0.00 Tilburg Memory-Based Learner, implementations of k-nearest neighbour classification proycon 2015-11-23 21:59 (UTC)
python-ucto-git 10-1 1 0.00 Python binding for Ucto, an advanced tokenizer (for NLP) proycon 2015-06-21 10:54 (UTC)
python-timbl-git 48-1 2 0.00 Python binding for Timbl, a k-Nearest Neighbours machine learning suite proycon 2015-06-21 10:54 (UTC)
python2-ucto-git 10-1 1 0.00 Python binding for Ucto, an advanced tokenizer (for NLP) proycon 2015-06-21 10:53 (UTC)
python2-timbl-git 41-3 1 0.00 Python binding for Timbl, a k-Nearest Neighbours machine learning suite proycon 2015-06-21 10:53 (UTC)
colibri-core-git 735-2 1 0.00 Colibri Core is a set of command-line tools as well as a C++ library (with Python binding) for NLP. It enables you to work with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way. proycon 2015-06-21 10:53 (UTC)

36 packages found. Page 1 of 1.