AUR (en) - Packages

Search Criteria

Enter search criteria

Search by

Keywords

Out of Date

Sort by

Sort order

Per page

20 packages found. Page 1 of 1.

Name	Version	Votes	Popularity^?	Description	Maintainer	Last Updated
uctodata	0.10.1-1	0	0.00	An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. This package contains the necessary data.	proycon	2024-04-16 08:22 (UTC)
ucto	0.32.1-1	1	0.00	An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline.	proycon	2024-03-21 11:50 (UTC)
timblserver	1.18-1	2	0.00	Tilburg Memory Based Learner Server.	proycon	2023-12-05 14:52 (UTC)
frog	0.32-1	1	0.00	Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It includes a tokenizer, part-of-speech tagger, lemmatizer, morphological analyser, named entity recognition, shallow parser and dependency parser.	proycon	2023-12-05 14:52 (UTC)
frogdata	0.22-1	1	0.00	Data for Frog. Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch.	proycon	2023-10-31 11:01 (UTC)
mbt	3.10-1	2	0.00	Memory-based tagger-generator and tagger in one.	proycon	2023-10-31 11:00 (UTC)
timbl	6.9-1	3	0.00	Tilburg Memory-Based Learner, implementations of k-nearest neighbour classification	proycon	2023-10-31 11:00 (UTC)
libfolia	2.17-1	3	0.10	C++ library for FoLiA (Format for Linguistic Annotation)	proycon	2023-10-31 10:59 (UTC)
ticcutils	0.34-1	4	0.00	Common library with functions for tools developed at Tilburg Centre for Cognition and Communication (Tilburg University) and Centre for Language and Speech Technology (Radboud University Nijmegen)	proycon	2023-10-31 10:58 (UTC)
foliautils	0.20-1	0	0.00	Tools for working with the FoLiA format, based on libfolia. NOT the same as Python package FoLiA-tools!	proycon	2023-03-13 21:47 (UTC)
mbtserver	0.16-1	2	0.00	Memory-based tagger-generator and tagger server.	proycon	2023-01-03 11:37 (UTC)
python-frog-git	1-2	1	0.00	Python binding for Frog, a NLP suite for Dutch containing a part-of-speech tagger, lemmatizer, morphological analyser, named entity recognition, shallow parser and dependency parser	proycon	2022-06-12 18:22 (UTC)
mbt-git	1-3	1	0.00	Memory-based tagger-generator and tagger in one.	proycon	2022-06-12 18:21 (UTC)
toad-git	1-3	0	0.00	Toad: Trainer Of All Data, the Frog training collection	proycon	2022-06-12 18:20 (UTC)
foliautils-git	1-2	1	0.10	Tools for working with the FoLiA format, based on libfolia. NOT the same as Python package FoLiA-tools!	proycon	2022-06-12 18:14 (UTC)
toad	0.7-1	0	0.00	Toad: Trainer Of All Data, the Frog training collection	proycon	2021-12-19 23:24 (UTC)
python-folia-git	479-3	1	0.00	Command line tools for dealing with the FoLiA format (Format for Linguistic Annotation).	proycon	2021-04-02 13:34 (UTC)
python-pynlpl-git	1324-3	3	0.10	Python Natural Language Processing Library (pronounce as: pineapple). Contains various modules useful for common, and less common, NLP tasks. Includes full FoLiA library.	proycon	2021-04-02 13:33 (UTC)
ticcltools	0.7.1-1	0	0.00	Tools for TICCL: A spelling normalisation engine	proycon	2020-09-15 19:02 (UTC)
vocage-git	1.0.0.42.g86f9c6f-1	0	0.00	A minimalistic terminal-based vocabulary learner or flashcard tool, using a spaced repetition algorithm.	proycon	2020-09-13 12:31 (UTC)

20 packages found. Page 1 of 1.

Arch Linux User Repository

Search Criteria