python-pynlpl-git
|
1324-3 |
3 |
0.00
|
Python Natural Language Processing Library (pronounce as: pineapple). Contains various modules useful for common, and less common, NLP tasks. Includes full FoLiA library. |
proycon
|
2021-04-02 13:33 (UTC) |
libfolia
|
2.20-1 |
3 |
0.00
|
C++ library for FoLiA (Format for Linguistic Annotation) |
proycon
|
2024-09-12 11:11 (UTC) |
foliautils-git
|
1-2 |
1 |
0.00
|
Tools for working with the FoLiA format, based on libfolia. *NOT* the same as Python package FoLiA-tools! |
proycon
|
2022-06-12 18:14 (UTC) |
python-textblob-git
|
0.17.1.r1.g9945064-1 |
3 |
0.00
|
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. |
ttc0419
|
2023-04-21 16:40 (UTC) |
python-evaluate
|
0.4.3-1 |
1 |
0.00
|
HuggigFace library for easily evaluating machine learning models and datasets |
daskol
|
2024-09-21 23:03 (UTC) |
ucto
|
0.34-1 |
1 |
0.00
|
An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. |
proycon
|
2024-09-12 09:57 (UTC) |
tomita-parser
|
1.0.73-1 |
0 |
0.00
|
GLR parser |
wagn3r
|
2017-06-09 06:44 (UTC) |
timbl
|
6.9-2 |
3 |
0.00
|
Tilburg Memory-Based Learner, implementations of k-nearest neighbour classification |
proycon
|
2024-07-03 14:17 (UTC) |
python-textacy
|
0.12.0-1 |
0 |
0.00
|
A Python library for performing a variety of natural language processing (NLP) tasks, built on the high-performance spaCy library. |
Cycatz
|
2022-04-24 15:39 (UTC) |
python-svgling
|
0.3.1-1 |
0 |
0.00
|
linguistics tree drawing to SVG in python, aimed at Jupyter |
Cycatz
|
2022-04-10 16:33 (UTC) |
python-sacrebleu
|
2.3.0-1 |
0 |
0.00
|
Reference BLEU implementation that auto-downloads test sets |
daskol
|
2023-10-21 22:02 (UTC) |
python-gensim
|
4.3.3-1 |
16 |
0.00
|
Library for topic modelling, document indexing and similarity retrieval with large corpora |
edh
|
2024-07-31 04:01 (UTC) |
mbt
|
3.10-2 |
2 |
0.00
|
Memory-based tagger-generator and tagger in one. |
proycon
|
2024-07-03 14:24 (UTC) |
manatee
|
2.151.5-1 |
0 |
0.00
|
Corpus management tool including corpus building and indexing, fast querying and providing basic statistical measures |
lenoch
|
2017-08-11 15:58 (UTC) |
juman++
|
1.02-1 |
0 |
0.00
|
Morphological Analyzer for Japanese |
orphan
|
2021-05-14 14:52 (UTC) |
juman
|
7.01-2 |
2 |
0.00
|
Morphological Analyzer for Japanese |
orphan
|
2021-05-14 14:16 (UTC) |
frog
|
0.33-2 |
1 |
0.00
|
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It includes a tokenizer, part-of-speech tagger, lemmatizer, morphological analyser, named entity recognition, shallow parser and dependency parser. |
proycon
|
2024-07-03 14:34 (UTC) |
finlib
|
2.36.5-1 |
0 |
0.00
|
Fast indexing library for the Manatee corpus management tool (a part of NoSketch Engine) |
lenoch
|
2017-08-11 14:10 (UTC) |