frog
|
0.32-1 |
1 |
0.00
|
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It includes a tokenizer, part-of-speech tagger, lemmatizer, morphological analyser, named entity recognition, shallow parser and dependency parser. |
proycon
|
2023-12-05 14:52 (UTC) |
sayit
|
1.5-1 |
1 |
0.00
|
A text-to-speech command line tool backed by Azure Cognitive Services. |
skypixel
|
2023-11-18 16:14 (UTC) |
opensmile
|
3.0.2-2 |
3 |
0.00
|
A fast, real-time (audio) feature extraction utility for automatic speech, music and paralinguistic recognition research |
agkphysics
|
2023-11-06 05:54 (UTC) |
flite-unpatched
|
2.2-1 |
0 |
0.00
|
A lightweight speech synthesis engine (built without Arch linux patches) |
Teyras
|
2023-11-01 16:07 (UTC) |
ticcutils
|
0.34-1 |
4 |
0.00
|
Common library with functions for tools developed at Tilburg Centre for Cognition and Communication (Tilburg University) and Centre for Language and Speech Technology (Radboud University Nijmegen) |
proycon
|
2023-10-31 10:58 (UTC) |
libdf-git
|
v0.5.6.r10.g59789e1-1 |
1 |
0.57
|
A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using Deep Filtering (Git version) - core library |
openglfreak
|
2023-09-16 19:55 (UTC) |
libdeep_filter_ladspa-git
|
v0.5.6.r10.g59789e1-1 |
1 |
0.57
|
A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using Deep Filtering (Git version) - ladspa plugin |
openglfreak
|
2023-09-16 19:55 (UTC) |
deepfilternet-demos-git
|
v0.5.6.r10.g59789e1-1 |
1 |
0.57
|
A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using Deep Filtering (Git version) - demo application |
openglfreak
|
2023-09-16 19:55 (UTC) |
libdeep_filter_ladspa-bin
|
0.5.6-1 |
5 |
1.44
|
A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using on Deep Filtering (LASPDA) |
rvasilev
|
2023-09-16 18:43 (UTC) |
nerd-dictation-git
|
0.0.r155.1d52c1d-1 |
6 |
0.29
|
Simple, hackable offline speech to text - using the VOSK-API. |
majewsky
|
2023-08-29 15:55 (UTC) |
lib32-opencore-amr
|
0.1.6-1 |
18 |
0.00
|
Open source implementation of the Adaptive Multi Rate (AMR) speech codec, lib32 |
autinerd
|
2023-08-13 14:43 (UTC) |
codec2-lpcnet
|
1:1.2.0-1 |
0 |
0.00
|
Open source speech codec designed for communications quality speech between 450 and 3200 bit/s with support for LPCNet |
ra1nb0w
|
2023-07-26 07:24 (UTC) |
python-speechbrain-git
|
0.5.14-1 |
0 |
0.00
|
All-in-one speech toolkit in pure Python and Pytorch |
lumaku
|
2023-07-23 18:28 (UTC) |
python-espnet-git
|
202304-1 |
0 |
0.00
|
End-to-End Speech Processing Toolkit Python Package |
lumaku
|
2023-07-23 18:27 (UTC) |
espeak
|
1:1.48.04-4 |
10 |
0.00
|
Text to Speech engine for English, with support for other languages |
SanskritFritz
|
2023-07-23 16:43 (UTC) |
mimic-voices
|
1.0.0-1 |
0 |
0.00
|
Voice models for Mimic 3 text to speech system |
AlphaJack
|
2023-06-30 16:53 (UTC) |
mimic
|
0.2.4-3 |
20 |
0.00
|
A fast, local, neural text to speech system for Mycroft |
AlphaJack
|
2023-06-30 11:23 (UTC) |
aspeak-bin
|
6.0.0-1 |
2 |
0.04
|
A simple text-to-speech client for Azure TTS API |
kxxt
|
2023-06-29 00:59 (UTC) |
aspeak
|
6.0.0-1 |
1 |
0.10
|
A simple text-to-speech client for Azure TTS API |
kxxt
|
2023-06-29 00:58 (UTC) |
ekho
|
8.9.3-1 |
11 |
0.00
|
Multilingual text-to-speech (TTS) software for Cantonese, Mandarin, Toisanese, Zhaoan Hakka, Tibetan, Ngangien, Korean and English |
malacology
|
2023-06-25 09:48 (UTC) |
flite1
|
1.4-6 |
42 |
0.57
|
A lighweight speech synthesis engine (version 1.x) |
dbermond
|
2023-06-20 22:33 (UTC) |
python-google-speak
|
0.2.1-1 |
0 |
0.00
|
Simple class to create speech files using Google Translate URL |
carlosal1015
|
2023-05-30 05:35 (UTC) |
lpcnetfreedv
|
0.5-1 |
0 |
0.00
|
Experimental Neural Net speech coding for FreeDV |
ra1nb0w
|
2023-05-06 05:37 (UTC) |
python-textblob-git
|
0.17.1.r1.g9945064-1 |
3 |
0.00
|
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. |
ttc0419
|
2023-04-21 16:40 (UTC) |
speex-git
|
1.2.1.r8.gf39602d-1 |
0 |
0.00
|
An patent-free audio compression format designed for speech |
Chocobo1
|
2023-04-13 08:39 (UTC) |
python2-pyttsx
|
1.1-1 |
0 |
0.00
|
pyttsx - cross platform text-to-speech |
mai
|
2023-04-11 20:41 (UTC) |
google-lyra
|
1.3.2-2 |
0 |
0.00
|
A very low-bitrate codec for speech compression |
Chocobo1
|
2023-04-10 15:20 (UTC) |
google-lyra-git
|
1.3.2.r0.g47698da-1 |
0 |
0.00
|
A very low-bitrate codec for speech compression |
Chocobo1
|
2023-04-10 14:50 (UTC) |
obs-captions-plugin-bin
|
0.28-1 |
1 |
0.04
|
Standalone OBS Studio plugin providing closed captioning via Google Cloud Speech Recognition API |
Freso
|
2023-04-04 22:25 (UTC) |
espeak-ng-git
|
1.51.r708.gff46761d-1 |
11 |
0.00
|
Multi-lingual software speech synthesizer (development version) |
kyle
|
2023-03-23 20:12 (UTC) |
python-speechrecognition
|
3.9.0-1 |
3 |
0.00
|
Google-powered speech recognition for Python |
orphan
|
2023-03-07 19:02 (UTC) |
python-google-cloud-texttospeech
|
2.14.1-1 |
0 |
0.00
|
A google cloud speech api for python to convert text to audio. |
gee
|
2023-02-26 08:09 (UTC) |
subsync
|
0.17-5 |
5 |
0.00
|
Subtitle Speech Synchronizer |
orphan
|
2023-02-24 16:26 (UTC) |
gosling-git
|
r15.31f87c7-1 |
1 |
0.00
|
Natural sounding text-to-speech in the terminal (and more). |
gmy
|
2023-02-09 11:18 (UTC) |
mimic1
|
1.3.0.1-1 |
0 |
0.00
|
Text-to-speech voice synthesis from the Mycroft project. |
robertfoster
|
2023-01-25 11:03 (UTC) |
serenade.ai
|
2.0.2-1 |
0 |
0.00
|
Serenade is the most powerful way to program using natural speech. |
cedricfarinazzo
|
2023-01-08 14:41 (UTC) |
pytranscriber-bin
|
1.9-1 |
4 |
0.03
|
UI for generating transcription (subtitles) using Google Speech Recognition (X11) |
manni
|
2023-01-01 18:00 (UTC) |
python-vosk-bin
|
0.3.45-1 |
2 |
0.00
|
Offline open source speech recognition API based on Kaldi and Vosk |
danielquinn
|
2022-12-18 12:25 (UTC) |
vosk-api-bin
|
0.3.45-1 |
1 |
0.01
|
Offline speech recognition toolkit |
hamblingreen
|
2022-12-16 20:41 (UTC) |
vosk-api-git
|
0.3.45.r0.gcf2560c-1 |
2 |
0.00
|
Offline speech recognition toolkit (git version) |
dbermond
|
2022-12-15 21:09 (UTC) |
libsonic-git
|
0.2.0_53+r182.20180706.71c5119-1 |
1 |
0.00
|
Simple library to speed up or slow down speech |
stormdragon2976
|
2022-11-28 16:15 (UTC) |
rhvoice-git
|
1.8.0.r73.a4481436-1 |
23 |
0.00
|
Free and open source speech synthesizer for Russian and other languages. (development version) |
vantu5z
|
2022-10-22 16:59 (UTC) |
nvda2speechd-bin
|
0.1-1 |
0 |
0.00
|
A bridge between Windows applications that speak through NVDA and Speech dispatcher |
jticket1024
|
2022-09-15 16:31 (UTC) |
deepspeech-models-zh-cn
|
0.9.3-1 |
0 |
0.00
|
A TensorFlow implementation of Baidu's DeepSpeech architecture - models and supporting files. |
orphan
|
2022-09-06 16:19 (UTC) |
deepspeech-models
|
0.9.3-1 |
2 |
0.00
|
A TensorFlow implementation of Baidu's DeepSpeech architecture - models and supporting files. |
orphan
|
2022-09-06 16:04 (UTC) |
deepspeech-bin
|
0.9.3-1 |
2 |
0.00
|
A TensorFlow implementation of Baidu's DeepSpeech architecture - C++ native client + devel files. |
orphan
|
2022-09-06 15:46 (UTC) |
voicegen
|
1.6.3-1 |
1 |
0.00
|
Convert text to speech using multiple engines |
Master81
|
2022-08-09 19:47 (UTC) |
mingw-w64-gsm
|
1.0.22-1 |
5 |
0.00
|
Shared libraries for GSM 06.10 lossy speech compression (mingw-w64) |
kfg
|
2022-08-09 17:21 (UTC) |
lib32-gsm
|
1.0.22-1 |
36 |
0.00
|
Shared libraries for GSM 06.10 lossy speech compression |
RavuAlHemio
|
2022-08-08 15:24 (UTC) |
python-lhotse-git
|
1.5.0.dev0+git.08a613a0.clean-1 |
0 |
0.00
|
Speech and audio data preparation toolkit |
lumaku
|
2022-08-07 12:58 (UTC) |