Speech to text software in repos?

Is there a software package in the repositories that will allow me to speak and have it write out text in whatever app has the focus. For example, suppose i have kate open and it has the focus, then i say “hello how are you” and see it typed into kate at the cursor.

No. No ready solution. A couple of years ago I needed to search through many podcasts by text and I was looking for a speech to text solution. I didn’t find an easy straight forward solution but I did find Vosk. There are a couple of community repositories in build.o.o that seem to have tried to build Vosk but failed. I downloaded Vosk from github and used it in my project and it worked fine.

Thanks for the info, it seems it only supports python version 3.5 - 3.9
What version of python do you have?
$ python3 --version
Python 3.11.5
$ pip3 --version
pip 23.2.1 from /usr/lib/python3.11/site-packages/pip (python 3.11)

1 Like

@erbenton Maybe https://www.rev.com/blog/resources/the-5-best-open-source-speech-recognition-engines-apis?

https://github.com/kaldi-asr/kaldi