speech to text ubuntu

Speech Note Transcribes Voice to Text on Linux

' src=

  • Posted by by Scott Bouvier
  • August 28, 2023

Speech Note is an offline, AI-powered app able to transcribe your speech into text in a variety of different languages.

A reader got in touch to point me towards the app — thanks, David! — and given that it sounds pretty cool I figured I’d give it a spotlight on the site.

Speech Note use OpenAI’s Whisper and a stack of other open-source libraries, voice engines, and other doohickeys to perform its transliterative magic.

It supports Speech to Text (i.e you speak, it types), Text to Speech (i.e. you type, it speaks), and machine translation to translate text/speech from one language to another.

speech to text ubuntu

“Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet,” assures the application’s Flathub listing.

Those with a supported GPU will want to turn GPU acceleration on as will hugely improve processing times (which are on the slow side if only using CPU processing).

Speech Note is a 620MB download from Flathub (excluding any runtimes or platforms required) and takes up around ~2GB when installed – if you’re data or disk constrained, do keep those factors in mind.

• Get Speech Note on Flathub

Home > Apps > Speech Note Transcribes Voice to Text on Linux

' src=

Scott Bouvier

An international man of mystery, Scott enjoys personal computing, mobile technology, and outdoor pursuits. He once went wing-walking on a biplane over London – scary, huh?

Suramya's Blog : Welcome to my crazy life…

January 21, 2022, nerd-dictation: a fantastic open source speech to text software for linux.

After a long time of searching I finally found a speech to text software for Linux that actually works well enough that I can use it for dictating without having to jump through too many hoops to configure and use. The software is called nerd-dictation and is an open source software. It is fairly easy to setup as compared to the other voice-to-text systems that are available but still not at a stage where a non-tech savvy person would be able to install it easily. (There is effort ongoing to fix that)

The steps to install are fairly simple and documented below for reference:

  • pip3 install vosk
  • git clone https://github.com/ideasman42/nerd-dictation.git
  • cd nerd-dictation
  • wget https://alphacephei.com/kaldi/models/vosk-model-small-en-us-0.15.zip
  • unzip vosk-model-small-en-us-0.15.zip
  • mv vosk-model-small-en-us-0.15 model

nerd-dictation allows you to dictate text into any software or editor which is open so I can dictate into a word document or a blog post or even the command prompt. Previously I have used tried using software like otter.ai which actually works quite well but doesn’t allow you to edit the text as you’re typing, so you basically dictate the whole thing and the system gives you the transcription after you are done. So, you have to go back and edit/correct the transcript which can be a pain for long dictations. This software works more like Microsoft dictate which is built into Word. Unfortunately my word install on Linux using Crossover doesn’t allow me to use the built in dictate function and I have no desire to boot into windows just so that I can dictate a document.

This downloads the software in the current directory. I set it up on /usr/local but it is up to you where you want it. In addition, I would recommend that you install one of the larger dictionaries/models which makes the voice recognition a lot more accurate. However, do keep in mind that the larger models use up a lot more memory so you need to ensure that your computer has enough memory to support the larger models. The smaller ones can run on systems as small as a raspberry pi, so depending on your system configuration you can choose. The models are available here .

The software does have some quirks, like when you are talking and you pause it will take it as a start of a new sentence and for some reason it doesn’t put a space after the last word. So unless you’re careful you need to go back and add spaces to all the sentences that you have dictated, which can get annoying. (I started manually pressing space everytime I paused to add the space). Another issue is that it doesn’t automatically capitalize the words when you dictate such as those at the beginning of the sentence or the word ‘I’. This requires you to go back and edit, but that being said it still works a lot better than the other software that I have used so far on Linux. For Windows system Dragon Voice Dictation works quite well but is expensive. I tested it out by typing out this post using it and for the most part it does work it worked quite well.

Running the software again requires you to run commands on the commandline, but I configured shortcut keys to start and stop the dictation which makes it very convenient to use. Instructions on how to configure custom shortcut keys are available here . If you don’t want to do that, then you can start the transcription by issuing the following command (assuming the software is installed in /usr/local/nerd-dictation):

This starts the software and tells it that we are going to dictate for a long time. More details on the options available are available on the project site. To stop the software you should run the following command:

I suggest you try this if you are looking for a speech-to-text software for Linux. Well this is all for now. Will post more later.

Thanks to Hacker News: Nerd-dictation, hackable speech to text on Linux for the link.

– Suramya

No Comments »

No comments yet.

RSS feed for comments on this post. TrackBack URL

Leave a comment

Name (required)

Mail (will not be published) (required)

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

  • Article Releases
  • Astronomy / Space
  • Reviews-Fantasy
  • Reviews-Paranormal
  • Reviews-Romance
  • Reviews-Science Fiction
  • Reviews-Thriller
  • Reviews-Urban Fantasy
  • Reviews-Young Adult Fantasy
  • Computer Hardware
  • Security Tools
  • Security Tutorials
  • Computer Software
  • Computer Tips
  • Artificial Intelligence
  • Quantum Computing
  • General/News
  • Interesting Sites
  • Knowledgebase
  • Linux/Unix Related
  • My Thoughts
  • Computer Related
  • Science Related
  • Tech Related
  • Travel/Trips
  • Uncategorized
  • Website Updates
  • Search for:
  • Suramya on Trip to Historic Hampi – Part 1 : “ Hey, Yes, you should be able to cover the ruins in a single day. We did a relatively relaxed visit… ” Jul 11, 03:35
  • Ami on Trip to Historic Hampi – Part 1 : “ U cover the hampi ruins on day 1 and 2 We have lesser days so can we do it in… ” Jul 10, 16:10
  • Suramya on Getting my Pocket C.H.I.P. to finally work : “ All the best! Do share your experience if you manage to get Raspbian running on the system. – Suramya ” May 25, 05:21
  • Dave on Getting my Pocket C.H.I.P. to finally work : “ Thanks! Was hoping to get my old one running again!! Will try to update it to maybe a rasbian flavor.… ” May 24, 21:01
  • September 2024
  • August 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • February 2023
  • January 2023
  • December 2022
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • January 2022
  • September 2021
  • August 2021
  • February 2021
  • January 2021
  • November 2020
  • October 2020
  • September 2020
  • August 2020
  • January 2020
  • December 2019
  • November 2019
  • October 2019
  • September 2019
  • August 2019
  • January 2019
  • September 2018
  • August 2018
  • February 2018
  • January 2018
  • December 2017
  • October 2017
  • September 2016
  • February 2016
  • January 2016
  • October 2015
  • September 2015
  • August 2015
  • January 2015
  • December 2014
  • November 2014
  • October 2014
  • September 2014
  • February 2013
  • November 2012
  • October 2012
  • August 2012
  • February 2012
  • January 2012
  • December 2011
  • November 2011
  • October 2011
  • September 2011
  • December 2010
  • October 2010
  • September 2010
  • August 2010
  • February 2010
  • January 2010
  • December 2009
  • November 2009
  • October 2009
  • September 2009
  • August 2009
  • February 2009
  • January 2009
  • December 2008
  • November 2008
  • October 2008
  • September 2008
  • August 2008
  • February 2008
  • January 2008
  • December 2007
  • November 2007
  • October 2007
  • September 2007
  • August 2007
  • February 2007
  • January 2007
  • December 2006
  • November 2006
  • October 2006
  • September 2006
  • August 2006
  • February 2006
  • January 2006
  • December 2005
  • November 2005
  • October 2005
  • September 2005
  • August 2005
  • February 2005
  • January 2005
  • December 2004
  • November 2004
  • October 2004
  • Entries feed
  • Comments feed
  • WordPress.org

Powered by WordPress

IMAGES

  1. Text to Speech on Ubuntu [1 minute tutorial]

    speech to text ubuntu

  2. How to Install eSpeak Text to Speech Software on Ubuntu 22.04

    speech to text ubuntu

  3. Speech Recognition to Text in Linux, Ubuntu using Google Docs

    speech to text ubuntu

  4. Ubuntu Speech-to-Text Tutorial

    speech to text ubuntu

  5. GitHub

    speech to text ubuntu

  6. Text To Speech in Ubuntu

    speech to text ubuntu

VIDEO

  1. Change Boot Screen (Plymouth Theme) ~ Ubuntu 10.10

  2. UBUNTU

  3. Ubuntu philosophy.

  4. Things To Remove/Install After Upgrading to Gnome 3 (Gnome Shell) ~ Ubuntu 11.04

  5. Speech to Text Voice Recognition Dragon NaturallySpeaking on Linux

  6. Mark Shuttleworth Keynote