Go to a document or other text field and place the insertion point where you want your dictated text to appear. Press the keyboard shortcut for starting dictation, or choose Edit > Start Dictation. The default shortcut is Fn Fn (press the Fn key twice). On the Mac and Windows the 'desktop app' is more than just a shortcut to the website. Disconnect your wifi and close your default browser (set to clear cookies, offline data, active logins, cache, site preferences, browsing and download history).
Looking for the best free Windows software for speech to text? The most-repeated paid recommendation is Dragon Naturally Speaking (DNS). But some might scoff at paying money for software. Fortunately, there are several great free programs out there like Google Docs Voice Typing (GDVT) and Windows Speech Recognition (WSR).
Dragon NaturallySpeaking Basics 13 (Discontinued)Dragon NaturallySpeaking Basics 13 (Discontinued) Buy Now On Amazon $25.00
You might wonder how these two products compare against DNS — and whether or not DNS meets your needs. For this article, I’ve identified three kinds of users: those who need speech to text transcription for writing novels, those who need academic transcription, and those who write business documents, like memos. To this end, I tested three speech transcription programs (DNS, GDVT, and WSR).
Speech Transcription Setup
Before we head into the test, let’s first look at the recommended hardware and software setup.
Hardware and Software Requirements
Google Docs Voice Typing requires the Chrome Browser and a microphone. And it also needs a persistent internet connection (which isn’t mentioned in the requirements). Otherwise, this is probably the easiest method to get started with speech transcription.
DNS requires a processor made in 2001 or later, Windows 7 or later, and around 4GB of free storage. Its strictest requirement is 2GB of RAM. Here’s a complete list of DNS’s hardware requirements.
Windows Speech RecognitionEnable Speech-to-Text and Voice Control by Setting Up Speech Recognition in WindowsEnable Speech-to-Text and Voice Control by Setting Up Speech Recognition in WindowsSpeech Recognition remains more powerful than Cortana. It drives speech to text and voice control. This article will show you what Speech Recognition can do, how to set it up, train it, and use it.Read More, also known as Speech Platform Runtime (download WSR), costs nothing and requires only a Windows 7 or later operating system.
Microphone Considerations
I use a dynamic microphone (best microphone for podcastingThe Best Podcast Microphone for Your Money in 2017The Best Podcast Microphone for Your Money in 2017Microphones matter a lot when making a podcast. The initial costs are fairly low: recording software, a desktop or laptop, and of course, a microphone. So what's the best podcast microphone?Read More) and a relatively fast Intel Core i7 processor. While a high-quality microphone is desired, it isn’t required. Even so, your results will improve with better sound quality and reduced background noise.
The lowest-priced microphone that I would recommend for high-quality recording is the Audio-Technica ATR-2100. However, the accuracy difference between $5 microphone and a $200 device is pretty minimal.
Audio-Technica ATR2100-USB Cardioid Dynamic USB/XLR MicrophoneAudio-Technica ATR2100-USB Cardioid Dynamic USB/XLR Microphone Buy Now On Amazon $62.99
On the other hand, the minimum requirements for Microsoft Speech are pretty much any remotely modern computer (most computers made in the last ten years) and a microphone. If you own a laptop or tablet made in the last five years, it should have what you need by default.
Configuring Speech Transcription Programs
Here’s how to use Google Voice Typing:
Here’s how to get started with Windows Speech Recognition:
And, finally, here’s how to get started with Dragon Naturally Speaking:
Test Methodology
I want to find the best free Windows-based speech to text application. Because different consumers may need a different product, I’ve devised a simple test. I read three different passages from texts without copyright: one from Charles Darwin’s On the Tendency of Species to Form Varieties. One from H.P. Lovecraft’s Call of Cthulu. The last hails from Jerry Brown’s 2017 State of the Union speech. My methodology is by no means perfect, but it does give an impression of each voice recognition suite’s accuracies.
Fiction Writing Sample (From H.P. Lovecraft’s Call of Cthulu)
“The most merciful thing in the world, I think, is the inability of the human mind to correlate all its contents. We live on a placid island of ignorance in the midst of black seas of infinity, and it was not meant that we should voyage far. The sciences, each straining in its own direction, have hitherto harmed us little; but some day the piecing together of dissociated knowledge will open up such terrifying vistas of reality, and of our frightful position therein, that we shall either go mad from the revelation or flee from the deadly light into the peace and safety of a new dark age.”
Business Writing Sample (Jerry Brown’s 2017 State of the Union speech)
“It is customary on an occasion like this to lay out a specific agenda for the year ahead. Six times before from this rostrum, I have done that, and in some detail. And, as I reread those proposals set forth in previous State of the State speeches, I was amazed to see how much we have accomplished together.”
Academic Writing Sample (Charles Darwin’s On the Tendency of the Species to Form Varieties)
“Now when a variety of such an animal occurs, having increased power or capacity in any organ or sense, such increase is totally useless, is never called into action, and may even exist without the animal ever becoming aware of it. In the wild animal, on the contrary, all its faculties and power being brought into full action for the necessities of existence, any increase becomes immediately available, is strengthened by exercise, and must even slightly modify the food, the habits, and the whole economy of the race.”
3 Voice Transcription Suites Tested
It’s surprising how free voice recognitionThe Best List-Making Apps for Voice CommandsThe Best List-Making Apps for Voice CommandsMany free productivity tools support voice to text for lists. We'll show you the best list-making apps with voice recognition and commands you must try.Read More tests performed against paid software. But at the end of the day, the most accurate app is Dragon Naturally Speaking. However, both Google Voice Typing and Windows Speech Recognition cost nothing and deliver over 90 percent accuracy. But each has its own strengths and weaknesses and you might prefer one over the other.
H.P. Lovecraft (Fiction Writing Test)
Lovecraft loved writing in long, unbroken, parenthetically dense prose. While all three suites do a great job of accurately transcribing Lovecraft’s vocalized text, DNS comes out ahead of its competitors. It includes both capitalization and punctuation (which is completely insane).
DNS: DNS only dropped a single word from the text. Overall, it scored 107 correct out of 108 words. It nailed several long, non-stop sentences as well.
WSR: Windows did a very good job — but not amazing — of transcribing Lovecraft. It got around 97 of around 108 words correct. While that falls short of both GDVT and DNS, it’s still good for a free speech to text program that doesn’t require online access.
GDVT: I’m not sure what happened because Google nailed the transcription for the other excerpts. GDVT only achieved 103 right out of 108, dropping two words and mistranscribing three. It even once spelled out “semicolon” instead of inserting the correct punctuation. It also capitalized certain words, turning them into proper nouns (but I won’t penalize them since it’s accuracy and not capitalization that matters).
I’m pretty sure that if I reread the document a second time, it wouldn’t have any errors.
Charles Darwin (Scientific or Academic Writing Test)
Darwin writes in, like Lovecraft, long sentences loaded with parenthetical information. However, his use of language is very clear and he uses almost no jargon, which differs from nearly incomprehensible science writing today.
DNS: Darwin’s text comes out near perfect in Dragon Naturally Speaking. DNS misspelled only one word (“into”) and otherwise completely nailed the test with 87 words right out of 88.
WSR: Microsoft did a great job, matching 82 out of 88 words. It made some relatively bizarre errors, though, like spelling “sense” as “cents”.
GDVT: Google did great on Darwin’s excerpt. GDVT only fouled up two words, out of 88. Overall, for a free application, you can’t find a more accurate alternative.
Jerry Brown State of the State Address 2016 (Business Writing Test)
Brown’s speech doesn’t use a lot of complicated sentences or vocabulary (aside from the word “rostrum”). Overall, most of the transcription services performed amazingly. More or less, if you need a service that handles simple sentences and limited vocabulary, any one of these works great.
DNS: DNS nailed Brown’s State of the State Address. While it dropped a period, otherwise, it got every word perfectly. Note, though, that political speeches oftentimes lack the sort of complex language that you might see in fiction or academia. A memo or speech is direct and to the point. That’s something a speech recognition client shouldn’t have any problems handling.
WSR: Windows Speech Recognition did a great job — although not as great as DNS or Google — at transcribing Brown’s speech. It scored 55 out of 58 words. It even recognized the word “rostrum,” which I didn’t even know was a word, nor did I know how to pronounce it. Apparently, either I got it right or speech recognition technology can even catch mispronunciations.
GDVT: Google’s transcription software absolutely nailed the transcription, with 100 percent accuracy. It even managed to correctly capitalize “State of the State”, without needing user input. It did oddly use the number, rather than the spelling, for the word “six”. Which resulted in a stylistic error.
Are Free Transcription Services Worth Using?
There is a difference between Dragon Naturally Speaking, Google Voice Typing, and Microsoft’s Windows Speech Recognition. Dragon is more accurate than its competitors. However, the best free program in terms of accuracy is — by a narrow margin — Google’s Voice Typing. While both Microsoft’s and Google’s transcription services compare less-than-favorably against DNS, they do not cost $30.
Contrasting the two free services against each other, Google offers better voice recognition accuracy, punctuation, and case, it requires an internet connection. Google also captures a lot that you don’t intend, like punctuation and capitalization.
However, if you want a free transcription program that you don’t need an internet connection to use, Windows Speech Recognition fits the bill. It’s by no means bad and offers 90 percent of what Dragon Naturally Speaking offers. Give it a shot if you haven’t already.
For more, take a look at ways to make your computer read documents to you5 Ways to Make Your Computer Read Documents to You5 Ways to Make Your Computer Read Documents to YouCan you make your computer read aloud to you? Of course! Try these text-to-speech methods and tools on your PC.Read More. And for help with your Android phone, check out the best speech-to-text Android apps7 Best Android Dictation Apps for Easy Speech-to-Text7 Best Android Dictation Apps for Easy Speech-to-TextLooking for the best speech-to-text apps for Android? These Android dictation apps let you take notes and more.Read More.
What’s your favorite transcription service? Please let us know in the comments!
Explore more about: Speech Recognition, Speech to Text.
- Do you need to be online to the SW work?
- Thank you for this article! You did the research for me and made my choice easier. I appreciate it.
- Nice article. You did miss one error by GDVT in the Jerry Brown speech. The word 'from' is missing in the 2nd sentence (or maybe it was inadvertently not spoken?)
- Try the free office dictate add in for word and outlook. Better than windows default one and free. Also msft. Check garage site.Finally, Translator live feature, though not initially built for this does the best job. Hyyp://translate. It
- office (365) Dictate is a free add in, but you must have Microsoft Office 365 which is NOT free. It's only free if you're a student. You can do a free 30 day trial with a credit card, but you'll get charged unless you remember to cancel it in time. I contacted Microsoft asking if any discounts or free versions to the disabled. I was told no. I told them to voice my request that it should be.
Text to Speech (TTS) software allows you to have text read aloud to you. This is useful for struggling readers and for writers, when editing and revising their work. You can also convert eBooks to audiobooks so you can listen to them on long drives.
We’ve posted some websites here where you can find some good TTS software programs and online tools that are free or at least have free versions available.
NaturalReader
NaturalReader is a free TTS program that allows you to read aloud any text. The free version of the software converts Microsoft Word files, webpages, PDF files, and emails into spoken words. It includes Microsoft Voices and allows you to change voices and adjust the reading speed. Simply select any text and press one hotkey to have NaturalReader read the text to you. There are also paid versions that offer more features and more available voices.
Ultra Hal TTS Reader
Ultra Hal TTS Reader is a program that will read text out loud in one of its many high quality voices. The free version includes many high quality computerized voices and reads text files out loud, as well as instant messages, standard Windows dialogs, and text from the clipboard, which allows the program to read text from webpages and emails. You can also use Ultra HAL TTS Reader to convert a document into a WAV audio file, which can be burned to a CD or converted to an MP3 file.
ReadClip
ReadClip is a TTS reader that also offers a rich text editor that can read and spell check any text document, and allows you to manage several text and picture clips on the clipboard, and generate MP3 files. The TTS reader part of the software is free and will never expire. However, the other features are “try before you buy” features and you must buy the software to continue using them. You can keep the TTS reader hidden or it can display the text it’s reading in the clipboard and highlight each word as it’s read aloud. Besides monitoring the clipboard, you can also copy and paste text into the program, or type the text into the program, or load the text from a file.
Read4Me TTS Clipboard Reader
The Read4Me TTS Clipboard Reader allows you to read the contents of the clipboard aloud using a pre-installed SAPI5 TTS voice when you press a hotkey. Multiple hotkeys can be set for different languages, voices, speech rates, and volumes. Read4Me can also convert text files to MP3 files.
Kyrathasoft Text To Speech
Kyrathasoft Text To Speech is a portable program that allows you to use the default installed Microsoft Voice and SAPI to convert text files to the spoken word, that it saves into a WAV audio file. It is completely free and fully functional. There is no evaluation period and no crippled features.
FeyRecorder
FeyRecorder is a TTS conversion tool with natural voices that allows you to listen to any text document spoken aloud. You can also use the software to convert other sound sources into audio files, such as CDs, tapes, DVDs, online radio, and video games. The formats FeyRecorder can generate include MP3, WMA, OGG, VOX, AU, and AIFF. The audio files can be transferred to any portable device that handles them for on-the-go listening.
yRead
yRead3 allows you to load a plain text (TXT) file in a resizable window to be read out loud using human speech. Use yRead to listen to eBooks, your own writing, or any other piece of text.
yRead3 is an updated version of the software that runs on XP, Vista, and Windows 7, and requires at least .NET Framework 3.0 to run. You can also download yRead2 and run both versions on the same computer.
Panopreter
The free version of Panopreter Basic will read a text file, an RTF file, an MS Word document, or an HTML webpage to you aloud. You can also input text into the program window to be read aloud. It supports a variety of languages and voices and allows you to create WAV audio files and MP3 audio files from the text.
Text2Speech
Text2Speech is a free program that converts text into audible speech. You can play the text at a custom rate and volume, have the text be highlighted as it’s read, and export the text into a WAV file or an MP3 file. The program required .NET Framework 2.0 to run.
DeskBot
DeskBot is a free program that includes a clipboard reader, text reader, and time announcer for Windows. Select text in any application and press Ctrl + C to have it read aloud. For available commands and options, right-click on the DeskBot icon in the system tray. DeskBot will also read the contents of the clipboard when it changes.
DeskBot adds a “Read with DeskBot” item to the Internet Explorer context menu, when you right-click on a webpage.
PowerTalk
PowerTalk is a free program that allows you to have your Microsoft PowerPoint presentations spoken out loud. When you open a PowerPoint presentation and let it run as usual, PowerTalk speaks the text on the slides as it appears, and also hidden text attached to images. The speech in PowerTalk is provided by synthesized computer voices that come with Windows 7, Vista, and XP.
ClipSpeak
ClipSpeak is a small, portable, TTS tool that speaks text copied or cut to the clipboard. It’s compatible with all SAPI5 speech synthesizers. You can also use ClipSpeak to convert text to MP3 files for listening to on CDs, computers, smartphones, and portable media players. If you want other languages, look at eSpeak, which is a compact, open source speech synthesizer for English and other languages that works in Windows and Linux.
DSpeech
DSpeech is a free, portable TTS program that can read written text files in different formats aloud (such as TXT, RTF, DOC, DOCX, and HTML files) and also has Automatic Speech Recognition (ASR) functionality. The ASR allows you to use DSpeech to convert your own voice to text.
DSpeech allows you to save the output as a WAV, MP3, AAC, WMA, or OGG file. You can select different voices, or combine them to create dialogs among different voices for books or scripts, and DSpeech is compatible with all the vocal engines (SAPI4 and SAPI5 compliant). You can also have the content of the clipboard read to you.
Text To Speech Reader For Mac
Balabolka
Balabolka is a TTS program that allows you to read clipboard content and text from several types of files, such as DOC, EPUB, HTML, MOBI, LIT, CHM, PRC, PDF, and RTF files. The program uses various versions of the Microsoft Speech API (SAPI). This allows you to change a voice’s parameters, including rate and pitch.
To use the Microsoft SAPI4 voices, download and install the Microsoft Speech API file. You can also download the MS Speech Control Panel for the Windows Control Panel that allows you to easily list the compatible TTS engines installed on your system and customize their settings.
Balabolka also allows you to create digital audio files from text, including MP3, WMA, OGG, WAV, AAC, and AMR (Adaptive Multi-Rate).
One interesting feature of Balabolka is that you can save subtitled text in the LRC format or in the metadata of the audio file. This allows you to follow along with the text as the audio plays.
ReadTheWords.com
ReadTheWords.com is an online TTS tool that can generate a clear sounding audio file from almost any written material. Simply copy text from your file into their text box, or upload a Microsoft Office document, PDF file, TXT file, or HTML document. You can also enter a web address, or RSS feed URL, and ReadTheWords.com will read the text from that webpage or RSS feed out loud.
ReadTheWords.com allows you to save what it’s reading. You can download it to your computer or portable music player or smartphone. You can even embed the file in your website.
Odiogo
Odiogo allows you to create TTS podcasts from RSS feeds that can be downloaded to a PC, iPods/MP3 players, and mobile phones. People wanting to listen to your content can subscribe to your podcasts through iTunes, iPodder, or other similar services. You can also promote your audio content on podcast directories.
If you run a blog, you can have your blog posts turned into high quality audio files. Odiogo is compatible with all blog engines that publish RSS feeds, such as WordPress, Typepad, and Blogger. They generate MP3 files that are stored on their servers, and they let you know when the audio version of your blog is ready.
You can also make money from embedded ads in the audio versions of your blog posts and RSS feeds.
NOTE: As of the writing of this article, Odiogo was upgrading their service and they were not accepting new signups.
TTSReader
TTSReader is a free, TTS program that allows you to read TXT files or RTF files aloud and save them to WAV or MP3 files. It highlights the text being currently read and allows you to skip sentences or paragraphs while reading. TTSReader supports rich text formatting and both SAPI4 and SAPI5 voices. It can automatically read what’s in the clipboard and you can convert multiple documents to audio at a time.
TTS Add-ons for Browsers
You can also read text using add-ons or extensions in web browsers.
- SpeakIt! – SpeakIt for Google Chrome reads selected text using TTS technology with language auto-detection. It can read text in more than 50 languages.
- FoxVox – FoxVox for Firefox allows you to turn your blogs and articles into podcasts. It speaks any text you highlight in a webpage, and it can create audiobooks from the text in MP3, OGG, and WAV formats.
- SpokenText – The SpokenText Firefox extension allows you to easily record any text on public webpages simply by clicking a Record Web Page button on the toolbar. This extension is also available for Chrome.
- SpeakingFox – The SpeakingFox add-on for Firefox for Mac OS X converts text to audible speech.
Simultaneous Stanza Reader – For Mac
Pdf Text To Speech Reader For Mac
Simultaneous Stanza Reader for Mac OS X is a free, TTS reader that reads text files aloud and displays the text stanza-after-stanza. You can easily use this program to read books from Project Gutenberg aloud.
If you’ve found any other useful TTS readers, let us know.
READ NEXTFree Text To Speech Programs
- › What Can I Do with My Old iPhone?
- › How to Stream UFC 242 Khabib vs. Poirier Live Online
- › Free Download: Microsoft’s PowerToys for Windows 10
- › How to Overclock Your Computer’s RAM
- › What’s New in Chrome 77, Arriving September 10