Next Up for Google Chrome: Speech Recognition

Chrome's rapid evolution leads to more and better features

If the engineers at Google are correct, a budding new capability in the Chrome web browser will transform the way we interact with web apps. The recently-released Chrome 25 Beta is the first to offer native support for speech recognition, a natural user interface (NUI) that may ultimately be far more important than multi-touch.

Google has posted a decidedly nerdy video demonstrating this capability, but the uses for this are pretty cool: Dragon Naturally Speaking-style voice control, such as the ability to dictate an email in a web-based mail solution, or a word processing doc in an office productivity service.

In a separate announcement, Google noted that the Web Speech API in Chrome 25 Beta would open up a wellspring of speech-enable web apps. I think they’re right. This is going to be a big deal. I can’t wait to pick up my mouse and say, “Chrome, on!” like Scotty in “Star Trek 4.” You know, ironically.

Chrome 25 Beta also sports another, unrelated, advance: It automatically disables some extensions that may were added by third party programs without proper acknowledgement from users, and will prevent this kind of silent extension install going forward. A notification will appear with the option to re-enable the affected extensions, Google says.

I’ve seen some chatter about the version numbering in Chrome, because some people can’t understand why anyone would want to version software differently than we did in the 1990s. But Chrome is updated like a web service, and this practice will become more and more common going forward. Yes, even at Microsoft.

Discuss this Article 10

MikeGalos
on Jan 15, 2013

Of course, you COULD just use the speech recognition system that's already built into Windows (and has been since Windows Vista shipped in 2006)

brians (not verified)
on Jan 15, 2013

The speech recognition in "legacy" Windows is really very good. However, Microsoft did not see fit to make it available for "Metro" developers to use, let alone make it available in IE 10 (which, even with all the changes they made, still significantly lags the competition in HTML 5 features implemented).

Here are the current html5test.com results:

Browser Score Bonus
Maxthon 3.4.5 457 15
Chrome 23 448 13
Opera 12.10 419 9
Firefox 17 392 10
Safari 6.0 378 8
Internet Explorer 10 320 6

Just another two major failings of Windows 8.

Google not only has this new API in Chrome, but Android has had similar capability for quite some time.

Christopher Geo...
on Jan 15, 2013

The video, and the Google Now App on Android, are far superior. It auto corrects sentence structure as you continue to dictate.

rjohn05
on Jan 15, 2013

It's too bad we cannot get Microsoft to update IE10 with new/pending standards as often as Chrome. Hopefully this changes this year. MS needs to update IE10 more regularly, not only with new end user features but with new HTML/css capabilities.

tbsteph
on Jan 15, 2013

Can't wait - not only will people be talking on their cell phones incessantly but now they will be talking to their phones, laptops, etc. incessantly. Don't get me wrong, I see the advantages it's just that there will soon be more noise in the public arena. Maybe I'll just buy a set of ear plugs?

JimmyFal
on Jan 15, 2013

I'm happy to have speech on WinPho8, and it works great. But I'd be happier with more. Siri, whatever you think of it, actually is fantastic at setting appointment, canceling appointments, something I do all day long and cannot do with WinPho8.

On the Surface, I truly expected a total revamp of the existing speech input tools. MS has been in the speech game as long as anyone, but where it the great speech capabilities from all that research?!

On the Surface tablet, the built in speech recognition tools are antiquated, although they kind of work, it's far from hitting a microphone button, and speaking a search term. On the Xbox speech recognition for search is masterful. When will we see this next big speech update from MS? Soon I hope.

Josh602
on Jan 15, 2013

And where is the spec for this "Web Speech API"? I don't see it at the W3C or WHATWG's Web sites.

Sigh. It looks like we've got a new proprietary API for WebKit.

brians (not verified)
on Jan 15, 2013

This is not a proprietary API. Nor could "WebKit" contain one with its licensing. Google and others do add various experimental APIs early on, but they are namespaced to prevent a problem.
The documentation for the Web Speech API can be found at: https://dvcs.w3.org/hg/speech-api/raw-file/tip/speechapi.html

saqrkh
on Jan 15, 2013

I think MS should put out a parallel browser program, one that has a clear disclaimer of being indefinitely beta and used for evaluation purposes. Call it "Daredevil" and load it up with WebKit and whatever else to make it the fastest browser available.

Mark from CO
on Jan 15, 2013

Does Microsoft see the potentially massive shift in computing that is coming from the low end, driven by a unique business model (free OS/Services from Google/Android, and high quality and reasonably priced hardware from Samsung). Momentum is building and may not be long before it reaches a level not stoppable by Microsoft, or even Apple. Time is running short - Microsoft please show us a sense of urgency here!!

(Isn't Samsung in a very enviable position. It is playing both sides (Microsoft and Google) so well. It is positioning itself for outside profits, especially for a hardware maker.)

Please or Register to post comments.

IT/Dev Connections

Las Vegas
September 30th - October 4th

Paul ThurottYou'll have the opportunity to experience:
• 120 Technical
Sessions
• Networking with Peers
• Expert Speakers


Come See Paul Thurrott & Mary Jo Foley in Person!

Register Now

Office 365 InfoCenter

Get the latest insight and info from Paul

Read Now!

What I Use