BusinessWeek Logo

Voice Recognition on the iPhone

Posted by: Arik Hesseldahl on July 29, 2008

From our intern Jay Yarow:

Recently I spoke with a sports industry executive who purchased an iPhone 3G. While he loves the gadget, he isn’t abandoning his Blackberry because the iPhone’s touch screen hinders his ability to write email.

The touch screen complaint is all too common. When typing email you either like Apple’s on-screen QWERTY keyboard, or you don’t and prefer the Blackberry, the Treo or something else, no matter what great applications Apple puts on the App Store.

Eventually, Apple may build a model with a keyboard. Or not. It’s hard to say about a company whose CEO is said to despise buttons. But wouldn’t it be faster if you could speak your emails directly into the phone and have your words turned into text?

Meet Steve Chambers, president of Nuance Communications’ mobile division, the company behind Naturally Speaking, a voice recognition software package for the PC. “Voice is the answer to restrictive keyboards,” on mobile devices he says. Nuance has software on nearly 3 billion handsets worldwide. The firm developed owns the T9 predictive text software, which is the technology that tries to figure out the word you mean to use in a text message on many wireless phones. Of Nuance’s $900 million in total revenue, about $200 million comes from mobile software on mobile phones.

Why not speak what you mean to say into the phone rather than fumble with the tiny keys to type it? “Virtual keyboards are tough, we are actively working on active dictation for mobile devices,” says Chambers. In particular he sees the iPhone as a great opportunity.

A user can say, “Byte of The Apple” and a list of search results, culled from Google appears as a list. Click on the result and Safari opens up with the web page. Likewise, the search function works with maps. For instance, say, “1221 Avenue of the Americas, New York,” and you could click on an icon that shows you BusinessWeek’s New York offices on a map.

Nuance is still deciding what it’s going to do with this application. “The App Store is so new,” says Chambers, “how good a vehicle is it for us” as compared with selling to carriers? Or Nuance can work with software developers. “We can make the API available to people to integrate it,” Sejnoha says. “It could be the front end for someone like Powerset,” he says, referring the company that’s working on natural language search queries.

Chambers and Vlad Sejnoha, Nuance’s Chief Scientist demonstrated two prototype applications, one running on the iPhone, the other on a Treo, showing the potential, which you can see in a video (sorry, the close-ups are a little blurry) after the jump.

Chambers hopes to have dictation software for the iPhone, like these software featured above, ready by the beginning of 2009.

The great hurdle for Nuance (and others) is the relatively puny computing power of a mobile phone, when compared to a notebook or desktop. While the iPhone sports a beefy Samsung processor, its not nearly as powerful a the processor in a Mac or PC. To translate speech being dictated into the body of an email, you need some serious processing power. Nuance sends the speech to a server, which transforms it into text, then sends the text into the body of an email. The speed of the connection, determines the speed of the conversion.

There are already a few programs that do something similar on the iPhone. Jott turns short speech recordings into text that can then be sent by email. Unfortunately it takes 5-10 minutes to get the text translated and the message sent. Not ideal for when you’re in a hurry. There is also Reqall, which does a lot of the same things that Jott does. Both have their own dedicated iPhone App. Both hand off the voice message to a server in the cloud that does the heavy lifting required to turn the voice message into text.

One other issue: Bandwidth. Sending voice to a server in the cloud and getting a quick response takes some serious speed to be useful. You’ll need a solid 3G connection that isn’t always available or a Wi-Fi connection. Without it, you’re probably best taking your chance with the virtual keyboard.

TrackBack URL for this entry: http://blogs.businessweek.com/mt/mt-tb.cgi/

Reader Comments

Bob

July 29, 2008 05:38 PM

This sounds like a promising soultion.

In the meantime, I certainly have no qualms with the iPhone keyboard. Two weeks after ditching my blackberry, I'm already just as fast. It's all about trusting the keyboard to know what you are typing. Just let it roll.

Xopowo

July 29, 2008 06:53 PM

If they allowed you to turn the iPhone horitontally it would allow for a larger keyboard. Why do they empeche this ?

Talk? Into a phone?

July 29, 2008 08:34 PM

A phone? Sending voice over a network? What a marvelous new world we've entered.

Toby

July 30, 2008 03:18 AM

Nuance didn't invent T9, but merely acquired the company that did: Tegic.


dennis

July 30, 2008 11:14 AM

Christ,
here we go again with the keyboard.

you're boring me.

Jay Yarow

July 30, 2008 11:48 AM

Thanks Toby, we fixed that.

@Bob, what kind of typing have you done on the iPhone? I tried writing a blog post last night using the Wordpress application and my thumbs were killing me by the end.

@Xopowo, I agree, I'd like to see them make the horizontal keyboard an option. Maybe they'll release that by yearend. Along with copy and paste and some of the other things people are belly aching about. (Like fewer bugs and fewer crashes.)

If someone could create a portable blue tooth enabled keyboard that hooks up to the iPhone that'd be cool too.

Eric

July 30, 2008 06:32 PM

I switched to an iPhone from a Treo. In the two weeks or so I have had the iPhone I am already much faster on it than I ever was on the Treo after 2 years. I am as surprised as anyone but the predictive typing makes it almost magic.

@Jay Yarow - Seriously, writing a blog post from a cell phone, even an iPhone... I would expect there would be some discomfort. I can't imagine any small form factor device/phone keyboard being comfortable over a long time - certainly not any that I have owned.

Dave

August 1, 2008 06:20 AM

If I spoke an email, I could just as well make a call. I use SMSes when I'm say in a meeting and can't speak out loud.
Besides that though, yes it woud be great to have email at least that you can speak.

Impalaman

August 1, 2008 10:54 AM

The Fonix iSpeak solution would be a better fit. It is a purely client based application with no remote server. Small memory foot print coupled with highly efficient VR make this a perfect fit for the iPhone.

air max 90

April 21, 2010 11:11 PM

The newest version of air max are available now.Welcome to see our special air max 90 and Air Max 360.They are your must-have cheap air max.nike air max 2009.air max 2010.air max 97.

Post a comment

 

About

A blog on the daily doings of Apple and the many companies in its orbit, with insight and analysis by two longtime Apple-watchers BusinessWeek Senior Writer Peter Burrows and BusinessWeek.com Senior Technology Writer Arik Hesseldahl.

Leave us a voice message. Learn more.

BW Mall - Sponsored Links

Buy a link now!