Posts Tagged ‘srgs’

Talking to the Cloud: Build Speech Recognition Applications with Tropo

Tuesday, December 28th, 2010

One of the things that makes Tropo truly amazing is it’s support for speech recognition.

Speech recognition is becoming increasingly important in our everyday lives. Smartphones and powerful handheld devices enable multimodality, and there are more and more restrictions placed on our use of phones while doing other things (like driving).

If you want to build cloud-based mobile applications, you’ve got to have speech recognition. Why would anyone use a platform that tied a developer to nothing but DTMF input (touch tone entry using a key pad) – a technology that was initially rolled out to the public when John F. Kennedy was still in the White House.

The screencast below demonstrates how to use Tropo to build a speech recognition application that can capture a caller’s address. All of the code used in this demo can be obtained on GitHub.

If you are building a mobile application and would like to use speech recognition, Tropo provides some nice choices.

If the input you want to collect is relatively simple, you can use one of the prebuilt simple Tropo grammars. In addition, as discussed in the screencast above, you can use industry standard SRGS grammars as well. Tropo also supports advanced grammar features like semantic interpretation.

While this post focused on using speech recognition to collect an address, you can use it for almost anything your application requires. With Tropo, the sky is the limit.

Which I guess is the point of cloud telephony…

Latest Tropo Upgrade Completed

Tuesday, September 8th, 2009

We are continuing to evolve Tropo, by releasing a new upgrade to the Tropo cloud. This upgrade includes the following:

  • Support for fetching Java Speech Grammar Format (JSGF) and Speech Recognition Grammar Specification (SRGS) files from an external HTTP or FTP server, in addition to the built in support for Simple Grammar. We are working on a couple of follow up posts for how-tos on using these enhanced speech grammar capabilities.
  • When placing an outbound call, you must now include the ‘+’ and country code. To dial in the US would then need to be ‘+1415551212′ for every outbound call.
  • The ‘#’ symbol on the telephone keypad may now be used to terminate a recording in a Tropo application.
  • Addition of MP3 as an audio file playback format.
  • You may now play touch-tones (DTMF) after a call has connected. You may now issue a call with these additional parameters: “+14155551212;postd=1234;pause=22000ms”. Where ‘postd’ is the digits to be dialed and ‘pause’ is the amount of time to wait after connecting the call to issue the digits.
  • New accounts will now need to request outbound dialing access from support@voxeo.com. All existing accounts have outbound enabled and will continue to do so.

We continue to work on many new features and will roll them out as they become available. Enjoy the new features!