VoiceInteraction ASR

About

VoiceInteraction
Rua Alves Redol, 9
1000-029 Lisboa
Portugal

Website: http://www.voiceinteraction.pt

Definition

VoiceInteraction, founded in 2008 and based in Lisboa (Portugal), is a company specialised in the development of speech technologies. VoiceInteraction develops voice synthesis and speech recognition engines for many web and telephony applications.

Languages

Portuguese · Portuguese (Brazil)

Installation

This is for de Debian Lenny version, for other Linux, read the official Audimus Installation manual. To configure the apt client, just edit and add to the file /etc/apt/sources.list one of the following line set.

For asterisk 1.Y.X

deb http://services.voiceinteraction.pt/repo/Debian/5.0 engines 3rdparty Dixi Audimus
deb http://services.voiceinteraction.pt/repo/Debian/5.0 asterisk.1.Y.X Dixi Audimus

Refresh the package lacal base with:

# apt-get update

And Install the package audimus-asterisk-xx-xx (where xx-xx is the language requested):

# apt-get install audimus-asterisk-es-es

The following extra packages will be installed:

audimus audimus-config audimus-model-es-es-monophones-g2p-phonemodels audimus-model-es-es-monophones-mlp-telephone audimus-model-es-es-monophones-task-asterisk

Activate the license with:

# audimus_activate_license

Please enter your Audimus license:

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
Your License was successfully activated!

Configuration

To enable speech recognition, change the main speech parameter. The “speech” parameter can get three values, “yes”, “automatic”, “no” or “emulation” (don’t generate errors if you enable speech grammars).

…
speech=automatic
speechprovider=verbio
…

In the VoiceXML browser configuration file:

############################
# ASR server configuration #
############################

client.rec.resource.0.cacheDir              VXIString   /tmp/cacheContent
client.rec.resource.0.format                VXIString   txt
client.rec.resource.0.syntax                VXIString   doctype

You need to restart the Vxi and Asterisk to get all the changes.

Logs file from the ASR engine:

# tail -f /var/log/VI/VI.log

The following VoiceXML example uses the speech recognition, with the built in grammar, ‘digits.’

<?xml version="1.0" encoding="iso-8859-1"?>
<vxml version="2.0" xmlns="http://www.w3.org/2001/vxml" xml:lang="en-UK">
 <form>
  <property name="inputmodes" value="voice"/>
  <property name="timeout" value="30s"/>
  <field name="text" type="digits">
   <catch event="noinput nomatch">
    <reprompt/>
   </catch>
   <prompt>
    Speak to me:
   </prompt>
  </field>
  <filled>
   <prompt>
    You say me:
    <value expr="text" />
   </prompt>
   <clear namelist="text" />
  </filled>
 </form>
</vxml>
Log In