Artificial Solutions Enhances Automatic Speech Recognition

automatic-speech-recognition-mobile

New developments implemented in our award-winning Teneo platform help address the limitations of Automatic Speech Recognition (ASR).

ASR technology automatically transcribes spoken words into text and provides the front end interface to solutions such as dictation systems and voice dialing solutions (e.g. – “Call Home”, “Change Channel”). Another popular use for ASR technology is to transcribe speech into the input used by mobile personal assistants such as Apple’s Siri and Artificial Solutions’ Teneo assistants.

However, despite significant progress with ASR, errors still regularly occur such as adding extra words that were not spoken or transposing one word for another potentially changing the meaning of what the user wanted to say. A further limitation of ASR is that it doesn’t have humanlike intelligence. It can’t qualify a question by asking for more information. It can’t remember. It can’t search external data services for information. In summary, it’s not able to deliver intelligent solutions that give a high quality user experience.

However, by combining ASR with NLI, a much more humanlike experience is delivered. ASR is used to turn the speech into a textual input that the NLI engine processes in three steps. First it analyzes the query using powerful linguistic understanding libraries that understand and derive the meaning. It then interprets this using advanced linguistic and business rules that simulate ‘intelligent thinking’, allowing it to reason like a human and determine the most appropriate action. Finally it performs the necessary action – for example give a response or delivering the requested information.

Andy Peart, CMO at Artificial Solutions said “Whilst speech enablement is not a new concept in consumer devices the user experience has, and still is, proving to be quite restrictive in that it’s 2 or 3 word command-based; there’s no conversational flow based on the natural language that you would use if you were talking to another person. This is where Artificial Solutions steps in and takes the technology to the next level. Our Teneo technology is powered by our (NLI) engine in order to deliver intelligent conversations between consumers and the everyday devices they use. Imagine having an intelligent virtual assistant on your Smartphone, Smart TV, SatNav, games console, laptop or tablet – not command based but able to hold two-way conversations using everyday language.”

Improvements to the underlying algorithms used by Teneo ensure that it is able to cope with typical ASR errors such as poor grammar, fragmentary input and superfluous small words. The improvements also allow Teneo to handle the differences between spoken language and written language.

“There are many factors influencing the quality of ASR implementations, some of which depend on the user and the context, and some on the type of ASR system used,” adds Peart. “It’s impractical to expect a user to come up with perfectly formed, grammatically correct, non-fragmentary sentences. By combining the capabilities of Teneo with the capabilities of ASR systems the user has a far superior experience when talking to an application or device.”

The Teneo Platform is a powerful and easy-to-use platform used to build sophisticated Natural Language Interaction (NLI) solutions. Using a number of proven ASR-handling strategies, the Teneo Platform precisely fills the gaps left by ASR systems, increasing the quality of understanding and the end-user experience.

 

Tags

One comment

Leave a Reply

Your email address will not be published.

Please fill the empty box below: * Time limit is exhausted. Please reload CAPTCHA.

top