Giving voice to the web, the World Wide Web Consortium (W3C) has published VoiceXML 2.0 as a W3C Proposed Recommendation. The goal of VoiceXML 2.0 is to bring the advantages of web-based development and content delivery to interactive voice response applications.
Advancement of a W3C technical report to Proposed Recommendation indicates that the Working Group has successfully completed both public and W3C Working Group review, and has provided evidence of successful interoperable implementations. It is the final review before a document becomes a W3C Recommendation, commonly understood as a web standard.
Since 1999, W3C has been working on its Speech Interface Framework to expand access to the web to allow people to interact via key pads, spoken commands, listening to prerecorded speech, synthetic speech and music. With the number of telephone lines and mobile phones exceeding one billion units worldwide, the specifications of W3C‘s Speech Interface Framework will allow an unprecedented number of people to use any telephone to access appropriately designed web-based services.
VoiceXML 2.0 allows developers to create audio dialogs that feature synthesised speech, digitised audio, recognition of spoken and DTMF (touch-tone) key input, recording of spoken input, telephony, and mixed-initiative conversations.
“VoiceXML 2.0 has the power to change the way phone-based information and customer services are developed. No longer will we have to press ‘one’ for this or ‘two’ for that. Instead, we will be able to make selections and provide information by speech,” explained Dave Raggett, W3C Voice Browser Activity Lead. “In addition, VoiceXML 2.0 creates opportunities for people with visual impairments or those needing Web access while keeping their hands and eyes free for other things, such as getting directions while driving.”
In the W3C Speech Interface Framework, VoiceXML controls how the application interacts with the user, while the Speech Synthesis Markup Language (SSML) is used for spoken prompts and the Speech Recognition Grammar Specification (SRGS) for guiding the speech recognisers via grammars that describe the expected user responses. Other specifications in the framework include Voice Browser Call Control (CCXML), which provides telephony call control support for VoiceXML or other dialog systems, and Semantic Interpretation for Speech Recognition, which defines the syntax and semantics of the contents of tags in SRGS.
Patent Issues Resolved
Patent issues arising from inconsistencies with the Voice Browser Working Group’s Royalty-Free Licensing Mode have been addressed by a Patent Advisory Group within the W3C, per the W3C’s Current Patent Practice. As a result, the W3C Voice Browser Working Group committed to the production of an open specification, and the Voice Browser Patent Advisory Group resolved the remaining issues, making it possible for the core VoiceXML 2.0 specification to be made available according to W3C’s Royalty-Free License.