ImageVerifierCode 换一换
格式:PDF , 页数:16 ,大小:123.80KB ,
资源ID:740567      下载积分:10000 积分
快捷下载
登录下载
邮箱/手机:
温馨提示:
如需开发票,请勿充值!快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。
如填写123,账号就是123,密码也是123。
特别说明:
请自助下载,系统不会自动发送文件的哦; 如果您已付费,想二次下载,请登录后访问:我的下载记录
支付方式: 支付宝扫码支付 微信扫码支付   
注意:如需开发票,请勿充值!
验证码:   换一换

加入VIP,免费下载
 

温馨提示:由于个人手机设置不同,如果发现不能下载,请复制以下地址【http://www.mydoc123.com/d-740567.html】到电脑端继续下载(重复下载不扣费)。

已注册用户请登录:
账号:
密码:
验证码:   换一换
  忘记密码?
三方登录: 微信登录  

下载须知

1: 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。
2: 试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。
3: 文件的所有权益归上传用户所有。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 本站仅提供交流平台,并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

版权提示 | 免责声明

本文(ETSI TS 122 243-2017 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Speech recognition framework for automated vo_1.pdf)为本站会员(outsidejudge265)主动上传,麦多课文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知麦多课文库(发送邮件至master@mydoc123.com或直接QQ联系客服),我们立即给予删除!

ETSI TS 122 243-2017 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Speech recognition framework for automated vo_1.pdf

1、 ETSI TS 122 243 V14.0.0 (2017-03) Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; Speech recognition framework for automated voice services; Stage 1 (3GPP TS 22.243 version 14.0.0 Release 14) TECHNICAL SPECIFICATION ETSI ETSI TS 1

2、22 243 V14.0.0 (2017-03)13GPP TS 22.243 version 14.0.0 Release 14Reference RTS/TSGS-0122243ve00 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif

3、enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the presen

4、t document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network driv

5、e within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/TB/ETSIDeliverableStatus.aspx If you find errors in the

6、 present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as aut

7、horized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2017. All rights reserved. DECTTM, P

8、LUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Associ

9、ation. ETSI ETSI TS 122 243 V14.0.0 (2017-03)23GPP TS 22.243 version 14.0.0 Release 14Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETS

10、I members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ip

11、r.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential

12、to the present document. Foreword The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP

13、and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Draftin

14、g Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TS 122 243 V14.0.0 (2017-03)33GPP TS 22.243 version 14.0.0 Release 14Contents Intellectual Property Rights 2g3Foreword . 2g3Modal verbs t

15、erminology 2g3Foreword . 4g3Introduction 4g31 Scope 5g32 References 6g32.1 Normative References 6g32.2 Informative References 6g33. Definitions and abbreviations . 6g33.1 Definitions 6g33.2 Abbreviations . 7g34 Requirements 8g34.1 Initiation . 8g34.2 Information during the speech recognition session

16、 9g34.3 Control 9g34.4 User Perspective (User Interface) . 9g35 UE and network capabilities. 9g36 Administration 10g36.1 Authorization 10g36.2 Deauthorization 10g36.3 Registration 10g36.4 Deregistration . 10g36.5 Activation . 10g36.6 Deactivation . 11g37 Service Provisioning. 11g38 Security. 11g39 P

17、rivacy 11g310 Charging . 11g311 Roaming . 12g312 Interaction with other services . 12g3Annex A (informative): Speech recognition Framework-based automated voice service examples 13g3Annex B (informative): Change History 14g3History 15g3ETSI ETSI TS 122 243 V14.0.0 (2017-03)43GPP TS 22.243 version 14

18、.0.0 Release 14Foreword This Technical Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the pr

19、esent document, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under c

20、hange control. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the document. Introduction Forecasts show that speech-driven services will pla

21、y an important role on the 3G market. People want the ability to access information while on the move and the small portable mobile devices that will be used to access this information need improved user interfaces using speech input. At present, however, the complexity of medium and large vocabular

22、y speech recognition systems is beyond the memory and computational resources of such devices. Also associated delay to download speech data files (e.g. grammars, acoustic models, language models, vocabularies etc. .) may be prohibitive. Eventually, it may not always be acceptable for the speech ser

23、vice providers to allow download of these speech data files if they contained confidential information (password (security issue), customer names and address (privacy issue) or intellectual properties; for example a well crafted speech grammar is often considered by speech service providers as a tra

24、de secret. Server-side processing of the combined speech and DTMF input and speech output can overcome these constraints by taking full advantage of memory and processing power as well as specialized speech engines and data files. However, the distortions introduced by the encoding used to send the

25、audio between the client and the server as well as additional network errors can degrade the performance of the speech engines; therefore also limiting the achievable speech functionalities. A server-side speech service is generally equivalent to a phone call to an automatic service. As for any othe

26、r telephony service, DTMF is a feature that should always be considered as needed. This document describes a generic speech recognition framework to distribute the audio sub-system and the speech services by sending encoded speech and meta-information between the client and the server. Instead of us

27、ing a voice channel as in todays server-based speech services, an error-protected data channel will be used to transport encoded speech from the client audio sub-system (terminal client) to remote speech engines (on server) for processing (e.g. speech recognition, speaker recognition,). The speech r

28、ecognition framework will also enable downlink data streaming of voice and recorded audio prompt generated by server to the terminal client audio subsystem. The speech recognition framework may use conventional codecs like AMR or Distributed Speech Recognition (DSR) optimized codecs. The speech reco

29、gnition framework will provide users with a high performance distributed speech interface to server-based automatic speech services with communication, information access or transactional purposes. The types of supported user interfaces include those that are voice only, for example, automatic speec

30、h access to information, such as a voice portal described in this section. These typically support combined speech or DTMF input. In the future, a new range of multi-modal applications is also envisaged incorporating different modes of input (e.g. speech, keyboard, pen) and speech and visual output.

31、 ETSI ETSI TS 122 243 V14.0.0 (2017-03)53GPP TS 22.243 version 14.0.0 Release 141 Scope The present document defines the stage one description of the Speech Recognition Framework for Automated Voice Services. Stage One is the set of requirements for data seen primarily from the users and service pro

32、viders points of view. This Technical Specification includes information applicable to network operators, service providers, terminal and network manufacturers. This Technical Specification contains the core requirements for the Speech Recognition Framework for automated voice services. The scope of

33、 this Stage 1 is to identify the requirements for 3G networks to support the deployments of a speech recognition framework - based automated voice services and therefore to introduce a 3GPP speech recognition framework as part of speech-enabled services. The Speech Recognition Framework for automate

34、d voice services is an optional feature in a 3GPP system. Figure 1 positions the Speech recognition Framework (SRF) with respect to other speech-enabled services as discussed in 6. As illustrated, SRF is designed to support server-side speech recognition over packet switched network (e.g. IMS). As s

35、uch SRF also enable configurations of multimodal and multi-device services that include distribute the speech engines. Note that it is possible to design speech-enabled services that alternate or combine the use of client-side only engines and SRF. Figure 1 - Positions the scope of the speech recogn

36、ition framework as part of general speech enabled services. Speech-enabled Services Multimodal Services Speech-only ServicesMulti-device ServicesClient-side only Speech enginesServer-side Speech enginesSpeech Recognition Framework (Packet Switched)Circuit SwitchedSpeech RecognitionDSR optimized Code

37、cConventional CodecOtherETSI ETSI TS 122 243 V14.0.0 (2017-03)63GPP TS 22.243 version 14.0.0 Release 142 References The following documents contain provisions which, through reference in this text, constitute provisions of the present document. References are either specific (identified by date of p

38、ublication, edition number, version number, etc.) or non-specific. For a specific reference, subsequent revisions do not apply. For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refe

39、rs to the latest version of that document in the same Release as the present document. 2.1 Normative References 1 3GPP TS 21.133: “3G security; Security threats and requirements“. 2 3GPP TR 21.905: “Vocabulary for 3GPP Specifications“. 3 3GPP TR 22.941: “IP based multimedia framework; Stage 0“. 4 3G

40、PP TS 22.105: “Services and service capabilities“. 5 3GPP TS 22.228: “Service requirements for the Internet Protocol (IP) multimedia core network subsystem; Stage 1“. 6 3GPP TR 22.977: “Feasibility study for speech-enabled services“. 2.2 Informative References 7 ETSI ES 201 108 v1.1.2: “Distributed

41、Speech Recognition: Front-end Feature Extraction Algorithm; Compression Algorithm“, April 2000. 8 Void 9 Void 10 ETSI ES 202 050 v0.0.0 “Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms;

42、 DSR advanced front end“, standard selected; document in preparation. 3. Definitions and abbreviations Definitions and abbreviations used in the present document are listed in TR 21.905 2. For the purposes of this document the following definitions and abbreviations apply: 3.1 Definitions Automated

43、Voice Services: Voice applications that provide a voice interface driven by a voice dialog manager to drive the conversation with the user in order to complete a transaction and possibly execute requested actions. It relies on speech recognition engines to map user voice input into textual or semant

44、ic inputs to the dialog manager and mechanisms to generate voice or recorded audio prompts (text-to-speech synthesis, audio playback,). It is possible that it relies on additional speech processing (e.g. speaker verification). Typically telephony-based automated voice services also provide call proc

45、essing and DTMF recognition capabilities. Examples of traditional automated voice services are traditional IVR (Interactive Voice Response Systems) and VoiceXML Browsers. Barge-in event: Event that takes place when the user starts to speak while audio output is generated. ETSI ETSI TS 122 243 V14.0.

46、0 (2017-03)73GPP TS 22.243 version 14.0.0 Release 14Conventional Codec: The module in UE that encodes the speech input waveform , similar to the encoder in a vocoder e.g. EFR, AMR. Downlink exchanges: Exchanges from servers and networks to the terminal. Dialog manager: A technology to drive a dialog

47、 between user and automated voice services. For example a VoiceXML voice browser is essentially a dialog manager programmed by VoiceXML that drives speech recognition and text-to-speech engines. DSR Optimised Codec: The module in UE which takes speech input, extracts acoustic features and encodes th

48、em with a scheme optimised for speech recognition. This module is similar to the conventional codec, such as AMR. On the server-side, the uplink encoded stream can be directly consumed by speech engines without having to be converted to a waveform. Meta information: Data that may be required to faci

49、litate and enhance the server-side processing of the input speech and facilitate the dialog management in an automated voice service. These may include keypad events over-riding spoken input, notification that the UE is in hands-free mode, client-side collected information (speech/no-speech, barge-in), etc. Speech Recognition Framework: A generic framework to distribute the audio sub-system and the speech services by sending encoded speech between the client and the server. For the uplink, it can rely on conventional (ASR) or on DSR optimise

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1