ETSI TS 126 243-2017 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE ANSI-C code for the fixed-point distributed s.pdf

上传人:eventdump275 文档编号:741883 上传时间:2019-01-11 格式:PDF 页数:22 大小:154.73KB
下载 相关 举报
ETSI TS 126 243-2017 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE ANSI-C code for the fixed-point distributed s.pdf_第1页
第1页 / 共22页
ETSI TS 126 243-2017 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE ANSI-C code for the fixed-point distributed s.pdf_第2页
第2页 / 共22页
ETSI TS 126 243-2017 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE ANSI-C code for the fixed-point distributed s.pdf_第3页
第3页 / 共22页
ETSI TS 126 243-2017 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE ANSI-C code for the fixed-point distributed s.pdf_第4页
第4页 / 共22页
ETSI TS 126 243-2017 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE ANSI-C code for the fixed-point distributed s.pdf_第5页
第5页 / 共22页
点击查看更多>>
资源描述

1、 ETSI TS 126 243 V14.0.0 (2017-04) Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; ANSI-C code for the fixed-point distributed speech recognition extended advanced front-end (3GPP TS 26.243 version 14.0.0 Release 14) floppy3TECHNIC

2、AL SPECIFICATION ETSI ETSI TS 126 243 V14.0.0 (2017-04)13GPP TS 26.243 version 14.0.0 Release 14Reference RTS/TSGS-0426243ve00 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742

3、C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/

4、or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version

5、kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/TB/ETSIDeliverableStatus

6、.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopy

7、ing and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2017.

8、 All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registe

9、red and owned by the GSM Association. ETSI ETSI TS 126 243 V14.0.0 (2017-04)23GPP TS 26.243 version 14.0.0 Release 14Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to these essential IPRs, if any

10、, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on

11、 the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or ma

12、y be, or may become, essential to the present document. Foreword This Technical Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities

13、. These should be interpreted as being references to the corresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “shoul

14、d not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TS

15、126 243 V14.0.0 (2017-04)33GPP TS 26.243 version 14.0.0 Release 14Contents Intellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g31 Scope 5g32 References 5g33 Definitions and abbreviations . 5g33.1 Definitions 5g33.2 Abbreviations . 5g34 C code structure 5g34.1 Conten

16、ts of the C source code 5g34.2 Program execution 6g34.3 Code hierarchy . 7g34.5 Variables, constants and tables . 11g34.5.1 Description of constants used in the C-code . 12g34.5.2 Description of fixed tables used in the C-code . 15g34.5.3 Static variables used in the C-code . 16g35 File formats 19g3

17、5.1 Speech file 19g3Annex A (informative): Change history . 20g3History 21g3ETSI ETSI TS 126 243 V14.0.0 (2017-04)43GPP TS 26.243 version 14.0.0 Release 14Foreword This Technical Specification has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present document are

18、 subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the f

19、irst digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented

20、 when editorial only changes have been incorporated in the document. ETSI ETSI TS 126 243 V14.0.0 (2017-04)53GPP TS 26.243 version 14.0.0 Release 141 Scope The present document contains an electronic copy of the ANSI-C code for DSR Extended Advanced Front-end. The ANSI-C code is necessary for a bit

21、exact implementation of DSR Extended Advanced Front-end. 2 References The following documents contain provisions which, through reference in this text, constitute provisions of the present document. 1 ETSI ES 202 050 (2007-01) V1.1.5: “Distributed Speech Recognition; Advanced Front-end Feature Extra

22、ction Algorithm; Compression Algorithm“. 2 ETSI ES 202 212 (2005-11) V1.1.2: “Distributed Speech Recognition; Extended Advanced Front-end Feature Extraction Algorithm; Compression Algorithm, Back-end Speech Reconstruction Algorithm“. 3 3GPP TS 26.177: “Speech Enabled Services (SES); Distributed Spee

23、ch Recognition (DSR) extended advanced front-end test sequences“. 3 Definitions and abbreviations 3.1 Definitions Definition of terms used in the present document, can be found in 1, 2 3.2 Abbreviations For the purpose of the present document, the following abbreviations apply: ANSI American Nationa

24、l Standards Institute I/O Input/OutputRAM Random Access Memory ROM Read Only Memory AFE Advanced Front-end X-AFE eXtended Advanced Front-end DSR Distributed Speech Recognition 4 C code structure This clause gives an overview of the structure of the bit-exact C code and provides an overview of the co

25、ntents and organization of the C code attached to this document. The C code has been verified on the following systems: - Sun Microsystems workstations and GNU gcc compiler - IBM PC compatible computers with Linux operating system and GNU gcc compiler. ANSI-C was selected as the programming language

26、 because portability was desirable. 4.1 Contents of the C source code The distributed files with suffix “c“ contain the source code and the files with suffix “h“ are the header files. Makefiles are provided for the platforms in which the C code has been verified (listed above). ETSI ETSI TS 126 243

27、V14.0.0 (2017-04)63GPP TS 26.243 version 14.0.0 Release 144.2 Program execution There are separate executables for the FrontEnd and Vector Quantization, with and without Extensions. The command line options are described below. - indicates parameters for the given option for running the executable (

28、) indicates default parameter. FrontEnd w/ Extension: USAGE: bin/ExtAdvFrontEnd infile HTK_outfile pitch_outfile class_outfile options OPTIONS: -q Quiet Mode (FALSE) -F format Input file format (NIST) -fs freq Sampling frequency in kHz (8) -swap Change input byte ordering (Native) -noh No HTK header

29、 to output file (FALSE) -noc0 No c0 coefficient to output feature vector (FALSE) -nologE No logE component to output feature vector (FALSE) -skip_header_bytes n - Skip header, first n bytes ( Only for -F RAW) -noh, -noc0, -nologE and skip_header_bytes are not used and should not be changed. FrontEnd

30、 w/o Extension: USAGE: bin/AdvFrontEnd infile HTK_outfile options OPTIONS: - Same as FrontEnd w/ Extension Vector Quantization w/ Extension: Usage: extcoder htk_file_in pitch_file_in class_file_in bitstream_file_out pitch_file_out txt_file_out -freq x -VAD/No_VAD htk_file_in Input mel-frequency ceps

31、tral coefficient file in HTK MFCC format. pitch_file_in Input pitch period file. class_file_in Input classification file. bit_file_out Output binary bitstream. pitch_file_out Output quantised pitch period file. txt_file_out Vector quantiser output in text format. -freq x Sampling frequency in kHz (8

32、 or 16). -VAD Use voice activity detector data. Voice activity input file must have same name as htk_file, but extension .vad -No_VAD Do not incorporate voice activity detector information in output bitstream. Vector Quantization w/o Extension: Usage: coder htk_file_in bitstream_file_out txt_file_ou

33、t -freq x -VAD/No_VAD htk_file_in Input mel-frequency cepstral coefficient file in HTK MFCC format. bit_file_out Binary output bitstream. txt_file_out Vector quantiser output in text format. -freq x Sampling frequency in kHz (8 or 16). -VAD Use voice activity detector data. Voice activity input file

34、 must have same name as htk_file, but extension .vad -No_VAD Do not incorporate voice activity detector information in output bitstream. File extension descriptions as generated by the sample script: .cep Binary file containing cepstral features in HTK format. Output from the FrontEnd, input to the

35、vector quantizer. .pitch Binary file containing pitch information. Output from the FrontEnd, input to the vector quantizer. Only used for Extension. .class Ascii file containing class information. Output from the FrontEnd, input to the vector quantizer. Only used for Extension. .bs Binary file conta

36、ining the bitstream. Output from the vector quantizer. .log Log files from the different executables. ETSI ETSI TS 126 243 V14.0.0 (2017-04)73GPP TS 26.243 version 14.0.0 Release 144.3 Code hierarchy Tables 1 to 3 are call graphs that show the functions used for AFE (table 1), VQ (table 2), and Exte

37、nsion (table 3). Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards neighboring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. All standard C functions: printf(), fwrite()

38、, etc. have been omitted. Also, no basic operations (add(), L_add(), mac(), etc.) or double precision extended operations (e.g. L_Extract() appear in the graphs. The basic operations are not counted as extending the depth, therefore the deepest level in this software is level 7. Table 1: AFE call st

39、ructure main() AdvProcessInit_B() DoNoiseSupInit_B() DoWaveProcInit_B() DoCompCepsInit_B()DoPostProcInit_B() DoVADInit_F() Do16kProcInit_B()QMF_FIR_Init_B() fir_initialization_B() DP_HP_filters_B()BufIn32Alloc() AdvProcessAlloc_B() DoNoiseSupAlloc_B()DoWaveProcAlloc_B() DoCompCepsAlloc_B() DoPostPro

40、cAlloc_B()DoVADAlloc_F() Do16kProcAlloc_B() FlushAdvProcess_B() DoVADFlush_F() CvFeatInt2Float() AdvProcessDelete_B() DoNoiseSupDelete_B() DoWaveProcDelete_B() DoCompCepsDelete_B()DoPostProcDelete_B() DoVADDelete_B() BufIn32Free()DoAdvProcess_B() Do16kProcessing_B() DoNoiseSup_B()Get16k_p_bufferData

41、16k_B() Get16k_bufData16kSize_B()Get16k_p_BandsForCoding16k_B()Get16k_p_CodeForBands16k_B() Get16k_dataHP_B() VAD_F() Log_2() DoSigWindowing16_F1() DoSigWindowing16_F2()ff4NRFix32_B() GetL15() GetH15()Mult16x32()Add_Mult16x16_16() Sub_Mult16x16_16()Permut() FFTtoPSD_F() Square24d2_B() Square24_B()Ge

42、t16k_BFC_dec_B() GetBandsForCoding16k_B()PSDMean_F() NoiseEstimation_F1() Sqrt_2() Sqrt16_2()NoiseEstimation_F2() Sqrt_2() Sqrt16_2()FilterCalc_F() SpeechQVar()FilterBank16()SpeechQSpec() SpeechQMel() DoGainFact_F1() Log_2() DoGainFact_F2() Log_2()DoMelIDCT_F16() ApplyWF() Get16k_dec1()Get16k_dec2()

43、Get16k_dec3() DoSigWindowing16_F3() ff4NRFix32_B() ETSI ETSI TS 126 243 V14.0.0 (2017-04)83GPP TS 26.243 version 14.0.0 Release 14GetL15() GetH15()Mult16x32()Add_Mult16x16_16() Sub_Mult16x16_16()Permut() FFTtoPSD_F() Square24d2_B() Square24_B()DoMelFB_B() CodeBands16k_B()DoSpecSub16k_B()Log_2() UpDa

44、teDecal() ApplyDecal()DCOffsetFil_F() Get16k_hpBandsSize_B() Get16k_p_hpBands_B()Get16k_p_bufferCodeForBands16k_B()Get16k_p_CodeForBands16k_B() Get16k_p_bufferCodeWeights_B()Get16k_p_codeWeights_B()Set16k_hpBands_dec_B() DoWaveProc_B() TeagerEng() GetTeagerFilter() GetMaximaPositions() DoCompCeps_B(

45、) CepsCompute() Get16k_p_bufferCodeWeights_B() Get16k_p_bufferCodeForBands16k_B() PreEmphHamm() ff4NB16_B()GetBandsForDecoding16k_B() DecodeBands16k_B() FilterBank() Get16k_hpBands_dec_B()Get16k_p_hpBands_B() MergeSSandCoded_B()CorrectEnergy_B()CosInv16Khz() cosInv() (only for 8kHz) DoPostProc_B() D

46、oVADProc_F()focalpoint() Table 2: VQ call structure main() quantize_and_print()get_best_dataframe() best_centroid() quant_pitch_abs() get_class_bit()quant_pitch_diff()get_class_bit() mfcc_crc_encode()pc_crc_encode()ETSI ETSI TS 126 243 V14.0.0 (2017-04)93GPP TS 26.243 version 14.0.0 Release 14Table

47、3: Extension call structure main() RVC_ConstructPitchRom_be() RVC_ConstructPitchMeter_be() Allocate_InterpolatedDft_be() RVC_ResetPitchMeter_be() RVC_DestructPitchRom_be() RVC_DestructPitchMeter_be() Deallocate_InterpolatedDft_be() DoAdvProcess_B() DoPitchExtract() FilterBank() dsr_afe_vad() get_vm(

48、) fnLog2() IsLowBandNoise() get_zcm() pre_process() iir_d() iir_s() RVC_MeasurePitch_be() ClearPitch_be() DirichletInterpolation_be() IsLowLevelInput_be() Finalize_be() IsContinuousPitch_be() Mpy_lw_sw() Mpy_lw_sw() PrepareSpectralPeaks_be() CalcSpectrum_be() Mpy_lw_sw() Mpy_lw_sw_Add() FindPeaks_be

49、() Prelim_ScaleDownAmpsOfHighFreqPeaks_be() qsort_be()* swap() CompareIpointAmp_be() RefineSpectralPeaks_be() sqrt_l_fix() Final_ScaleDownAmpsOfHighFreqPeaks_be() Mpy_lw_sw() FindPitchCandidates_be() NormalizeAmplitudes_be() CalcUtilityFunction_be() CreatePieceWiseConstantFunction_be() L_Extract() Mpy_32_16() qsort_be()* swap() Compare_ARRAY_OF_XPOINTS_be() LinkArrayOfPoints_be() AddSortedArrayOfPoints_be() LinkArrayOfPoints_be() ConvertLinkedListOfDiffPointsToUtilFunc_be() FindDominantLocalMaximaInU

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 标准规范 > 国际标准 > 其他

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1