標題: Titlebook: Speech and Computer; 25th International C Alexey Karpov,K. Samudravijaya,S. R. Mahadeva Pras Conference proceedings 2023 The Editor(s) (if [打印本頁] 作者: Suture 時間: 2025-3-21 18:55
書目名稱Speech and Computer影響因子(影響力)
書目名稱Speech and Computer影響因子(影響力)學(xué)科排名
書目名稱Speech and Computer網(wǎng)絡(luò)公開度
書目名稱Speech and Computer網(wǎng)絡(luò)公開度學(xué)科排名
書目名稱Speech and Computer被引頻次
書目名稱Speech and Computer被引頻次學(xué)科排名
書目名稱Speech and Computer年度引用
書目名稱Speech and Computer年度引用學(xué)科排名
書目名稱Speech and Computer讀者反饋
書目名稱Speech and Computer讀者反饋學(xué)科排名
作者: 苦笑 時間: 2025-3-21 23:07 作者: 舔食 時間: 2025-3-22 01:58
Gauri Deshpande,Bj?rn W. Schuller,Pallavi Deshpande,Anuradha Rajiv Joshi,S. K. Oza,Sachin Patels of the three methodologies (probabilistic or stochastic modelling, fuzzy sets based analysis, antioptimization of structures) to deal with various uncertainties? and deepen the discussion of their pros and cons..978-3-7091-1670-8978-3-7091-1306-6Series ISSN 0254-1971 Series E-ISSN 2309-3706 作者: 指派 時間: 2025-3-22 07:58 作者: 愛哭 時間: 2025-3-22 12:01 作者: Creatinine-Test 時間: 2025-3-22 15:45
Pradeep Rangappa,Aditya Kiran Brahma,Venkatesh Vayyavuru,Rishi Yadav,Hemant Misra,Kasturi Karuna with nonnegative entries {a;}f=l. n Denote by R[a](:e) monomial in n variables of the form: n R[a](:e) = IT :ef‘; ;=1 d(a) = 2:7=1 ai is the total degree of monomial R[a]. Each polynomial in n variables can be written as sum of monomials with nonzero coefficients: P(:e) = L caR[a](:e), aEA{P) IX x 作者: 共同確定為確 時間: 2025-3-22 17:37 作者: elastic 時間: 2025-3-23 00:18 作者: Ascribe 時間: 2025-3-23 04:40
Irina Kipyatkova,Ildar Kagirovpplications of NDO. The following topics were considered in separate sessions: General motivation for research in NDO: nondifferentiability in applied problems, nondifferentiable mathematical models. Numerical methods for solving nondifferentiable optimization problems, numerical experiments, compar作者: cathartic 時間: 2025-3-23 08:00
Sougata Mukherjee,Jagabandhu Mishra,S. R. Mahadeva Prasannapplications of NDO. The following topics were considered in separate sessions: General motivation for research in NDO: nondifferentiability in applied problems, nondifferentiable mathematical models. Numerical methods for solving nondifferentiable optimization problems, numerical experiments, compar作者: THROB 時間: 2025-3-23 12:27 作者: 飛來飛去真休 時間: 2025-3-23 14:31
Ashwini Dasare,Amartya Roy Chowdhury,Aditya Srinivas Menon,Konjengbam Anand,K. T. Deepak,S. R. M. Pr indicates, our chief concern is with (i) nondifferentiable mathematical programs, and (ii) two-level optimization problems. In the first half of the book, we study basic theory for general smooth and nonsmooth functions of many variables. After providing some background, we extend traditional (diff作者: 釘牢 時間: 2025-3-23 19:31
Ankita,Shambhavi,Syed Shahnawazuddin indicates, our chief concern is with (i) nondifferentiable mathematical programs, and (ii) two-level optimization problems. In the first half of the book, we study basic theory for general smooth and nonsmooth functions of many variables. After providing some background, we extend traditional (diff作者: Aggressive 時間: 2025-3-23 23:41
Analysing Breathing Patterns in?Reading and?Spontaneous Speech. By comparing the performance across speakers, speech categories, and speech-breathing categories, we aim to uncover the factors influencing SBreathNet’s effectiveness when applied to these two types of speech signals.作者: BIBLE 時間: 2025-3-24 04:46 作者: 價值在貶值 時間: 2025-3-24 08:14
Analysis of?a?Hinglish ASR System’s Performance for?Fraud Detectionother equally important aspect while doing deployment of speech technology based products is that it is rather difficult to know if the performance of an ASR engine is adequate for its output to be used for a down-stream task. In this paper, we present our study of how the performance of an ASR engi作者: Goblet-Cells 時間: 2025-3-24 12:59 作者: Pelvic-Floor 時間: 2025-3-24 15:18
Improvements in?Language Modeling, Voice Activity Detection, and?Lexicon in?OpenASR21 Low Resource Lxicon from public text is beneficial for languages where the out-of-vocabulary rate is high, and outline conditions for reducing the WER. Adding an attention layer to the TDNN (time delay neural net) based voice activity detector reduced the WER for 17 out of the 18 languages. With all the improveme作者: 葡萄糖 時間: 2025-3-24 21:47 作者: 領(lǐng)袖氣質(zhì) 時間: 2025-3-24 23:30 作者: Incumbent 時間: 2025-3-25 04:23 作者: ETHER 時間: 2025-3-25 07:47 作者: 終點 時間: 2025-3-25 15:08
Code-Mixed Text-to-Speech Synthesis Under Low-Resource Constraintspeaker adaptation and multi-speaker training with Tacotron2 + Waveglow setup to show that the former approach works better. These approaches are also coupled with transfer learning and decoder-only fine-tuning to improve performance. We compare these approaches with the Google TTS and report a posit作者: Hla461 時間: 2025-3-25 18:43
Curriculum Learning Based Approach for?Faster Convergence of?TTS Modeloring functions based on text and acoustic features and achieved faster convergence of the end-to-end TTS model. We found ’text-length’ or the number of phonemes/characters in text to be a simple yet most effective measure of difficulty for designing curriculum for Text-to-Speech task. Using text-le作者: 針葉樹 時間: 2025-3-25 21:19 作者: RENIN 時間: 2025-3-26 01:22 作者: Chemotherapy 時間: 2025-3-26 06:07 作者: 洞察力 時間: 2025-3-26 10:17
Leena Dihingia,Prashant Bannulmath,Amartya Roy Chowdhury,S.R.M Prasanna,K.T Deepak,Tehreem Sheikh作者: 極大痛苦 時間: 2025-3-26 13:06
Alexey Karpov,K. Samudravijaya,S. R. Mahadeva Pras作者: photophobia 時間: 2025-3-26 20:05
978-3-031-48311-0The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerl作者: 正式通知 時間: 2025-3-27 00:31
Speech and Computer978-3-031-48312-7Series ISSN 0302-9743 Series E-ISSN 1611-3349 作者: ethnology 時間: 2025-3-27 02:34
Lecture Notes in Computer Sciencehttp://image.papertrans.cn/s/image/874038.jpg作者: bisphosphonate 時間: 2025-3-27 08:36
https://doi.org/10.1007/978-3-031-48312-7acoustic signal processing; artificial intelligence; automatic speech recognition; correlation analysis作者: 改革運動 時間: 2025-3-27 09:43
Conference proceedings 2023igital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine;?industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization..作者: Permanent 時間: 2025-3-27 14:04 作者: 敵手 時間: 2025-3-27 21:25 作者: notification 時間: 2025-3-27 23:19 作者: FISC 時間: 2025-3-28 02:44 作者: 小平面 時間: 2025-3-28 06:22 作者: crescendo 時間: 2025-3-28 11:02
Improvements in?Language Modeling, Voice Activity Detection, and?Lexicon in?OpenASR21 Low Resource L word error rates (WER) with text downloaded from the internet for only the case sensitive languages, since the development and evaluation audio contained broadcast news. For the 15 low resource languages, participants showed only small gains for some of the languages. The reason is that the develop作者: Lasting 時間: 2025-3-28 14:49
Phone Durations Modeling for Livvi-Karelian ASRnguage (Livvi-Karelian dialect). The main issues addressed within this work are related to acoustic modeling, viz.?the treatment of long and short phonemes. There are two approaches to modeling phonological duration in the so-called quantity languages: representation of long and short phonemes as di作者: 鴿子 時間: 2025-3-28 19:39 作者: Tractable 時間: 2025-3-28 23:06
Study of?Various End-to-End Keyword Spotting Systems on?the?Bengali Language Under Low-Resource Condvarious keyword techniques in the Indian regional Bengali language under low-resource conditions. In this context, we study several KWS techniques which are common in the English language in Bengali namely: Conv1D, Conv2D+attention, Conv2D+multi head attention, VGG, Dense-net, and Vision transformer作者: 刪減 時間: 2025-3-29 06:13
Bridging the?Gap: Towards Linguistic Resource Development for?the?Low-Resource Lambani Languagestic resources makes it challenging for technology development of under-resource languages. This paper aims at developing linguistic tools for Lambamni, an under-resourced tribal language of India through corpora creation, annotation, and transfer learning from contact language. Based on the annotat作者: Asparagus 時間: 2025-3-29 09:42
Studying the?Effect of?Frame-Level Concatenation of?GFCC and?TS-MFCC Features on?Zero-Shot Children’catenation of two complementary front-end acoustic features. The acoustic features chosen are TANDEM-STRAIGHT-based Mel-frequency cepstral coefficients (TS-MFCC) and Gamma-tone frequency cepstral coefficients (GFCC). The GFCC model the cochlear response of the human auditory system. The MFCC feature作者: 聯(lián)邦 時間: 2025-3-29 13:14 作者: 奇思怪想 時間: 2025-3-29 19:36 作者: Herd-Immunity 時間: 2025-3-29 23:13
An ASR Corpus in?Chhattisgarhi, a?Low Resource Indian Language including Chhattisgarhi. The paper elaborates on the entire process of such a low-resource database preparation in a crowd-sourced manner. Through this work we have open-sourced around 250?h of dialect-rich, domain-rich Chhattisgarhi ASR dataset to popularize the scope of voice technology to the Ch作者: ARY 時間: 2025-3-30 00:46
Cross Lingual Style Transfer Using Multiscale Loss Function for?Soliga: A Low Resource Tribal Langua on a multi-scale loss function, using a deep learning framework for syntactically similar languages Kannada and Soliga, under a low resource setup. The existing speaker adaptation methods usually depend on monolingual data and cannot be directly adopted for cross-lingual data. The proposed method c作者: degradation 時間: 2025-3-30 07:28 作者: 小故事 時間: 2025-3-30 08:19
Curriculum Learning Based Approach for?Faster Convergence of?TTS Modellmost human-like speech. Recent Text-to-Speech models use a sequence-to-sequence architecture that directly converts text or phoneme sequence into low-level acoustic representation such as spectrogram. These end-to-end models need a large dataset for training, and with conventional learning methodol作者: Congestion 時間: 2025-3-30 12:50
0302-9743 peech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization..978-3-031-48311-0978-3-031-48312-7Series ISSN 0302-9743 Series E-ISSN 1611-3349 作者: DIKE 時間: 2025-3-30 16:57 作者: DALLY 時間: 2025-3-30 23:45
0302-9743 Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023..The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions.?They focus on all aspects of speech science and technology:??automatic speech recognition; computational paraling作者: 延期 時間: 2025-3-31 02:13 作者: 喧鬧 時間: 2025-3-31 09:03 作者: 字形刻痕 時間: 2025-3-31 11:38