找回密碼
 To register

QQ登錄

只需一步,快速開(kāi)始

掃一掃,訪問(wèn)微社區(qū)

打印 上一主題 下一主題

Titlebook: Advances in Speech and Language Technologies for Iberian Languages; IberSPEECH 2014 Conf Juan Luis Navarro Mesa,Alfonso Ortega,Doroteo T. T

[復(fù)制鏈接]
樓主: Causalgia
41#
發(fā)表于 2025-3-28 17:04:50 | 只看該作者
https://doi.org/10.1007/978-3-319-48354-2across the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
42#
發(fā)表于 2025-3-28 19:47:46 | 只看該作者
43#
發(fā)表于 2025-3-29 00:12:10 | 只看該作者
Xiaobin Qiu,Hongqian Chen,Nan Zhoud. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
44#
發(fā)表于 2025-3-29 06:08:03 | 只看該作者
Global Speaker Clustering towards Optimal Stopping Criterion in Binary Key Speaker Diarizationacross the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
45#
發(fā)表于 2025-3-29 11:14:38 | 只看該作者
CVX-Optimized Beamforming and Vector Taylor Series Compensation with German ASR Employing Star-Shapedium-vocabulary German database for microphone array made of embedded clean signals contaminated with real room impulsive responses and mixed in a ‘natural’ way with real noises. We show that the proposed enhancement framework performs better than other related systems on the presented database.
46#
發(fā)表于 2025-3-29 13:35:42 | 只看該作者
Flexible Stand-Alone Keyword Recognition Application Using Dynamic Time Warpingd. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
47#
發(fā)表于 2025-3-29 19:37:59 | 只看該作者
Statistical Text-to-Speech Synthesis of Spanish Subtitlesthe best of our knowledge, this is the first time that a DNN-based TTS system has been implemented for the synthesis of Spanish. A comparative objective evaluation between both models has been carried out. Our results show that DNN-based systems can reconstruct speech waveforms more accurately.
48#
發(fā)表于 2025-3-29 20:50:04 | 只看該作者
Unsupervised Training of PLDA with Variational Bayesre latent variables. We experimented on unlabeled NIST SRE data. The trained models were evaluated on NIST SRE10. Compared to cosine distance, unsupervised PLDA improved EER by 28% and minimum DCF by 36%.
49#
發(fā)表于 2025-3-30 02:32:30 | 只看該作者
50#
發(fā)表于 2025-3-30 06:08:38 | 只看該作者
 關(guān)于派博傳思  派博傳思旗下網(wǎng)站  友情鏈接
派博傳思介紹 公司地理位置 論文服務(wù)流程 影響因子官網(wǎng) 吾愛(ài)論文網(wǎng) 大講堂 北京大學(xué) Oxford Uni. Harvard Uni.
發(fā)展歷史沿革 期刊點(diǎn)評(píng) 投稿經(jīng)驗(yàn)總結(jié) SCIENCEGARD IMPACTFACTOR 派博系數(shù) 清華大學(xué) Yale Uni. Stanford Uni.
QQ|Archiver|手機(jī)版|小黑屋| 派博傳思國(guó)際 ( 京公網(wǎng)安備110108008328) GMT+8, 2025-10-11 17:14
Copyright © 2001-2015 派博傳思   京公網(wǎng)安備110108008328 版權(quán)所有 All rights reserved
快速回復(fù) 返回頂部 返回列表
枞阳县| 益阳市| 高平市| 惠水县| 衡南县| 棋牌| 临沧市| 乡宁县| 威信县| 怀柔区| 抚顺市| 封丘县| 丰镇市| 建始县| 西林县| 紫金县| 清远市| 泰和县| 镇赉县| 大厂| 临泉县| 西乌珠穆沁旗| 吉安县| 济南市| 宿迁市| 阿瓦提县| 土默特右旗| 三河市| 巴马| 南丰县| 兴隆县| 镇康县| 靖宇县| 鲁甸县| 奉贤区| 江川县| 成安县| 大田县| 宜都市| 台北县| 瑞昌市|