找回密碼
 To register

QQ登錄

只需一步,快速開(kāi)始

掃一掃,訪問(wèn)微社區(qū)

打印 上一主題 下一主題

Titlebook: Advances in Speech and Language Technologies for Iberian Languages; IberSPEECH 2014 Conf Juan Luis Navarro Mesa,Alfonso Ortega,Doroteo T. T

[復(fù)制鏈接]
樓主: Causalgia
41#
發(fā)表于 2025-3-28 17:04:50 | 只看該作者
https://doi.org/10.1007/978-3-319-48354-2across the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
42#
發(fā)表于 2025-3-28 19:47:46 | 只看該作者
43#
發(fā)表于 2025-3-29 00:12:10 | 只看該作者
Xiaobin Qiu,Hongqian Chen,Nan Zhoud. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
44#
發(fā)表于 2025-3-29 06:08:03 | 只看該作者
Global Speaker Clustering towards Optimal Stopping Criterion in Binary Key Speaker Diarizationacross the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
45#
發(fā)表于 2025-3-29 11:14:38 | 只看該作者
CVX-Optimized Beamforming and Vector Taylor Series Compensation with German ASR Employing Star-Shapedium-vocabulary German database for microphone array made of embedded clean signals contaminated with real room impulsive responses and mixed in a ‘natural’ way with real noises. We show that the proposed enhancement framework performs better than other related systems on the presented database.
46#
發(fā)表于 2025-3-29 13:35:42 | 只看該作者
Flexible Stand-Alone Keyword Recognition Application Using Dynamic Time Warpingd. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
47#
發(fā)表于 2025-3-29 19:37:59 | 只看該作者
Statistical Text-to-Speech Synthesis of Spanish Subtitlesthe best of our knowledge, this is the first time that a DNN-based TTS system has been implemented for the synthesis of Spanish. A comparative objective evaluation between both models has been carried out. Our results show that DNN-based systems can reconstruct speech waveforms more accurately.
48#
發(fā)表于 2025-3-29 20:50:04 | 只看該作者
Unsupervised Training of PLDA with Variational Bayesre latent variables. We experimented on unlabeled NIST SRE data. The trained models were evaluated on NIST SRE10. Compared to cosine distance, unsupervised PLDA improved EER by 28% and minimum DCF by 36%.
49#
發(fā)表于 2025-3-30 02:32:30 | 只看該作者
50#
發(fā)表于 2025-3-30 06:08:38 | 只看該作者
 關(guān)于派博傳思  派博傳思旗下網(wǎng)站  友情鏈接
派博傳思介紹 公司地理位置 論文服務(wù)流程 影響因子官網(wǎng) 吾愛(ài)論文網(wǎng) 大講堂 北京大學(xué) Oxford Uni. Harvard Uni.
發(fā)展歷史沿革 期刊點(diǎn)評(píng) 投稿經(jīng)驗(yàn)總結(jié) SCIENCEGARD IMPACTFACTOR 派博系數(shù) 清華大學(xué) Yale Uni. Stanford Uni.
QQ|Archiver|手機(jī)版|小黑屋| 派博傳思國(guó)際 ( 京公網(wǎng)安備110108008328) GMT+8, 2025-10-11 17:14
Copyright © 2001-2015 派博傳思   京公網(wǎng)安備110108008328 版權(quán)所有 All rights reserved
快速回復(fù) 返回頂部 返回列表
林芝县| 略阳县| 呼和浩特市| 巧家县| 平度市| 股票| 连州市| 西安市| 保康县| 通河县| 青浦区| 富锦市| 湘西| 稻城县| 晋江市| 特克斯县| 泽州县| 天柱县| 绥滨县| 宣恩县| 朝阳市| 响水县| 安吉县| 南通市| 隆昌县| 正定县| 金寨县| 许昌县| 石阡县| 临泽县| 静乐县| 颍上县| 乌海市| 富锦市| 中阳县| 阿拉尔市| 德兴市| 双牌县| 高邑县| 南郑县| 新乡县|