找回密碼
 To register

QQ登錄

只需一步,快速開(kāi)始

掃一掃,訪問(wèn)微社區(qū)

打印 上一主題 下一主題

Titlebook: Advances in Speech and Language Technologies for Iberian Languages; IberSPEECH 2014 Conf Juan Luis Navarro Mesa,Alfonso Ortega,Doroteo T. T

[復(fù)制鏈接]
樓主: Causalgia
41#
發(fā)表于 2025-3-28 17:04:50 | 只看該作者
https://doi.org/10.1007/978-3-319-48354-2across the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
42#
發(fā)表于 2025-3-28 19:47:46 | 只看該作者
43#
發(fā)表于 2025-3-29 00:12:10 | 只看該作者
Xiaobin Qiu,Hongqian Chen,Nan Zhoud. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
44#
發(fā)表于 2025-3-29 06:08:03 | 只看該作者
Global Speaker Clustering towards Optimal Stopping Criterion in Binary Key Speaker Diarizationacross the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
45#
發(fā)表于 2025-3-29 11:14:38 | 只看該作者
CVX-Optimized Beamforming and Vector Taylor Series Compensation with German ASR Employing Star-Shapedium-vocabulary German database for microphone array made of embedded clean signals contaminated with real room impulsive responses and mixed in a ‘natural’ way with real noises. We show that the proposed enhancement framework performs better than other related systems on the presented database.
46#
發(fā)表于 2025-3-29 13:35:42 | 只看該作者
Flexible Stand-Alone Keyword Recognition Application Using Dynamic Time Warpingd. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
47#
發(fā)表于 2025-3-29 19:37:59 | 只看該作者
Statistical Text-to-Speech Synthesis of Spanish Subtitlesthe best of our knowledge, this is the first time that a DNN-based TTS system has been implemented for the synthesis of Spanish. A comparative objective evaluation between both models has been carried out. Our results show that DNN-based systems can reconstruct speech waveforms more accurately.
48#
發(fā)表于 2025-3-29 20:50:04 | 只看該作者
Unsupervised Training of PLDA with Variational Bayesre latent variables. We experimented on unlabeled NIST SRE data. The trained models were evaluated on NIST SRE10. Compared to cosine distance, unsupervised PLDA improved EER by 28% and minimum DCF by 36%.
49#
發(fā)表于 2025-3-30 02:32:30 | 只看該作者
50#
發(fā)表于 2025-3-30 06:08:38 | 只看該作者
 關(guān)于派博傳思  派博傳思旗下網(wǎng)站  友情鏈接
派博傳思介紹 公司地理位置 論文服務(wù)流程 影響因子官網(wǎng) 吾愛(ài)論文網(wǎng) 大講堂 北京大學(xué) Oxford Uni. Harvard Uni.
發(fā)展歷史沿革 期刊點(diǎn)評(píng) 投稿經(jīng)驗(yàn)總結(jié) SCIENCEGARD IMPACTFACTOR 派博系數(shù) 清華大學(xué) Yale Uni. Stanford Uni.
QQ|Archiver|手機(jī)版|小黑屋| 派博傳思國(guó)際 ( 京公網(wǎng)安備110108008328) GMT+8, 2025-10-11 13:09
Copyright © 2001-2015 派博傳思   京公網(wǎng)安備110108008328 版權(quán)所有 All rights reserved
快速回復(fù) 返回頂部 返回列表
龙游县| 清徐县| 民丰县| 依安县| 陇南市| 江达县| 类乌齐县| 湟源县| 深水埗区| 九龙坡区| 保康县| 涡阳县| 台安县| 寻甸| 神池县| 长武县| 内黄县| 灯塔市| 卢龙县| 松潘县| 景德镇市| 开原市| 益阳市| 姜堰市| 兰州市| 昌乐县| 高雄市| 方正县| 红原县| 台东市| 新蔡县| 宽甸| 满洲里市| 潮州市| 介休市| 义马市| 连云港市| 桐柏县| 临沂市| 北碚区| 浪卡子县|