找回密碼
 To register

QQ登錄

只需一步,快速開始

掃一掃,訪問微社區(qū)

打印 上一主題 下一主題

Titlebook: Document Analysis and Recognition - ICDAR 2023; 17th International C Gernot A. Fink,Rajiv Jain,Richard Zanibbi Conference proceedings 2023

[復(fù)制鏈接]
樓主: MEDAL
41#
發(fā)表于 2025-3-28 14:35:12 | 只看該作者
42#
發(fā)表于 2025-3-28 22:36:47 | 只看該作者
43#
發(fā)表于 2025-3-29 00:22:43 | 只看該作者
E2TIMT: Efficient and?Effective Modal Adapter for?Text Image Machine Translation, both two-stage cascade and one-stage end-to-end architectures, suffer from different issues. The cascade models can benefit from the large-scale optical character recognition (OCR) and MT datasets but the two-stage architecture is redundant. The end-to-end models are efficient but suffer from trai
44#
發(fā)表于 2025-3-29 03:09:06 | 只看該作者
Open-Set Text Recognition via?Shape-Awareness Visual Reconstructionmpared to conventional counterparts, the OSTR task demands actively spotting and incrementally recognizing novel characters. Existing methods have demonstrated some success, yet confusion among similar characters remains to be a major challenge, potentially due to insufficient shape information pres
45#
發(fā)表于 2025-3-29 10:58:21 | 只看該作者
Accelerating Transformer-Based Scene Text Detection and?Recognition via?Token Pruningnd all current state-of-the-art models and have achieved excellent performance. However, the computational requirements of the transformer architecture makes training these methods slow and resource heavy. In this paper, we introduce a new token pruning strategy that significantly decreases training
46#
發(fā)表于 2025-3-29 15:21:34 | 只看該作者
Text Enhancement: Scene Text Recognition in?Hazy Weatherer adverse weather conditions with poor visibility remains challenging. To address this problem, we propose a text image enhancement network that can be embedded into a scene text recognizer in a pluggable manner. This network comprises multiple sets of digital image processing (DIP) units, which ar
47#
發(fā)表于 2025-3-29 18:40:03 | 只看該作者
Reading Between the?Lanes: Text VideoQA on?the?Roadtion is a challenging problem, while textual cues typically appear for a short time span, and early detection at a distance is necessary. Systems that exploit such information to assist the driver should not only extract and incorporate visual and textual cues from the video stream but also reason o
 關(guān)于派博傳思  派博傳思旗下網(wǎng)站  友情鏈接
派博傳思介紹 公司地理位置 論文服務(wù)流程 影響因子官網(wǎng) 吾愛論文網(wǎng) 大講堂 北京大學(xué) Oxford Uni. Harvard Uni.
發(fā)展歷史沿革 期刊點(diǎn)評 投稿經(jīng)驗(yàn)總結(jié) SCIENCEGARD IMPACTFACTOR 派博系數(shù) 清華大學(xué) Yale Uni. Stanford Uni.
QQ|Archiver|手機(jī)版|小黑屋| 派博傳思國際 ( 京公網(wǎng)安備110108008328) GMT+8, 2025-10-26 09:48
Copyright © 2001-2015 派博傳思   京公網(wǎng)安備110108008328 版權(quán)所有 All rights reserved
快速回復(fù) 返回頂部 返回列表
句容市| 清徐县| 广州市| 枣阳市| 伊通| 左贡县| 曲周县| 江都市| 山东省| 绥棱县| 东宁县| 梅河口市| 苏尼特左旗| 濉溪县| 河西区| 定西市| 承德县| 姜堰市| 九江市| 安溪县| 上高县| 巴马| 宁陵县| 霍林郭勒市| 平定县| 鹰潭市| 宾阳县| 灯塔市| 和政县| 汝南县| 平遥县| 南漳县| 鹿邑县| 游戏| 永春县| 汉源县| 晋宁县| 安阳市| 钟山县| 曲水县| 大英县|