標題: Titlebook: Document Analysis and Recognition - ICDAR 2023; 17th International C Gernot A. Fink,Rajiv Jain,Richard Zanibbi Conference proceedings 2023 [打印本頁] 作者: 非決定性 時間: 2025-3-21 16:54
書目名稱Document Analysis and Recognition - ICDAR 2023影響因子(影響力)
書目名稱Document Analysis and Recognition - ICDAR 2023影響因子(影響力)學科排名
書目名稱Document Analysis and Recognition - ICDAR 2023網絡公開度
書目名稱Document Analysis and Recognition - ICDAR 2023網絡公開度學科排名
書目名稱Document Analysis and Recognition - ICDAR 2023被引頻次
書目名稱Document Analysis and Recognition - ICDAR 2023被引頻次學科排名
書目名稱Document Analysis and Recognition - ICDAR 2023年度引用
書目名稱Document Analysis and Recognition - ICDAR 2023年度引用學科排名
書目名稱Document Analysis and Recognition - ICDAR 2023讀者反饋
書目名稱Document Analysis and Recognition - ICDAR 2023讀者反饋學科排名
作者: 休息 時間: 2025-3-21 22:24
An End-to-End Local Attention Based Model for Table Recognitiony powerful for table recognition. However, Transformer-based models usually struggle to process big tables due to the limitation of their global attention mechanism. In this paper, we propose a local attention mechanism to address the limitation of the global attention mechanism. We also present an 作者: 圣人 時間: 2025-3-22 00:43
Optimized Table Tokenization for?Table Structure Recognitione-structure can be recognized with impressive accuracy using Image-to-Markup-Sequence (Im2Seq) approaches. Taking only the image of a table, such models predict a sequence of tokens (e.g. in HTML, LaTeX) which represent the structure of the table. Since the token representation of the table structur作者: ALE 時間: 2025-3-22 06:18
Towards End-to-End Semi-Supervised Table Detection with?Deformable Transformerwe observe remarkable success in table detection. However, a significant amount of labeled data is required to train these models effectively. Many semi-supervised approaches are introduced to mitigate the need for a substantial amount of label data. These approaches use CNN-based detectors that rel作者: laxative 時間: 2025-3-22 12:43
SpaDen: Sparse and?Dense Keypoint Estimation for?Real-World Chart Understanding (KP), which are used to reconstruct the components within the plot area. Our novelty lies in detecting a fusion of continuous and discrete KP as predicted heatmaps. A combination of sparse and dense per-pixel objectives coupled with a uni-modal self-attention-based feature-fusion layer is applied t作者: 羊齒 時間: 2025-3-22 16:40
Generalization of?Fine Granular Extractions from?ChartsAnnotating a dataset and retraining for every new chart type with a shift in the spatial composition of chart elements, text role regions, legend preview styles, chart element shapes and text-role definitions, is a time-consuming and costly affair. Current approaches struggle to generalize to new ch作者: 羊齒 時間: 2025-3-22 20:56 作者: 表否定 時間: 2025-3-22 23:16
Language Independent Neuro-Symbolic Semantic Parsing for?Form Understandingpre-training. In contrast, humans can usually identify key-value pairings from a form only by looking at layouts, even if they don’t comprehend the language used. No prior research has been conducted to investigate how helpful layout information alone is for form understanding. Hence, we propose a u作者: EXALT 時間: 2025-3-23 04:23
DocILE Benchmark for?Document Information Localization and?Extractions documents, 100k synthetically generated documents, and nearly?1M unlabeled documents for unsupervised pre-training. The dataset has been built with knowledge of domain- and task-specific aspects, resulting in the following key features: (i) annotations in 55 classes, which surpasses the granularit作者: wreathe 時間: 2025-3-23 08:14
Robustness Evaluation of?Transformer-Based Form Field Extractors via?Form Attacksm transformations to evaluate the vulnerability of the state-of-the-art field extractors against form attacks from both OCR level and form level, including OCR location/order rearrangement, form background manipulation and form field-value augmentation. We conduct robustness evaluation using real in作者: 柳樹;枯黃 時間: 2025-3-23 12:44
Key-Value Information Extraction from?Full Handwritten Pages different steps that were so far performed by separate models: feature extraction, handwriting recognition and named entity recognition. We compare this integrated approach with traditional two-stage methods that perform handwriting recognition before named entity recognition, and present results a作者: Metamorphosis 時間: 2025-3-23 16:19 作者: 編輯才信任 時間: 2025-3-23 19:38 作者: 儀式 時間: 2025-3-24 01:32
DQ-DETR: Dynamic Queries Enhanced Detection Transformer for?Arbitrary Shape Text Detectiont instances from images with high localization accuracy. Unlike previous Transformer-based methods which take all control points on the boundaries/center-lines of all text instances as the queries of each Transformer decoder layer, we extend the query set for each decoder layer gradually, allowing t作者: Triglyceride 時間: 2025-3-24 04:39
Decoupling Visual-Semantic Features Learning with?Dual Masked Autoencoder for?Self-Supervised Scene ays, Masked Image Modeling?(MIM) shows superiority in visual representation learning, and several works introduce it into text recognition. In this paper, we take a further step and design a method for text-recognition-friendly self-supervised feature learning. Specifically, we propose to decouple v作者: 無思維能力 時間: 2025-3-24 09:23 作者: 殘廢的火焰 時間: 2025-3-24 11:47 作者: 開頭 時間: 2025-3-24 16:00
Conference proceedings 2023om 316 submissions, and are presented with 101 poster presentations...The papers are organized into the following topical sections: Graphics Recognition, Frontiers in Handwriting Recognition, Document Analysis and Recognition..作者: accrete 時間: 2025-3-24 22:53
0302-9743 elected from 316 submissions, and are presented with 101 poster presentations...The papers are organized into the following topical sections: Graphics Recognition, Frontiers in Handwriting Recognition, Document Analysis and Recognition..978-3-031-41678-1978-3-031-41679-8Series ISSN 0302-9743 Series E-ISSN 1611-3349 作者: Silent-Ischemia 時間: 2025-3-25 01:17 作者: 做方舟 時間: 2025-3-25 03:46
Palgrave Studies in Languages at Wars with their corresponding named entities. We compare our models to state-of-the-art methods on three public databases (IAM, ESPOSALLES, and POPP) and outperform previous performances on all three datasets.作者: Indict 時間: 2025-3-25 09:34
Robustness Evaluation of?Transformer-Based Form Field Extractors via?Form Attacksisruption of the neighboring words of field-values(. 10% drop in F1 score). Guided by the analysis, we make recommendations to improve the design of field extractors and the process of data collection. Code will be available at ..作者: 使長胖 時間: 2025-3-25 12:12
Key-Value Information Extraction from?Full Handwritten Pagess with their corresponding named entities. We compare our models to state-of-the-art methods on three public databases (IAM, ESPOSALLES, and POPP) and outperform previous performances on all three datasets.作者: FELON 時間: 2025-3-25 16:31
0302-9743 nference on Document Analysis and Recognition, ICDAR 2021, held in San José, CA, USA, in August 2023.?The 53 full papers were carefully reviewed and selected from 316 submissions, and are presented with 101 poster presentations...The papers are organized into the following topical sections: Graphics作者: 不開心 時間: 2025-3-25 21:07 作者: 惡心 時間: 2025-3-26 01:09 作者: 護航艦 時間: 2025-3-26 06:13 作者: 悲痛 時間: 2025-3-26 10:23 作者: 紳士 時間: 2025-3-26 13:21
SpaDen: Sparse and?Dense Keypoint Estimation for?Real-World Chart Understanding estimation and the combination of deep layer aggregation and corner pooling approaches. The results of our experiments provide extensive evaluation for the task of real-world chart data extraction. Our Code is publicly available (.).作者: Arteriography 時間: 2025-3-26 16:49
Generalization of?Fine Granular Extractions from?Chartse mentioned shifts in chart element distributions. We demonstrate the generalization capabilities of our models trained on the PlotQA train set by providing chart extraction results on out-of-distribution charts selected from the LeafQA dataset. We achieve an mAP of 90.64% and 92.18% for @0.90 IOU f作者: N防腐劑 時間: 2025-3-26 22:36
Improving Information Extraction from?Semi-structured Documents Using Attention Based Semi-variation. We tested the architecture on two artificially generated datasets: Gen-Invoices and Gen-Payslips and one real dataset: receipts issued from the SROIE ICDAR 2019 competition. The latter data set yielded an important F1 score of 97.94%, placing our system among the best systems on this dataset.作者: conceal 時間: 2025-3-27 03:04
Language Independent Neuro-Symbolic Semantic Parsing for?Form Understanding layout information to facilitate easy transfer across languages. To further improve the performance of ., and achieve isomorphism between entity-relation graphs and word-relation graphs, we use integer linear programming (ILP) based inference. Code is publicly available at ..作者: 表被動 時間: 2025-3-27 06:28
DocILE Benchmark for?Document Information Localization and?Extraction and DETR-based Table Transformer; applied to both tasks of the DocILE benchmark, with results shared in this paper, offering a quick starting point for future work. The dataset, baselines and supplementary material are available at ..作者: brother 時間: 2025-3-27 13:15
Information Extraction from?Documents: Question Answering Vs Token Classification in?Real-World Setud on Few-Shot Learning and finally Zero-Shot Learning..Our research showed that when dealing with clean and relatively short entities, it is still best to use token classification-based approach, while the QA approach could be a good alternative for noisy environment or long entities use-cases.作者: 別名 時間: 2025-3-27 17:00 作者: hallow 時間: 2025-3-27 20:14 作者: 舞蹈編排 時間: 2025-3-28 01:18
Decoupling Visual-Semantic Features Learning with?Dual Masked Autoencoder for?Self-Supervised Scene on this idea, we first propose a siamese network that aligns dual features with each other, then we explore the dual distillation with a co-teacher framework. Our proposed method shows the effectiveness of self-supervised scene text recognition with state-of-the-art performances on most benchmarks.作者: jocular 時間: 2025-3-28 02:28 作者: Hirsutism 時間: 2025-3-28 06:19
Ernestine Wohlfart,Manfred Zaumseilfied as replicable using the similar dataset under certain IoU values. No paper is identified as replicable using the new dataset. We offer observations on the causes of irreproducibility and irreplicability. All code and data are available on Codeocean at ..作者: 誘拐 時間: 2025-3-28 12:52 作者: 令人發(fā)膩 時間: 2025-3-28 15:23
Translanguaging for Empowerment and Equitycy improves significantly, inference time is halved compared to HTML-based models, and the predicted table structures are always syntactically correct. This in turn eliminates most post-processing needs. Popular table structure data-sets will be published in OTSL format to the community.作者: 無效 時間: 2025-3-28 22:15
https://doi.org/10.1007/978-981-99-8589-0able transformer) by +3.4 points on 10% labels of TableBank-both dataset and the previous CNN-based semi-supervised approach (Soft Teacher) by +1.8 points on 10% labels of PubLayNet dataset. We hope this work opens new possibilities towards semi-supervised and unsupervised table detection methods.作者: 使虛弱 時間: 2025-3-29 01:31
https://doi.org/10.1007/978-981-99-8589-0 estimation and the combination of deep layer aggregation and corner pooling approaches. The results of our experiments provide extensive evaluation for the task of real-world chart data extraction. Our Code is publicly available (.).作者: 蝕刻 時間: 2025-3-29 04:33 作者: 提名的名單 時間: 2025-3-29 10:28 作者: CREEK 時間: 2025-3-29 13:20 作者: 杠桿支點 時間: 2025-3-29 16:38
https://doi.org/10.1057/9780230289703 and DETR-based Table Transformer; applied to both tasks of the DocILE benchmark, with results shared in this paper, offering a quick starting point for future work. The dataset, baselines and supplementary material are available at ..作者: 咯咯笑 時間: 2025-3-29 19:45 作者: 抱狗不敢前 時間: 2025-3-30 03:01 作者: Scleroderma 時間: 2025-3-30 08:00 作者: uveitis 時間: 2025-3-30 10:15
https://doi.org/10.1007/978-3-031-27700-9 on this idea, we first propose a siamese network that aligns dual features with each other, then we explore the dual distillation with a co-teacher framework. Our proposed method shows the effectiveness of self-supervised scene text recognition with state-of-the-art performances on most benchmarks.作者: 熒光 時間: 2025-3-30 14:15 作者: FECK 時間: 2025-3-30 16:36 作者: 神經 時間: 2025-3-30 21:51 作者: Lipoprotein 時間: 2025-3-31 00:51
978-3-031-41678-1The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerl作者: 改變立場 時間: 2025-3-31 08:21
Ernestine Wohlfart,Manfred Zaumseilpublished findings in the field. Replicability, the ability to affirm a finding using the same procedures on new data, has not been well studied. In this paper, we examine both reproducibility and replicability of a corpus of 16 papers on table structure recognition (TSR), an AI task aimed at identi作者: plasma-cells 時間: 2025-3-31 12:13 作者: FICE 時間: 2025-3-31 17:21