找回密碼
 To register

QQ登錄

只需一步,快速開始

掃一掃,訪問微社區(qū)

打印 上一主題 下一主題

Titlebook: Web Corpus Construction; Roland Sch?fer,Felix Bildhauer Book 2013 Springer Nature Switzerland AG 2013

[復(fù)制鏈接]
樓主: nourish
11#
發(fā)表于 2025-3-23 11:34:05 | 只看該作者
Roland Sch?fer,Felix Bildhauerlin erschienenen Journal ?Stadterneuerung“ — vier zweispaltige Din A4-Seiten plus zwei Seiten Inhaltsangabe der Zeitschrift, aus der hervorgeht, da? der Aufsatz relativ weit hinten, als zweitletzter, plaziert worden ist. Die Ausgabe dieser Zeitschrift von 1987 tr?gt den Titel ?Idee, Proze?, Ergebnis
12#
發(fā)表于 2025-3-23 15:51:54 | 只看該作者
13#
發(fā)表于 2025-3-23 18:26:51 | 只看該作者
Web Corpora,rpus for Computational Linguistics tasks. In addition, many freely available corpora cannot be downloaded as a whole, which is required for many applications in Computational Linguistics. Examples of the above include:
14#
發(fā)表于 2025-3-24 01:31:34 | 只看該作者
Web Corpora,rpus for Computational Linguistics tasks. In addition, many freely available corpora cannot be downloaded as a whole, which is required for many applications in Computational Linguistics. Examples of the above include:
15#
發(fā)表于 2025-3-24 02:40:05 | 只看該作者
Web Corpora,etimes too small, sometimes too unbalanced, or they are balanced according to inappropriate criteria for the task, sometimes too close to the respective standard language (again, for certain types of research questions), and sometimes they are simply too expensive. Sometimes, it is also the case tha
16#
發(fā)表于 2025-3-24 08:55:57 | 只看該作者
Web Corpora,etimes too small, sometimes too unbalanced, or they are balanced according to inappropriate criteria for the task, sometimes too close to the respective standard language (again, for certain types of research questions), and sometimes they are simply too expensive. Sometimes, it is also the case tha
17#
發(fā)表于 2025-3-24 13:12:40 | 只看該作者
Book 2013is data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking
18#
發(fā)表于 2025-3-24 17:09:11 | 只看該作者
19#
發(fā)表于 2025-3-24 20:12:14 | 只看該作者
Book 2013Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and removal of duplicated content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the
20#
發(fā)表于 2025-3-25 01:26:06 | 只看該作者
1947-4040 ed content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the978-3-031-01024-8978-3-031-02152-7Series ISSN 1947-4040 Series E-ISSN 1947-4059
 關(guān)于派博傳思  派博傳思旗下網(wǎng)站  友情鏈接
派博傳思介紹 公司地理位置 論文服務(wù)流程 影響因子官網(wǎng) 吾愛論文網(wǎng) 大講堂 北京大學(xué) Oxford Uni. Harvard Uni.
發(fā)展歷史沿革 期刊點(diǎn)評 投稿經(jīng)驗(yàn)總結(jié) SCIENCEGARD IMPACTFACTOR 派博系數(shù) 清華大學(xué) Yale Uni. Stanford Uni.
QQ|Archiver|手機(jī)版|小黑屋| 派博傳思國際 ( 京公網(wǎng)安備110108008328) GMT+8, 2025-10-15 01:44
Copyright © 2001-2015 派博傳思   京公網(wǎng)安備110108008328 版權(quán)所有 All rights reserved
快速回復(fù) 返回頂部 返回列表
大化| 景宁| 宁陵县| 恭城| 阿合奇县| 临泉县| 香格里拉县| 宜阳县| 胶州市| 吉隆县| 田东县| 邵东县| 虹口区| 廊坊市| 绥滨县| 永登县| 长寿区| 沂水县| 牙克石市| 达拉特旗| 刚察县| 双峰县| 芦溪县| 宜兰县| 神农架林区| 北京市| 辛集市| 桂东县| 巨鹿县| 武穴市| 台中市| 五大连池市| 隆化县| 栾川县| 翁源县| 罗平县| 石河子市| 仁化县| 肇庆市| 株洲市| 开封县|