派博傳思國(guó)際中心

標(biāo)題: Titlebook: Building and Using Comparable Corpora; Serge Sharoff,Reinhard Rapp,Pascale Fung Book 2013 Springer-Verlag Berlin Heidelberg 2013 68T50, 91 [打印本頁(yè)]

作者: 債務(wù)人    時(shí)間: 2025-3-21 16:45
書目名稱Building and Using Comparable Corpora影響因子(影響力)




書目名稱Building and Using Comparable Corpora影響因子(影響力)學(xué)科排名




書目名稱Building and Using Comparable Corpora網(wǎng)絡(luò)公開度




書目名稱Building and Using Comparable Corpora網(wǎng)絡(luò)公開度學(xué)科排名




書目名稱Building and Using Comparable Corpora被引頻次




書目名稱Building and Using Comparable Corpora被引頻次學(xué)科排名




書目名稱Building and Using Comparable Corpora年度引用




書目名稱Building and Using Comparable Corpora年度引用學(xué)科排名




書目名稱Building and Using Comparable Corpora讀者反饋




書目名稱Building and Using Comparable Corpora讀者反饋學(xué)科排名





作者: flammable    時(shí)間: 2025-3-21 23:56
http://image.papertrans.cn/b/image/191857.jpg
作者: 脫離    時(shí)間: 2025-3-22 01:40
https://doi.org/10.1007/978-3-642-60550-5The beginning of the 1990s marked a radical turn in various NLP applications towards using large collections of texts.
作者: 破裂    時(shí)間: 2025-3-22 06:27

作者: seduce    時(shí)間: 2025-3-22 10:19

作者: 碳水化合物    時(shí)間: 2025-3-22 14:01
https://doi.org/10.1007/978-3-642-20128-868T50, 91F20; Corpus linguistics; Machine Translation; Natural Language Processing; Terminology
作者: aptitude    時(shí)間: 2025-3-22 17:26

作者: Femish    時(shí)間: 2025-3-22 22:13

作者: 盡管    時(shí)間: 2025-3-23 04:50
Geometrie verstehen: statisch – kinematischextraction out of comparable corpora, and an evaluation we have subjected them to: bilingual terminology has been extracted out of automatically collected domain-comparable web corpora, in Basque and English, and the resulting terminology lists have been validated automatically using a specialized d
作者: Exclaim    時(shí)間: 2025-3-23 08:37
https://doi.org/10.1007/978-3-8348-9836-4thods because statistical comparison requires the use of statistical tests, which again require certain properties of the data under analysis. Linguistic data, however, do not automatically meet these requirements. In corpus linguistics and other linguistic fields, statistical methods are often appl
作者: uveitis    時(shí)間: 2025-3-23 10:48

作者: MOT    時(shí)間: 2025-3-23 17:43
Mathematik semantologisch verstehens still art rather than proper science. Here I will discuss attempts at approximating the content of corpora collected from the Web using various methods, also in comparison to traditional corpora, such as the BNC. The procedure for estimating the corpus composition is based on selecting keywords, f
作者: Evolve    時(shí)間: 2025-3-23 18:24

作者: BET    時(shí)間: 2025-3-23 22:15
Die Grenzen wissenschaftlicher Verfahren,. These include, among many others, the field of Natural Language Processing. The Leipzig Corpora Collection creates and uses such corpora for more than fifteen years. However, the work on preprocessing distributed resources to ensure homogeneity and thus comparability is a steady process. As a resu
作者: aesthetic    時(shí)間: 2025-3-24 02:59

作者: TOM    時(shí)間: 2025-3-24 07:13

作者: negotiable    時(shí)間: 2025-3-24 12:55

作者: ADOPT    時(shí)間: 2025-3-24 18:32
https://doi.org/10.1007/978-3-322-84284-8en two comparable corpora in the same language and the same domain, but displaying two different discourse types (lay and specialized), specific paraphrases can be spotted which provide a dimension along which these discourse types can be contrasted. Detecting such paraphrases in comparable corpora
作者: CONE    時(shí)間: 2025-3-24 22:59

作者: 圣人    時(shí)間: 2025-3-25 01:38
https://doi.org/10.1007/978-3-662-46382-6uld be the comparability for specialized comparable corpus and stress the distinction between expert and non-experts documents. We then turn to the contextual information method that concentrates on bilingual lexicon extraction and show its limits: the context vectors do not discriminate very much d
作者: affluent    時(shí)間: 2025-3-25 03:47
https://doi.org/10.1007/978-3-662-46382-6lping online translators. In the past 15?years, we have witnessed great advances in bilingual term extraction technologies based both on parallel and comparable corpora. The use of comparable corpora is widely held to be especially important, because not many parallel corpora are available in many l
作者: Affectation    時(shí)間: 2025-3-25 11:01
https://doi.org/10.1007/978-3-662-46382-6 be perceived as less demanding in terms of time and effort required to obtain and (learn to) use them (e.g. translation memory software, term bases and so forth). These resources however have limitations that could be compensated for through the integration of comparable corpora and corpus building
作者: 方便    時(shí)間: 2025-3-25 13:18
https://doi.org/10.1007/978-3-662-46382-6ularly, if we aim at quantifying differences and commonalities. We approach this problem by investigating the CroCo Corpus of English and German originals and translations in eight different registers concentrating on the (in)comparability of the different subcorpora. They are comparable across lang
作者: 金哥占卜者    時(shí)間: 2025-3-25 16:35

作者: 治愈    時(shí)間: 2025-3-25 21:58
Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dicictionary, in order to evaluate the quality of the extracted terminology lists. Thus, this evaluation measures the usefulness of putting these two automatic tools to work together in a real-world task, that is, specialized dictionary making.
作者: Lamina    時(shí)間: 2025-3-26 00:40
Measuring the Distance Between Comparable Corpora Between Languagesollowed by hard clustering or by building topic models. This can apply to corpora within the same language, e.g.,? the BNC against ukWac as well as to corpora in different languages, e.g.,? webpages collected using the same procedure for English and Russian.
作者: instill    時(shí)間: 2025-3-26 06:59

作者: 紡織品    時(shí)間: 2025-3-26 10:06

作者: 雪白    時(shí)間: 2025-3-26 14:20

作者: 妨礙議事    時(shí)間: 2025-3-26 20:29

作者: interrogate    時(shí)間: 2025-3-26 21:07

作者: 禁令    時(shí)間: 2025-3-27 03:26

作者: 牲畜欄    時(shí)間: 2025-3-27 09:05

作者: 圓木可阻礙    時(shí)間: 2025-3-27 12:30

作者: HERE    時(shí)間: 2025-3-27 16:26

作者: Alveoli    時(shí)間: 2025-3-27 18:06

作者: Incompetent    時(shí)間: 2025-3-27 22:57
Information Network Construction and Alignment from Automatically Acquired Comparable Corporae two state-of-the-art name translation mining methods and use two existing name translation gazetteers to compare with our approach. Comparisons show our approach can effectively augment the results from each of these alternative methods and resources.
作者: 加強(qiáng)防衛(wèi)    時(shí)間: 2025-3-28 03:37

作者: Brain-Waves    時(shí)間: 2025-3-28 08:37
Geometrie verstehen: statisch – kinematischictionary, in order to evaluate the quality of the extracted terminology lists. Thus, this evaluation measures the usefulness of putting these two automatic tools to work together in a real-world task, that is, specialized dictionary making.
作者: 地名詞典    時(shí)間: 2025-3-28 11:48

作者: GLEAN    時(shí)間: 2025-3-28 17:50
Mathematik als Bildungsgrundlageof existing comparable corpora, prior to showing that the bilingual lexicons extracted from corpora enhanced in this way are of better quality. All the experiments in this chapter are performed on French-English comparable corpora.
作者: Emg827    時(shí)間: 2025-3-28 19:37
Die Grenzen wissenschaftlicher Verfahren,lt created corpora in identical formats allow the use of different statistical methods to generate various data for manual or automatic analysis. These are basis for applications in intra- and inter-language comparison or quality assurance of text stocks.
作者: 1分開    時(shí)間: 2025-3-29 00:03
Book 2013logy mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel t
作者: 肌肉    時(shí)間: 2025-3-29 03:44
https://doi.org/10.1007/978-3-8348-9836-4mber of other frequently used terms and concepts, such as representativeness, homogeneity, and balanced corpora, play a central role in corpus-linguistic argumentations and will be analysed in the paper, too, as they concern compilation and use of comparable corpora.
作者: Flounder    時(shí)間: 2025-3-29 07:48

作者: 截?cái)?nbsp;   時(shí)間: 2025-3-29 12:31

作者: 颶風(fēng)    時(shí)間: 2025-3-29 16:14

作者: 忙碌    時(shí)間: 2025-3-29 21:43

作者: Predigest    時(shí)間: 2025-3-30 02:19
Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dicextraction out of comparable corpora, and an evaluation we have subjected them to: bilingual terminology has been extracted out of automatically collected domain-comparable web corpora, in Basque and English, and the resulting terminology lists have been validated automatically using a specialized d
作者: Orgasm    時(shí)間: 2025-3-30 06:49

作者: 新娘    時(shí)間: 2025-3-30 12:05
Methods for Collection and Evaluation of Comparable DocumentsSMT) but for other applications as well, e.g. the extraction of paraphrases. The potential value of such corpora requires efficient and effective methods for gathering and evaluating them. Most of these methods have been tested in retrieving document pairs for well resourced languages, however there
作者: Observe    時(shí)間: 2025-3-30 14:25
Measuring the Distance Between Comparable Corpora Between Languagess still art rather than proper science. Here I will discuss attempts at approximating the content of corpora collected from the Web using various methods, also in comparison to traditional corpora, such as the BNC. The procedure for estimating the corpus composition is based on selecting keywords, f
作者: gastritis    時(shí)間: 2025-3-30 19:15
Exploiting Comparable Corpora for Lexicon Extraction: Measuring and Improving Corpus Quality We first develop a measure which can capture different comparability levels. This measure correlates very well with gold-standard comparability levels and is relatively robust to dictionary coverage. We then propose a well-founded algorithm to improve the quality, in terms of comparability scores,
作者: 破譯    時(shí)間: 2025-3-30 21:47

作者: Paradox    時(shí)間: 2025-3-31 03:18
Comparable Multilingual Patents as Large-Scale Parallel Corporaal. In this chapter, we explore a new but important area involving patents by investigating the potential of cultivating large-scale parallel corpora from comparable multilingual patents. Two major issues are investigated on multilingual patents: (1) How to build large-scale corpora of comparable pa
作者: 吞下    時(shí)間: 2025-3-31 08:29
Extracting Parallel Phrases from Comparable DataLP applications. Even if two comparable documents have few or no parallel sentence pairs, there is still potential for parallelism in the sub-sentential level. The ability to detect these phrases creates a valuable resource, especially for low-resource languages. In this chapter we explore three phr
作者: 動(dòng)機(jī)    時(shí)間: 2025-3-31 11:28

作者: Comedienne    時(shí)間: 2025-3-31 15:26
Paraphrase Detection in Monolingual Specialized/Lay Comparable Corporaen two comparable corpora in the same language and the same domain, but displaying two different discourse types (lay and specialized), specific paraphrases can be spotted which provide a dimension along which these discourse types can be contrasted. Detecting such paraphrases in comparable corpora
作者: WAG    時(shí)間: 2025-3-31 18:32





歡迎光臨 派博傳思國(guó)際中心 (http://www.pjsxioz.cn/) Powered by Discuz! X3.5
内江市| 唐河县| 宁乡县| 罗山县| 澄迈县| 巴南区| 图片| 墨脱县| 新民市| 湘潭市| 山东| 桓台县| 潮州市| 孟津县| 赫章县| 于田县| 河曲县| 遂平县| 蒙阴县| 万载县| 广河县| 康乐县| 育儿| 浏阳市| 湖北省| 娱乐| 贵港市| 桐柏县| 江城| 鹿泉市| 大理市| 江津市| 皋兰县| 塘沽区| 怀远县| 安多县| 黄大仙区| 定襄县| 秭归县| 龙胜| 兰西县|