標(biāo)題: Titlebook: Building and Using Comparable Corpora; Serge Sharoff,Reinhard Rapp,Pascale Fung Book 2013 Springer-Verlag Berlin Heidelberg 2013 68T50, 91 [打印本頁(yè)] 作者: 債務(wù)人 時(shí)間: 2025-3-21 16:45
書目名稱Building and Using Comparable Corpora影響因子(影響力)
書目名稱Building and Using Comparable Corpora影響因子(影響力)學(xué)科排名
書目名稱Building and Using Comparable Corpora網(wǎng)絡(luò)公開度
書目名稱Building and Using Comparable Corpora網(wǎng)絡(luò)公開度學(xué)科排名
書目名稱Building and Using Comparable Corpora被引頻次
書目名稱Building and Using Comparable Corpora被引頻次學(xué)科排名
書目名稱Building and Using Comparable Corpora年度引用
書目名稱Building and Using Comparable Corpora年度引用學(xué)科排名
書目名稱Building and Using Comparable Corpora讀者反饋
書目名稱Building and Using Comparable Corpora讀者反饋學(xué)科排名
作者: flammable 時(shí)間: 2025-3-21 23:56
http://image.papertrans.cn/b/image/191857.jpg作者: 脫離 時(shí)間: 2025-3-22 01:40
https://doi.org/10.1007/978-3-642-60550-5The beginning of the 1990s marked a radical turn in various NLP applications towards using large collections of texts.作者: 破裂 時(shí)間: 2025-3-22 06:27 作者: seduce 時(shí)間: 2025-3-22 10:19 作者: 碳水化合物 時(shí)間: 2025-3-22 14:01
https://doi.org/10.1007/978-3-642-20128-868T50, 91F20; Corpus linguistics; Machine Translation; Natural Language Processing; Terminology作者: aptitude 時(shí)間: 2025-3-22 17:26 作者: Femish 時(shí)間: 2025-3-22 22:13 作者: 盡管 時(shí)間: 2025-3-23 04:50
Geometrie verstehen: statisch – kinematischextraction out of comparable corpora, and an evaluation we have subjected them to: bilingual terminology has been extracted out of automatically collected domain-comparable web corpora, in Basque and English, and the resulting terminology lists have been validated automatically using a specialized d作者: Exclaim 時(shí)間: 2025-3-23 08:37
https://doi.org/10.1007/978-3-8348-9836-4thods because statistical comparison requires the use of statistical tests, which again require certain properties of the data under analysis. Linguistic data, however, do not automatically meet these requirements. In corpus linguistics and other linguistic fields, statistical methods are often appl作者: uveitis 時(shí)間: 2025-3-23 10:48 作者: MOT 時(shí)間: 2025-3-23 17:43
Mathematik semantologisch verstehens still art rather than proper science. Here I will discuss attempts at approximating the content of corpora collected from the Web using various methods, also in comparison to traditional corpora, such as the BNC. The procedure for estimating the corpus composition is based on selecting keywords, f作者: Evolve 時(shí)間: 2025-3-23 18:24 作者: BET 時(shí)間: 2025-3-23 22:15
Die Grenzen wissenschaftlicher Verfahren,. These include, among many others, the field of Natural Language Processing. The Leipzig Corpora Collection creates and uses such corpora for more than fifteen years. However, the work on preprocessing distributed resources to ensure homogeneity and thus comparability is a steady process. As a resu作者: aesthetic 時(shí)間: 2025-3-24 02:59 作者: TOM 時(shí)間: 2025-3-24 07:13 作者: negotiable 時(shí)間: 2025-3-24 12:55 作者: ADOPT 時(shí)間: 2025-3-24 18:32
https://doi.org/10.1007/978-3-322-84284-8en two comparable corpora in the same language and the same domain, but displaying two different discourse types (lay and specialized), specific paraphrases can be spotted which provide a dimension along which these discourse types can be contrasted. Detecting such paraphrases in comparable corpora 作者: CONE 時(shí)間: 2025-3-24 22:59 作者: 圣人 時(shí)間: 2025-3-25 01:38
https://doi.org/10.1007/978-3-662-46382-6uld be the comparability for specialized comparable corpus and stress the distinction between expert and non-experts documents. We then turn to the contextual information method that concentrates on bilingual lexicon extraction and show its limits: the context vectors do not discriminate very much d作者: affluent 時(shí)間: 2025-3-25 03:47
https://doi.org/10.1007/978-3-662-46382-6lping online translators. In the past 15?years, we have witnessed great advances in bilingual term extraction technologies based both on parallel and comparable corpora. The use of comparable corpora is widely held to be especially important, because not many parallel corpora are available in many l作者: Affectation 時(shí)間: 2025-3-25 11:01
https://doi.org/10.1007/978-3-662-46382-6 be perceived as less demanding in terms of time and effort required to obtain and (learn to) use them (e.g. translation memory software, term bases and so forth). These resources however have limitations that could be compensated for through the integration of comparable corpora and corpus building作者: 方便 時(shí)間: 2025-3-25 13:18
https://doi.org/10.1007/978-3-662-46382-6ularly, if we aim at quantifying differences and commonalities. We approach this problem by investigating the CroCo Corpus of English and German originals and translations in eight different registers concentrating on the (in)comparability of the different subcorpora. They are comparable across lang作者: 金哥占卜者 時(shí)間: 2025-3-25 16:35 作者: 治愈 時(shí)間: 2025-3-25 21:58
Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dicictionary, in order to evaluate the quality of the extracted terminology lists. Thus, this evaluation measures the usefulness of putting these two automatic tools to work together in a real-world task, that is, specialized dictionary making.作者: Lamina 時(shí)間: 2025-3-26 00:40
Measuring the Distance Between Comparable Corpora Between Languagesollowed by hard clustering or by building topic models. This can apply to corpora within the same language, e.g.,? the BNC against ukWac as well as to corpora in different languages, e.g.,? webpages collected using the same procedure for English and Russian.作者: instill 時(shí)間: 2025-3-26 06:59 作者: 紡織品 時(shí)間: 2025-3-26 10:06 作者: 雪白 時(shí)間: 2025-3-26 14:20 作者: 妨礙議事 時(shí)間: 2025-3-26 20:29 作者: interrogate 時(shí)間: 2025-3-26 21:07 作者: 禁令 時(shí)間: 2025-3-27 03:26 作者: 牲畜欄 時(shí)間: 2025-3-27 09:05 作者: 圓木可阻礙 時(shí)間: 2025-3-27 12:30 作者: HERE 時(shí)間: 2025-3-27 16:26 作者: Alveoli 時(shí)間: 2025-3-27 18:06 作者: Incompetent 時(shí)間: 2025-3-27 22:57
Information Network Construction and Alignment from Automatically Acquired Comparable Corporae two state-of-the-art name translation mining methods and use two existing name translation gazetteers to compare with our approach. Comparisons show our approach can effectively augment the results from each of these alternative methods and resources.作者: 加強(qiáng)防衛(wèi) 時(shí)間: 2025-3-28 03:37 作者: Brain-Waves 時(shí)間: 2025-3-28 08:37
Geometrie verstehen: statisch – kinematischictionary, in order to evaluate the quality of the extracted terminology lists. Thus, this evaluation measures the usefulness of putting these two automatic tools to work together in a real-world task, that is, specialized dictionary making.作者: 地名詞典 時(shí)間: 2025-3-28 11:48 作者: GLEAN 時(shí)間: 2025-3-28 17:50
Mathematik als Bildungsgrundlageof existing comparable corpora, prior to showing that the bilingual lexicons extracted from corpora enhanced in this way are of better quality. All the experiments in this chapter are performed on French-English comparable corpora.作者: Emg827 時(shí)間: 2025-3-28 19:37
Die Grenzen wissenschaftlicher Verfahren,lt created corpora in identical formats allow the use of different statistical methods to generate various data for manual or automatic analysis. These are basis for applications in intra- and inter-language comparison or quality assurance of text stocks.作者: 1分開 時(shí)間: 2025-3-29 00:03
Book 2013logy mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel t作者: 肌肉 時(shí)間: 2025-3-29 03:44
https://doi.org/10.1007/978-3-8348-9836-4mber of other frequently used terms and concepts, such as representativeness, homogeneity, and balanced corpora, play a central role in corpus-linguistic argumentations and will be analysed in the paper, too, as they concern compilation and use of comparable corpora.作者: Flounder 時(shí)間: 2025-3-29 07:48 作者: 截?cái)?nbsp; 時(shí)間: 2025-3-29 12:31 作者: 颶風(fēng) 時(shí)間: 2025-3-29 16:14 作者: 忙碌 時(shí)間: 2025-3-29 21:43 作者: Predigest 時(shí)間: 2025-3-30 02:19
Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dicextraction out of comparable corpora, and an evaluation we have subjected them to: bilingual terminology has been extracted out of automatically collected domain-comparable web corpora, in Basque and English, and the resulting terminology lists have been validated automatically using a specialized d作者: Orgasm 時(shí)間: 2025-3-30 06:49 作者: 新娘 時(shí)間: 2025-3-30 12:05
Methods for Collection and Evaluation of Comparable DocumentsSMT) but for other applications as well, e.g. the extraction of paraphrases. The potential value of such corpora requires efficient and effective methods for gathering and evaluating them. Most of these methods have been tested in retrieving document pairs for well resourced languages, however there作者: Observe 時(shí)間: 2025-3-30 14:25
Measuring the Distance Between Comparable Corpora Between Languagess still art rather than proper science. Here I will discuss attempts at approximating the content of corpora collected from the Web using various methods, also in comparison to traditional corpora, such as the BNC. The procedure for estimating the corpus composition is based on selecting keywords, f作者: gastritis 時(shí)間: 2025-3-30 19:15
Exploiting Comparable Corpora for Lexicon Extraction: Measuring and Improving Corpus Quality We first develop a measure which can capture different comparability levels. This measure correlates very well with gold-standard comparability levels and is relatively robust to dictionary coverage. We then propose a well-founded algorithm to improve the quality, in terms of comparability scores, 作者: 破譯 時(shí)間: 2025-3-30 21:47 作者: Paradox 時(shí)間: 2025-3-31 03:18
Comparable Multilingual Patents as Large-Scale Parallel Corporaal. In this chapter, we explore a new but important area involving patents by investigating the potential of cultivating large-scale parallel corpora from comparable multilingual patents. Two major issues are investigated on multilingual patents: (1) How to build large-scale corpora of comparable pa作者: 吞下 時(shí)間: 2025-3-31 08:29
Extracting Parallel Phrases from Comparable DataLP applications. Even if two comparable documents have few or no parallel sentence pairs, there is still potential for parallelism in the sub-sentential level. The ability to detect these phrases creates a valuable resource, especially for low-resource languages. In this chapter we explore three phr作者: 動(dòng)機(jī) 時(shí)間: 2025-3-31 11:28 作者: Comedienne 時(shí)間: 2025-3-31 15:26
Paraphrase Detection in Monolingual Specialized/Lay Comparable Corporaen two comparable corpora in the same language and the same domain, but displaying two different discourse types (lay and specialized), specific paraphrases can be spotted which provide a dimension along which these discourse types can be contrasted. Detecting such paraphrases in comparable corpora 作者: WAG 時(shí)間: 2025-3-31 18:32