Skip to main content
QUICK REVIEW

[論文レビュー] The EcoLexicon English Corpus as an open corpus in Sketch Engine

Pilar León-Araúz, Antonio San Martín|arXiv (Cornell University)|Jul 16, 2018
linguistics and terminology studies参考文献 8被引用数 51
ひとこと要約

{"title":"Tldr","type":"string"}

ABSTRACT

The EcoLexicon English Corpus (EEC) is a 23.1-million-word corpus of contemporary environmental texts. It was compiled by the LexiCon research group for the development of EcoLexicon (Faber, Leon-Arauz & Reimerink 2016; San Martin et al. 2017), a terminological knowledge base on the environment. It is available as an open corpus in the well-known corpus query system Sketch Engine (Kilgarriff et al. 2014), which means that any user, even without a subscription, can freely access and query the corpus. In this paper, the EEC is introduced by de- scribing how it was built and compiled and how it can be queried and exploited, based both on the functionalities provided by Sketch Engine and on the parameters in which the texts in the EEC are classified.

研究の動機と目的

  • Describe how the EcoLexicon English Corpus (EEC) was built and compiled.
  • Explain how the EEC can be queried and exploited using Sketch Engine.
  • Detail the classification parameters used for texts in the EEC.
  • Demonstrate open-access access for users without a Sketch Engine subscription.

提案手法

  • Present the workflow for building the EEC and integrating it into Sketch Engine.
  • Describe the steps and criteria for compiling the environmental texts.
  • Explain the Sketch Engine functionalities used to query and analyze the EEC.
  • Outline the parameterization and classification schemes applied to the corpus texts.

実験結果

リサーチクエスチョン

  • RQ1How was the EcoLexicon English Corpus constructed and what sources were included?
  • RQ2How can users query and exploit the EEC within Sketch Engine, including non-subscriber access?
  • RQ3What classification parameters are used to organize and categorize EEC texts?
  • RQ4What are the practical capabilities and limitations of using the EEC as an open corpus in Sketch Engine?

主な発見

  • The EcoLexicon English Corpus comprises 23.1 million words of contemporary environmental texts.
  • The corpus is made available as an open corpus in Sketch Engine, accessible to non-subscribers.
  • The paper describes construction, compilation criteria, and querying/exploitation methods within Sketch Engine.
  • Users can query the EEC using Sketch Engine functionalities and based on defined classification parameters.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。