QUICK REVIEW

[论文解读] Friend or Foe? Exploring the Implications of Large Language Models on the Science System

Benedikt Fecher, Marcel Hebing|arXiv (Cornell University)|Jun 16, 2023

Artificial Intelligence in Healthcare and Education被引用 8

一句话总结

该论文报告了一项包含72名科学与人工智能专家的德尔菲研究，评估大型语言模型（LLMs）的应用、局限性及对科学系统的影响，强调其变革潜力与需监管和教育以规避风险的重要性。

ABSTRACT

The advent of ChatGPT by OpenAI has prompted extensive discourse on its potential implications for science and higher education. While the impact on education has been a primary focus, there is limited empirical research on the effects of large language models (LLMs) and LLM-based chatbots on science and scientific practice. To investigate this further, we conducted a Delphi study involving 72 experts specialising in research and AI. The study focused on applications and limitations of LLMs, their effects on the science system, ethical and legal considerations, and the required competencies for their effective use. Our findings highlight the transformative potential of LLMs in science, particularly in administrative, creative, and analytical tasks. However, risks related to bias, misinformation, and quality assurance need to be addressed through proactive regulation and science education. This research contributes to informed discussions on the impact of generative AI in science and helps identify areas for future action.

研究动机与目标

评估大型语言模型（LLMs）及基于LLM的聊天机器人如何影响科学实践与科学系统。
识别在行政、创造与分析任务中的潜在应用。
研究伦理、法律与监管考虑以及有效使用所需的能力。
强调偏见、错误信息与质量保障等风险，并提出教育与政策行动建议。

提出的方法

对72名研究与AI领域专家进行德尔菲研究，以获得对LLM在科学中使用的结构化专家判断。
通过循环问卷探讨应用领域、局限性以及对科学系统的影响。
审查在科学中有效使用LLMs所需的伦理与法律考量以及能力。
综合研究结果，讨论规制与教育需求以降低风险。

实验结果

研究问题

RQ1LLMs及基于LLM的聊天机器人在科学的行政、创造与分析任务中可能的应用有哪些？
RQ2在科学实践中使用LLM时有哪些局限性和风险（偏见、错误信息、质量保障）？
RQ3为负责任地将LLMs整合进科学系统，需要哪些监管、伦理与教育方面的措施？
RQ4科学家应发展哪些能力与技能，以有效且安全地使用LLMs？

主要发现

LLMs在科学的行政、创造和分析任务方面具有变革潜力。
风险包括偏见、错误信息和质量保障挑战，需要主动管理。
需要监管与科学教育来应对伦理与法律考量。
本研究为理性讨论做出贡献并指出生成式AI与科学领域未来行动的方向。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。