[논문 리뷰] EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education
EduChat은 도메인 특화 사전 학습, 교육 초점 지침에 대한 파인튜닝, 그리고 검색 증강 오픈 질의응답을 결합한 오픈 소스 LLM 기반 교육용 챗봇으로, 개방형 질문, 에세이 평가, 소크라테스식 지도, 정서적 지원을 개선합니다.
EduChat (https://www.educhat.top/) is a large-scale language model (LLM)-based chatbot system in the education domain. Its goal is to support personalized, fair, and compassionate intelligent education, serving teachers, students, and parents. Guided by theories from psychology and education, it further strengthens educational functions such as open question answering, essay assessment, Socratic teaching, and emotional support based on the existing basic LLMs. Particularly, we learn domain-specific knowledge by pre-training on the educational corpus and stimulate various skills with tool use by fine-tuning on designed system prompts and instructions. Currently, EduChat is available online as an open-source project, with its code, data, and model parameters available on platforms (e.g., GitHub https://github.com/icalk-nlp/EduChat, Hugging Face https://huggingface.co/ecnu-icalk ). We also prepare a demonstration of its capabilities online (https://vimeo.com/851004454). This initiative aims to promote research and applications of LLMs for intelligent education.
연구 동기 및 목표
- 심리학 및 교육 이론이 일반 LLM을 교육 도메인에 적용하는 방법을 탐색합니다.
- 도메인 지식을 학습하기 위해 대규모 교육 코퍼스에서 사전 학습합니다.
- 교육 기능을 활성화하기 위한 작업별 지침으로 파인튜닝합니다.
- 지식을 최신 상태로 유지하고 환각을 줄이기 위해 검색을 통합합니다.
- 교육 중심 AI 연구를 가속화하기 위해 EduChat을 오픈 소스 시스템으로 공개합니다.]
- method
- [
- Pre-train on educational books and questions plus poetry and psychology texts to encode domain knowledge.
- Fine-tune on 500k high-quality customized instructions to activate educational functions.
- Use retrieval-augmented open QA to access up-to-date information and self-check for relevance.
- Design diverse system prompts to control tool usage and enable skills like Socratic teaching and emotional support.
- Translate and curate data for emotional support dialogues (ESConv-zh) and develop fine-grained essay assessment datasets.
- Manual data cleaning and semantic deduplication using sentence-transformers to ensure data quality.]
- research_questions
- [
- How can EduChat align LLMs with educational abilities like essay assessment and Socratic teaching?
- Does retrieval-augmented QA improve accuracy and reduce hallucinations in educational contexts?
- Can psychology-based emotional support be effectively provided by fine-tuned LLMs in education?
- What is the impact of specialized educational fine-tuning on performance compared to general LLMs of similar size?]
- key_findings
- [
- EduChat (with retrieval) outperforms similar-sized baselines on C-Eval Chinese evaluation across disciplines.
- EduChat with retrieval achieves higher average scores than EduChat without retrieval (49.3 vs 40.7 in the reported metrics).
- Retrieval-augmented QA significantly improves performance on multi-discipline benchmarks and reduces fabrication risk.
- Fine-grained essay assessment and Socratic teaching capabilities are enhanced through targeted educational data fine-tuning.
- Psychology-based emotional support enables EduChat to act as a compassionate counselor in education contexts.]
- table_headers
- ["STEM", "Social Science", "Humanities", "Others", "Avg(hard)", "Avg"]
- table_rows
- [["EduChat", "36.2", "50.7", "42.9", "37.7", "28.3", "40.7"], ["EduChat (w Retrieval)", "43.5", "59.3", "53.7", "46.6", "33.1", "49.3"]]
더 나은 연구,지금 바로 시작하세요
연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.
카드 등록 없음 · 무료 플랜 제공
이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.