Generation of good-quality distractors is a key and time-consuming task associated withmultiple-choice questions(MCQs),one of the assessment items that have dominated the educational field for years.Recent advances in...Generation of good-quality distractors is a key and time-consuming task associated withmultiple-choice questions(MCQs),one of the assessment items that have dominated the educational field for years.Recent advances in language models and architectures present an opportunity for helping teachers to generate and update these elements to the required speed and scale of widespread increase in online education.This study focuses on a text-to-text approach for joints generation of distractors for MCQs,where the context,question and correct answer are used as input,while the set of distractors corresponds to the output,allowing the generation of three distractors in a singlemodel inference.By fine-tuning FlanT5 models and LongT5 with TGlobal attention using a RACE-based dataset,the potential of this approach is explored,demonstrating an improvement in the BLEU and ROUGE-L metrics when compared to previous works and a GPT-3.5 baseline.Additionally,BERTScore is introduced in the evaluation,showing that the fine-tuned models generate distractors semantically close to the reference,but the GPT-3.5 baseline still outperforms in this area.A tendency toward duplicating distractors is noted,although models fine-tuned with Low-Rank Adaptation(LoRA)and 4-bit quantization showcased a significant reduction in duplicated distractors.展开更多
基金supported by the Universidad de Alcalá(UAH)under Grant PIUAH21/IA-010Comunidad Autonóma de Madrid under Grant CM/JIN/2021-034.
文摘Generation of good-quality distractors is a key and time-consuming task associated withmultiple-choice questions(MCQs),one of the assessment items that have dominated the educational field for years.Recent advances in language models and architectures present an opportunity for helping teachers to generate and update these elements to the required speed and scale of widespread increase in online education.This study focuses on a text-to-text approach for joints generation of distractors for MCQs,where the context,question and correct answer are used as input,while the set of distractors corresponds to the output,allowing the generation of three distractors in a singlemodel inference.By fine-tuning FlanT5 models and LongT5 with TGlobal attention using a RACE-based dataset,the potential of this approach is explored,demonstrating an improvement in the BLEU and ROUGE-L metrics when compared to previous works and a GPT-3.5 baseline.Additionally,BERTScore is introduced in the evaluation,showing that the fine-tuned models generate distractors semantically close to the reference,but the GPT-3.5 baseline still outperforms in this area.A tendency toward duplicating distractors is noted,although models fine-tuned with Low-Rank Adaptation(LoRA)and 4-bit quantization showcased a significant reduction in duplicated distractors.