Funded by the European Union. Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Education and Culture Executive Agency (EACEA). Neither the European Union nor EACEA can be held responsible for them.

Tips

Use Large Language Models (LLM) generate question in which LLM fails or makes mistakes that are not easy to detect

Large Language Models (LLMs ), such as GPT-4, can fail or make mistakes in answering a question that are not easily detected. Such scenarios often involve nuanced contextual understanding, the interpretation of ambiguous statements, or the generation of inherently biased or misleading content. For example, LLMs can struggle with the task of reliably distinguishing between valid and invalid reasoning or may generate content based on spurious correlations in the training data. They might also make factual errors as they generate information based on patterns learned during training, not by accessing real-time or post-training-cutoff data. These cases highlight the limitations of LLMs and underscore the need for better understanding and further improvement of these models. Finding questions where LLM fails can be useful to generate more complex questions or check for student misbehavior in answering the questions.

Example:
Here the example of a question and the wrong answer of using ChatGPT (https://chat.openai.com/ ). It has been asked to ChatGPT to solve the following problem: a line parallel to y= 4x 6 passes through (5,10 ). What is the y-coordinate of the point where this line crosses the y-axis?
The answer has been the following: “A line parallel to y=4x 6 will have the same slope as y=4x 6 which is 4. Thus, the equation of the line will be of the form y=4x b, where b is the y-intercept. We cab find the value of b by substituting the coordinates of the given point, (5,10 ), into the equation y=4x b. This gives us 10=4*5 b, so b=-20. Therefore, the equation of the line is y=4x-20. The y-coordinate of the point where this line crosses the y-axis can be found by setting x=0 in the equation y=4x-20. This gives us y=4*0-20=-20. Therefore, the y-coordinate of the point where this line crosses the y-axis is -20.”

Reference:
[1] Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell. 2021. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (FAccT '21 ). Association for Computing Machinery, New York, NY, USA, 610–623. https://doi.org/10.1145/3442188.3445922

Author of the tip:
Giulia Cademartori
University of Genoa