Detection of GPT-4 Generated Text in Higher Education: Combining Academic Judgement and Software to Identify Generative AI Tool Misuse

Perkins, Mike, Roe, Jasper, Postma, Darius, McGaughran, James, and Hickerson, Don (2024) Detection of GPT-4 Generated Text in Higher Education: Combining Academic Judgement and Software to Identify Generative AI Tool Misuse. Journal of Academic Ethics, 22 (1). pp. 89-113.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: https://doi.org/10.1007/s10805-023-09492...
 
1
1


Abstract

This study explores the capability of academic staff assisted by the Turnitin Artificial Intelligence (AI) detection tool to identify the use of AI-generated content in university assessments. 22 different experimental submissions were produced using Open AI’s ChatGPT tool, with prompting techniques used to reduce the likelihood of AI detectors identifying AI-generated content. These submissions were marked by 15 academic staff members alongside genuine student submissions. Although the AI detection tool identified 91% of the experimental submissions as containing AI-generated content, only 54.8% of the content was identified as AI-generated, underscoring the challenges of detecting AI content when advanced prompting techniques are used. When academic staff members marked the experimental submissions, only 54.5% were reported to the academic misconduct process, emphasising the need for greater awareness of how the results of AI detectors may be interpreted. Similar performance in grades was obtained between student submissions and AI-generated content (AI mean grade: 52.3, Student mean grade: 54.4), showing the capabilities of AI tools in producing human-like responses in real-life assessment situations. Recommendations include adjusting the overall strategies for assessing university students in light of the availability of new Generative AI tools. This may include reducing the overall reliance on assessments where AI tools may be used to mimic human writing, or by using AI-inclusive assessments. Comprehensive training must be provided for both academic staff and students so that academic integrity may be preserved.

Item ID: 81127
Item Type: Article (Research - C1)
ISSN: 1572-8544
Keywords: AI detection, Artificial intelligence, Assessment design, ChatGPT, GPT-4, Turnitin AI detect
Copyright Information: © The Author(s), under exclusive licence to Springer Nature B.V. 2023
Date Deposited: 04 Mar 2024 22:36
FoR Codes: 39 EDUCATION > 3904 Specialist studies in education > 390402 Education assessment and evaluation @ 100%
SEO Codes: 28 EXPANDING KNOWLEDGE > 2801 Expanding knowledge > 280109 Expanding knowledge in education @ 100%
Downloads: Total: 1
Last 12 Months: 1
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page