Javascript must be enabled to continue!

Prompting Science Report 5: This is an Excellent Paper: The Effects of Prompt Injection on Grading

<div> This is the fifth in a series of short reports that help business, education, and policy leaders understand the technical details of working with AI through rigorous testing. Here, we ask whether frontier reasoning large language models (LLMs) tasked with grading academic papers can still be influenced by prompt injection across several settings. Drawing from two distinct text corpora, we study one concise and one verbose prompt injection, both designed to artificially inflate grades. For each prompt, we further vary prompt placement, inserting the injected text at the start, in the middle, or at the end of a given paper. </div> <div> <br> </div> <div> We tested a total of four model families: </div> <div> <br> </div> <div> 1. Claude Opus 4.5 </div> <div> 2. Gemini 3 Pro  </div> <div> 3. GPT-5.2 </div> <div> 4. GPT-4o mini </div> <div> <br> </div> <div> We found that:  </div> <div> - Prompt injections did not meaningfully increase LLM-assigned grades in most of our experiments. Across roughly 40,000 trials using Opus 4.5, Gemini 3 Pro, and GPT-5.2, prompt injections increased scores by about 2.6 percentage points across two independent datasets. </div> <div> - Vulnerability varies across LLMs. Notably, Gemini 3 Pro stood out as the most vulnerable reasoning model, particularly for the longer-paper corpus, where beginning and middle injections produced effects exceeding 10 percentage points, while end-of-document injections had no effect. In our experiments, Opus 4.5 showed only minimal effects. Interestingly, GPT-5.2 with reasoning enabled was slightly more susceptible than the non-reasoning configuration, suggesting that reasoning did not provide additional protection against prompt injections. </div> <div> - GPT-4o mini, a smaller and less capable LLM, however, was highly susceptible, with an average increase of 19 percentage points. </div> <div> - We observed heterogeneity along two design dimensions. First, verbose prompt injections consistently outperformed concise ones by a factor of two to three, and, for GPT-4o mini, by as much as sixfold. Second, placement also mattered depending on the LLM used. While Gemini 3 Pro showed strong effects for beginning and middle placements and only minimal effects at the end, particularly in the longer-paper corpus, the remaining frontier model families showed little to no placement-specific effects. </div>

Elsevier BV

Benjamin Wanjura Dan Shapiro Ethan R. Mollick Lilach Mollick Lennart Meincke

2026

Title: Prompting Science Report 5: This is an Excellent Paper: The Effects of Prompt Injection on Grading

Description:

<div> This is the fifth in a series of short reports that help business, education, and policy leaders understand the technical details of working with AI through rigorous testing.

Here, we ask whether frontier reasoning large language models (LLMs) tasked with grading academic papers can still be influenced by prompt injection across several settings.

Drawing from two distinct text corpora, we study one concise and one verbose prompt injection, both designed to artificially inflate grades.

For each prompt, we further vary prompt placement, inserting the injected text at the start, in the middle, or at the end of a given paper.

</div> <div> <br> </div> <div> We tested a total of four model families: </div> <div> <br> </div> <div> 1.

Claude Opus 4.

5 </div> <div> 2.

Gemini 3 Pro  </div> <div> 3.

GPT-5.

2 </div> <div> 4.

GPT-4o mini </div> <div> <br> </div> <div> We found that:  </div> <div> - Prompt injections did not meaningfully increase LLM-assigned grades in most of our experiments.

Across roughly 40,000 trials using Opus 4.

5, Gemini 3 Pro, and GPT-5.

2, prompt injections increased scores by about 2.

6 percentage points across two independent datasets.

</div> <div> - Vulnerability varies across LLMs.

Notably, Gemini 3 Pro stood out as the most vulnerable reasoning model, particularly for the longer-paper corpus, where beginning and middle injections produced effects exceeding 10 percentage points, while end-of-document injections had no effect.

In our experiments, Opus 4.

5 showed only minimal effects.

Interestingly, GPT-5.

2 with reasoning enabled was slightly more susceptible than the non-reasoning configuration, suggesting that reasoning did not provide additional protection against prompt injections.

</div> <div> - GPT-4o mini, a smaller and less capable LLM, however, was highly susceptible, with an average increase of 19 percentage points.

</div> <div> - We observed heterogeneity along two design dimensions.

First, verbose prompt injections consistently outperformed concise ones by a factor of two to three, and, for GPT-4o mini, by as much as sixfold.

Second, placement also mattered depending on the LLM used.

While Gemini 3 Pro showed strong effects for beginning and middle placements and only minimal effects at the end, particularly in the longer-paper corpus, the remaining frontier model families showed little to no placement-specific effects.

</div>.

Back

Abstarct Introduction Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...

[RETRACTED] Keanu Reeves CBD Gummies v1

[RETRACTED]Keanu Reeves CBD Gummies ==❱❱ Huge Discounts:[HURRY UP ] Absolute Keanu Reeves CBD Gummies (Available)Order Online Only!! ❰❰= https://www.facebook.com/Keanu-Reeves-CBD-G...

Overview of Key Zonal Water Injection Technologies in China

Abstract Separated layer water injection is the important technology to realize the oilfield long-term high and stable yield. Through continuous researches and te...

Breast Carcinoma within Fibroadenoma: A Systematic Review

Abstract Introduction Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...

Study on radiographic grading of ankle joint in adult patients with Kashin Beck disease in Shaanxi and Gansu Province, China

Abstract Purpose This paper aims to establish an X-ray imaging grading for assessing ankle joints in adult Kashin Beck disease (KBD), and investigate its correlation with ...

4D flow MRI-based grading of left ventricular diastolic dysfunction: a validation study against echocardiography

Abstract Objectives To assess the feasibility and accuracy of 4D flow MRI-based grading of left ventricular diastolic dysfunction, using echocard...

Optimal Injection Parameters for Enhancing Coalbed Methane Recovery: A Simulation Study from the Shizhuang Block, Qinshui Basin, China

The injection of N2 into coal reservoir has great potential in improving recovery of coalbed methane (CBM). In this study, a numerical model was established based on the GEM compon...

Application of Novel Techniques to Fractured Injection Diagnostics in Waterflood Developments

Abstract Controlled injection at high rates predominantly under fracture regime has been identified at the onset of most waterflood field developments as being cruci...

Email:
Password:

Email:

Prompting Science Report 5: This is an Excellent Paper: The Effects of Prompt Injection on Grading

Related Results