Javascript must be enabled to continue!

Weaponising Generative AI Through Data Poisoning: Analysing Various Data Poisoning Attacks on Large Language Models (LLMs) and Their Countermeasures

Large Language Models (LLMs) and most modern AI models profoundly rely on the quantity, quality and integrity of training data, which ultimately determines the overall success of these LLMs or AI models. This enormous amount of training data is collected from diverse sources and in various formats, then undergoes multiple transformation processes to train the LLM or AI model. This enormous training data poses significant challenges in the data management supply chain of the LLM or AI model, making it difficult to detect anomalies or corrupted data before they impact the training of the LLM or AI model, thereby increasing the risk of data-related attacks on the LLM or AI model. Data poisoning occurs when attackers intentionally manipulate or corrupt the training data used to develop an LLM or AI model. It is also challenging to detect data poisoning due to the subtlety of manipulation or corruption and the enormous volume of training data. Therefore, data poisoning attacks are one of the most prevalent attacks on LLMs and AI models that can adversely affect the learning process, behaviour, functionality or performance of the LLM or AI model. This paper will present an in-depth analysis of data poisoning attacks on LLMs or AI models covering their types, attack vectors, risks and mitigation techniques. Initially, it will classify data poisoning attacks on LLMs or AI models into various types based on the nature, target and aim of poisoning. Next, it will analyse each type of data poisoning attack with a clear distinction from other types of data poisoning attacks and its specific attack vectors. Subsequently, it will analyse several risks associated with data poisoning attacks on LLMs or AI models. Finally, it will analyse several mitigation techniques for data poisoning attacks on LLMs or AI models.

Institute of Electrical and Electronics Engineers (IEEE)

Dishita Naik Ishita Naik Nitin Naik

2025

Title: Weaponising Generative AI Through Data Poisoning: Analysing Various Data Poisoning Attacks on Large Language Models (LLMs) and Their Countermeasures

Description:

This enormous amount of training data is collected from diverse sources and in various formats, then undergoes multiple transformation processes to train the LLM or AI model.

This enormous training data poses significant challenges in the data management supply chain of the LLM or AI model, making it difficult to detect anomalies or corrupted data before they impact the training of the LLM or AI model, thereby increasing the risk of data-related attacks on the LLM or AI model.

Data poisoning occurs when attackers intentionally manipulate or corrupt the training data used to develop an LLM or AI model.

It is also challenging to detect data poisoning due to the subtlety of manipulation or corruption and the enormous volume of training data.

Therefore, data poisoning attacks are one of the most prevalent attacks on LLMs and AI models that can adversely affect the learning process, behaviour, functionality or performance of the LLM or AI model.

This paper will present an in-depth analysis of data poisoning attacks on LLMs or AI models covering their types, attack vectors, risks and mitigation techniques.

Initially, it will classify data poisoning attacks on LLMs or AI models into various types based on the nature, target and aim of poisoning.

Next, it will analyse each type of data poisoning attack with a clear distinction from other types of data poisoning attacks and its specific attack vectors.

Subsequently, it will analyse several risks associated with data poisoning attacks on LLMs or AI models.

Finally, it will analyse several mitigation techniques for data poisoning attacks on LLMs or AI models.

Back

<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...

Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study

Abstract Introduction The exact manner in which large language models (LLMs) will be integrated into pathology is not yet fully comprehended. This study examines the accuracy, bene...

Manipulating Recommender Systems: A Survey of Poisoning Attacks and Countermeasures

Recommender systems have become an integral part of online services due to their ability to help users locate specific information in a sea of data. However, existing studies show ...

Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga

The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...

A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare

Abstract Background The launch of the Chat Generative Pre-trained Transformer (ChatGPT) in November 2022 has attracted public a...

Perspectives and Experiences With Large Language Models in Health Care: Survey Study (Preprint)

BACKGROUND Large language models (LLMs) are transforming how data is used, including within the health care sector. However, frameworks including the Unifie...

Perspectives and Experiences With Large Language Models in Health Care: Survey Study

Background Large language models (LLMs) are transforming how data is used, including within the health care sector. However, frameworks including the Unified Th...

RingChains Graph-based Summarizer and Enhanced Large Language Models for Summarizing Long Documents

Large language models (LLMs) have influenced real-world applications after ChatGPT appeared. Although powerful LLMs produce high quality summaries, it remains challenging for LLMs ...

Email:
Password:

Email:

Weaponising Generative AI Through Data Poisoning: Analysing Various Data Poisoning Attacks on Large Language Models (LLMs) and Their Countermeasures

Related Results