Javascript must be enabled to continue!

GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design

The growing importance of textual and relational systems has driven interest in enhancing large language models (LLMs) for graph-structured data, particularly Text-Attributed Graphs (TAGs), where samples are represented by textual descriptions interconnected by edges. While research has largely focused on developing specialized graph LLMs through task-specific instruction tuning, _a comprehensive benchmark for evaluating LLMs solely through prompt design_ remains surprisingly absent. Without such a carefully crafted evaluation benchmark, most if not all, tailored graph LLMs are compared against general LLMs using simplistic queries (e.g., zero-shot reasoning with LLaMA), which can potentially camouflage many advantages as well as unexpected predicaments of them. To achieve more general evaluations and unveil the true potential of LLMs for graph tasks, we introduce GRAPH IN-CONTEXT LEARNING (GRAPHICL) BENCHMARK, a comprehensive benchmark comprising novel prompt templates designed to capture graph structure and handle limited label knowledge. Our systematic evaluation shows that general-purpose LLMs equipped with our GraphICL outperform state-of-the-art specialized graph LLMs and graph neural network models in resource-constrained settings and out-of-domain tasks. These findings highlight the significant potential of prompt engineering to enhance LLM performance on graph learning tasks without training and offer a strong baseline for advancing research in graph LLMs.

Qeios Ltd

Yuanfu Sun Zhengnan Ma Yi Fang Jing Ma Qiaoyu Tan

2025

Title: GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design

Description:

While research has largely focused on developing specialized graph LLMs through task-specific instruction tuning, _a comprehensive benchmark for evaluating LLMs solely through prompt design_ remains surprisingly absent.

Without such a carefully crafted evaluation benchmark, most if not all, tailored graph LLMs are compared against general LLMs using simplistic queries (e.

, zero-shot reasoning with LLaMA), which can potentially camouflage many advantages as well as unexpected predicaments of them.

To achieve more general evaluations and unveil the true potential of LLMs for graph tasks, we introduce GRAPH IN-CONTEXT LEARNING (GRAPHICL) BENCHMARK, a comprehensive benchmark comprising novel prompt templates designed to capture graph structure and handle limited label knowledge.

Our systematic evaluation shows that general-purpose LLMs equipped with our GraphICL outperform state-of-the-art specialized graph LLMs and graph neural network models in resource-constrained settings and out-of-domain tasks.

These findings highlight the significant potential of prompt engineering to enhance LLM performance on graph learning tasks without training and offer a strong baseline for advancing research in graph LLMs.

Back

Abstract Introduction The exact manner in which large language models (LLMs) will be integrated into pathology is not yet fully comprehended. This study examines the accuracy, bene...

Perspectives and Experiences With Large Language Models in Health Care: Survey Study (Preprint)

BACKGROUND Large language models (LLMs) are transforming how data is used, including within the health care sector. However, frameworks including the Unifie...

Perspectives and Experiences With Large Language Models in Health Care: Survey Study

Background Large language models (LLMs) are transforming how data is used, including within the health care sector. However, frameworks including the Unified Th...

LLMs and AI: Understanding Its Reach and Impact

Large Language Models (LLMs) have revolutionized the field of Artificial Intelligence with their ability to understand and generate natural language discourse. This has led to the ...

Design

Conventional definitions of design rarely capture its reach into our everyday lives. The Design Council, for example, estimates that more than 2.5 million people use design-related...

Ask and You Shall Receive (a Graph Drawing): Testing ChatGPT’s Potential to Apply Graph Layout Algorithms

Large language models (LLMs) have recently taken the world by storm. They can generate coherent text, hold meaningful conversations, and be taught concepts and basic sets of instru...

Ask and You Shall Receive (a Graph Drawing): Testing ChatGPT’s Potential to Apply Graph Layout Algorithms

Large language models (LLMs) have recently taken the world by storm. They can generate coherent text, hold meaningful conversations, and be taught concepts and basic sets of instru...

Designing and deploying scalable intelligent tutoring systems to enhance adult education

Intelligent tutoring systems have consistently been shown to be effective in enhancing student learning outcomes. However, despite their demonstrated benefits, these systems have n...

Email:
Password:

Email:

GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design

Related Results