Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Efficient Visual Prompt Engineering for Creative Story Writing

View through CrossRef
Large Language Models (LLMs) are extensively utilized for generating stories, showcasing their ability to handle complex, creative tasks. To begin the process of story generation, an initial textual prompt is required. The prompt is iteratively refined such that the discrepancy between the user’s expectations and the story generated from the prompt is minimized. Each iteration is a time-consuming process; the user needs to read and analyze the story in order to refine the prompt. A key insight from cognitive research suggests that analyzing visual data is 60,000 times faster than textual analysis. This paper proposes visual prompt engineering for story generation wherein textual prompts are transformed into images using a diffusion model, then refined based on the discrepancy between the user’s expectations and the generated image. This refined prompt is then used to generate a story. The entire process is repeated until the user is satisfied with the story. This method leverages the relative speed of image processing to enhance the quality of text generation per iteration. Experiments show that for the same number of iterations, stories generated by visual prompt engineering outperformed those generated by text-based prompts in terms of story quality.
Title: Efficient Visual Prompt Engineering for Creative Story Writing
Description:
Large Language Models (LLMs) are extensively utilized for generating stories, showcasing their ability to handle complex, creative tasks.
To begin the process of story generation, an initial textual prompt is required.
The prompt is iteratively refined such that the discrepancy between the user’s expectations and the story generated from the prompt is minimized.
Each iteration is a time-consuming process; the user needs to read and analyze the story in order to refine the prompt.
A key insight from cognitive research suggests that analyzing visual data is 60,000 times faster than textual analysis.
This paper proposes visual prompt engineering for story generation wherein textual prompts are transformed into images using a diffusion model, then refined based on the discrepancy between the user’s expectations and the generated image.
This refined prompt is then used to generate a story.
The entire process is repeated until the user is satisfied with the story.
This method leverages the relative speed of image processing to enhance the quality of text generation per iteration.
Experiments show that for the same number of iterations, stories generated by visual prompt engineering outperformed those generated by text-based prompts in terms of story quality.

Related Results

KAJIAN KESIAPAN PENERAPAN KONSEP KOTA KREATIF DESAIN DI SURAKARTA
KAJIAN KESIAPAN PENERAPAN KONSEP KOTA KREATIF DESAIN DI SURAKARTA
<p><em>City plays important role making the city owns its high enchantment. This could effect on the emerging of city’s problems, where the city could not accommodate t...
Prompt Engineering For ChatGPT: A Quick Guide To Techniques, Tips, And Best Practices
Prompt Engineering For ChatGPT: A Quick Guide To Techniques, Tips, And Best Practices
<p>In the rapidly evolving landscape of natural language processing (NLP), ChatGPT has emerged as a powerful tool for various industries and applications. To fully harness th...
Prompt Engineering For ChatGPT: A Quick Guide To Techniques, Tips, And Best Practices
Prompt Engineering For ChatGPT: A Quick Guide To Techniques, Tips, And Best Practices
<p>In the rapidly evolving landscape of natural language processing (NLP), ChatGPT has emerged as a powerful tool for various industries and applications. To fully harness th...
Like Lady Godiva
Like Lady Godiva
Introducing Lady Godiva through a Fan-Historical Lens The legend of Lady Godiva, who famously rode naked through the streets of Coventry, veiled only by her long, flowing hair, has...
Hydatid Cyst of The Orbit: A Systematic Review with Meta-Data
Hydatid Cyst of The Orbit: A Systematic Review with Meta-Data
Abstarct Introduction Orbital hydatid cysts (HCs) constitute less than 1% of all cases of hydatidosis, yet their occurrence is often linked to severe visual complications. This stu...
Western Mesoamerican Calendars and Writing Systems
Western Mesoamerican Calendars and Writing Systems
<i>Western Mesoamerican Calendars and Writing Systems</i> draws together studies by some of the world’s leading experts presented at a conference held in December 2020,...
Chronological Evolution of the Urashima Taro Story and its Interpretation
Chronological Evolution of the Urashima Taro Story and its Interpretation
<p>The present thesis examines the evolution of the Urashima story. In modern Japan traditional Japanese tales have been presented in the form of illustrated books for young ...
Researching the Facts, Writing the Fiction: A Creative Writing Practice Study - Herself Alone in Orange Rain
Researching the Facts, Writing the Fiction: A Creative Writing Practice Study - Herself Alone in Orange Rain
Writing a novel about recent controversial and tragic events is a difficult, even potentially daunting challenge, for an author. The ethical considerations of including such facts ...

Back to Top