
Composing New Urban Futures with AI: Speculative Design and AI Text-to-Image Synthesis
Jamie Littlefield Texas Tech University
Introduction
In 2010, I co-founded a local organization dedicated to creating safer streets in Provo, Utah—a mid-sized city about an hour South of Salt Lake. The primary goal of our grassroots group was to encourage the construction of more walkable, bicycle-friendly, and accessible infrastructure in the built environment. As the organization matured and eventually became a 501(c)(3) non-profit, I often found myself debating developers and city officials in the public forum, in both textual exchanges (online forums, newspaper op-eds, mailers) and in face-to-face environments (town halls, engineering open houses, city council meetings). I soon discovered that citizen advocates are disadvantaged in debates about city infrastructure due to our lack of access to expensive, specialized technologies.
The professionals designing U.S. cities access industry software to extract place-based data and compose rhetorically persuasive visual renderings of the infrastructural futures they propose. Architects have AutoCAD and Revit. Urban planners rely on ArcGIS and CityEngine. City engineers use Infraworks and MicroStation. While these technologies are often described as helping professionals plan projects, they also work in tandem to help professionals communicate potentialities. Through polished renderings of street re-designs, skyscrapers, pedestrian crossings, and bus stops, members of the public are prompted to visualize the city not as it is but as it could be. City planning professionals are engaged in a perpetual act of storytelling in which “alternative versions of city space are explored, evaluated, and enacted over time” and “designers and architects envision new possibilities…then work to build them in physical form” (Hoffman, 2022, p. 11).
Without access to professional tools, citizens and grassroots advocates often struggle to convey alternative possibilities. For example, street safety organizations may have to pay outside consultants significant fees to create alternative images of sidewalk designs that prioritize vulnerable road users, such as people using wheelchairs or families with strollers. Visual rhetoricians have also attempted to demonstrate urban possibilities through free or low-cost design programs such as SketchUp, the method used by Gries et al. (2020) to counter gentrification by re-imagining a civic space at the University of Nevada, Reno. However, these tools can be time-consuming, and the results are generally more oriented toward communicating concrete design details than constructing rhetorically persuasive, stylized imagery.
Now, with the popularization of text-to-image tools, the disparity between professionals and grassroots communicators is beginning to decrease. AI is threatening to disrupt the monopoly urban professionals hold on a number of communicative affordances. Text-to-image synthesis (the processes used by programs such as DALL-E, Midjourney, and Stable Diffusion) is an emergent AI technology offering underresourced communicators a way to produce customized renderings with a lower price tag and a lower educational barrier to entry. A grassroots urbanist group might generate an image that adds a streetcar to an existing road, re-imagines an entire neighborhood around different values, or displays a visual prototype to demonstrate the consequences of a particular decision.

In my professional communications work, I’ve engaged with a number of non-profit leaders and community advocates who are beginning to explore how AI tools can help them more effectively communicate with the public. In 2022, social media saw an influx of AI-generated imagery showing cities not as they were but as they could be. That summer, I released a short prompt book for this niche audience, “Re-Imagining Urban Spaces with DALL-E AI,” which has been downloaded by just over 1,500 early adopters as of this writing. A second wave of AI-generated cities appeared on social media in late 2023, with the release of Microsoft Bing’s no-cost image generator powered by DALL-E 3 and ChatGPT. Social sites like X (formerly Twitter) circulated user-created images such as a Golden Gate Bridge favoring trolleys and pedestrians, a Baltimore Inner Harbor with extensively remodeled public spaces, and dozens of public beaches adjacent to muti-use paths rather than highways. For example, using DALL-E 2’s outpainting feature, I was able to take photos of real streets in my own community and edit them to include AI-generated elements such as highways, walking paths with native foliage, and water features (See Figure 2).

This chapter critically reflects on my experiences with AI text-to-image synthesis as a practitioner composing materials for grassroots advocacy and examines how this practical knowledge might translate to a classroom setting. I outline the essentials of how these tools work and how they offer new communicative possibilities to underresourced groups such as fledging non-profits and young political movements. I also describe some of the risks and challenges of AI text-to-image synthesis, including the possibility of value-locking the future. Finally, I suggest a method for critically engaging with AI text-to-image synthesis: speculative design.
Speculative design is a method used by designers, writers, and artists to envision and create concepts for possible futures rather than practical solutions for the present (Hoffman, 2022; Galloway & Caudwell, 2018; Dunne & Raby, 2013). It goes beyond traditional design by questioning societal norms and considering the broader impacts of technology and innovation on our lives. Speculative design involves crafting artifacts (such as AI-generated images) that demonstrate or visualize potential futures and encourage us to think critically about what could be (Opel & Rhodes, 2018; Wilkie et al., 2017). In their constitutive text, Dunne and Raby (2013) argue that speculative practices can function to generate greater elasticity for human futures: “...by speculating more, at all levels of society…reality will become more malleable and, although the future cannot be predicted, we can help set in place today factors that will increase the probability of more desirable futures happening…” (p. 6).
The tools of speculative design are uniquely equipped to help us critically consider the ways that AI is bound up in time, continually replicating the discourses of the past and artificially projecting limitations onto the ways humans will experience the future. In the case of urban communications, speculative design projects can help non-profits and grassroots organizations persuade the public by presenting images that deviate from the detrimental patterns of the past. In the writing classroom, speculative design practices can provide students with concrete approaches to analyzing and composing AI generated images that demonstrate awareness of the ways AI is entangled with the past.
How Text-to-Image Synthesizers Generate Content
In order to discuss critical approaches to AI, it may be helpful to review the processes text-to-image synthesizers use (See: Ramesh et al., 2021; OpenAI, 2021; Reed et al., 2016). With an eye toward balancing concision with clarity (and an acknowledgment that many details are not included here), this section presents a generalized snapshot of how OpenAI's DALL-E generates synthetic images. While the specifics of these processes differ between applications (programs like Midjourney use GANs or generative adversarial networks while DALL-E does not, for example), many features are similar.
As a broad overview, DALL-E generates synthetic images through a five-stage process:
- DALL-E is trained on image/text pairs. DALL-E is trained on an extensive dataset containing pairs of text descriptions and corresponding images. This foundational training allows the model to learn the relationship between textual descriptions and their visual representations. Both the images and the text descriptors are broken into small chunks of meaning called "tokens."
- A user inputs a prompt, which is processed as contextualized tokens. After a user submits a textual prompt describing the desired image, DALL-E breaks down the prompt into tokens and contextualizes these tokens. This means it derives a dense representation for each token that understands not only its standalone meaning but also its significance in relation to the entire prompt description. DALL-E analyzes those tokens based on its past exposure to the tokenized pairs of images and text within the training data.
- DALL-E sequentially generates a synthetic image. With the contextually enriched textual representation as a foundation, DALL-E begins the image generation process. This is not done in one attempt; rather, it is a sequential process, similar to how ChatGPT generates text. DALL-E predicts parts of the image, often patches, in a sequence, ensuring that the evolving visual content aligns with the input description's intent. The initial output becomes input as the image completes.
- DALL-E refines the synthetic image. After the initial synthesis, DALL-E leverages techniques like temperature sampling to refine and diversify the generated image. By adjusting the "temperature" or randomness parameter, DALL-E can produce a spectrum of outputs for a single prompt. This variability ensures that the user can select the most contextually relevant and visually cohesive image from the generated set of image options.
- The synthetic image (eventually) becomes a part of a new data training set. Although companies have been reluctant to provide details of what output becomes input for AI systems, it is widely expected that a large number of synthetic images will become a part of future training data. This might happen from formal processes to directly capture and include AI output or through the publication of synthetic images on webpages that are later scraped to form new datasets.
As an introduction to this process, students may find it helpful to think of text-to-image data training sets as old photo albums in which every picture is accompanied by a caption. As it studies this album, DALL-E learns to create its own pictures when given new captions. By breaking down these captions into smaller pieces (like individual words), DALL-E generates new images by remixing visual and textual meanings from past data.
Working with the Past to Compose the Future
While the purpose of generative AI is to produce something new, the process itself is inextricably bound to the old. Generating a new image through AI is an act of engaging with textual history. In practice, Large Language Models (LLMs) like ChatGPT and text-to-image synthesis tools such as Midjourney are fundamentally rooted in the past, trained on vast datasets that encapsulate historical texts, images, and multimedia. This historical orientation allows AI systems to access a rich mosaic of human conversation and creativity, generating images that resonate with cultural, historical, stylistic, discursive, or artistic contexts. These tools have the potential to amplify the intelligence of previous generations, offering a synthesized form of collective human experience. They allow for an interaction with a broad swath of human history and culture, potentially making the past more readily accessible and interpretable to the present.
However, AI systems also present new risks: they may inadvertently perpetuate the prejudices and limitations of past data by parroting "encoded biases without understanding the significant harm of the language it produces" (Johnson, 2023, p. 170). Historical stereotypes, outdated norms, and factual inaccuracies can be unwittingly regurgitated, potentially reinforcing harmful social constructs or misleading users (Anderson, 2023). When exposed to extensive training sets, generative AI tools have been shown to exhibit bias favoring the people and subjects most prevalent in the data (Byrd, 2023; Getchell et al., 2022). The overall size of a dataset is not an indicator of its proportionality or inclusivity.
As it re-mixes meaning from the past, generative AI does not compose on a blank slate. Instead, it generates content overtop a virtual palimpsest—a layered canvas of data, code, and pre-existing human input. This palimpsest is not neutral; it carries the biases, assumptions, and limitations of its human developers in addition to the dataset it was trained on. Because of this, the output of AI is a complex interplay between machine learning and the past socio-cultural contexts from which its training data originates. Both humans and generative AI are engaged in a continual process of layering new information over old, negotiating between past influences and present intentions. Understanding the complexities of AI-generated content requires recognizing it as part of a broader communicative ecosystem, where human and machine are influenced by layers of pre-existing context.
AI's entrenchment in the past generates a sort of algorithmic resistance to change, creating synthetic output that is subject to "value lock." Bender et al. (2021) describe "value lock" as the way in which "the LM-reliant technology reifies older, less-inclusive understandings" (p. 614). Rapidly spreading social shifts in the way people discuss an issue—be it the discourse surrounding the Black Lives Matter movement, the #MeToo movement's focus on sexual harassment, or the global conversation on energy production—may not be fully captured by LLMs reliant on training data from years ago. Accelerated changes in public opinion, such as Western attitudes towards same-sex marriage between the late 1990s and the mid-2010s, may be attributed, in part, to what Ridolfo and DeVoss (2009) refer to as "rhetorical velocity." These significant social shifts can occur as texts are rapidly circulated, re-mixed, and re-circulated throughout a population.
Faced with a kairotic situation, a human communicator may choose to give more weight to a new concept in the creation of content, while an algorithm may favor the replication of patterns more widely represented in decades of past training materials. Rather than facilitating a change in discourse or even remaining neutral, AI systems may actively obstruct social change by generating content that is untouched by the ideological shifts spreading rapidly through human networks.
Essentially, when it generates synthetic content, AI re-assembles units of meaning drawn from the materials of the past. While the historical orientation of generative AI tools offers certain benefits, deliberate approaches are needed to address its limitations. At this critical moment, we might stop to ask ourselves: If new texts continue stochastically (randomly) parroting the meanings found in the past, what kind of futures will result? When generative AI is designed to shoulder the weight of history, how does it impact discourse and design? What interventional strategies can we use to change course now?
Speculative Design as a Future-Oriented Tool
I suggest speculative design as a method uniquely equipped to help compositionists and other communicators respond to the challenges of emergent AI technologies. Speculative design is a practice that focuses on exploring and illustrating potential futures rather than designing for immediate practical needs. Speculative design stimulates thought, discussion, and critical reflection about potential societal, cultural, ethical, and environmental futures. It challenges existing assumptions and conventions about the role that products or services play in our lives, fostering new ideas about possible alternatives (Hoffman, 2022; Wilkie et al., 2017; Dunne & Raby, 2013). Speculations may center around hopeful possibilities or dystopian consequences resulting from present choices (Jagoda, 2023). Speculation grows from "a sensibility concerned with resisting a future that presents itself as probable or plausible," and contends that "no matter how pervasive the impasse may be, [the probable future] can never exhaust the unrealized potential of the present" (Wilkie et al., 2017, p. 7).
Practicing speculative design often involves crafting scenarios, stories, technological prototypes, or artifacts—like the "photos" of future streets and cities I generated in my non-profit work. These creations help demonstrate potential futures, making them more material for public audiences. Essentially, speculative design encourages us to think critically about what could be. It is closely related to critical design, a term coined by designers Anthony Dunne and Fiona Raby (2013), authors of the text Speculative Everything: Design, Fiction, and Social Dreaming. Both critical design and speculative design use critical, maker-style methods to challenge the status quo and encourage people to consider their relationship with the designed world (Faris & Holmes, 2022).
Speculative design offers several benefits that make it a valuable approach in both academic and practitioner contexts. First, it fosters creative thinking by encouraging communicators (designers) and researchers to step outside conventional boundaries and explore alternative futures. This imaginative exercise can lead to innovative solutions that might not be discovered through traditional design methods. Speculative design often evokes what Johnson & Johnson (2020) refer to as "ambient imaginaries," the sensory details that "focus the attention of publics deciding which parts of the material environment to carry into the future" (p. 15). These sensory-based approaches help audiences form affective connections with an unlived future. Second, speculative design serves as a tool for social critique, allowing us to question existing systems and norms by presenting alternative scenarios. This can spark meaningful discussions about societal values, ethics, and directions for future development. The artifacts created by speculative design are not end products; instead, they are a means for facilitating the kind of participatory public processes that result in the usable, citizen-created data discussed by Moore and Elliott (2016).
Third, speculative design enhances interdisciplinary collaboration by integrating insights from fields such as technology, social sciences, and the humanities, potentially creating a richer, more nuanced expression of human intent. At its best, it invites broader participation in the design field rather than reinforcing disciplinary boundaries. Finally, speculative design acts as a catalyst for public engagement. By presenting provocative or visionary scenarios, this method invites the broader community to participate in shaping their own futures, a move towards democratizing the design process. Research suggests that when communities work together to imagine new futures, they develop social resilience—a necessary element of navigating challenge and change (Hoffman, 2022, p. 5).
While AI is entrenched in the meanings of the past, speculative design asks us to put our intentionality into projecting possible futures with the hope that doing so will serve as a method of reflecting on and rectifying present action. This chapter does not propose that all AI-generated projects should be speculative in nature. Rather, I am suggesting that human users may be able to mitigate the sociocultural consequences of generative AI by acquiring and implementing the critical skills that emerge from engaging with speculative design practices.
Three AI-Generated Examples of Speculative Design
To examine the potential uses of speculative design in text-to-image synthesis, I will consider three generative examples. Each of these small, exploratory projects turns a critical eye on the work I've conducted in the area of human-scale, eco-friendly city development as a non-profit communications consultant. These synthetic images are not intended to be polished pieces; instead, they are examples of tactical communicative interventions that can be rapidly created by underresourced groups.
Generative Example #1: Tracing the Projected Past
This first project demonstrates the ways that AI text-to-image synthesis re-creates the conventions of the past and provides suggestions for interventions based on speculative design. To begin this project, I examined how generative AI uses the cities of the past to create new images. Over 14 consecutive days, I entered the same four-word prompt in Midjourney's image-generating Discord server:
ideal neighborhood. photorealistic.
Out of the four resulting image options from each of the 14 initial generations, I consistently upgraded the first image (on the top left of the image results). I archived these synthetic images and analyzed the results, as shown in Video 1. By using the same prompt, I was able to develop a small corpus of synthetic images. Rather than analyzing images individually, examining a collection enabled me to begin identifying patterns.
Users may be tempted to assume that the resulting synthetic images would be widely varied, considering Midjourney's extensive training datasets. "Neighborhoods'' have long existed in a wide variety of countries, cultures, and decades. "Ideal" is similarly a vague descriptor that could apply to many subjective contexts. However, in this experiment, Midjourney recreated many of the same details for the prompt in different configurations. Its "new" output reflected patterns from particular subsets of the past.
Descriptive Video Transcript
Onscreen Text | Image Description | |
---|---|---|
Tracing the Past with AI
Asking AI text-to-image synthesizer Midjourney to generate a new image every day for the prompt: "/imagine: an ideal neighborhood. photorealistic" | ||
Day 01 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A residential street lined with two- and three-story houses featuring classic architectural details such as bay windows, front porches, and decorative trim. The houses are painted in muted greens, blues, and reds. Trees with autumn-colored leaves line both sides of the street, casting shadows on the pavement. The asphalt street has concrete sidewalks directly alongside it, with steps leading up to the houses. A single car is visible in the distance. The sky is blue with scattered clouds. | |
Day 02 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A picturesque residential street lined with colorful Victorian-style houses, featuring ornate trim, bay windows, and decorative railings. The houses are painted in deep blues, greens, and reds, with lush landscaping spilling onto the sidewalks. Trees with vibrant autumn leaves create a canopy over the street. The asphalt road is narrow, with a few cars parked along the curb. Sunlight filters through the trees, casting golden light and long shadows. Overhead power lines run diagonally across the sky. The street gently curves in the background, leading to more houses and trees. | |
Day 03 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A residential street lined with two-story houses featuring pitched roofs, front porches, and wooden siding. The houses are painted in muted greens, blues, and tans. The street is paved with asphalt, with concrete sidewalks separated by narrow grassy strips. Cars are parked along the curb. The sky is golden, suggesting early morning or late afternoon. | |
Day 04 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A tree-lined residential street with large, two- and three-story houses featuring pitched roofs, front porches, and decorative trim. The houses are painted in shades of green, brown, and navy blue. The street has grassy front yards, concrete sidewalks, and cars parked along the curb. Shadows from tall, mature trees create a dappled effect on the road. The background shows more houses and trees. The sky is blue with wispy clouds. | |
Day 05 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A suburban residential street with two-story houses featuring pitched roofs, front porches, and decorative trim. The houses are painted in soft blues, greens, and yellows, with well-maintained landscaping. A red car is parked along the curb, and a few people are walking on the sidewalks. A group is gathered near a house. Trees with green and autumn-colored leaves cast dappled shadows. The sky is bright blue with scattered white clouds, and a small airplane is visible. | |
Day 06 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A tree-lined residential street with two-story houses painted blue, green, and brown. The street is paved with asphalt, with concrete sidewalks and grassy front yards. Trees with green and red foliage cast shadows. Cars are parked along the curb, and a few vehicles are driving down the street. Sunlight filters through the branches, creating a warm glow. The background shows the road continuing into the distance. | |
Day 07 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A residential street at dusk, lined with two-story houses painted deep green, red, and blue. The street is paved with asphalt, with concrete sidewalks and grassy front yards. Streetlights cast a warm glow, illuminating the sidewalks and parked cars. A few cars drive down the street, their taillights glowing. Birds fly overhead against a golden-orange sky with scattered clouds. Trees with dense foliage partially frame the scene. | |
Day 08 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A row of closely spaced, two- and three-story houses painted green, blue, and tan. Each house has a small front yard with neatly trimmed grass. The asphalt street has concrete sidewalks. A few cars are parked, and one vehicle drives down the street. Overhead power lines run diagonally. Trees with green and red autumn foliage cast shadows. The warm sunlight suggests early morning or late afternoon. | |
Day 09 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A residential street lined with two-story houses painted navy blue, beige, and light brown. Each house has well-maintained yards with flower beds. The asphalt street has concrete sidewalks and grassy front yards. A red car is parked along the curb. Power lines and wooden utility poles run along the street. The golden sunlight suggests late afternoon or early evening. The sky is blue with wispy clouds. | |
Day 10 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A quiet residential street lined with modest two-story houses painted beige, red, and green. The street is paved with asphalt, with concrete sidewalks. Several cars are parked, including a blue sedan. Overhead power lines stretch down the street. Trees with golden leaves cast dappled shadows. The sky is bright blue with white clouds. The background shows the street continuing into the distance. | |
Day 11 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A residential street with two-story houses painted navy blue, beige, and brown. Each home has flower beds and hedges. The street is paved with asphalt and lined with trees casting dappled shadows. Power lines and wooden poles extend along the street. The warm sunlight suggests late afternoon or early evening. The background shows more houses and trees. | |
Day 12 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A street lined with tall, Victorian-style houses painted green, beige, and blue. The asphalt street has concrete sidewalks. Trees with red and golden autumn leaves arch over the street, casting shadows. Several cars are parked, including a red car. Overhead power lines crisscross above. The street continues into the distance, framed by houses and trees under a bright blue sky. | |
Day 13 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A tree-lined street with Victorian-style houses painted red, beige, and blue. The street is paved with asphalt and has concrete sidewalks. Cars are parked along the curb, including a red car driving down the street. Trees provide shade, casting shadows. The sky is bright blue with fluffy white clouds. The street extends into the distance, lined with more houses and trees. | |
Day 14 Midjourney Prompt: "/imagine: an ideal neighborhood. photorealistic." | A vibrant residential street with colorful Victorian-style houses painted red, orange, blue, and green. Trees with golden leaves line the street. Some cars are parked and some cars are driving. Overhead power lines crisscross above. A modern city skyline with skyscrapers rises in the background. The sky is bright blue with scattered clouds. | |
Audio Note: This video contains no spoken dialogue. The background track features calm, upbeat music with a synthetic quality, resembling piano sounds. |
After compiling these synthetic images, I examined the threads that connected them. I found that Midjourney was projecting a very specific interpretation of “ideal” and “neighborhood.” The corpus of synthetic images tended towards:
- Suburbanism. All of the images showed houses in suburban settings. There are no mixed-use options (i.e., combinations of different types of building uses such as commercial spaces, apartments. or local restaurants). Only one image displayed an urban area in the background.
- Americanism. Although the specific locale is unclear, the general aesthetics of the streets suggest that they are located in the United States (or possibly Canada). The architecture appears to be Victorian or Edwardian, a style popular in the U.S. from the late 1800s to the early 1900s and characterized by integrated details, varied rooflines, bay windows, and ornamental trims.
- Car-centricity. 100% of the synthetic images include cars using the public space in the street or sidewalk. By contrast, only 7% of the images included humans using these public spaces. No alternative transportation methods, such as light rail, scooters, or buses were identified. Additionally, only 42% of the synthetic images included usable sidewalks on both sides of the street (or one side if only one side was pictured) that would be unobstructed to a person using a wheelchair or pushing a stroller.
- Classism. All synthetic images featured large houses (generally two or three stories) in good condition with front porches and mature trees in what could be considered historic areas.
- Racial Exclusivity and Redlining. While the images do not explicitly depict people, they evoke the aesthetics of redlined communities and mid-20th-century suburban idealism. The visual elements—large homes, manicured lawns, and detached housing—align with the historical American Dream imagery that was explicitly marketed to white, middle-class families in the post-WWII era. The federal government’s redlining policies and exclusionary zoning laws contributed to the development of these suburban enclaves, shaping neighborhood access based on race and economic status. AI’s reliance on past datasets means that it unintentionally perpetuates these historical patterns, reinforcing a limited and exclusionary vision of what an "ideal neighborhood" should be.
By drawing out the patterns in these images, we can start to identify how the AI text-to-image synthesizer is replicating a certain interpretation of the past. We can also begin to imagine how using these synthesized images in communicative contexts could produce value-lock by limiting the way we think about the future of neighborhoods. From the position of my own grassroots advocacy, I am concerned that using these images might contribute to the replication of the same type of exclusionary spaces that are already present in the built environment.
Applying a speculative design approach can help users imagine alternative ways to think about the same concept. Speculative design asks us to consider what alternative futures might be available to us. What if we were to create and circulate images of ideal neighborhoods that were not suburban, American, car-centric, and upper-class? What other types of neighborhoods might be considered "ideal" in the future? Speculative design practices can help users imagine built environments that are less predictable and less inclined to perpetuate value-lock.
One approach to generating more speculative synthetic images is to use AI as a tool for what Deleuze and Guattari (2013) call "rhizomatic thinking." Rhizomatic thinking is a non-hierarchical, decentralized cognitive approach that allows for multiple entry and exit points, much like the underground stems of rhizomatic vegetation (like bamboo, ginger, or mint). Unlike traditional, linear ways of thinking that seek a single "correct" solution or approach to an answer, rhizomatic thinking embraces complexity, multiplicity, and interconnectedness. In the context of AI-generated images, this means moving beyond mere replication or simulation of existing visual forms derived from the past. Instead, AI can be prompted to explore a larger array of possibilities, each branching off from the other, to create images that are not just novel but also speculative. These images can challenge our preconceived notions of reality, aesthetics, and even the limitations of human imagination, opening up new avenues for inquiry.
After noticing the patterns that emerged in the first corpus of generated images, I worked to create prompts that challenged text-to-image synthesizers to connect the concept of "neighborhood" to concepts that might be considered more distant and less linear. I experimented with new nodes like "Afrofuturism," "design for women," and "agricultural urbanism." The key phrases used in these prompts were intended to serve as nodes for rhizomatic thinking, asking AI to tap into knowledge systems that were further from its original concept of "neighborhoods."

The resulting images (Figure 6) showed visual evidence of meaning drawn from alternative branches of thinking. In these images, we see: neighborhoods being used by many different people, bicyclists and pedestrians, shared public spaces, waterways, communal playgrounds, and different housing types. The synthesizers are still drawing from the past, but the generated content shows greater novelty because it is working with additional units of meaning. It is being asked to create connections between somewhat disparate pasts, between branches of thought that are less related. AI text-to-image synthesis allows users to intentionally and rapidly experiment with this style of rhizomatic thinking by generating content from multiple "nodes" through textual input, providing new possibilities for projecting alternative futures.
While connecting to alternative nodes through rhizomatic thinking helps generate novel combinations, each new conceptual branch carries its own set of value-locked patterns that deserve critical examination. When connecting to "Afrofuturism" as a node, the AI draws from existing visual conventions that often frame Black futures through a lens of technological dystopia and architectural monumentalism, replicating patterns from science fiction media rather than the full spectrum of Afrofuturist thought. The "design for women" node activates deeply embedded visual patterns from historically gendered spaces and marketing, defaulting to stereotypical feminine coding through color and ornamental choices that reflect decades of gender-segregated design practices. Agricultural urbanism visualizations tend to reproduce the aesthetic conventions of contemporary "eco-luxury" developments, pulling from marketing imagery that presents urban farming through a lens of white, middle-class lifestyle rather than connecting to the diverse cultural histories of community agriculture and food sovereignty movements.
Prompt Node | Description | Prompt Phrases |
---|---|---|
Spatial Concepts | Foundational elements that anchor the design scenario | Neighborhood, Public space, Urban village, Housing district, City block, Street network, Town square, Transit corridor, Cultural district, Innovation hub, Waterfront development, Mixed-use complex, Educational campus, Healthcare precinct |
Adjective Modifiers | Qualities that shape the character and atmosphere | Sustainable, Futuristic, Vibrant, Market-driven, Inclusive, Minimalist, Playful, Communal, Resilient, Adaptable, Biophilic, Dynamic, Harmonious, Interactive, Regenerative |
Styles or Movements | Design philosophies and cultural influences | Bauhaus-inspired, Afrofuturist, Postmodern, Indigenous architecture, Brutalist, Cyberpunk, Art Deco, Metabolist, Solarpunk, Biomimetic, Neo-vernacular, Deconstructivist, Eco-gothic, Retro-futuristic, Feminist, New urbanist |
Material Features | Physical elements that add functionality and character | Green rooftop gardens, Integrated solar panels, Walkable streets, Community gathering spaces, Vertical farming, Underground transit hubs, Public art installations, Rainwater harvesting systems, Smart street furniture, Urban food forests, Microclimate zones, Elevated pedestrian networks, Wildlife corridors, Energy microgrids |
Environmental Contexts | Setting and surrounding conditions | Coastal adaptation, Desert oasis, Mountain terrain, Urban forest, River delta, Arctic settlement, Floating city, Underground development, Sky city, Island ecosystem, Canyon dwelling, Wetland integration |
Social Dynamics | Community interactions and behavioral patterns | Intergenerational living, Creative commons, Learning landscapes, Care networks, Free enterprise zones, Cultural exchange hubs, Maker spaces, Democratic forums, Healing environments, Play zones, Knowledge sharing centers, Ritual spaces, Community kitchens |
These patterns reveal how, even when generating novelty through rhizomatic connections, AI systems reconstruct value-locked elements from each conceptual branch they access. This suggests that speculative design practices must involve not just making new connections between nodes but must critically examine what historical patterns each node brings with it. Practitioners can use this awareness to deliberately identify and challenge these inherited conventions—perhaps by introducing additional nodes that specifically counteract recognized patterns, or by crafting prompts that explicitly reject certain visual tropes. The goal is not only to generate novel combinations, but to thoughtfully curate which elements from each knowledge branch we want to carry forward into our speculative futures.
Composition instructors can guide students through rhizomatic prompt creation by teaching them to strategically combine conceptual nodes (see Table 1). Like assembling blocks of meaning, students can start with a Base Spatial Concept (a "neighborhood" or "public space"), then thoughtfully layer on Adjective Modifiers that shape its character ("sustainable," "vibrant"). By adding a Style or Movement node ("solarpunk," "indigenous architecture"), students begin making more distant conceptual connections. Additional Features ("green rooftops," "community spaces"), Environmental Context ("coastal," "urban forest"), and Social Dynamics ("intergenerational living") can then be mixed in deliberately, with students examining how each new node might introduce its own historical patterns or biases. For example, rather than simply accepting "Design a vibrant neighborhood with indigenous-inspired architecture featuring rooftop gardens in a mountain terrain for intergenerational living," students should interrogate what assumptions about class, culture, and accessibility each element brings. This process teaches critical awareness of how different knowledge branches interact while encouraging experimentation with unconventional combinations that might challenge value-locked futures.
Generative Example #2: Speculative Design as a Tool for Value Negotiation and Identity Formation
As a practice, speculative design challenges communicators to tell stories that prompt discussion about who we are and what we value. By crafting narratives that exist beyond the present moment, communicators invite audiences to confront their own assumptions, biases, and ethical frameworks. This form of storytelling catalyzes dialogue, encouraging people to negotiate their values and reconsider their identities in the face of hypothetical yet plausible situations. In doing so, speculative design transcends aesthetic or functional considerations; it becomes a tool for social critique and identity exploration (Snow et al., 2021).
A place can be understood as a space that is filled with identity, culture, and shared references, bound together in time. Non-places, on the other hand, tend to promote the solitary identity of individuals disconnected from the people immediately around them (Varnelis & Friedberg, 2008). Examining the way places are connected to human values can help us identify what Rai (2016) refers to as "the competing rhetorical frames that circulate within and are tied to literal places" (p. 34).
For this example, I used DALL-E 3 to generate images that promote discussion about shared values around vacant property within my community. This project was done in response to local debate around the need for more convenient parking versus the need for more public spaces.
To generate the first synthetic image, I attempted to create distance between the subject and the anticipated audience. I chose the Eiffel Tower due to its status as a cultural symbol and its physical and ideological distance from Provo, Utah (i.e., it is known and loved but is distant and not considered "ours"). I used DALL-E 3 to generate a synthetic image, removing the public space that picnickers and tourists enjoy below the tower and replacing it with a parking lot for electric vehicles.

For the second synthetic image, I attempted to close the gap between the place pictured and the anticipated audience. Using an iPhone, I took a simple photograph of an empty lot in a downtown Provo neighborhood. I then used DALL-E's outpainting tool to generate three images representing the materialization of different values: the construction of a parking lot, a bicycle-friendly community garden, and a neighborhood restaurant (Figure 8).

The purpose of these generations is not to needlessly provoke but to encourage community discussion. In these examples, I attempted to elicit responses that called for a deeper reflection on the ways public space is used and how that use reflects community values. Speculative design can draw people together around new kinds of inquiry: What do we gain when we use publicly owned space for parking? What do we lose? What does that say about us and our values? Are there specific circumstances where we might choose to trade the convenience of parking for another value, and how do we identify those circumstances?
Generative Example #3: Speculative Design & The Construction of Publics
This final example demonstrates how speculative design has the potential to create publics—groups of strangers who are united around a common text (written, visual, or multimedia). Drawing on the work of Dewey (1927), Habermas (1989), and Latour & Weibel (2005), writing researchers have long been interested in the ways that the creation of a text can construct a group of people and things empowered to take conjoint action (Gries, 2019; Moore, 2023, Preston, 2015; Boyle & Rivers, 2018). Such texts often prompt action through their circulation as a part of a larger rhetorical ecology, generating force through their association with other texts, rhetors, readers, histories, and materials (Rivers & Weber, 2011; Edbauer, 2005). By assembling a public with intentionality, communicators invite readers to see themselves and their involvement in an issue in a particular way and, potentially, work with each other to change the course of the future (Warner, 2002; Rice, 2012).
For this project, I wanted to communicate the urgency of responding to the environmental threats facing Utah due to climate change and highlight the need for alternative transportation methods like infrastructure that supports walking and bicycling. I chose to focus on the Great Salt Lake, a rapidly declining body of water receding due to recent droughts. For generations, the lakebed was used to dispose of toxic chemicals, which now threaten to become airborne as dust storms. I used Midjourney to generate a synthetic image displaying a dystopian possibility for the future of Salt Lake—a dried, cracked lakebed releasing toxins toward the city (Figure 9).

Following the practices of speculative design, I generated a companion image to suggest potential technologies that might be needed to mitigate this dystopian future (Figure 10). Refined in Canva, the image displays a prototype of a colorful mask combined with an AI-enabled air quality sensor that could be distributed to Utah children in areas experiencing toxic dust exposure from the declining lake. While K-12 students in our area are already required to stay indoors during recess on days with dangerous particulate matter readings, such a device could allow children access to outdoor spaces if air quality continues to decrease. These corresponding images invite readers to consider how the actions taken now may impact the nature of our community and the ways we exist within it in the future.

When combined with narrative, speculative imagery can serve as a rhetorical tool to gather the people and materials needed to enact change. In the case above, the images are intended to evoke emotion around an undesirable future and its technological implications and, potentially, generate the affective energy required to initiate public action. Here, again, the purpose is not to needlessly provoke but to identify ways that AI text-to-image synthesis can help us speculate about the future in order to change the present. In this case, generative AI is a valuable tool for rapidly creating narrative-based images and prototypes. It allows users to experiment with variations, pulling together materials of the past to imagine solutions for problems in the future. Most importantly, it offers new tools for communicators to pull together groups of actants around shared concerns.
In my non-profit work, the theories behind the construction of publics have proven particularly relevant. Our task is to assemble groups of people and things (publics) who are inspired to work together to reassemble the material elements of the built environment (streets, bike lanes, parks, etc.). Our recent air quality failures might be viewed as a breakdown in our attempts to effectively form publics—we have not yet been able to assemble the kinds of groups that are capable and motivated to address the environmental problems we face. By providing communicators the chance to rapidly experiment with different types of content creation, we may be able to more effectively assemble the types of publics we need.
It is important to note here that emergent technology may end up impacting not only what publics are formed but how publicness itself is experienced. That is, technology has the potential not only to change the membership of a public but to change the way people/things are drawn into a public and the way they experience that association itself. In Composing Place: Digital Rhetorics for a Mobile World, Greene (2023) points to the ways that technology may be used to create new kinds of public orientations:
Rather than reducing the function of a digital text to its ability (or inability) to persuade a pre-given group of people, we should also attempt to understand how emerging technological infrastructures facilitate new formations of public-ness itself. (p. 40).
As technologies like augmented reality (AR) gain traction, humans may begin experiencing material places (like bus stops or public plazas) through interfaces that restrict or emphasize particular aspects of their experience with the built environment. Wearing AR headsets, one user may see factoids about local art in his visual field, while another may see evidence of local homelessness as a nondescript blur. Such applications function to "not only orient our physical position within a space but also our rhetorical position" (Crider et al., 2020, p. 9). Speculative design offers avenues for drawing users away from their individualized AR experiences and toward shared, material ways of addressing community challenges.
Conclusion
This chapter has considered how speculative design can serve as a tool for the critical examination and use of AI text-to-image generation. Although the focus has been on visual texts, a number of the concepts are transferable to other modes of communication. In many ways, the emergence of technologies reliant on large language models is an opportunity for writers to think more deeply about their own craft. As Walter Ong (2012) resonantly explained, writing itself is a technology. The act of writing has always existed in a space of tension between the past and the future. Through the manipulation of material tools and human symbols, we generate texts that encourage new understandings by remixing past understandings.
Beyond writing itself, many of the tools professionals use to gather and communicate data are also entangled with the past. In the field of urbanism, professional tools used by city planners and engineers are explicitly designed to recreate (rather than re-imagine) the existing elements of the built environment. It seems only sensible to design new streets based on our knowledge of past streets rather than rethink the nature of the street itself each time a road is called for. However, this way of engaging with the world creates entrenched patterns that can replicate bias and exclusion (like sidewalks without wheelchair access or public spaces only accessible to car owners).
Emergent technologies do not fundamentally change the purpose of communication. However, they do pose new questions, especially as AI begins to act as a mediating influence between the communicative acts of the past and the communicative acts of the future. Generative AI technology allows us to shift some of the labor of symbolic work to algorithmic functions. But, by doing so, we may risk offloading the work that is most important: our ability to imagine and communicate new futures—possibilities that loosen our entanglements to the past and propel humanity forward. Speculative design can help human communicators re-consider not only what kind of work they do but also what kind of beings they are. Specifically, in Gries' (2019) terms, it can help us remember that we are "assembling beings" with the "ability to assemble and distribute discourse that can, in turn, assemble and reassemble bodies around a shared concern" (p. 336).
A speculative approach to generative AI can help us further examine what it means to exist as an assembling being and how new things (especially new futures) emerge from the fragments of the old. Such a perspective can help us identify new ways to work with AI systems serving as mediators between ourselves and a collective, partial past.
References
Anderson, Salena S. (2023). Places to stand: Multiple metaphors for framing ChatGPT's corpus. Computers and Composition , 68, 102778. doi:10.1016/j.compcom.2023.102778
Bender, Emily M., Gebru, Timnit, McMillan-Major, Angelina, & Shmitchell, Shmargaret. (2021). On the dangers of stochastic parrots: Can language models be too big? 🦜. Proceedings of the 2021 AC Conference on Fairness, Accountability, and Transparency 610–623. doi:10.1145/3442188.3445922
Boyle, Casey, & Rivers, Nathaniel A. (2018). Augmented publics. In Laurie Gries & Collin Brooke (Eds.), Circulation, writing, and rhetoric (pp. 83&nash;101). Utah State University Press. doi:10.2307/j.ctt21668mb
Byrd, Antonio. (2023). Truth-telling: Critical inquiries on LLMs and the corpus texts that train them. Composition Studies , 51(1), 135–142.
Crider, Jason, Greene, Jacob, & Morey, Sean. (2020). Digital daimons: Algorithmic rhetorics of augmented reality. Computers and Composition, 57, 102579. doi:10.1016/j.compcom.2020.102579
Deleuze, Gilles, & Guattari, Félix. (2013). A thousand plateaus. Bloomsbury Academic.
Dewey John. (1927). The public and its problems. H. Holt and Company.
Dunne, Anthony, & Raby, Fiona. (2013). Speculative everything: Design, fiction, and social dreaming. MIT Press.
Edbauer, Jenny. (2005). Unframing models of public distribution: From rhetorical situation to rhetorical ecologies. Rhetoric Society Quarterly, 35(4), 5–24. doi:10.1080/02773940509391320
Faris, Michael J., & Holmes, Steve. (Eds.). (2022). Reprogrammable rhetoric: Critical making theories and methods in rhetoric and composition. University Press of Colorado.
Galloway, Anne, & Caudwell, Catherine. (2018). Speculative design as research method. In Gretchen Coombs, Andrew McNamara, & Gavin Sade (Eds.), Undesign (pp. 85–96). Routledge. doi:10.4324/9781315526379-8
Getchell, Kristin M., Carradini, Stephen, Cardon, Peter. W., Fleischmann, Carolin., Ma, Haibing., Aritz, Jolanta., & Stapp, James. (2022). Artificial intelligence in business communication: The changing landscape of research and teaching. Business and Professional Communication Quarterly, 85(1), 7–33. doi:doi.org/10.1177/23294906221074311
Greene, Jacob. (2023). Composing place: Digital rhetorics for a mobile world. Utah State University Press.
Gries, Laurie, Watson, Blake, Kalin, Jason P., Pratt, Jaqui, & Dighton, Desiree. (2020). (Re)designing Innovation Alley: Fostering civic living and learning through visual rhetoric and urban design. Review of Communication, 20(2), 170–177. doi:10.1080/15358593.2020.1737190
Gries, Laurie E. (2019). Writing to assemble publics: Making writing activate, making writing matter. College Composition & Communication, 70(3), 327–355. doi:10.58680/ccc201929986
Habermas, Jürgen. (1989). The structural transformation of the public sphere: An inquiry into a category of bourgeois society. MIT Press.
Hoffman, Johanna. (2022). Speculative futures: Design approaches to navigate change, foster resilience, and co-create the cities we need. North Atlantic Books.
Jagoda, Patrick. (2023). Speculative design. In Patrick Jagoda, with Ireashia Bennett & Ashlyn Sparrow. Transmedia stories: Narrative methods for public health and social justice. Stanford University Press. http://transmediastories.org. doi:10.21627/2022ts
Johnson, Gavin P. (2023). Don't act like you forgot: Approaching another "literacy crisis" by (re) considering what we know about teaching writing with and through technologies. Composition Studies, 51(1), 169–218.
Johnson, Meredith A., & Johnson, Nathan R. (2020). The resilience of sensation in urban planning. Poroi, 15(1). doi:10.13008/2151-2957.1295
Latour Bruno. & Weibel Peter. (2005). Making things public: Atmospheres of democracy. MIT Press.
Moore, Kristen R. (2023). Public. In Han Yu & Jonathan Buehl (Eds.), Keywords in technical and professional communication (pp. 233–239). WAC Clearinghouse/University Press of Colorado. doi:10.37514/TPC-B.2023.1923
Moore, Kristen R., & Elliott, Timothy J. (2016). From participatory design to a listening infrastructure: A case of urban planning and participation. Journal of Business and Technical Communication, 30(1), 59–84. doi:10.1177/1050651915602294
Ong, Walter J., Hartley, John. (2012). Orality and literacy: The technologizing of the word. Routledge.
OpenAI (2021). DALL·E: Creating images from text. Retrieved October 5, 2023, from https://openai.com/research/dall-e
Opel, Dawn S., & Rhodes, Jacqueline. (2018). Beyond student as user: Rhetoric, multimodality, and user-centered design. Computers and Composition, 49, 71–81. doi:10.1016/j.compcom.2018.05.008
Preston, Jacqueline. (2015). Project(ing) literacy: Writing to assemble in a postcomposition FYW classroom. College Composition and Communication, 67(1), 35–63.
Rai, Candice. (2016). Democracy's lot: Rhetoric, publics, and the places of invention. University of Alabama Press.
Ramesh, Aditya, Pavlov, Mikhail, Goh, Gabriel, Gray, Scott, Voss, Chelsea, Radford, Alec, Chen, Mark, & Sutskever, Ilya. (2021). Zero-shot text-to-image generation. http://arxiv.org/abs/2102.12092
Reed, Scott, Akata, Zeynep, Yan, Xinchen, Logeswaran, Lajanugen, Schiele, Bernt, & Lee, Honglak. (2016). Generative adversarial text to image synthesis. Proceedings of the 33rd International Conference on Machine Learning.
Rice, Jennifer. (2012). Distant publics: Development rhetoric and the subject of crisis. University of Pittsburgh Press.
Ridolfo, Jim R. & DeVoss, Dànielle N. (2009). Composing for recomposition: rhetorical velocity and delivery. Kairos: A Journal of Rhetoric, Technology, and Pedagogy, 13(2). https://kairos.technorhetoric.net/13.2/topoi/ridolfo_devoss/
Rivers, Nathaniel A., & Weber, Ryan P. (2011). Ecological, pedagogical, public rhetoric. College Composition and Communication 63(2), 187–218.
Snow, Stephen, Khan, Awais H., Glencross, Mashhuda, & Horrocks, Neil. (2021). Neighbourhood wattch: Using speculative design to explore values around curtailment and consent in household energy interactions. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, 1–12. doi:10.1145/3411764.3445095
Varnelis, Kazys, & Friedberg, Anne. (2008). Place: The networking of public space. In Kazys Varnelis (Ed.), Networked publics (pp. 15–42). MIT Press.
Warner, Mark. (2002). Publics and counterpublics. Zone Books.
Wilkie, Alex, Savransky, Martin, & Rosengarten, Marsha. (2017). Speculative research: The lure of possible futures. Taylor & Francis.