Djali

May 17, 2023

Basis Research Institute, a nonprofit applied research organization dedicated to understanding and building intelligence from first principles, collaborates with GKZ, a creative collective led by brothers Gaika, Kibwe, and Zenna Tavares. Together with a diverse collective of independent artists, technicians, and researchers, they present Djali at the 2023 Venice Biennale of Architecture. The Biennale Architettura 2023, titled "The Laboratory of the Future," explores themes of equity, race, hope, and fear from a global perspective, incorporating the unique talents and expertise of this multifaceted group of collaborator

Children playing with toys on a futuristic laboratory

At the heart of Djali is an exploration into counterfactual and
imaginative reasoning about possible worlds and lives, mediated by
experimentation in both narrative structure and technology. The exhibit
integrates advances in computer vision, computer graphics, automated
reasoning, and artificial intelligence generally, with innovation into
the structure of storytelling itself.

As a visitor moves through the exhibit, they encounter a structure that
embodies Djali, a ruminating artificial intelligence. Djali detects
visitors through an array of sensors, and communicates with them via
sound and visuals on a large display that covers much of its surface.

Narratives

Djali takes the visitor through a sequence of vignettes, short stories
that serve as windows into imaginary worlds and the lives within them,
loosely linked together by a backdrop that touches on questions of race,
class, technology and the climate. Each window offers a glimpse into a
different counterfactual branch, presenting stories of worlds that might
exist in the future, that could have existed but didn’t, or that could
never exist in reality. Djali guides visitors through these different
worlds and encourages them to explore and reflect on the choices and
circumstances that created and continue to shape the world we inhabit.

Mediums

Djali employs an array of storytelling mediums, both traditional and
new. Traditional static imagery, video, and sound come together to
depict the detailed environments and narratives of each hypothetical
world.

Beyond these established technologies, Djali uses “Figments”, a new
technology developed for the exhibit, to give visitors a novel sense of
presence within the alternative realities. The Figments, enabled by
composing very recent advances in computer vision, computer graphics,
and artificial intelligence, create a viewing experience goes beyond the
sum of its parts. Experientially, as a visitor shifts their perspective
by moving their head or gaze, the computer reconstructs the displayed
image correspondingly, creating the perception that the visitor is
genuinely looking into an alternative world through a window, or portal.

Interactions

Visitors are active participants within the exhibit. Their presence and
movements guide Djali through the counterfactual branches, giving them
an influential role in shaping the narratives. As visitors engage with
the windows and explore the presented alternative realities, they are
encouraged to reflect on the choices, consequences, and possibilities
that exist within each world.

The Venice Biennale Context

The 2023 Venice Biennale of Architecture, titled "The Laboratory of the
Future," provides an ideal setting for Djali. The Biennale challenges
architects, designers, and artists to push the boundaries of their
disciplines, exploring new technologies and ideas that have the
potential to reshape the future.

Djali aligns with this vision, embodying a spirit of innovation and
exploration. By merging the technical and creative talents of Basis and
GKZ, the exhibition offers a unique perspective on the potential futures
that lie before us.

Intended Impact

The intended impact of Djali is multifaceted, aiming to challenge
visitors’ assumptions about the real world and its properties. By
presenting alternative realities, the exhibit encourages visitors to
reflect on which aspects of our world are immutable, as opposed to those
that are coincidental or subject to change.

Djali seeks to showcase the richness of everyday lives in scenarios
where roles are altered and hierarchies are upended, while also
prompting visitors to question the limitations of their imagination. By
mediating these worlds through artificial intelligence, the exhibit
raises questions about the creative potential of artificial
intelligences, and the degree to which these cognitive faculties can or
will remain distinctly human.

Furthermore, the exhibition honors the role of Jali, also known as
Griot, the traditional West African storyteller responsible for
preserving and sharing cultural knowledge and wisdom. By incorporating
the concept of Jali, which shares its root with "Djali," into the
exhibition, the creators emphasize the significance of storytelling in
promoting empathy, understanding, and connection across diverse cultures
and societies.

Ultimately, Djali aims to transcend the boundaries of conventional
storytelling by using artificial intelligence as a medium to deliver
imaginative narratives that inspire visitors to contemplate the
possibilities of our shared future and the role each individual plays in
shaping it.

imag

System at a Glance

The Djali architecture and its creation process consist of several
components:

The physical structure: Visitors to the exhibit encounter a
physical structure—a large display within an enclosure. Equipped
with high-speed cameras, microphones, and embedded high-performance
computers, the structure can sense its environment and detect the
presence of people.
Narrative: The structure embodies an observant, archival, and
ruminating artificial intelligence named Djali. Djali presents its
thoughts, both real and imaginary, through various forms of
expression.
Library of works: Djali conveys the narrative by sequencing
together various pieces of work.
Photography: Images and visual content form an integral part of
the narrative and the visitor’s experience.
Figments: The most unique experiences are presented through
Figments. For the visitor, viewing a Figment feels like looking
through a window or portal into another environment frozen in time.
Unlike conventional videos or images on a screen, a Figment
dynamically changes in response to the visitor’s viewpoint, creating
the sensation of peering through a window. Constructing this
experience requires (i) capturing 3D representations of scenes from
images, (ii) tracking the viewpoint of the visitor, and (iii)
real-time rendering of the 3D scene on the display with the correct
perspective relative to the estimated viewer’s viewpoint.
Interactive compositions: Visitors actively engage with the
exhibit, influencing the narratives and experiences presented to
them.
The AI Architecture: The sensing, rendering, composition, and
interactivity are all mediated by a unified artificial intelligence
architecture. In other words, Djali functions as an artificial
intelligence both within the narrative and in actuality.

Physical Construction

The physical structure of the Djali exhibition is composed of
extruded anodized aluminum, providing a robust and functional framework.
At the center, the lenticular display is placed, while the GPU computers
are concealed at the back.

The frame features multiple panels, filled with translucent
polycarbonate sheets. The structure also supports the sensing
infrastructure, with cameras, microphones, and speakers fixed to the
frame, ensuring seamless integration of technology and design.

AI Architecture

The AI architecture for the Djali exhibition consists of
interconnected software modules. Input data from cameras and microphones
is processed by a GPU-powered machine running these modules.

The gaze estimation module uses computer vision techniques to track
the visitor’s irises and triangulate their gaze in 3D space. The gaze
prediction module predicts the visitor’s gaze several milliseconds
into the future, ensuring 3D models are rendered at the appropriate
perspective.

Cognitive components, such as text-to-speech, speech-to-text,
and the NERF 3D rendering module, are read from and written to by
the cognition core, which serves as the central controlling module,
deciding what to present to the visitor.

The audio source separation module processes audio streams,
isolating the visitor’s voice from background noise. The asset
library stores 3D NERF models, videos, still images, and other media
assets for use in the exhibit.

Finally, the data sinks, including the lenticular display and
speakers for positional audio, present visual and auditory elements of
alternative realities to the visitor, mediated by the cognitive core.

Figments

Figments are a unique and central aspect of the Djali exhibit, providing
visitors with a snapshot into alternative worlds. The experience is made
possible by a combination of off-axis projection and Neural Radiance
Fields (NeRF), which capture high-fidelity scenes in 3D. This
cutting-edge technology enables the Figments to offer an immersive and
captivating glimpse into the richly detailed environments of each
hypothetical world.

As visitors engage with the Figments, they are guided by audio vignettes
and, which in some cases, are narrated by Djali itself. This
multi-sensory experience fosters a deeper understanding and connection
to the alternative realities presented.

Narrative

In a world reshaped by revolution and technology, old borders and
nations have faded, replaced by a new order of interconnected
city-states governed by benevolent artificial intelligences. In this
realm, idealized collectivism triumphs over individualism, while the
Earth’s ecosystems are revered as a sentient and sacred deity.
Citizenship, no longer bound by physical borders, is granted and
maintained through constant surveillance and adherence to neo-religious
ecological beliefs.

Within the densely populated urban polities, digital citizenship grants
individuals access to communal resources, generous basic income, and the
freedom to traverse allied city-states. These urban hubs, meticulously
designed to minimize humanity’s impact on the Earth, house ninety
percent of the world’s population within a land area no larger than the
Tokyo metro area.

Despite the system’s efficiency, a predictable number of individuals
find themselves denied or stripped of their citizenship. Cast out from
the city-states, they are left with no choice but to seek refuge in the
forgotten relics of the old world, joining the unrehabilitated
criminals, capitalist ideologues, and religious zealots. In these
chaotic, class-driven principalities, the remaining ten percent of the
global population resides, living under the rule of warlords who thrive
on cyber warfare, terrorism, and contraband trafficking within the
city-states.

This is a world divided by ideology, where the future of humanity
teeters between the pursuit of Earth preservation and the dangerous
allure of unchecked ambition. It is a tale of revolution, technology,
and the complexities of a society shaped by artificial intelligence,
ecological reverence, and the remnants of the old world order.

References and Concept Work

image/djali/images/

![image](/djali/images/wistful.png

Contributors

Research Development: Zenna Tavares, Ria Das, Shaiyan Keshvari, Karen Schroder, Emily Mackevicius, Wyatt Garfield, Michelle Yi

Creative Development: Zenna Tavares, Kibwe Tavares, Gaika Tavares, Lullyn Tavares, Jessica Au, Ying Suen

Article: Karen Schroeder, Zenna Tavares

Acknowledgments: APOC Store, Zuckerman Institute, The Venice Biennale of Architecture, Jessica Wimbart