How it works
System Overview
Software & Pipeline
References and Examples Explored
Build the Canopy
Prototype Evolution - From Physical Space to Spatial Experience
Ethics & Limitations
Future Development

Team SAFE DREAMS:

•

Created March 5, 2026

SAFE SPACE. A Home Office Intervention.

A canopy in your home office. Speak your ideal workspace. AI renders live image distortion in Touch Designer. A relationship made visible.

Things used in this project

Hardware components

Smartphone (camera + microphone

Laptop or desktop computer

Portable projector

Audio / Video Cable Assembly, Ultra Slim RedMere HDMI to HDMI

Assorted fabric (curtains, bedsheets, offcuts)

Wood dowels, plastic pipes, or broom handles (~1m each)

Screws or duct tape

Rope or strong string

Software apps and online services

Touch Desinger

Camo App

Typeform

OpenAI WhisperAI

DreamDiffusion Real-time image generator

Microsoft Windows 10

Hand tools and fabrication machines

Sewing Machine

Story

We work from home. We are surveilled at work. The line between the two has collapsed.

SAFE SPACE makes that relationship visible. It's a handmade fabric canopy, suspended in the home office, large enough for one person and a chair. Inside, a projector casts AI-generated images onto the walls. A microphone listens on your answers from the interview.

The piece grows from two research questions. Armin asks: What are we willing to give up to enjoy the benefits of AI at work?
Madlen asks: How does the meaning of home change in times of impermanence? Together they point at the same problem: the home office is both refuge and workplace, both private and monitored.

The canopy exaggerates this contradiction. It offers a temporary shelter. It also records everything you say.

Regarding the challenge of Cognitive Orgies II this project defines intelligences and communication as the following:

Human Spatial Intelligence (embodied knowledge of dwelling) communicates with computational intelligence (LLM processing) through voice to generate spatial visuals and images.
Voice descriptions of a workspace at home are processed by an LLM that generates real-time 3D visualizations in TouchDesigner.

This setup communicates the hypothesis "We shape space and space shapes us."

Testing the full experience.

How it works

When a participant enters the canopy, they find an Phone on a small table or chair. The screen greets them:

"You're in a private space. Speak freely. Your voice will be heard, but your image is shielded. Answer each question aloud."

They speak their answers. As they talk, two things happen simultaneously.

Their words are transcribed by AI and used as a prompt to generate a room image, projected live onto the canopy fabric around them. Describe a forest cabin: a forest cabin appears. Describe your actual desk: that appears too.

At the same time, the volume and pace of their voice controls the pixelation of a live camera feed. The more they speak, the more their image breaks apart.

Six questions guide the experience. They start warm with What does your ideal workspace look like? and move toward colder, more transactional territory. The final question is the ethical one:

"If your employer would pay for your apartment, would you trade your privacy for financial security?"

Then: "You may exit. The canopy provided temporary refuge. Thank you."
The participant leaves. The monitored workspace is still there.

This intervention serves to make visible the tension and discrepancies between lived realities and the physical built environment. This goes back to the core question how "How does our behaviour adapt to space, and how does space adapt to our behaviour?".

System Overview

The system has three components working together:

1 The Experience — an Smartphone running a Typeform that guides the participant through the experience and triggers the audio recording and visual projection.

2 The Canopy — physical fabric structure, the projection surface and shelter.

3 The Projection — a live composite of AI-generated room images and a pixelated webcam feed, driven by the participant's voice.

First ideation on the systems setup and experience flow.

Software & Pipeline

Systems architecture diagram.

Step 1 Capture Audio and Video
The smartphone connects to the laptop via the Camo app, which presents it as both a webcam and an audio input device wireless.

Inside TouchDesigner, an Audio Device In CHOP captures the live audio stream. An Audio Record DAT saves the stream in chunks, when the subject stopped talking, into a local recordings/ folder. The audio footages are sent to OpenAI Whisper 1.0 with direct integration, that sends back the transcript in a Text DAT. That Text DAT serves as an input for a DreamDiffusion node that generates images real-time.

Step 2 Audio Reactivity (Pixelation)
While the audio is being recorded, it is also analyzed in real time.

An Audio Analyze CHOP (set to RMS) measures the volume level and outputs a normalized 0–1 value. This value is exported directly as a parameter into a Pixelate TOP applied to the live video feed.

The result: silence shows a clear image of the participant. Speech breaks it apart. The visual metaphor matches the conceptual one, the more you reveal, the more you disappear.

References and Examples Explored:

https://github.com/modem-works/dream-recorder?tab=readme-ov-file

https://github.com/cumulo-autumn/StreamDiffusion

https://labs.google/projectgenie

https://github.com/openai/shap-e

https://www.instagram.com/aview.fromabridge/

Build the Canopy

This canopy is designed to be made by anyone. No matching fabrics required. Imperfection is the point.

DIY instructions to make your own!

Step 1: Gather Your Fabrics
Collect mismatched fabrics, old curtains, bedsheets, upholstery offcuts, whatever you have. Patchwork is not just acceptable here, it's intentional. The canopy should look handmade and lived-in.

First prototype of the canopy.

Step 2: Cut the Panels
Cut 4 panels at 1.5m × 3m each. Cut 1 top square at 1m × 1m.
✂️ Add 1cm seam allowance on all edges before cutting.

1 / 2 • Sewing the panels together.

Step 4: Attach the Top Square
Pin the 1m × 1m square centred to the top edge of the joined panels. Sew with right sides together.

Step 5: Sew the Ties
Make 8 fabric ties from strips of leftover fabric. Attach them on the inside:

4 ties at each corner
4 ties at each midpoint of each side

These lash the canopy to the internal frame.

Step 6: Build the Frame
Cut 4 pieces to 1m each from wood dowels, plastic pipes, or broom handles. Join them into a square using screws, cable ties, or duct tape.

(maybe insert picture of frame assembled)

Step 7: Attach Frame to Canopy
Thread the ties through the frame and knot them firmly. The frame sits inside the top of the canopy, creating the open shape.

Step 8: Reinforce the Top Hole
Cut a small hole in the centre of the top square. Your hanging rope will pass through here.
✂️ Reinforce the edge with a buttonhole stitch by hand, or use iron-on interfacing and a metal grommet for extra strength.

Step 9: Find a High Point
Find a ceiling hook, overhead beam, cable bridge, or strong tree branch. Thread your rope through the top hole and tie a secure knot. A bowline works well.

Step 10: Raise It
Pull the rope. Adjust until it hangs evenly and the fabric falls open around a person-sized space inside.

Step 11 — Get In. Be Safe.
Crawl inside. Sit down. You're home. ♥

Prototype Evolution - From Physical Space to Spatial Experience

The process evolved across three interconnected design pillars: entering a physical space, interacting with the space and its devices, and using projection to transform the spatial experience.

Day 1: Establishing the Foundation

Projection Layer
Initial exploration focused on audio dynamics as the input. From this we brainstormed what the audio could alter in terms of visual. Also we thought about what other attributes of the audio, like volume, rhythm, frequency, we could use besides the literal words spoken. So the initial idea was to turn speech-into-text-into- 3D world modelling. Camera and microphone input were introduced as the primary sensing modalities, capturing presence, movement, and sound as the raw inputs to drive the system.

Entering the Space - The Physical Artefact
We explored the idea of using textile as a projection surface and to create an enclosed space to focus on the visual alterations.

At this point we did not think of the project to become more an installation style. But we were clear that using the artefact for research and data collection purposes would be great.

Day 2: First Prototype & First Testing

Projection Layer
First testing with textile projection, projecting onto fabric surfaces rather than flat walls, opened up new possibilities for how light, texture, and space could be combined.

Entering the Space
The first physical prototype revealed an immediate spatial constraint: we needed a larger space to project. So the circle hardware frame needed to become a square.

Interaction Layer
We started thinking on the experience the user would have in the canopy. A complete interaction experience was designed around a tablet interface, mapping user input from the device to projected outputs. This represented the first full end-to-end feedback loop.

⚠️ Note: The arrow in the process diagram indicates that the tablet-based interaction fed directly back into the spatial entry experience — a key design insight.

Day 3: Pivoting Based on User Testing

Entering the Space
Testing revealed that users needed an enclosed experience: the interaction needed to happen inside a defined space. The decision was made to introduce furniture to define and bound the space and to start the experience inside instead of outside and then stepping in.

Interaction Layer
First external user testing exposed a critical UX problem: participants were distracted by the tablet interface itself which also blocked the view to the projection on the textile wall. We decided to change out the tablet with a phone to control the focus and give more visual space to the projection.

Projection Layer
In response to the tablet distraction finding, we decided to:

Incorporate spoken words as a primary interaction modality
Simplify the visual output to reduce cognitive load and refocus attention on the spatial experience

Day 4: Pivoting Based on User Testing

Entering the Space
We put a chair to support for the projection level. For the design dialogues it would be nice to build a little night stand to fit the theme and make the space feel more cozy and whole.

Interaction Layer
The hardware of the computer is not big enough to operate AI and the video and audio through Camo App. So we decided to use the laptop inside for presentation purposes.

Projection Layer
The projection works well now with the AI integrated. The only thing is that when there is no continious

1 / 3 • Prototype Setup

Ethics & Limitations

This project is not a solution. It doesn't protect anyone from actual surveillance. It can't scale. It is least accessible to those who are most surveilled. What it is: a temporary intervention. A space for recognition. A question, not an answer. The canopy offers refuge for a few minutes. Then you exit. The surveillance continues. That tension is the work.

Future Development

This project opens several directions for further exploration:

Portability: Can the canopy become truly portable, replicable and adaptable to different contexts? How would the methodology shift if tested in different cultural conexts?
Participant Agency: What happens if participants design their own questions, shape the archive, co-create the methodology? Can research become genuine collaboration?
The Scaling Paradox: Intimacy requires smallness, slowness, care. Can or should this intervention scale? Or is its power precisely in its unreplicability?

Schematics

Code

Real-time speech-to-text node script

"""
================================================================================
TouchDesigner  RMS Voice Transcription + Image Capture
================================================================================

A CHOP Execute DAT that listens to an RMS Analysis CHOP and automatically:
  - Starts recording when sound exceeds a threshold
  - Stops recording after a period of silence
  - Sends the WAV to OpenAI Whisper for transcription
  - Saves the transcript as a .txt file alongside the .wav
  - Saves a snapshot of a TOP (e.g. your visual output) as a .jpg

--------------------------------------------------------------------------------
NETWORK SETUP
--------------------------------------------------------------------------------

  audiodevin1
      |
      +--> audiofileout1        (Audio File Out CHOP  Record = Off)
      |
      +--> analyze1             (Analysis CHOP  Function: RMS Power)
                |
                v
           chopexec1            (THIS DAT  CHOP Execute)
                                 CHOPs      = analyze1
                                 Value Change = On

  button1
      |
      v
  chopexec_button              (optional CHOP Execute for manual override)
                                CHOPs     = button1
                                Off to On = On
                                paste same script, both callbacks are included

  transcript_output            (Text DAT  displays latest transcription)
  moviefileout1                (Movie File Out TOP  connected to your visual output)

--------------------------------------------------------------------------------
REQUIREMENTS
--------------------------------------------------------------------------------

  - TouchDesigner 2023.x or later
  - Python 3.x (bundled with TouchDesigner)
  - OpenAI API key set as environment variable:
      Windows:  setx OPENAI_API_KEY "sk-..."  (restart TD after)
      macOS:    export OPENAI_API_KEY="sk-..."

  No external Python packages required  uses only stdlib + TD built-ins.

--------------------------------------------------------------------------------
LICENSE
--------------------------------------------------------------------------------

  MIT License
  Free to use, modify, and distribute with attribution.

================================================================================
"""

import os
import io
import json
import time
import urllib.request


# 
# CONFIGURATION  edit these to match your TouchDesigner network
# 

# OpenAI API key  reads from environment variable (recommended)
# Alternatively hardcode: OPENAI_API_KEY = "sk-..."
OPENAI_API_KEY = os.environ.get("OPENAI_API_KEY", "")

# Path to your Audio File Out CHOP
RECORDER_PATH  = "/project1/audiofileout1"

# Folder where .wav / .txt / .jpg files will be saved
# Defaults to Desktop/recordings  change to any absolute path
RECORDINGS_DIR = os.path.join(os.path.expanduser("~"), "Desktop", "recordings")

# Text DAT that will display the latest transcription
TRANSCRIPT_DAT = "/project1/transcript_output"

# Button COMP for manual start/stop override (optional)
BUTTON_PATH    = "/project1/button1"

# Movie File Out TOP for saving image snapshots (optional  set to "" to disable)
IMAGE_OUT_PATH = "/project1/moviefileout1"

# OpenAI model  "whisper-1" or "gpt-4o-transcribe"
MODEL          = "whisper-1"

# Language hint for Whisper  improves accuracy (e.g. "en", "es", "fr", "de")
# Set to "" for auto-detection
LANGUAGE       = "en"

# Maximum file size in MB before rejecting (OpenAI limit is 25MB)
MAX_FILE_MB    = 24


# 
# RMS TRIGGER SETTINGS
# 

# RMS level that triggers recording start
# Too low = background noise triggers it
# Too high = quiet speech gets missed
# Tip: watch your analyze1 CHOP values while silent to find your noise floor,
#      then set this to roughly double that value
RMS_THRESHOLD    = 0.08

# Seconds of silence before recording stops and transcription begins
SILENCE_TIMEOUT  = 2.0

# Seconds to wait after a take ends before allowing a new auto-recording
# Prevents immediate re-triggering from residual noise
RESTART_COOLDOWN = 3.0


# 
# INTERNAL STATE  do not edit
# 

_last_sound_time = 0.0
_last_stop_time  = 0.0
_image_saved     = False


# 
# INTERNAL HELPERS
# 

def _rec():
    node = op(RECORDER_PATH)
    if node is None:
        debug("WHISPER: recorder node not found at " + RECORDER_PATH)
    return node


def _set_label(label):
    if not BUTTON_PATH:
        return
    btn = op(BUTTON_PATH)
    if btn:
        btn.par.label = label


def _is_recording():
    node = _rec()
    return bool(node.par.record.val) if node else False


def _save_transcript(wav_path, text):
    txt_path = wav_path.replace(".wav", ".txt")
    try:
        with open(txt_path, "w", encoding="utf-8") as f:
            f.write(text)
        debug("WHISPER: transcript saved -> " + txt_path)
    except Exception as e:
        debug("WHISPER: could not save transcript - " + str(e))


def _save_image(wav_path):
    if not IMAGE_OUT_PATH:
        return
    top = op(IMAGE_OUT_PATH)
    if top is None:
        debug("IMAGE: node not found at " + IMAGE_OUT_PATH)
        return
    img_path = wav_path.replace(".wav", ".jpg")
    try:
        top.par.file = img_path
        top.save(img_path)
        debug("IMAGE: saved -> " + img_path)
    except Exception as e:
        debug("IMAGE: save failed - " + str(e))


def _start_recording():
    global _image_saved
    node = _rec()
    if node is None:
        return ""
    node.par.record = 0
    time.sleep(0.1)
    os.makedirs(RECORDINGS_DIR, exist_ok=True)
    ts       = time.strftime("%Y%m%d_%H%M%S")
    wav_path = os.path.join(RECORDINGS_DIR, "take_" + ts + ".wav")
    node.par.file   = wav_path
    node.par.record = 1
    _image_saved    = False
    tdat = op(TRANSCRIPT_DAT)
    if tdat:
        tdat.text = ""
    _set_label("[ REC ]")
    debug("WHISPER: recording started -> " + wav_path)
    return wav_path


def _stop_recording():
    global _last_stop_time
    node = _rec()
    if node is None:
        return ""
    wav_path        = str(node.par.file)
    node.par.record = 0
    _last_stop_time = time.time()
    _set_label("Sending...")
    debug("WHISPER: recording stopped -> " + wav_path)
    return wav_path


def _multipart_body(fields, files, boundary):
    body = io.BytesIO()
    enc  = boundary.encode()
    for name, value in fields.items():
        body.write(b"--" + enc + b"\r\n")
        body.write(('Content-Disposition: form-data; name="' + name + '"\r\n\r\n').encode())
        body.write(str(value).encode())
        body.write(b"\r\n")
    for name, (filename, ctype, data) in files.items():
        body.write(b"--" + enc + b"\r\n")
        body.write(('Content-Disposition: form-data; name="' + name + '"; filename="' + filename + '"\r\n').encode())
        body.write(('Content-Type: ' + ctype + '\r\n\r\n').encode())
        body.write(data)
        body.write(b"\r\n")
    body.write(b"--" + enc + b"--\r\n")
    return body.getvalue()


def _transcribe(wav_path):
    if not OPENAI_API_KEY:
        debug("WHISPER: OPENAI_API_KEY not set  set env var and restart TouchDesigner")
        _set_label("Press to record")
        return

    debug("WHISPER: waiting for file flush...")
    time.sleep(0.8)

    if not os.path.isfile(wav_path):
        debug("WHISPER: file not found: " + wav_path)
        _set_label("Press to record")
        return

    size_bytes = os.path.getsize(wav_path)
    size_mb    = size_bytes / (1024 * 1024)

    if size_bytes < 1000:
        debug("WHISPER: file too small - no speech detected")
        _set_label("Press to record")
        return

    if size_mb > MAX_FILE_MB:
        debug("WHISPER: file too large (" + str(round(size_mb, 1)) + " MB) - keep under 2 minutes")
        _set_label("Press to record")
        return

    debug("WHISPER: sending " + str(round(size_mb, 2)) + " MB to OpenAI...")

    with open(wav_path, "rb") as f:
        wav_bytes = f.read()

    fields   = {"model": MODEL, "response_format": "json"}
    if LANGUAGE:
        fields["language"] = LANGUAGE

    boundary = "TDBoundary7MA4YWxkTrZu0gW"
    body     = _multipart_body(
        fields,
        {"file": ("audio.wav", "audio/wav", wav_bytes)},
        boundary,
    )
    req = urllib.request.Request(
        "https://api.openai.com/v1/audio/transcriptions",
        data    = body,
        method  = "POST",
        headers = {
            "Authorization": "Bearer " + OPENAI_API_KEY,
            "Content-Type" : "multipart/form-data; boundary=" + boundary,
        },
    )

    try:
        with urllib.request.urlopen(req, timeout=60) as resp:
            raw  = resp.read().decode()
            debug("WHISPER: response -> " + raw)
            text = (json.loads(raw).get("text") or "").strip()
    except urllib.error.HTTPError as e:
        debug("WHISPER: HTTP " + str(e.code) + " - " + e.read().decode("utf-8", errors="replace"))
        _set_label("Press to record")
        return
    except Exception as e:
        debug("WHISPER: error - " + str(e))
        _set_label("Press to record")
        return

    if not text:
        debug("WHISPER: empty result - speak clearly and close to mic")
        _set_label("Press to record")
        return

    debug("WHISPER: result -> " + text)
    _save_transcript(wav_path, text)

    tdat = op(TRANSCRIPT_DAT)
    if tdat is None:
        debug("WHISPER: transcript DAT not found at " + TRANSCRIPT_DAT)
        _set_label("Press to record")
        return

    tdat.text = text
    debug("WHISPER: transcript_output updated")
    _set_label("Press to record")


# 
# MANUAL TOGGLE  triggered by button CHOP Execute
# 

def toggle():
    """Start or stop recording manually via button press."""
    if _is_recording():
        debug("WHISPER: manual STOP")
        wav_path = _stop_recording()
        if wav_path:
            _save_image(wav_path)
            _transcribe(wav_path)
    else:
        debug("WHISPER: manual START")
        _start_recording()


# 
# RMS AUTO-TRIGGER  triggered by Analysis CHOP Execute
# 

def on_rms(rms_value):
    """
    Called on every RMS value change from the Analysis CHOP.
    Starts recording when sound exceeds RMS_THRESHOLD.
    Stops recording after SILENCE_TIMEOUT seconds of silence.
    """
    global _last_sound_time

    now = time.time()

    if rms_value >= RMS_THRESHOLD:
        _last_sound_time = now

        if not _is_recording():
            if now - _last_stop_time >= RESTART_COOLDOWN:
                debug("WHISPER: RMS " + str(round(rms_value, 4)) + "  auto starting")
                _start_recording()
            else:
                remaining = round(RESTART_COOLDOWN - (now - _last_stop_time), 1)
                debug("WHISPER: cooldown " + str(remaining) + "s remaining")
    else:
        if _is_recording():
            silence_duration = now - _last_sound_time
            if silence_duration >= SILENCE_TIMEOUT:
                debug("WHISPER: " + str(round(silence_duration, 1)) + "s silence  auto stopping")
                wav_path = _stop_recording()
                if wav_path:
                    _save_image(wav_path)
                    _transcribe(wav_path)


# 
# TOUCHDESIGNER CALLBACKS
# 

def onOffToOn(channel, sampleIndex, val, prev):
    """Fires when button value goes from 0 to 1 (button pressed)."""
    toggle()


def onValueChange(channel, sampleIndex, val, prev):
    """Fires on every RMS value change from the Analysis CHOP."""
    on_rms(val)

Credits

Madlen Elise von Wulffen

3 projects • 1 follower

Armin Gulbert

3 projects • 1 follower

SAFE SPACE. A Home Office Intervention.