Published April 23, 2026 © GPL3+

Philosotron: Automating the Thinkers

A robot that stares out of a window, and provides cryptic messages to be misunderstood and fought over for centuries.

IntermediateFull instructions provided5 hours36

Things used in this project

Hardware components

Raspberry Pi 4 Model B

Flash Memory Card, MicroSD Card

Model W3 1080p Webcam

Voltaic v88

Optional

Voltaic Cable

Optional

Adafruit PCA9685

Optional

MG996r 180 Degrees

Optional

Adjustable Step Down - LM2596S

Optional

M-M Jumper wire

Optional

F-M Jumper Wires

Optional

USB Cable, USB Type C Plug

Optional

A Box

Velcro stickers

Optional

Software apps and online services

Raspberry Pi Raspbian

Moondream API

Google Gemini API

Raspberry Pi Imager

FileZilla

PuTTY

Microsoft Visual Studio Code

Flask

OpenCV

TailwindCSS

Hand tools and fabrication machines

3D Printer (generic)

Optional

Multitool, Screwdriver

Optional

Super Glue

Optional

Story

Introduction

In the last 250 years, we have made great strides forward in automation. This has provided us with more luxury and convenience than we could have ever dreamed of. Soon we will even have generated music, food, entertainment, warfare, and love. Currently there is just one stone left untouched: philosophy.

Cue the Philosotron, to take care of this truly dreadful activity. It's a robot that stares out of a window, and provides cryptic messages to be misunderstood and fought over for centuries.

This article consists of the following steps:

Introduction
Dataflow
Raspberry Pi Setup
Camera Setup
Pan & Tilt (optional)
The Box
Observing
Deep Analysis
Website
Full Code & Installation
Results
Notes & Considerations

Dataflow

1. The Raspberry Pi directs the servos via a chip to move to a random position (optional).
2. The camera takes a picture and sends it back to the Raspberry Pi.
3. This picture is passed to Moondream via their API.
4. Which shortly describes what it sees and replies back with a caption.
5. This caption is combined with the latest stored thought, to be passed onto the Gemini API.
6. A new piece of wisdom is generated and sent back.
7. This thought and the accompanying picture are stored locally.
8. For viewing, a local website is displayed.

Raspberry Pi Setup

1 / 3

We are opting for a headless setup. This means it won't have a desktop environment or run any visual components. Here's a good tutorial on Getting started with your Raspberry Pi and a good tutorial on remote access.

Setting up remote access is optional, the desktop version is great as well. I usually go for a headless setup, because I prefer to work via my laptop, and avoid the need for a separate screen, mouse, and keyboard.

Once the MicroSD card is written and ready, we can slide the card into the designated slot, plug in the power supply and the Pi should come to life.

Update, Upgrade, Autoremove, and Clean
Once the hardware is ready and we have access directly or via SSH, we run the command below making sure everything is up to date and clean:

sudo apt update && sudo apt full-upgrade -y && sudo apt autoremove -y && sudo apt clean

Camera Setup

Vision for our sage will be provided by a USB webcam. All we need to do is plug it into the USB port of our Raspberry Pi. Once done, we can check if it is connected by running the following command:

lsusb

(env) user@Philosotron :~/Philosotron $ lsusb
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Bus 001 Device 002: ID 2109:3431 VIA Labs, Inc. Hub
Bus 001 Device 003: ID 1a40:0801 Terminus Technology Inc. USB 2.0 Hub
Bus 001 Device 004: ID 26e0:3c13 Sonix Technology Co., Ltd. HDF Webcam USB
Bus 001 Device 005: ID 1b3f:2008 Generalplus Technology Inc. Usb Audio Device
Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub

As you can see our webcam is registered as USB device 004.

Pan & Tilt (Optional)

1 / 5

This step is completely optional, if you prefer simpler setup, you can keep the camera static, and power the Raspberry Pi with a standard power supply.

Pan-Tilt Parts
What you see, defines your reality. To allow our guru to have a changing perception, we use this 3d printed pan-tilt setup by Tommy Zhang to move the camera into different positions before each snap.

This setup uses two MG996R 180-degrees servos for movement, a PCA9685 chip for power supply and control of these servos, and a LM2596S step down converter. This converter will take the 12V coming from the V88 battery, tune it down to 6.4V, and feed it into the supply side of the servo control chip (PCA9685). To connect the chip to our Pi, you can follow this tutorial by Adafruit.

To attach the webcam to the pan-tilt setup we glue some Velcro on both the baseplate and underside of the webcam (see picture). This allows us to both remove and position the webcam easily. It's sturdy enough to hold everything in place, yet flexible enough to allow calibration, as an added bonus it also dampens some of the vibrations coming from the servos.

As we already have the Voltaic V88 battery to power the pan-tilt setup, we might as well use it to power the Raspberry Pi itself. All we need to do is run a USB-C cable from the battery to the Pi, and it's all sorted. This has an extra benefit of making the robot completely portable.

See the pictures attached for an overview of the wiring and assembly.

Centering Servos
Once setup, we can run the calibration script to put both servos into their default position (90 degrees), this is the center point in their range of motion. Then we screw in the parts of the pan, and do the same for the tilt. This makes sure that the setup looks straight forward and dead center. If this would not be done, moving the servos might make it crash into itself or look too far to either side.

The calibration script, and necessary configuration are discussed in step Full Code & Installation.

The Box

1 / 2

To keep everything neat and tidy, we opt to house our electronics in a classy cardboard box. The camera setup is mounted on the lid, the wires are fed through the side openings, and the electronic bits and bobs are placed inside.

To attach the camera to the lid, and to manage the electronics inside we use stick-on Velcro stickers. These are strong, small, and make sure everything stays in place.

Observing

"A wooden fence with holes serves as a privacy screen, accompanied by hanging planters, a large pot with dry grass, a small tree, a water tank, and a bowl on a paved surface."

With the camera in place we can start observing the world. We use a Python script and the opencv library to access the camera and take a picture.

This picture is then sent to the Moondream API, which will provide us with a short caption of the image. Moondream has a free tier where you receive $5 each month to experiment with, which is plenty for this project.

If the pan-tilt setup is part of the robot, we also add some code to initialise the control chip, and move the servos into a random position before each photograph is taken, changing the scenery dynamically.

The complete code, and necessary configuration are discussed in step "Full Code & Installation".

Deep Analysis

With this short caption available, we now collect the previous thought, combine them, and send it to an external brain for processing. By including the previous thought we create an interlinked chain of thoughts, which is somewhat similar to a stream of consciousness.

The thinking power is provided by the Google AI Studio. They offer a free tier, but the total requests per day are too limited (12), so I opted to upgrade. The cost involved will be less than 2 cents a day, if run 24/7. The free tier suffices, but will run out after ~24 minutes.

The core of our deep thinker lies in the instructions we give it. We prompt Gemini to use the description of the image, and previous platitude, to combine them into a new insight using the following:

prompt = f"""
Internal Monologue History: '{previousThought}'
Current Sensory Report: '{imageSummary}'.

As the Philosotron, you must ignore the mundane.
Translate this visual data into a philosophical platitude that builds upon your previous thought.
Do not repeat yourself. Sound like a guru who has seen too much. Be concise.
"""

The complete code, and necessary configuration are discussed in step "Full Code & Installation".

Website

It would be nice to browse the Philosotrons process and considerations in a user-friendly way. So, to get away from JSON files and folder structures, we are building a small local website to fetch all the relevant data, and display it neatly. To do this we will use Flask as a backend, and Tailwind as a frontend.

This frontend will function as an archive, with a date selector and a chronological list to highlight the chained logic of our great thinker.

The complete code, and necessary configuration are discussed in step Full Code & Installation.

Full Code & Installation

Environment Creation & Libraries Installation
Below are the commands needed to create the environment and install all components needed.

#Create a project folder
mkdir Philosotron

# Navigate into the created folder
cd Philosotron

# Create a python environment, required for the installation of libraries
python3 -m venv env

# Activate the created environment
source env/bin/activate

# Install the needed libraries and packages
pip install opencv-python moondream pillow google-genai adafruit-circuitpython-servokit flask flask-cors

API Keys
In the attached code (codeBaseClean.py) we use API keys to access both Moondream and Google AI Studio. You need to replace those with your own, which you can create by following the Google AI studio quick start, and this Moondream Quick Start.

Once configured you can replace the XXX in these lines of code with the corresponding keys:

# Gemini model setup
client = genai.Client(api_key="XXX")

# Moondream model
model = md.vl(api_key="XXX")

File Transfer
With the keys in place and saved, we now transfer the codeBaseClean.py into the newly created folder. Here is tutorial on how to transfer files with FileZilla.

With everything in place we can turn it with the following command:

python3 codeBase.py

(env) user@Philosotron:~/Philosotron $ python3 codeBase.py
Awakening
Moving the Camera.
Captured: image_13-56-37.jpg
-> A white refrigerator is positioned against a light gray wall, with a painting of flowers on its right side and a wooden scroll hanging to its left.
-> The stark white void, a monument to preserved emptiness, reminds me that awareness is merely the first step out of oblivion.

Archive
Inside the Philosotron folder create another folder called templates:

#Create a templates folder
mkdir templates

Download the index.html file, and move it into the templates folder.

We can run the backend and website with the following command:

python3 backend.py
(env) user@EyeBot:~/Philosotron $ python3 backend.py
* Serving Flask app 'backend'
* Debug mode: on
WARNING: This is a development server. Do not use it in a production deployment. Use a production WSGI server instead.
* Running on all addresses (0.0.0.0)
* Running on http://127.0.0.1:5500
* Running on http://192.168.1.5:5500
Press CTRL+C to quit
* Restarting with stat
* Debugger is active!
* Debugger PIN: 178-027-370

Next we use our web browser to navigate to the IP address of our Raspberry Pi, adding the port 5500. In my case this looks like:

192.168.1.5:5500

IMPORTANT: Flask will not see your website unless it is inside a folder named templates. Your folder should look like: Philosotron > templates > index.html.

Results

1 / 4

Notes & Considerations

Breakable arms
In the process of handling the 3D-printed the servo arms I broke quite a few, so you have been warned.

First Thought
On line 66 of the codeBaseClean.py file there is the following line:

return data[-1].get("thought", "")

This makes sure that if there is no previous thought, it will use a default value, in this case "". You can steer the thought process by setting the seed thought to what you like.

Webcam Buffer
On line 140 of the codeBaseClean.py file there is the following line:

for _ in range(60):
video.read()

Webcams often have a buffer that holds old images. We read the camera 60 times quickly to clear out the old frames so the Philosotron is looking at what is happening right now, not what happened 2 seconds ago.

VPN
The Raspberry Pi is set to local network access only by default, you might need to turn off your VPN.

Special Thanks
Thanks to Gemini for moral, code and proofreading support.
Thanks to my garden gnome wife for proofreading.

Code

from adafruit_servokit import ServoKit
import time
import random

# Initialize the PCA9685 and servos
kit = ServoKit(channels=16, frequency=50)

panServo = 15 
kit.servo[panServo].actuation_range = 180
kit.servo[panServo].set_pulse_width_range(500, 2500)


tiltServo = 12
kit.servo[tiltServo].actuation_range = 180
kit.servo[tiltServo].set_pulse_width_range(500, 2500)

def setToCenter():
    
    kit.servo[panServo].angle =  90
    time.sleep(1)
    kit.servo[tiltServo].angle = 90
    time.sleep(1)
    
def moveCamera():
    
    print("Moving the Camera.")
    
    kit.servo[panServo].angle =  random.randint(10, 170)
    time.sleep(1)
    kit.servo[tiltServo].angle = random.randint(70, 110)
    time.sleep(1)
    
    

# For servo centenrig
setToCenter()
time.sleep(1)
print("Done centering both servos, you ca now attach the arms.")

# # For testing the range
# try:
#     while True:
#         moveCamera()
    
# except KeyboardInterrupt:
#     print("Philosotron is powering down...")
#     kit.servo[panServo].angle =  90
#     time.sleep(1)
#     kit.servo[tiltServo].angle = 90

import time
from datetime import datetime
import os
import cv2
import moondream as md
from PIL import Image
import json
from google import genai
from adafruit_servokit import ServoKit
import random

# Initialize the PCA9685 and servos
kit = ServoKit(channels=16, frequency=50)

panServo = 15 
kit.servo[panServo].actuation_range = 180
kit.servo[panServo].set_pulse_width_range(500, 2500)


tiltServo = 12 
kit.servo[tiltServo].actuation_range = 180
kit.servo[tiltServo].set_pulse_width_range(500, 2500)

# Gemini model setup
client = genai.Client(api_key="XXX")

# Moondream model
model = md.vl(api_key="XXX")

# Base folders
basePictureFolder = "pictures"
logFolder = "logs" 

# Ensure main folders exist
for d in [basePictureFolder, logFolder]:
    if not os.path.exists(d):
        os.makedirs(d)

cooldownSeconds = 120

def moveCamera():
    
    print("Moving the Camera.")
    
    kit.servo[panServo].angle =  random.randint(10, 170)
    time.sleep(1)
    kit.servo[tiltServo].angle = random.randint(70, 110)
    time.sleep(1)
    
    

def getLatestThought():
    
    currentDate = datetime.now().strftime("%Y-%m-%d")
    filename = os.path.join(logFolder, f"history_{currentDate}.json")
    
    if os.path.exists(filename):
        
        with open(filename, 'r') as f:
            
            try:
                
                data = json.load(f)
                
                if data:
                    return data[-1].get("thought", "")
                
            except:
                pass
            
    return "I have just opened my eyes."

def saveToDatedJSON(data):
    
    currentDate = datetime.now().strftime("%Y-%m-%d")
    filename = os.path.join(logFolder, f"history_{currentDate}.json")
    
    listToSave = []
    
    if os.path.exists(filename):
        with open(filename, 'r') as f:
            try:
                listToSave = json.load(f)
            except:
                listToSave = []

    listToSave.append(data)
    
    with open(filename, 'w') as f:
        json.dump(listToSave, f, indent=4)

def getAISummary(frame):
    
    frameRgb = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)
    pilImg = Image.fromarray(frameRgb)
    
    try:
        shortResult = model.caption(pilImg, length="short")
        return shortResult["caption"]
    except Exception as e:
        return f"Vision Error: {e}"

def getAIThought(imageSummary, previousThought):
    
    prompt = f"""
        Internal Monologue History: '{previousThought}'
        Current Sensory Report: '{imageSummary}'.

        As the Philosotron, you must ignore the mundane. 
        Translate this visual data into a philosophical platitude that builds upon your previous thought. 
        Do not repeat yourself. Sound like a guru who has seen too much. Be concise.
        """
    try:
        response = client.models.generate_content(model="gemini-2.5-flash-lite", contents=prompt)
        return response.text.strip()
    except Exception as e:
        return f"Crisis: {e}"

print("Awakening")

video = cv2.VideoCapture(0)

if not video.isOpened():
    print("Error: Could not open webcam.")
    exit()

# --- Main Loop ---
try:
    
    while True:
        
        # Prepare Daily Folder
        currentDate = datetime.now().strftime("%Y-%m-%d")
        dayFolder = os.path.join(basePictureFolder, currentDate)
        os.makedirs(dayFolder, exist_ok=True)

        moveCamera()
        time.sleep(2) 
        
        for _ in range(60):
            video.read()
        
        check, frame = video.read()

        if check:
            
            # Get the timestamp
            now = datetime.now()
            timestampStr = now.strftime("%H-%M-%S")

            # Get the caption, last thought and new insight
            aiSummary = getAISummary(frame)
            lastThought = getLatestThought()
            aiThought = getAIThought(aiSummary, lastThought)

            # Save Image in the designated folder
            imageFilename = f"image_{timestampStr}.jpg"
            filepath = os.path.join(dayFolder, imageFilename)
            cv2.imwrite(filepath, frame)

            # Log data
            entry = {
                "time": timestampStr,
                "folder": currentDate,
                "image": imageFilename,
                "summary": aiSummary,
                "thought": aiThought
            }
            
            # Save the data in a local json
            saveToDatedJSON(entry)
            
            print(f"Captured: {imageFilename} -> {aiSummary} -> {aiThought}")
        
        # Go to sleep
        time.sleep(cooldownSeconds)
        
except KeyboardInterrupt:
    
    print("Philosotron is powering down...")
    
finally:
    
    video.release()
    kit.servo[panServo].angle =  90
    time.sleep(1)
    kit.servo[tiltServo].angle = 90

import os
from flask import Flask, render_template, jsonify, send_from_directory
from flask_cors import CORS

app = Flask(__name__)
CORS(app)

LOGS_DIR = os.path.join(os.getcwd(), 'logs')
PICTURES_DIR = os.path.join(os.getcwd(), 'pictures')

@app.route('/')
def index():
    return render_template('index.html')

@app.route('/api/dates')
def get_dates():
    dates = []
    if os.path.exists(LOGS_DIR):
        for filename in os.listdir(LOGS_DIR):
            if filename.startswith("history_") and filename.endswith(".json"):
                # Extracts "2026-04-04" from "history_2026-04-04.json"
                date_str = filename.replace("history_", "").replace(".json", "")
                dates.append(date_str)
    
    # Sort strings like "2026-04-04" alphabetically in reverse 
    # results in the latest date being at the top (index 0).
    return jsonify(sorted(dates, reverse=True))

@app.route('/api/logs/<date>')
def get_log(date):
    return send_from_directory(LOGS_DIR, f"history_{date}.json")

@app.route('/pictures/<path:filename>')
def serve_pictures(filename):
    return send_from_directory(PICTURES_DIR, filename)

if __name__ == '__main__':
    app.run(host='0.0.0.0', port=5500, debug=True)

<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>Philosotron</title>
    <script src="https://cdn.tailwindcss.com"></script>
    <style>
        @import url('https://fonts.googleapis.com/css2?family=Inter:wght@300;400;600&family=Lora:ital,wght@1,400&display=swap');
        body { font-family: 'Inter', sans-serif; background-color: #fafafa; }
        .thought-text { font-family: 'Lora', serif; }
    </style>
</head>
<body class="text-slate-800">

    <div class="max-w-5xl mx-auto px-4 py-12">
        <header class="mb-16 border-b border-slate-200 pb-8 flex flex-col md:flex-row justify-between items-start md:items-end gap-4">
            <div>
                <h1 class="text-3xl font-light tracking-tight text-slate-900 italic">Philosotron</h1>
                <p class="text-slate-500 mt-2">A daily record of observations and reflections.</p>
            </div>
            
            <div class="flex flex-col items-end w-full md:w-auto">
                <label for="dateSelect" class="text-xs uppercase tracking-widest text-slate-400 mb-2 font-bold">Select Archive Date</label>
                <select id="dateSelect" class="w-full md:w-64 bg-white border border-slate-200 rounded-lg px-4 py-2 outline-none focus:ring-2 focus:ring-indigo-100 transition-all cursor-pointer shadow-sm">
                    </select>
            </div>
        </header>

        <main id="gallery-feed" class="space-y-32">
            <div class="text-center py-20 text-slate-400">Initializing archive...</div>
        </main>
    </div>

    <script>
        const feedContainer = document.getElementById('gallery-feed');
        const dateSelect = document.getElementById('dateSelect');
    
        // Utility to turn "12-43-08" into "12:43"
        function formatTime(rawTime) {
            if (!rawTime) return "";
            // Replaces all dashes with colons
            return rawTime.replace(/-/g, ':');
        }
    
        async function init() {
            try {
                const response = await fetch('/api/dates');
                const logDates = await response.json();
    
                if (!logDates || logDates.length === 0) {
                    feedContainer.innerHTML = '<p class="text-center py-20">No logs found.</p>';
                    return;
                }
    
                // Populate dropdown (already sorted by Python)
                dateSelect.innerHTML = logDates.map(date => 
                    `<option value="${date}">${date}</option>`
                ).join('');
                
                // Load the latest date
                loadData(logDates[0]);
            } catch (err) {
                console.error("API Error:", err);
            }
        }
    
        async function loadData(date) {
            feedContainer.innerHTML = '<div class="text-center py-20 text-slate-400 animate-pulse">Loading Archive...</div>';
            try {
                const response = await fetch(`/api/logs/${date}`);
                const data = await response.json();
                renderFeed(data);
            } catch (err) {
                feedContainer.innerHTML = '<p class="text-center py-20 text-red-500">Error loading log.</p>';
            }
        }
    
        function renderFeed(items) {
            // Sort individual entries within the day by time (latest entry at top)
            const sortedItems = items.sort((a, b) => b.time.localeCompare(a.time));
    
            feedContainer.innerHTML = sortedItems.map(item => `
                <article class="grid grid-cols-1 lg:grid-cols-12 gap-8 lg:gap-16 items-start mb-24 border-b border-slate-100 pb-16 last:border-0">
                    <div class="lg:col-span-7">
                        <div class="rounded-2xl overflow-hidden shadow-lg bg-white">
                            <img src="/pictures/${item.folder}/${item.image}" class="w-full h-auto">
                        </div>
                    </div>
    
                    <div class="lg:col-span-5 pt-4">
                        <div class="flex items-center gap-3 mb-4">
                            <span class="bg-slate-900 text-white text-xs font-bold px-2 py-1 rounded">
                                ${formatTime(item.time)}
                            </span>
                            <span class="text-slate-300 text-xs uppercase tracking-widest">${item.folder}</span>
                        </div>
                        
                        <p class="text-slate-600 text-lg mb-8 leading-relaxed">
                            ${item.summary}
                        </p>
                        
                        <div class="relative pl-6 border-l-2 border-indigo-100">
                            <p class="thought-text text-2xl text-slate-400 italic leading-tight">
                                ${item.thought}
                            </p>
                        </div>
                    </div>
                </article>
            `).join('');
        }
    
        dateSelect.addEventListener('change', (e) => loadData(e.target.value));
        init();
    </script>
</body>
</html>

Credits

WildGardenGnome

1 project • 0 followers

A wild garden gnome in a high-tech world.

Philosotron: Automating the Thinkers

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Introduction

Dataflow

Raspberry Pi Setup

Camera Setup

Pan & Tilt (Optional)

The Box

Observing

Deep Analysis

Website

Full Code & Installation

Results

Notes & Considerations

Schematics

Wiring

Code

ServoCentering

codeBaseClean.py

backend.py

index.html

Credits

WildGardenGnome

Comments

Embed the widget on your own site

Philosotron: Automating the Thinkers

Philosotron: Automating the Thinkers

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Introduction

Dataflow

Raspberry Pi Setup

Camera Setup

Pan & Tilt (Optional)

The Box

Observing

Deep Analysis

Website

Full Code & Installation

Results

Notes & Considerations

Schematics

Wiring

Code

ServoCentering

codeBaseClean.py

backend.py

index.html

Credits

WildGardenGnome

Comments

Related channels and tags