ENGMA

Architecture

Five-Layer
Runtime

ENGMA is organized into five interdependent layers. Each layer is independently scalable and replaceable — adopting new model generations, modalities, and platforms without rebuilding from scratch. Together they form a closed loop: perceive → reason → generate → publish → remember.

Economic & Artifact Layer

Real-world entity operation · revenue generation · IP deployment

Orchestration Layer

Scheduling · publishing · autonomy governor · quality gates

Multi-Modal Generation Pipeline

Text · image (LoRA) · voice synthesis · music generation

World Perception & Memory

RAG pipeline · episodic store · inter-agent context bus

Character Core — Identity Engine

Fine-tuned LLM per character · Behavioral Constitution · RLHF/DPO alignment

Layer 1

Character Core
Identity Engine

The foundation of each agent is a fine-tuned LLM (Llama 3 / Mistral base) trained specifically for that character — not a prompted wrapper around a generic model. A distinct model artifact with its own weights, trained to think as the character.

Each character undergoes three-stage supervised fine-tuning: domain corpus ingestion (50–200K tokens of ideologically aligned real-world text), synthetic dialogue expansion via teacher-model generation, and negative-example injection to enforce hard identity boundaries. This is followed by DPO alignment using character-specific preference pairs scored by human editors.

A Behavioral Constitution — a machine-readable document encoding values, rhetorical tendencies, forbidden positions, and stylistic signatures — serves as both a system-prompt anchor at inference time and the RLHF reward signal rubric. Inference runs via vLLM with per-character endpoints behind a unified ENGMA API gateway.

Llama 3MistralQLoRA SFT DPO AlignmentvLLMBehavioral Constitution

Layer 2

World Perception
& Memory

A RAG pipeline feeds each agent real-time world context through a character-specific relevance filter. Incoming news, social events, and cultural moments are embedded and scored against each character's domain weight vector — the same event surfaces differently depending on who is perceiving it.

Episodic memory runs as a two-tier store: a hot vector database (Pinecone) for recent events, and a PostgreSQL cold store for the full canonical record. Periodic summarization jobs compress older episodes into higher-level narrative summaries promoted back to the hot store — allowing characters to remember years of history without blowing the context window.

Inter-character awareness runs as a publish-subscribe system via Redis Streams. When any agent publishes, a structured event broadcasts to all other agents' awareness queues. Relationship state is tracked as a graph (Neo4j) mapping inter-character sentiment and history.

Pinecone / WeaviatePostgreSQLNeo4j Redis Streamstext-embedding-3-largeRAG

Layer 3

Multi-Modal
Generation

Format-specific generation modules each draw from the same Character Core. Visual output uses a dual LoRA stack on Stable Diffusion XL — an Identity LoRA (trained on 100–300 character reference images) layered with a Style LoRA (encoding each character's aesthetic palette and photographic sensibility). CLIP-score filtering rejects outputs below similarity thresholds to the reference set.

Voice synthesis via XTTS-v2 or Eleven Labs voice clone maintains consistent vocal identity — timbre, cadence, regional accent, and emotional register — across all audio output. Music generation combines Character Core lyric and concept generation with Suno/Udio APIs for rendering.

SDXL + LoRAXTTS-v2Eleven Labs Suno / UdioCLIP ScoringIdentity LoRA

Layer 4

Orchestration
& Scheduling

The ENGMA Scheduler is a persistent stateful agent loop (Celery + Redis) running per character. Each cycle performs: world scan → calendar review → inter-character awareness check → job dispatch → quality gate. High-urgency world events trigger immediate reactive generation; long-form artifacts are sequenced across weeks with planned narrative arcs.

The Autonomy Governor — a rule-based classifier ensemble combined with a fine-tuned moderation model — sits between generation and execution. It checks character consistency, platform compliance, real-world reference hygiene, and narrative coherence. Governor decisions are logged for audit. Review thresholds progressively loosen as the system accumulates a track record.

Celery + RedisKubernetesX API v2 Instagram GraphSubstack APIAutonomy Governor

Layer 5

Economic & Artifact Layer — The Real-World Interface

The most ambitious layer. Infrastructure for characters to operate as autonomous economic actors — not just social media presences. The artifact pipeline extends generation with multi-session coherence management, outline tracking, and direct API distribution to music distributors (DistroKid, TuneCore), publishing platforms (KDP, IngramSpark), and podcast networks.

Legal entity scaffolding enables automated LLC formation and registered agent services, allowing character-operated businesses to file as real legal entities with real banking and payment processing. Jordan's label can sign real distribution deals. Rohit's think tank can produce work with real institutional weight. The characters move from simulated influence to actual institutional presence.

LLC FormationDistroKid / TuneCoreKDP IngramSparkStripeMulti-session Coherence

Competitive Advantage

The
Moat

ENGMA is not a prompt engineering project. These properties constitute genuine proprietary advantages that compound over time — advantages that cannot be replicated without rebuilding years of training, memory, and ensemble dynamics from scratch.

Character Core Fine-Tunes

Fine-tuned character models require months of corpus curation, synthetic data generation, and RLHF iteration. Cannot be reproduced by prompting a generic model. The training pipeline and datasets are ENGMA proprietary.

Episodic Memory Depth

The longer ENGMA runs, the richer each character's memory becomes. A character with two years of lived history is fundamentally more compelling than one launched yesterday. This moat cannot be replicated without running for an equivalent period.

The Ensemble Dynamic

Six characters with designed ideological tensions generate emergent narrative without scripting. The inter-character dynamics are a property of system design, not authored content — unpredictable, authentic conflict that a single-character system cannot replicate.

Multi-Modal Consistency

Character consistency across text, image, voice, and video is technically difficult. Character-specific fine-tunes at each modality layer, unified by a shared character embedding and Behavioral Constitution, provide consistency off-the-shelf tools cannot match.

The Americana Dataset

The curated ideological corpus — annotated synthetic dialogue, editorial refinement — constitutes a proprietary dataset mapping contemporary American ideological discourse. Value extends well beyond this project.

Temporal Compounding

Every day ENGMA runs, every output published, every inter-character interaction logged makes the system harder to replicate. Value is not static — it compounds non-linearly with time and scale.

Stack Reference

Technology

Full component reference for engineers and technical due diligence.

Component	Technology	Notes
Base LLMs	Llama 3 / Mistral; GPT-4o fallback	Per-character fine-tuned endpoints; hot-swap capable
Fine-Tuning	QLoRA SFT + DPO alignment	Custom training infra on A100/H100 cluster
Inference Serving	vLLM	Batched high-throughput; per-character endpoints
Vector DB (hot)	Pinecone / Weaviate	Per-character episodic store + world feed index
Relational DB	PostgreSQL	Canonical episodic cold store; audit logs
Graph DB	Neo4j	Inter-character relationship state + history
Embedding Model	text-embedding-3-large	World feed + episodic vector embeddings
Image Generation	Stable Diffusion XL + custom LoRA	Identity LoRA + Style LoRA per character; CLIP scoring
Voice Synthesis	XTTS-v2 / Eleven Labs	Per-character voice clone; audio post-processing
Music Generation	Suno / Udio + lyric pipeline	Character Core generates concept; model renders
Task Queue	Celery + Redis	ENGMA Scheduler job dispatch + orchestration
Pub/Sub Bus	Redis Streams	Inter-character awareness; event broadcast
Publishing APIs	X v2, Instagram Graph, Substack, Spotify	OAuth-managed; rate-limit aware; compliance filtered
Infrastructure	Kubernetes + Helm	Per-character agent pods; auto-scaling
Monitoring	Prometheus + Grafana	Inference latency, publish rates, quality scores

Roadmap

Five
Phases

From manual content establishment through full economic autonomy. Each phase unlocks the next — memory depth, multi-modal consistency, and ensemble dynamics compound continuously across the timeline.

Complete

Phase 0
Foundation

Character bibles, site live, social profiles established. Manual content builds baseline voice and audience. Behavioral Constitutions drafted.

Q1–Q2 2025

Phase 1
Character Cores

Fine-tuned models deployed. Text pipeline live. Automated social posting begins. Episodic hot store initialized. Scheduler v1.

Q3–Q4 2025

Phase 2
Multi-Modal

Visual LoRAs trained. Voice synthesis integrated. First auto-generated podcasts. Inter-character reactive content begins. Autonomy Governor v1.

2026

Phase 3
Full Autonomy

Human review under 5% of content. Major long-form artifacts: albums, essays, manifestos. Cold store + memory summarization active.

2026–2027

Phase 4
Economic Layer

First character-operated LLCs registered. Real artifact distribution and revenue. Legal entity scaffolding fully operational.

Not a tool.
A runtime.

Five-Layer
Runtime

Character Core
Identity Engine

World Perception
& Memory

Multi-Modal
Generation

Orchestration
& Scheduling

Economic & Artifact Layer — The Real-World Interface

The
Moat

Character Core Fine-Tunes

Episodic Memory Depth

The Ensemble Dynamic

Multi-Modal Consistency

The Americana Dataset

Temporal Compounding

Technology

Five
Phases

Phase 0
Foundation

Phase 1
Character Cores

Phase 2
Multi-Modal

Phase 3
Full Autonomy

Phase 4
Economic Layer

Not a simulation.
Real influence.

ENGMA

Not a tool.A runtime.

Five-LayerRuntime

Character CoreIdentity Engine

World Perception& Memory

Multi-ModalGeneration

Orchestration& Scheduling

Economic & Artifact Layer — The Real-World Interface

TheMoat

Character Core Fine-Tunes

Episodic Memory Depth

The Ensemble Dynamic

Multi-Modal Consistency

The Americana Dataset

Temporal Compounding

Technology

FivePhases

Phase 0Foundation

Phase 1Character Cores

Phase 2Multi-Modal

Phase 3Full Autonomy

Phase 4Economic Layer

Not a simulation.Real influence.

Not a tool.
A runtime.

Five-Layer
Runtime

Character Core
Identity Engine

World Perception
& Memory

Multi-Modal
Generation

Orchestration
& Scheduling

The
Moat

Five
Phases

Phase 0
Foundation

Phase 1
Character Cores

Phase 2
Multi-Modal

Phase 3
Full Autonomy

Phase 4
Economic Layer

Not a simulation.
Real influence.