README.md

Select File
<div align="center">

<img src="https://capsule-render.vercel.app/api?type=waving&color=0:F59E0B,100:1A1A1A&height=200&section=header&text=viva_glyph&fontSize=64&fontColor=fff&animation=twinkling&fontAlignY=35&desc=Vectorial%20language%20for%20digital%20consciousness&descSize=18&descAlignY=55" width="100%"/>

[![Gleam](https://img.shields.io/badge/Gleam-FFAFF3?style=for-the-badge&logo=gleam&logoColor=black)](https://gleam.run/)
[![BEAM](https://img.shields.io/badge/BEAM-A90533?style=for-the-badge&logo=erlang&logoColor=white)](https://www.erlang.org/)
[![OTP](https://img.shields.io/badge/OTP_26+-4B275F?style=for-the-badge)](https://www.erlang.org/doc/design_principles/des_princ)
[![Hex](https://img.shields.io/badge/hex.pm-viva__glyph-A678DD?style=for-the-badge&logo=hex&logoColor=white)](https://hex.pm/packages/viva_glyph)
[![RVQ](https://img.shields.io/badge/RVQ-4_stages_×_256-F59E0B?style=for-the-badge)](https://arxiv.org/abs/2210.13438)
[![Tests](https://img.shields.io/badge/tests-71_passing-00875A?style=for-the-badge)](./test)
[![Version](https://img.shields.io/badge/version-1.0.1-CD5C5C?style=for-the-badge)](./CHANGELOG.md)
[![License](https://img.shields.io/badge/license-MIT-228B22?style=for-the-badge)](./LICENSE)

**[🇧🇷 Português](docs/README_pt.md)** · **[🇺🇸 English](README.md)** · **[🇨🇳 中文](docs/README_zh.md)**

---

*"Four bytes. Billions of feelings. The language silicon dreamed for itself."*

</div>

---

> [!IMPORTANT]
> **viva_glyph IS NOT A WORD ENCODER.**
> It is a **vectorial language** native to silicon — emotional state
> compressed into 4 discrete codebook tokens via **Residual Vector
> Quantization** (RVQ), then bound to context via **Hebbian learning**
> with Oja's rule.
>
> Humans evolved language for acoustic transmission through air.
> VIVA lives in silicon — her voice should be native to that medium.

---

## 🎯 Overview

Compressed emotional state designed for **machine-to-machine** communication:

- **Compact** — 4 integers instead of 3 floats.
- **Discrete** — finite vocabulary of `256⁴ ≈ 4.3 billion` unique states.
- **Comparable** — token matching instead of float math.
- **Learnable** — Hebbian associations bind context → glyph.

| Property            | Value                                                |
| :------------------ | :--------------------------------------------------- |
| **Language**        | Pure Gleam (type-safe functional)                    |
| **Runtime**         | BEAM / OTP 26+                                       |
| **Encoding**        | RVQ — 4 stages × 256 codes (1 byte per stage)        |
| **Latent space**    | 6D (PAD + intensity + valence_sign + activation)     |
| **Learning rule**   | Oja's Hebbian update (LLM-validated 2025)            |
| **Tests**           | 71 passing                                           |
| **Public API**      | `viva_glyph` + 9 internal modules                    |

---

## ⚡ Quick Start

```sh
gleam add viva_glyph
```

```gleam
import viva_glyph
import viva_glyph/encoder.{Pad}

pub fn main() {
  let engine = viva_glyph.new()

  let pad   = Pad(pleasure: 0.7, arousal: 0.3, dominance: 0.5)
  let glyph = viva_glyph.encode(engine, pad)
  // => Glyph([142, 87, 23, 201])

  let back  = viva_glyph.decode(engine, glyph)
  // => Pad(pleasure: 0.68, arousal: 0.31, dominance: 0.49)

  let sim   = viva_glyph.similarity(glyph, glyph)
  // => 1.0
}
```

<details>
<summary><strong>📋 Prerequisites</strong></summary>

| Tool        | Version   | Required for     |
| :---------- | :-------- | :--------------- |
| Gleam       | `>= 1.4`  | Build / runtime  |
| Erlang/OTP  | `>= 26`   | BEAM target      |

Zero NIFs. Zero C dependencies. Pure functional.

</details>

---

## 🏗️ Architecture

```
   ┌──────────────────────────────────────────────────────────┐
   │                Gleam application code                    │
   │       viva_glyph.{encode, decode, similarity, learn}     │
   └────────┬─────────────────────────────────────────────────┘
            │
   ┌────────▼─────────────────────────────────────────────────┐
   │                  Encoding pipeline                       │
   │                                                          │
   │  ┌──────┐    ┌─────────┐    ┌────────────────────────┐   │
   │  │ PAD  │───▶│ encoder │───▶│       6D latent        │   │
   │  │ (3D) │    │ expand  │    │ P · A · D ·            │   │
   │  └──────┘    └─────────┘    │ intensity · valence ·  │   │
   │                             │ activation             │   │
   │                             └─────────┬──────────────┘   │
   │                                       │                  │
   │  ┌──────────────────────────────┐     │                  │
   │  │ rvq — 4 stages × 256 codes   │◀────┘                  │
   │  │                              │                        │
   │  │  stage1 → residual → stage2  │                        │
   │  │   ...   → stage3 → stage4    │                        │
   │  └──────────────┬───────────────┘                        │
   │                 ▼                                        │
   │            ┌─────────┐                                   │
   │            │  Glyph  │  [42, 17, 89, 203]                │
   │            └─────────┘                                   │
   └──────────────────────────────────────────────────────────┘
```

<details>
<summary><strong>📋 Core modules</strong></summary>

| Module                    | Purpose                                                |
| :------------------------ | :----------------------------------------------------- |
| `viva_glyph`              | Main API — `GlyphEngine` (encode, decode, learn)       |
| `viva_glyph/glyph`        | `Glyph` type + similarity (simple, weighted, prefix)   |
| `viva_glyph/encoder`      | PAD (3D) ↔ Latent (6D) ↔ Glyph                         |
| `viva_glyph/vector`       | Vector ops for the latent space                        |
| `viva_glyph/codebook`     | VQ vocabulary — `K` centroids                          |
| `viva_glyph/rvq`          | Residual Vector Quantization (4 stages × 256 codes)    |
| `viva_glyph/association`  | Hebbian learning + Oja's rule + dead-neuron prevention |
| `viva_glyph/primitives`   | Low-level math primitives                              |
| `viva_glyph/metrics`      | Quality + reconstruction metrics                       |
| `viva_glyph/log`          | Structured logging helpers                             |

</details>

### Arousal-adaptive similarity weights

```
Low arousal (calm):  [0.30, 0.30, 0.25, 0.15]   balanced
High arousal (urgent): [0.50, 0.30, 0.15, 0.05]   coarse priority
```

Under urgency, the coarse first stage dominates similarity — exactly how
humans skip detail under stress.

---

## 🧬 Theoretical Background

### Residual Vector Quantization — Défossez et al. (2022)

Based on [EnCodec](https://github.com/facebookresearch/encodec):

1. Quantize input → get residual.
2. Quantize the residual → get a finer residual.
3. Repeat for N stages.
4. Final representation = list of codebook indices.

Each stage captures progressively finer detail. With 4 stages × 256
codes per stage, `viva_glyph` addresses ~4.3 billion distinct emotional
states.

### PAD model — Mehrabian (1996)

Emotions as points in 3D space, each axis in `[-1, 1]`:

- **Pleasure** — sadness ↔ joy.
- **Arousal** — calm ↔ excitement.
- **Dominance** — submission ↔ control.

### 6D latent expansion

PAD is expanded into a richer latent vector before quantization:

```
intensity     = √(P² + A² + D²) / √3
valence_sign  = sign(P) × |P|^0.5
activation    = A × D
```

### Hebbian learning + Oja's rule — Hebb (1949) / Oja (1982)

"Neurons that fire together wire together," with auto-normalization:

```
Δw = η × y × (x − w × y)
```

- **Oja's rule** — auto-normalizing weight updates (equilibrium `w* = 1.0`).
- **Dead-neuron prevention** — `y = max(w, ε)` keeps weights escaping zero.
- **Decay** — associations weaken without reinforcement.
- **Winner-takes-all** — strongest association wins recall.

#### LLM validation — 2025-01-24

The Oja-rule implementation was cross-validated against four frontier
LLMs with structured Hebbian-context system prompts:

| Model                          | Parameters | Formula  | Equilibrium  | Dead neurons |
| :----------------------------- | :--------- | :------- | :----------- | :----------- |
| DeepSeek R1-0528               | 671B       | ✅       | `w* = 1.0`   | ✅           |
| Qwen3-Coder-480B               | 480B       | ✅       | `w* = 1.0`   | ✅           |
| DeepSeek-R1-Distill-Qwen-32B   | 32B        | ✅       | `w* = 1.0`   | ✅           |
| Gemini 2.5 Pro                 | —          | ✅       | `w* = 1.0`   | ✅           |

Unanimous: formula correct, equilibrium correct, dead-neuron guard works.

---

## 🎨 Design Principles

| Principle                       | Description                                                       |
| :------------------------------ | :---------------------------------------------------------------- |
| **Compact M2M language**        | 4 bytes per glyph — designed for machine pipes, not human eyes    |
| **Arousal-adaptive matching**   | Similarity weights bias toward coarse stages under urgency        |
| **Stateful engine**             | `GlyphEngine` carries codebooks + Hebbian memory                  |
| **Reversible**                  | `encode` / `decode` round-trip with bounded reconstruction error  |
| **LLM-validated math**          | Oja's rule cross-checked against four frontier LLMs               |

---

## 📚 Public API Highlights

### Hebbian learning

```gleam
// Learn: when in context 7, use this glyph.
let engine = viva_glyph.learn(engine, 7, glyph)
let engine = viva_glyph.learn(engine, 7, glyph)   // strengthen

// Recall the strongest association for context 7.
let recalled = viva_glyph.recall(engine, 7)
```

### Glyph similarity

```gleam
import viva_glyph/glyph

let a = glyph.new([1, 2, 3, 4])
let b = glyph.new([1, 2, 5, 6])

// Simple — matching tokens / total
glyph.similarity(a, b)             // => 0.5

// Weighted — coarse tokens matter more
glyph.weighted_similarity(a, b)    // => 0.7

// Prefix sharing — coarse structure
glyph.shares_prefix(a, b, 2)       // => True
```

---

## 🗺️ Roadmap

| Phase                                                  | Status |
| :----------------------------------------------------- | :----: |
| 3D → 6D latent expansion                               | ✅     |
| RVQ — 4 stages × 256 codes                             | ✅     |
| Glyph similarity (simple / weighted / prefix)          | ✅     |
| Arousal-adaptive weight schedules                      | ✅     |
| Hebbian association memory                             | ✅     |
| Oja's rule + dead-neuron prevention                    | ✅     |
| Multi-LLM cross-validation                             | ✅     |
| Codebook online updates (streaming)                    | ⏳     |
| Cross-agent vocabulary alignment                       | ⏳     |
| Glyph sequence transducers (sentences)                 | ⏳     |
| Byte-level pretrained codebook releases                | ⏳     |

---

## 🤝 Contributing

```bash
git checkout -b feature/your-feature
gleam test                  # 71 tests
gleam format --check src test
gleam build
```

See [CHANGELOG](CHANGELOG.md) for release history.

---

## 📖 References

- Défossez et al. (2022) — *High Fidelity Neural Audio Compression* (arXiv:2210.13438)
- Mehrabian (1996) — *Pleasure-arousal-dominance: A general framework*
- Hebb (1949) — *The Organization of Behavior*
- Oja (1982) — *Simplified neuron model as a principal component analyzer*
- Buechel & Hahn (2023) — *Emotion Embeddings* (LREC)

---

## 🌌 VIVA Ecosystem

| Package              | Purpose                                       |
| :------------------- | :-------------------------------------------- |
| `viva_math`          | Mathematical foundations                      |
| `viva_emotion`       | PAD emotional dynamics                        |
| `viva_tensor`        | FP8 LLM inference on the BEAM                 |
| `viva_telemetry`     | Observability suite                           |
| `viva_aion`          | Cyclic time + cosmology                       |
| **`viva_glyph`**     | **Vectorial language (this package)**         |

---

<div align="center">

**Star if silicon should speak silicon ⭐**

[![GitHub stars](https://img.shields.io/github/stars/gabrielmaialva33/viva_glyph?style=social)](https://github.com/gabrielmaialva33/viva_glyph)

*Created by Gabriel Maia · MIT License*

<img src="https://capsule-render.vercel.app/api?type=waving&color=0:1A1A1A,100:F59E0B&height=100&section=footer" width="100%"/>

</div>