Global Awareness Engine: Unified Cross-Signal Intelligence

An octopus processes information from eight arms simultaneously — each arm sensing independently, yet contributing to a unified model of the environment. The Global Awareness Engine gives OctopusOS the same capability: a unified perception layer that consolidates all signal categories into a single awareness plane, enabling cross-signal correlation, influencer tracking, meme propagation analysis, and competitive positioning.

1. Why Unified Awareness Matters

OctopusOS already has deep subsystem-level awareness: Self-Awareness monitors internal health, Resource Governor watches the environment, ScreenOS tracks workspace activity, Internet Awareness monitors online mentions, and the Execution layer tracks human feedback. But each subsystem operates in its own silo.

The critical intelligence gap is cross-signal correlation: when a CPU spike (environment) coincides with a spike in negative mentions (internet) and increased task failures (human feedback), each subsystem reports its own anomaly — but no one connects the dots.

The Global Awareness Engine (GAE) sits above all subsystems as a read-only consumer, normalizing their outputs into a unified GlobalSignalEvent schema and performing cross-category analysis that no individual subsystem can achieve.

2. Five Perception Categories

Every signal in the OctopusOS universe is classified into one of five fundamental awareness categories. These categories map directly to existing subsystems while providing a unified vocabulary for cross-signal reasoning.

Internal

CPU/memory pressure from AwarenessReportFlywheel learning rate changesCapability evolution eventsTask queue depth anomalies

Environment

Resource Governor HostSnapshot dataDisk utilization thresholdsNetwork interface changesContainer/process health signals

Workspace

ScreenOS desktop eventsApplication focus changesUser interaction patternsInterface state transitions

Internet

InternetMention from 12+ sourcesTopic trend signalsCompetitive landscape changesSocial media sentiment shifts

Human

Task execution outcomesAuthorization decisionsUser satisfaction signalsEscalation and timeout events

3. The GlobalSignalEvent: A Universal Perception Schema

At the core of GAE is the GlobalSignalEvent — a frozen, immutable dataclass that provides a unified view of any event from any subsystem.

@dataclass(frozen=True)
class GlobalSignalEvent:
    event_id:        str                  # Unique identifier
    category:        AwarenessCategory    # internal | environment | workspace | internet | human
    source_kind:     str                  # Origin subsystem (self_awareness, resource_governor, etc.)
    ts_ms:           int                  # Millisecond timestamp
    summary:         str                  # Human-readable description
    severity:        EventSeverity        # info | warning | critical
    payload:         dict[str, Any]       # Structured event data
    signal_id:       str                  # Link to SignalOS SignalEnvelope
    correlation_id:  str                  # Cross-signal correlation chain
    metadata:        dict[str, Any]       # Extensible metadata

Design principles:

Category-first — every event is tagged with one of five categories, enabling cross-category queries
Severity-aware — automatic severity inference from source data (e.g., CPU > 90% = critical)
SignalOS integration — signal_id links to SignalOS SignalEnvelope for full causal tracing
Subsystem-agnostic — normalizers handle the translation from subsystem-specific formats

4. Signal Normalization Pipeline

GAE includes six normalizers that translate subsystem outputs into GlobalSignalEvent instances. Each normalizer is a pure function with no IO dependencies.

Six Signal Normalizers

Self-Awareness → Internal

AwarenessReport fields (CPU, memory, task metrics) → severity inferred from pressure levels. CPU > 90% = critical, memory > 80% = warning.

Resource Governor → Environment

HostSnapshot (CPU topology, memory, disk) → severity from resource utilization thresholds.

ScreenOS → Workspace

ScreenEvent (event kind, risk level, app context) → severity mapped from screen risk assessment.

Internet Awareness → Internet

InternetMention (source, score, sentiment) → severity by engagement score (>500 = warning, >1000 = critical).

Execution Feedback → Human

ExecutionOutcome (result, error type) → AUTH_FAILURE and TIMEOUT classified as critical.

SignalOS → Any Category

SignalEnvelope (domain, kind, priority) → category mapped from IO domain (host_io → environment, network_io → internet).

5. Influencer Detection

The Influencer Detection module identifies high-impact authors across platforms and monitors their activity. This goes beyond simple mention counting — it uses a weighted composite scoring formula that considers multiple dimensions of influence.

Influence Score Formula

The composite influence score (0–100) is computed using four weighted factors:

Factor	Weight	Description
Mention frequency	25%	Logarithmic scaling of total mentions
Total engagement	30%	Logarithmic scaling of cumulative scores
Sentiment impact	15%	Average sentiment × 100, absolute value
Platform diversity	30%	Unique platform count × 25, capped at 100

score = (
    0.25 × min(100, log2(1 + mentions) × 15)
  + 0.30 × min(100, log2(1 + engagement) × 10)
  + 0.15 × min(100, abs(sentiment) × 100)
  + 0.30 × min(100, platforms × 25)
)

Influencer Alerts

When an influencer exceeding the configured threshold (default: 50.0) publishes new content, GAE generates an InfluencerAlert with the author’s profile, the triggering mention, and a human-readable reason. These alerts are emitted as high-priority SignalEnvelope messages into SignalOS.

6. Meme Propagation Graph

Content doesn’t stay on one platform. A Hacker News post gets shared on Reddit, tweeted about, and forked on GitHub. The Meme Propagation module tracks how content spreads across platforms by building directed propagation graphs.

Three-Tier Matching

Cross-Platform Content Matching

Tier 1: URL Match

Exact URL deduplication across platforms. Same URL appearing on different sources = cross-platform spread. Confidence: 0.95

Tier 2: Content Fingerprint

SHA-256 hash of normalized title + URL path. Catches reposts with same content but different tracking parameters. Confidence: 0.80

Tier 3: Title Similarity

Jaccard word-overlap between titles. Catches derivative content and commentary. Confidence: proportional to similarity score.

Propagation Metrics

Each MemeGraph captures:

Origin — the earliest mention, identified as the propagation source
Propagation velocity — platforms reached per hour
Total reach — sum of engagement scores across all nodes
Platform count — number of distinct platforms the content spread to

7. Competitive Positioning

The Competitive Positioning module provides automated share-of-voice analysis, sentiment benchmarking, and trend divergence detection — the kind of competitive intelligence that typically requires dedicated market research tools.

For each topic (brand, product, competitor), GAE computes:

Mention share — what percentage of total mentions belong to this topic
Engagement share — weighted by platform engagement scores
Sentiment average — mean sentiment across all mentions
Trend direction — rising, stable, or declining

Sentiment Gap Analysis

The sentiment gap between your topic and each competitor is computed as a simple delta: primary_sentiment - competitor_sentiment. Positive gaps indicate sentiment advantage; negative gaps signal competitive vulnerability.

Trend Divergence Detection

When a competitor’s trend direction diverges from your own, GAE generates labeled warnings:

Divergence	Meaning
`RISK: competitor rising while primary declining`	Competitive threat — losing momentum
`WATCH: competitor rising while primary stable`	Emerging challenge — needs monitoring
`ADVANTAGE: competitor declining while primary rising`	Competitive opportunity — press the advantage

8. Global Awareness Snapshot

The awareness snapshot is the crown jewel of GAE — a periodic, holistic summary that correlates signals across all five categories and generates actionable insights.

Awareness Snapshot Assembly

Internal Events

Self-awareness signals

Environment Events

Resource & infrastructure

Workspace Events

Screen & interaction

Internet Events

Mentions & intelligence

Human Events

Execution feedback

Awareness Builder

Cross-signal correlation engine

Anomalies

Cross-category anomaly detection

Correlations

Temporal co-occurrence patterns

Recommendations

Actionable next steps

Cross-Signal Anomaly Detection

GAE detects four categories of cross-signal anomalies:

Environment + Internal stress — Resource pressure coinciding with internal issues suggests infrastructure-driven failures
Critical signal concentration — Many critical-severity events within a window indicate a developing incident
Internet spike — Sudden increase in internet mentions (>10 in a window) may indicate viral content or PR event
Human feedback failures — Elevated failure rate in execution outcomes signals workflow issues

Temporal Correlation

Events are bucketed into hourly windows and analyzed for co-occurrence. When multiple categories spike simultaneously, GAE reports the correlation — for example: “Environment and Internal events co-occurred in 3 hourly windows, suggesting infrastructure-driven internal issues.”

9. SignalOS Integration

GAE produces four types of SignalEnvelope messages that feed directly into the SignalOS bus:

Emitter	Signal Kind	Priority	Key Payload
`influencer_alert_to_signal()`	`gae_inf_{alert_id}`	high	Author name, score, reason
`competitive_report_to_signal()`	`gae_comp_{report_id}`	normal	SOV share, competitors, highlights
`meme_graph_to_signal()`	`gae_meme_{graph_id}`	normal	Platform count, velocity, reach
`awareness_snapshot_to_signal()`	`gae_snap_{snapshot_id}`	high (if anomalies)	Total events, categories, anomalies

This integration means GAE insights flow through SignalOS governance (risk grading, routing, replay) and eventually into the Flywheel learning loop — creating a closed intelligence cycle.

10. Architecture Overview

GAE Architecture Layers

L4: REST API + Dashboard

12 REST endpointsHTML dashboard at /ui/global-awarenessManual scan triggerConfig management

L3: Worker Mixin (Runtime)

Influencer scan (piggyback on IA cadence)Meme detection (same cadence)Competitive report (weekly)Awareness snapshot (hourly)

L2: Pure Domain Logic (6 Modules)

influencer_detector — scoring & alertsmeme_propagation — content fingerprint & graphcompetitive_positioning — SOV & divergencesignal_normalizer — 6 normalizersawareness_builder — cross-signal correlationsignal_emitter — SignalOS envelope generation

L1: Contracts + Port

15 frozen dataclasses in contracts/global_awareness.pyGlobalAwarenessPort protocol (14 methods)GAEConfig with tunable thresholds

Layer Placement Rules

L1 (Contracts + Port) — kernel/contracts/global_awareness.py and kernel/ports/global_awareness/interfaces.py. Pure type definitions and protocol.
L2 (Domain Logic) — kernel/domains/global_awareness/. Six pure-function modules with zero IO.
L3 (Runtime) — kernel/runtime/_wl_global_awareness.py. Worker Loop Mixin scheduling four periodic tasks.
L4 (API + Dashboard) — server/shared/adapters/http/_routes_global_awareness.py. REST API and HTML dashboard.

11. API Surface

GAE exposes twelve REST endpoints:

Endpoint	Method	Purpose
`/api/global-awareness/config`	GET	Current GAE configuration
`/api/global-awareness/config`	POST	Update thresholds and categories
`/api/global-awareness/influencers`	GET	Top influencer profiles with scores
`/api/global-awareness/influencers/alerts`	GET	Recent influencer activity alerts
`/api/global-awareness/memes`	GET	Meme propagation graph summaries
`/api/global-awareness/competitive`	GET	Competitive positioning reports
`/api/global-awareness/competitive/latest`	GET	Most recent competitive report
`/api/global-awareness/snapshots`	GET	Global awareness snapshots
`/api/global-awareness/snapshots/latest`	GET	Most recent snapshot
`/api/global-awareness/events`	GET	Global events with category filter
`/api/global-awareness/scan`	POST	Manual analysis trigger
`/ui/global-awareness`	GET	Interactive HTML dashboard

12. Implementation Metrics

Metric	Count
Frozen dataclasses in `contracts/global_awareness.py`	15
Pure domain modules in `domains/global_awareness/`	6
Signal normalizers (subsystem → GlobalSignalEvent)	6
Signal emitters (GAE output → SignalEnvelope)	4
REST API endpoints	12
Unit + integration tests	96
Lines of kernel code (contracts + domains)	~900
Gate violations introduced	0

13. Design Philosophy

GAE embodies three architectural principles:

1. Read-only consumption. GAE never modifies existing subsystems. It reads their outputs through normalizers, correlates across categories, and emits new signals — all without side effects on the source subsystems. This ensures that adding GAE cannot break any existing functionality.

2. Cross-signal intelligence over depth. Individual subsystems already provide deep domain-specific analysis. GAE’s value is in the connections between subsystems: detecting that a resource pressure spike (environment) correlates with mention sentiment decline (internet) and increased task failures (human). No single subsystem can see this pattern.

3. Progressive enrichment. GAE builds intelligence in layers: raw normalization, then cross-signal anomaly detection, then competitive positioning, then actionable recommendations. Each layer can operate independently, and each adds value even if the layers above it are disabled.

14. Application Scenarios

Real-Time Brand Intelligence

When a mention of OctopusOS goes viral on Hacker News (internet category), GAE tracks the propagation to Reddit, Twitter, and GitHub. The influencer detection module identifies which authors are driving the conversation, and the competitive positioning module shows how this compares to competitor mention patterns — all updated in real-time through the dashboard.

Incident Correlation

During a production incident, GAE correlates environment signals (CPU spike, disk pressure) with internal signals (task queue growth, learning rate drop) and human signals (increased failure rate) to build a unified incident timeline. The cross-signal anomaly detector flags the correlation before any single subsystem would escalate.

Competitive Early Warning

When a competitor’s mention velocity suddenly increases while yours stays flat, GAE’s trend divergence detector generates a WATCH or RISK alert. This early warning — powered by share-of-voice analysis across all monitored platforms — gives the team time to respond before the competitive gap widens.

Content Strategy Feedback Loop

GAE’s meme propagation graphs reveal which content resonates across platforms and which stays contained. By correlating propagation velocity with influencer engagement, the engine identifies which authors and platforms are most effective distribution channels — actionable intelligence for content strategy.