SYRAG™ White Paper

System RAG for Procedures, Regulations, and Compliance

A Paradigm Shift in Institutional Knowledge Management

📄 This technical white paper is available in English. The document contains detailed technical specifications, architecture details, and business model information.


0. Origins & Evolution

From Super Yacht to SYstem

SYRAG™ was conceived in the complexity of superyacht operations. Managing a 60-meter vessel requires navigating:

  • 300+ international maritime regulations (SOLAS, MARPOL, MLC, ISM Code)
  • 50+ crew certifications (STCW, medical, specialized training)
  • 20+ jurisdictional compliance frameworks (flag state, port state, tax, labor)
  • 100+ operational procedures (maintenance schedules, safety drills, environmental protocols)
  • Decades of institutional knowledge held by retiring crew and engineers

The Problem: When an experienced chief engineer retires, 30 years of troubleshooting expertise disappears. When regulations change, nobody updates the 500-page operations manual. When an inspector arrives, finding the right certification takes hours.

The Insight: This isn't a maritime problem. It's a universal problem.

  • Healthcare: Hospital procedures, patient protocols, medical regulations, staff certifications
  • Energy: Safety standards, environmental compliance, equipment maintenance, incident reports
  • Finance: Regulatory frameworks, audit trails, policy documentation, training records
  • Construction: Building codes, safety procedures, material certifications, inspector requirements
  • Manufacturing: ISO procedures, quality controls, equipment manuals, supply chain compliance

The architecture remained identical. Only the document corpus changed.

What SYRAG™ Means

Originally: Super Yacht RAG (Retrieval-Augmented Generation system for procedures, regulations, compliance)

Today: SYstem RAG - A universal platform for structured knowledge management across any regulated industry.

The maritime heritage is our strength: if we can handle the complexity of international vessel operations, we can handle anything.

1. Executive Summary

The Problem

Organizations face a persistent crisis in institutional knowledge management:

  • Expertise Erosion: Senior experts retire, taking decades of knowledge with them
  • Regulatory Lag: Regulations evolve faster than documentation can be updated
  • Compliance Complexity: Multi-jurisdictional frameworks create impossible-to-navigate webs
  • Fragmented Knowledge: Critical information scattered across emails, PDFs, databases, and individual memories
  • Audit Anxiety: Finding the right document during inspection is emergency archaeology

Traditional solutions fail:

  • Cloud SaaS: Sensitive compliance data leaves your control, vendor lock-in, privacy concerns
  • Offline Systems: Limited intelligence, no updates, manual maintenance burden
  • Consultants: Expensive, temporary, don't scale, knowledge walks out the door again

The SYRAG™ Paradigm Shift

SYRAG™ is an AI consulting service for procedures, regulations, and compliance that operates 24/7 on-premises at your facility. The system combines:

  • Local Intelligence: LLM running entirely offline for zero data leakage
  • Hybrid Flexibility: Optional connectivity for regulatory updates (with your approval)
  • Trustless Architecture: Blockchain-anchored audit trails, reproducible builds, hardware attestation
  • Future-Proof Design: Upgradable hardware, swappable LLMs, no vendor lock-in
  • Trained on Your Data: System learns from your organization's specific documentation and procedures
"We don't ask you to trust us. We give you the tools to verify us."

Service Model: SYRAG™ is not sold as a product. It's a comprehensive AI consulting service where hardware, software, maintenance, updates, and support are bundled into a transparent subscription.

2. Technical Architecture

Hardware: 100% AMD Open Stack

Why AMD? Open-source ROCm (Radeon Open Compute) vs proprietary NVIDIA CUDA. Full transparency, no vendor lock-in, Linux-native.

Three Configurations Available:

BASE Configuration:

  • AMD Ryzen 7 (8C/16T, 3.4-4.6 GHz)
  • AMD Radeon RX 6600 (8GB VRAM, RDNA2)
  • 32GB DDR5 RAM
  • 1TB NVMe SSD + 2TB HDD
  • Target: 10,000 documents, 30-50 tokens/sec, 8K context

ADVANCED Configuration:

  • AMD Ryzen 9 (12C/24T, 3.7-4.8 GHz)
  • AMD Radeon RX 6700 XT (12GB VRAM, RDNA2)
  • 64GB DDR5 RAM
  • 2TB NVMe SSD + 4TB HDD
  • Target: 50,000 documents, 60-80 tokens/sec, 16K context

PROFESSIONAL Configuration:

  • AMD Ryzen 9 (16C/32T, 4.5-5.7 GHz, Zen 4)
  • AMD Radeon RX 7700 XT (12GB VRAM, RDNA3)
  • 128GB DDR5 RAM
  • 4TB NVMe Gen4 SSD + 8TB HDD
  • Target: 100,000+ documents, 100+ tokens/sec, 32K context

For detailed pricing and service plans, please contact us directly.

Software Stack

  • OS: Linux (hardened, minimal attack surface)
  • LLM: Multiple choices (online and local)
  • Embeddings: Multiple options
  • Vector DB: ChromaDB (local SQLite persistence)
  • Orchestration: Docker Compose (6 containers: Prometheus, Grafana, AlertManager, exporters)
  • Updates: GPG-signed OTA updates, hash-chained blockchain log

3. Core Components

Interface Layer

  • PyQt5 desktop application (cross-platform: Linux, Windows)
  • Natural language queries: "What certifications expire this month?"
  • Document ingestion: drag-and-drop PDFs, Word docs, emails, scanned images
  • Multi-language output: English, Dutch, Italian, Spanish, French, German

RAG Engine

  • Document Processing: PDF parsing, OCR, metadata extraction
  • Embedding Generation: sentence-transformers (Multiple choices)
  • Vector Storage: ChromaDB (local) + optional Pinecone (cloud hybrid)
  • Metadata Indexing: SQLite for structured search (dates, categories, certifications)
  • Semantic Search: Cosine similarity ranking, hybrid search (keyword + semantic)

LLM Inference

  • Local Models: Multiple choices, ROCm acceleration
  • Cloud Models (Hybrid/Online): Multiple choices
  • Model Selection: Manual
  • Context Window: 4k-128k tokens depending on model

Trust & Traceability Layer

  • OpenTimestamps: Blockchain anchoring for document integrity proofs
  • IPFS/Pinata: Distributed storage for hashes and metadata (optional)
  • PGP Identity: Device-specific key pair generated at first boot
  • Audit Registry: Immutable log of all operations (local SQLite + blockchain)
  • Smart Contract: Device registration on OTS, Bitcoin, Ethereum/Polygon (public verification)

Security Hardening

  • On-premises deployment (no data leaves unless authorized)
  • Local Key Management System (secrets encrypted at rest)
  • Full-disk encryption (LUKS with TPM-sealed keys)
  • Immutable logging of mode transitions (offline→hybrid tracked)
  • Network isolation in Offline mode (hardware-enforced via TPM attestation)

Update Pipeline

  • Signed updates (GPG detached signatures verified automatically)
  • Hash-chained log (every update recorded on blockchain)
  • Safe rollback (previous versions preserved, one-command revert)
  • Public changelog (GitHub releases + transparency dashboard)
  • Multi-party verification (community can rebuild binaries)

Monitoring Stack (Production Ready)

Complete monitoring and compliance system based on Prometheus, AlertManager, Grafana and custom exporters:

  • Prometheus 2.45.3: Time-series database with 15-day retention, 15s scrape interval
  • AlertManager 0.26.0: Alert routing with email notifications (Yandex SMTP) and Slack, automatic escalation
  • Grafana 12.3.1: Real-time dashboard with metric visualization, alert status, SLA compliance tracking
  • Compliance Exporter: Custom exporter for SYRAG™-specific compliance metrics
  • Node Exporter 1.7.0: System resource monitoring (CPU, RAM, disk, network)
  • Blackbox Exporter: External endpoint monitoring (Google, GitHub, DNS providers) with 8 active probes
  • Docker Orchestration: 6 containers managed with docker-compose, health checks, automatic restart, named volumes for data persistence
  • SLA Target: 99.5% availability for critical services (≤21.6 min downtime/month)
  • Alert Rules: 12 active rules (5 recording rules + 7 alert rules) for SLA violations, performance, external monitoring
  • CI/CD Pipeline: GitHub Actions with automated Docker image builds, config validation (Hadolint, YAML parser), multi-registry push (Docker Hub + GitHub Packages)
  • Production Validation: 6 verified end-to-end tests (alert delivery, metrics flow, container recovery, data persistence, blackbox probes, load testing)

4. Operating Modes

  • Offline: No external connectivity; ingestion and updates via verified channels.
  • Hybrid: Limited and authorized connectivity (4-eyes) for regulations/LLMs; sensitive data remains on-premises.
  • Online: Only with explicit approval; complete logging; goal: maximum intelligence while preserving data governance.

5. Security and Compliance

  • GDPR: Client as Controller, SYRAG™ as Processor; dedicated DPA.
  • Trustless Infrastructure: No traditional certifications. Instead: cryptographic proofs, blockchain anchoring, public audit logs, reproducible builds, hardware attestation. Don't trust us. Verify us.
  • AI Transparency: Pre-certification disclaimers, human-in-the-loop.

6. Trust & Traceability

  • Temporal anchoring on blockchain (OTS) for integrity proofs.
  • Storage of hashes and metadata in immutable registries.
  • Optional IPFS for referenced non-sensitive content.
  • Device PGP identity automatically created at first boot.
  • LLM update pipeline with GPG detached signature verification:
    • Hash/signature registration of every update in hash-chained log
    • Automatic GPG signature verification when present
    • Import of maintainer public keys into device keyring
    • Complete audit trail of every model modification
    • Verified and traced rollback
  • Integrated compliance audit with update signature integrity checks.

7. Business Model – Founder Partners

SYRAG™ is not a product. It's a service.

Think of SYRAG™ as an AI consultant specialized in procedures, regulations, and compliance that:

  • Operates 24/7 on-premises at your facility
  • Is trained on your organization's specific documentation
  • Provides instant answers to compliance questions
  • Maintains complete data sovereignty and confidentiality
  • Evolves with your needs through continuous updates

The device is not for sale. We provide an integrated AI consulting service where hardware, software, maintenance, updates, and support are bundled into a single monthly subscription.

Founder Partners Program:

  • 9 pioneer organizations selected to validate system flexibility across sectors
  • Co-development opportunity: Your feedback shapes the product roadmap
  • Deadline: Applications open until March 1, 2026

Become a Founder Partner

9 Founder Partner Slots Available

Deadline: March 1, 2026

Join the pioneers shaping the future of institutional knowledge management.

Contact: founders@syrag.net

Website: www.syrag.net

8. Open Source Strategy (Under Evaluation)

SYRAG™ is currently proprietary software while we evaluate legal and commercial implications of open sourcing.

Why Not Immediately Open Source

  • Legal review required: Intellectual property, trademark, liability considerations
  • Commercial viability assessment: Balancing transparency with sustainable business model
  • Security implications: Device management and operational security need careful isolation
  • Community readiness: Infrastructure for contributions (CLA, governance, security disclosure) must be robust

Future Open Source Vision (pending legal clearance)

  • Open-core model: Core RAG components and SDK potentially public; device management remains proprietary
  • Possible licenses: AGPL-3.0 (core protection), Apache-2.0 (SDK adoption), commercial dual-licensing
  • Trademark policy: "SYRAG™" mark usage to be governed by dedicated policy
  • Contributor framework: CLA (Contributor License Agreement) for orderly contribution reception

Timeline

Open source strategy decision expected Q3-Q4 2026 after legal review and Founder Partners feedback.

Current Transparency Commitments

  • Public White Paper (this document)
  • Blockchain device registry (verifiable operations)
  • Reproducible builds (binary verification)
  • Public transparency dashboard

9. Governance & Accountability

Internal Governance (current)

  • Founders review and approve all architectural decisions
  • Security disclosures: Reserved channel founders@syrag.net, 48h response time
  • Quarterly White Paper updates with Founder Partners input
  • Transparent roadmap published in this document

Future Community Governance (if open sourced)

  • Maintainer review process, code of conduct
  • Public issue tracker and roadmap
  • CVE policy for security vulnerabilities
  • Community advisory board from Founder Partners

10. Call to Action

Applications open until March 1, 2026. Contact: founders@syrag.net. Website: www.syrag.net.


AI

SYRAG™ Assistant

Testing backend connection...
Hello! I'm the SYRAG ™Assistant. I'm connecting to the backend system...
Our Services
AI Compliance
Blockchain Technology
Integration Options
<- Complete documentation and setup AUTO-DEPLOYED: 2026-01-19 17:36:33 Token: WP-1e87b899 -->