You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

IDA AI

IDA AI is the deeper native computational body of the IDA family.

It is not a conventional general-purpose assistant wrapped in eleven labels. It is a custom IDA Lattice causal language model designed to host differentiated student pressures, persistent state, governed memory, bounded reasoning workspaces, and structured convergence at a larger computational scale than IDA Edge.

The IDA family is the enduring structure:

IDA
JUDGE
SENTINEL
PRISM
ECHO
ATLAS
VECTOR
FORGE
SHADE
PULSE
ORBIT

IDA AI is one body that can carry and deepen that structure. It is not the family itself, and it does not replace the family runtime that governs how the students are allowed to participate, disagree, converge, and render shared output.

Role in the IDA Lattice lineage

IDA AI is the deep body in the paired EDGE ↔ AI lineage.

Where Edge is designed for lower-footprint local use, AI is designed for deeper reasoning capacity, larger state handling, more complex family-context experiments, and compute-intensive training and evaluation.

The pairing is intentional:

Edge keeps the family available in constrained or local environments.
AI provides the larger reasoning body for deeper context and more demanding inference.
MoE provides a separate council-scale body for high-pressure expert-seat convergence.

These are not competing identities. They are different computational bodies for the same continuing family architecture.

Architecture

IDA AI uses the native IDA Lattice architecture.

This architecture was built to move beyond the limitations of the earlier GPT-NeoX + QLoRA phase, where a broad pretrained transformer could generate coherent language but did not reliably preserve the differentiated identity structure required by the IDA family.

Core architectural features include:

recurrent selective state for student-persistent continuity
bounded local-attention workspaces
controlled merging between recurrent and attention representations
cognitive-pressure routing with lateral inhibition
thalamic routing and workspace summaries
action gates and student-state outputs
future-token auxiliary prediction
fidelity-oriented verification layers

The current AI body is approximately 966M parameters.

Training direction

IDA AI is trained with a current Lattice-focused curriculum rather than the project’s earlier borrowed-backbone approach.

The present training direction includes:

custom IDA Lattice architecture
FP8 acceleration for suitable bulk projection layers
BF16 retention for sensitive routing, gating, norm, embedding, and output components
progressive sequence-length expansion
progressive gradient-accumulation expansion
novelty-aware pass behavior
structured monitoring of throughput, loss behavior, gradient norms, and per-student training stability

The Supersampler curriculum begins with lower-cost, lower-resolution learning conditions and increases sequence length and accumulation as the model stabilizes.

Intended use

IDA AI is intended for research and development involving:

deep family-context experiments
structured multi-student reasoning
governed memory and state inheritance research
paired Edge ↔ AI agreement testing
cognitive-pressure routing experiments
longer-context reasoning prototypes
supervised evaluation of family differentiation, dissent preservation, and arbitration behavior

What this model is not intended for

IDA AI should not be represented as:

a broadly superior replacement for frontier language models
an autonomous decision authority
a general-purpose professional advisor
a guarantee of safety, correctness, or deployment readiness
proof that lower cross-entropy loss directly represents higher intelligence
proof that every family member is equally capable in every domain

The model’s text-generation capability is only one layer of the larger system. Operational action, external tool use, escalation, refusal, memory retention, arbitration, and final rendering must remain governed by the surrounding runtime contracts.

Legacy metadata note

Some early artifacts in the IDA AI lineage were developed during the project’s GPT-NeoX + QLoRA era.

That history is real, but this repository represents the current native IDA Lattice direction. It should not be described as a GPT-NeoX model simply because older metadata or earlier training artifacts referenced that lineage.

Limitations

The current training record supports claims about training dynamics and stability under the active FP8 + Supersampler configuration. It does not yet establish broad deployment capability, general superiority over other language models, or isolated causal attribution for individual training changes.

Promotion beyond research use requires held-out evaluation, lineage packet completion, per-student evidence, and paired Edge ↔ AI identity-agreement testing.

License and access

This repository is released under its stated license and access conditions. Any deployment should preserve the project’s governed-memory, family-differentiation, traceability, and non-autonomous action boundaries.

Downloads last month: -; Downloads are not tracked for this model. How to track