AI & ML interests
None defined yet.
Recent Activity
THOX.ai
Edge AI computing devices and from-scratch language models for local, privacy-preserving inference. No cloud. No data uploads. Your weights, your hardware, your network.
Profile: https://huggingface.co/Thox-ai Website: https://www.thox.ai GitHub: https://github.com/ttracx Org owner: @tommytracx License default: Apache-2.0 (open source); proprietary firmware kept separate.
What we ship
THOX.ai owns the stack end to end. Three layers, all of them real today.
- Foundation models trained from scratch on THOX-defined architectures. Not fine-tunes of someone else's base.
- ThoxOS kernel, runtime, and userspace orchestration framework for the device fleet.
- Edge devices the models run on. ThoxClip, ThoxMini, ThoxNova, ThoxMicro, and the MagStack Air cluster variants.
Models on this org
| Repo | Status | Architecture | Params | Training data | Best metric | Notes |
|---|---|---|---|---|---|---|
| Thox-ai/ThoxLLM-327M | Published (private) | Decoder-only, single-projection THOX SwiGLU x*sigmoid(x)^2, 20L hidden 1024, GQA 16Q/8KV, vocab 50304, ctx 2048, bf16 |
327M | wikitext-103 | val ppl 22.03 at 50k steps | First fully THOX-owned base. Not converged at 50k; loss curve still descending. 619 MB safetensors + GPT-2 BPE tokenizer. GGUF gap (see below). |
| Thox-ai/ThoxLLM-Forge-7B | Published (private) | LoRA over Qwen 2.5 base | 7B | THOX domain corpus | train loss 0.635 at 240 steps | Fine-tune. GGUF Q4_K_M shipped. |
| Thox-ai/ThoxLLM-Mini-3B | Published (private) | LoRA over Qwen 2.5 base | 3B | THOX domain corpus | train loss 0.820 at 240 steps | Fine-tune. GGUF Q4_K_M shipped. |
| Thox-ai/ThoxLLM-Global-7B | Published (private) | LoRA over Llama base | 7B | THOX domain corpus | train loss 0.682 at 240 steps | Fine-tune. GGUF Q4_K_M shipped. |
| Thox-ai/ThoxLLM-Wave-8B-Unleashed | Published (private) | LoRA over Llama 3.1 base | 8B | THOX domain corpus | train loss 1.168 at 240 steps | Fine-tune. GGUF Q4_K_M shipped. |
| Thox-ai/ThoxLLM-Nova-12B-Unleashed | Published (private) | LoRA over Gemma base | 12B | THOX domain corpus | train loss 0.713 at 240 steps | Fine-tune. GGUF Q4_K_M shipped. |
| Thox-ai/ThoxGem-E4B-SFT | Published (private) | LoRA over Gemma-4 E4B | 4B effective | 50-example THOX persona dataset | train loss 1.330 / 4.64 epochs | SFT adapter. |
| Thox-ai/ThoxMicro-125M | Published (private) | Decoder-only THOX base | 125M | wikitext-103 | val ppl 13.6 | First published THOX base LM. |
Repos are private during the 2026-Q3 hardening window. Email tommy@thox.ai or open an issue at https://github.com/ttracx/thoxllm for access.
Foundation training roadmap
ThoxLLM-1B (target: 2026-06 to 2026-07). 24 layers, hidden 2048, GQA 32Q/8KV, ctx 4096, vocab 50304, bf16. 200k steps on wikitext-103 with grad-accum 64. Two architecture shapes scaffolded; final pick lands 2026-06-09:
| Shape | intermediate | Total params | Inference reach day-zero |
|---|---|---|---|
| Standard SwiGLU (gate + up + down) | 4480 | 1.015B | llama.cpp / Ollama / MLX / vLLM / Candle / every existing runtime |
| THOX single-projection SwiGLU | 6656 | 1.009B | HF Transformers only; llama.cpp needs a new arch |
The 327M result (val ppl 22.03 at 50k) is the floor; the 1B run is 4x params at 4x compute and should converge below 12 ppl on the same data at 200k steps.
ThoxLLM-3B (target: 2026-Q3). 32 layers, hidden 3072, GQA 24Q/8KV. Architecture identical to 1B; trained once the 1B run is published.
ThoxLLM-7B / 13B (target: 2026-Q4). Match Llama 3 dimensions. Multi-node training on a clustered MagStack Air rig.
Open-source layer
These ship on GitHub as Apache-2.0, not on HF. Linked here because the models target them.
| Project | Repo | What it is |
|---|---|---|
| ThoxOS Kernel | https://github.com/ttracx/thoxos-kernel | From-scratch async-first no_std Rust kernel. v1.0.0 boots a static thox-init ELF at CPL3 on x86_64 with a real SYSCALL trap entry and per-task page tables. Multi-arch: x86_64 (multiboot2 GRUB) + aarch64 (QEMU virt + ThoxClip CM5) + riscv64gc planned. 26 workspace crates, 18 ADRs. |
| THOX DigitalHumans | https://github.com/ttracx/thox-digitalhumans | Rust digital-human orchestration framework. 7-tier cognitive memory, persona handoff engine, ThoxOS device manifests. |
| THOX Experience Fabric (TXF) | https://github.com/ttracx/thox-experience-fabric | Cross-platform experience layer. Native SwiftUI / Compose / Rust codegen from one contract definition. |
| THOX LiteRT-LM | https://github.com/ttracx/thox-litert-lm | Pure-Rust on-device LLM runtime. Targets the THOX device fleet directly. |
| ThoxMesh | https://github.com/ttracx/thoxmesh | Multi-agent framework with tiered memory + mesh cognition. |
| Thox Portable | https://github.com/ttracx/thox-portable | Carried-device runtime: Rust thoxd + Next.js PWA + USB-gadget. |
| Thox Terminal | https://github.com/ttracx/thox-terminal | THOX-branded SwiftUI iOS17+ / macOS14+ control plane for the ecosystem. |
Device fleet (current and committed)
| Device | SoC | Status | Cluster role |
|---|---|---|---|
| ThoxClip | Raspberry Pi CM5 | Production-track | Smallest THOX endpoint |
| ThoxMini | Pi Zero W (locked 2026-06-01) | Production-track | Pocket / wearable |
| ThoxMini Air | Pi Zero W | Production-track | MagStack Air leaf node |
| ThoxNova | LattePanda N100 (Intel x86 / SYCL / Vulkan; replaces earlier Jetson plan) | Production-track | Desktop appliance |
| ThoxMicro | Luckfox Pico Mini B (Rockchip RV1103, 64 MB DDR2 / 0.5 TOPS NPU) | Prototype | Ultra-edge worker |
| MagStack Air | Pi Zero 2 W cluster, 4-8 nodes, magnetic pogo stack | Prototype | Distributed inference for ThoxLLM-1B+ |
The August 2026 Kickstarter ships ThoxClip ($39+), ThoxMini ($69), ThoxAir ($79), ThoxNova ($499). Goal: $250K baseline / $3M ceiling.
Open known gaps
Honest list of what's not done.
- GGUF for ThoxLLM-327M is blocked. llama.cpp expects three FFN tensors per layer (gate / up / down). The current THOX SwiGLU has two (fc_in / fc_out). Either we (a) add a
LLM_ARCH_THOXarch + kernel to a llama.cpp fork (2-3 days) or (b) switch ThoxLLM-1B+ to standard two-projection SwiGLU. Decision lands 2026-06-09; Path B is the recommended call indocs/decisions/2026-06-07-swiglu-shape-decision.md. - ThoxLLM-327M is not converged. Val loss was still descending at 50k. The 1B run targets 200k+ steps for a real converged baseline.
- Repos are private. Hardening window ends 2026-Q3; expect public re-publication of ThoxLLM-327M and ThoxLLM-1B after the SwiGLU decision is committed.
- No HF Inference API endpoint for the THOX base architecture yet. Single-projection THOX SwiGLU requires the
thoxllmpackage; standard-SwiGLU 1B+ will work in vanilla Transformers.
Quickstart (once a model goes public)
from huggingface_hub import snapshot_download
from thoxllm import ThoxLLM, ThoxTokenizer
# Download via huggingface-cli login + access grant
local = snapshot_download("Thox-ai/ThoxLLM-327M", token=HF_TOKEN)
tok = ThoxTokenizer.from_pretrained(local)
model = ThoxLLM.from_pretrained(local, dtype="bfloat16", device_map="auto")
out = model.generate(tok("Edge AI means", return_tensors="pt").to(model.device),
max_new_tokens=64)
print(tok.decode(out[0], skip_special_tokens=True))
For the fine-tuned adapter family (Forge / Mini / Global / Wave / Nova), a GGUF Q4_K_M is shipped alongside the safetensors and runs in Ollama directly:
ollama create thox-forge-7b -f Modelfile.thox-forge-7b
ollama run thox-forge-7b "Explain the THOX edge architecture in two paragraphs."
Compliance and posture
THOX devices run a stripped Linux-compatible userspace on top of ThoxOS Kernel. The whole pipeline is auditable:
- All training data is documented in repo
dataset.mdfiles. - No telemetry from devices. ThoxOS Kernel has zero outbound network calls in the boot path.
- Compliance gates (HIPAA / GDPR / SOC2 / PCI / ISO27001) are enforced at the kernel syscall layer via
thox-compliance. See ADR-0006 in the kernel repo. - Persona binding (THOXY / Mira / Sera) constrains what each model context can read or write per device session.
Roadmap (only what is committed)
| Quarter | Target |
|---|---|
| 2026-06 | ThoxLLM-1B SwiGLU decision committed. 200k converged run starts on RTX 4060 Ti. ThoxOS Kernel v1.0.0 verified on QEMU. |
| 2026-Q3 | ThoxLLM-1B published. THOX device fleet Kickstarter (August 2026). ThoxOS v1.1.0 with multi-task scheduler. |
| 2026-Q4 | ThoxLLM-3B published. MagStack Air cluster ships. THOX LiteRT-LM v1.0 on every device. |
| 2027-Q1 | ThoxLLM-7B published. ThoxOS v2.0 with distributed mesh inference. |
Contact
- Tommy Xaypanya (@tommytracx) - founder and engineer of record
- tommy@thox.ai for org member requests and partnership inquiries
- Issues + discussion: https://github.com/ttracx/thoxllm and https://github.com/ttracx/thoxos-kernel
Updated 2026-06-07. Last 30 days: shipped ThoxLLM-327M trained run, six fine-tune adapter pairs, ThoxOS Kernel v0.1.0 -> v1.0.0 (18 ADRs), THOX DigitalHumans v0.1.0, THOX Experience Fabric v0.3.0, ThoxLiteRT-LM, ThoxMesh, Thox Portable v0.2.0, Thox Terminal v0.1.0, MagStack Air family (3 repos), ThoxMicro.