Building on HF

13 1 45

Dean Byrne PRO

Quazim0t0

dean-byrne-02a28b191

AI & ML interests

Joined HuggingFace: 1-20-2025 / DaisyChainAI🌼 / SmallLM's / San Francisco

Recent Activity

updated a Space about 1 hour ago

Quazim0t0/Escarda-86M-Chat

updated a model about 1 hour ago

Quazim0t0/Escarda-86M-Identity

published a model about 1 hour ago

Quazim0t0/Escarda-86M-Identity

View all activity

Organizations

reacted to Kurapika993's post with 👀 about 5 hours ago

Post

128

🚀 Released two Responsible AI lightweight instruction-tuned models focused on toxicity, bias, and safety analysis

Model 1: Responsible AI Safety Assistant (Qwen 2.5)

Kurapika993/qwen2.5-7b-responsible-ai-qlora
Base Model: Qwen2.5-7B-Instruct
Method: QLoRA
Training Data: BeaverTails + Wiki Toxic + custom Responsible AI instruction dataset

Model 2: Responsible AI Assistant (Llama)

Kurapika993/llama-3.1-8b-responsible-ai-safety-lora
Base Model: Llama-3.1-8b Instruct
Method: QLoRA
Training Data: BeaverTails + Wiki Toxic + custom curated examples

This model follows the same structured output format but explores the impact of a different base architecture on safety-analysis tasks.

Intended Use

These models are designed for:

✅ Responsible AI research
✅ Moderation decisions
✅ Safety and bias analysis
✅ Human-in-the-loop moderation workflows
✅ Dataset generation and annotation assistance

reacted to TinyKat's post with 👍 about 5 hours ago

Post

🍲 Progress Update - Build Small Hackathon

I just built my first working AI tool: **Team Lunch Order Collector** for my organization!

Features so far:
- Fixed menu with prices (Kenyan favorites)
- Easy ordering (Individual or Group)
- Live summary + total cost
- AI suggestions for the organizer

Still polishing images and a few things, but it's already usable.

Try it here 👇
build-small-hackathon/team_lunch_app_v1

Any feedback is very welcome!

#BuildSmallHackathon

reacted to marinarosa's post with 🔥 about 5 hours ago

Post

137

I'm participating in the Build Small Hackathon! 💖

I chose the Backyard AI chapter: solve a real problem for someone you know.

My idea is to help a business owner extract valuable information from their data. She operates, negotiates and provides support for customers all through Whatsapp. She doesn't use Notion, Obsidian, Spreadsheets and does not like doing repetitive data-entry tasks. It's a messy, chaotic messaging workflow that kind of works for her now.

What if she could open a Space, upload her exported WhatsApp data, and a fine-tuned model tailored to her business domain and conversation style extracted, classified, organized all her customers and deals into a coherent dashboard, along with a chatbot for her to ask questions about her business? That's my current approach.

5 replies

reacted to mmhamdy's post with 🚀 about 5 hours ago

Post

1012

Human brains don't recreate every pixel to understand the world!

Most current models in genomics, proteomics, and single-cell transcriptomics rely on generative objectives like masked language modeling or next token prediction. While effective, these architectures waste significant capacity reconstructing raw, noisy sequence details that may not carry functional biological meaning.

But a promising, more efficient alternative is emerging: Joint-Embedding Predictive Architecture (JEPA)

Originally introduced by Yann LeCun for computer vision, JEPA is a non-generative, self-supervised learning (SSL) framework. Instead of predicting raw inputs, it operates as a world model that predicts abstract semantic embeddings in latent space.

Recently, the JEPA framework (and its more efficient LeJEPA variant) has been adapted into the biological sciences to develop performing foundation models and to improve on already existing ones.

It's interesting how each adaptation modified and tailored JEPA to suit its specific biological domain, whether by experimenting with different backbones or complementing the objective with other loss terms.

For example, JEPA-DNA and ProteinJEPA used JEPA as a continual pre-training framework to enhance existing foundation models without training from scratch, while Cell-JEPA and JEPA-DNA employed a hybrid objective that combines the JEPA loss with a traditional language modeling loss.

The article below provides an overview of these implementations, along with others that came out this year. As always, your thoughts and feedback are welcome and highly appreciated!

Link to the article is in the first comment 👇

3 replies

reacted to azettl's post with 👍 about 5 hours ago

Post

921

𝗕𝘂𝗶𝗹𝗱 𝗦𝗺𝗮𝗹𝗹 𝗛𝗮𝗰𝗸𝗮𝘁𝗵𝗼𝗻 - 𝗪𝗲𝗲𝗸 𝗶𝗻 𝗣𝗿𝗼𝗴𝗿𝗲𝘀𝘀

After Consilium, I wanted to build something entirely different. Smaller. Weirder.

The idea came from a single Asterix scene—Asterix and Obelix trying to obtain Permit A38, only to discover you need Permit A38 to apply for Permit A38. That's the whole game.

𝗪𝗵𝗮𝘁 𝗜 𝗯𝘂𝗶𝗹𝘁: A bureaucratic text adventure powered by one fine-tuned 1.7B model playing five different characters—each with their own system prompt, their own rules, and their own way of sending you in circles.

𝗧𝗵𝗲 𝗳𝗶𝘃𝗲 𝗽𝗲𝗿𝘀𝗼𝗻𝗮𝗹𝗶𝘁𝗶𝗲𝘀:

✅ Clerk Vitalstatistix—requires Form 27b/6. Has never issued a permit in 23 years.

✅ Supervisor Caligula Minus—perpetually at lunch. Invented A38 in 1987, forgot why.

✅ SYSTEMA v2.3—last updated 1994. Every response starts with an error code.

✅ Form 27b/6—sent. Not happy about it. Page 3 is always missing.

✅ Ombudsman Panoramix—investigates complaints about the process. It is also the process.

𝗧𝗵𝗲 𝘁𝗲𝗰𝗵:

- Fine-tuned SmolLM2-1.7B on ~1000 synthetic NPC examples generated with Claude.
- Converted to GGUF Q4_K_M for fast CPU inference
- Streaming responses so you feel the bureaucracy arriving in real time
- Custom beige government office UI

➡️ Try it: azettl/the-place-that-sends-you-mad

Built for the Build Small Hackathon—Track 2: Thousand Token Wood. Model at azettl/permit-a38-npc and Dataset azettl/permit-a38-npcs.

reacted to FlameF0X's post with 🔥 about 5 hours ago

Post

189

My models on the Intel Low-Bit LLM Leaderboard

Figured I'd share where my quantized models landed on Intel/low_bit_open_llm_leaderboard since I hadn't posted about it yet.

FlameF0X/Qwen3-4B-Distilled-Claude-4.6 (NVFP4 and MXFP4) sit at ranks 23 and 24 with 62.68% and 61.18% average, right below the base Qwen3-4B. Not bad considering they were distilled from Claude 4.6 rather than trained from scratch.

FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6 and FlameF0X/LFM2.5-1.2B-Thinking-CodeX land around rank 47-49, competitive with MiniCPM5-1B and the Qwen3 sub-1B models despite being a larger base architecture.

The funny one is FlameF0X/Qwen2-0.2B-pt and FlameF0X/Qwen2-0.2B-it. They're not properly trained — genuinely undertrained, basically undefined — and they still beat openai/gpt-oss-20b at rank 66. The 20B model. Not sure what that says but it's something.

FlameF0X/LFM2-Research is at the bottom of my lineup but it's a research artifact, not meant to be competitive.

Chart below showing my models vs nearby competitors, with size vs performance on the left.

Chart made by Claude

1 reply

reacted to evillegasgarcia's post with 👍🚀 about 5 hours ago

Post

141

Introducing ESM3-PPISites! 🧬🤖 We leveraged the multimodal ESM3 to predict protein-protein interaction interfaces with state-of-the-art accuracy—using only sequence data! By feeding these predictions into HADDOCK 🧩, we can accurately reconstruct protein complexes while reducing computation time by an order of magnitude.
🚀 Test the live model on spaces: area-science-park/esm3-ppisites
📄 Read the preprint: https://doi.org/10.64898/2026.05.29.728739
#Bioinformatics #MachineLearning #ProteinDocking

reacted to Jiaqi-hkust's post with 🔥 about 5 hours ago

Post

3952

🚀 Introducing Robust-U1: Teaching MLLMs to Self-Recover Corrupted Visual Content

Multimodal Large Language Models (MLLMs) have achieved impressive visual understanding, yet they remain highly brittle under real-world corruptions—noise, blur, compression artifacts, adverse weather.

Standard MLLMs suffer dramatic performance drops, and existing robustness solutions come with fundamental limits: black‑box feature alignment lacks interpretability, while white‑box text reasoning cannot restore the lost pixel‑level visual details. This raises a crucial question:

🧐 Can MLLMs recover corrupted visual content by themselves?

If the answer is yes, we can move beyond merely “compensating” for corruption and instead build a more intrinsic, generalizable form of resilience. Robust-U1 is our answer to that question.

💡 Paper: https://arxiv.org/abs/2606.08063
🔗 Code: github.com/jqtangust/Robust-U1
🌍 Demo: Jiaqi-hkust/Robust-U1

1 reply

reacted to andyolivers's post with 🔥 about 5 hours ago

Post

149

→ Try it live: build-small-hackathon/lolaby
→ Watch the demo: https://youtu.be/eY_JnijT62E
→ Read the build journal: https://huggingface.co/blog/build-small-hackathon/lolaby-blog

Every parent, teacher, or babysitter knows the moment. The lights go dim. Blankets come out. Your child asks for a song. Then another. Then suddenly you’re improvising lyrics about dinosaurs, fire trucks, and princesses while trying to convince a little one that it’s actually bedtime.

That’s exactly the problem my partner’s sister faces as a kindergarten teacher. Every day she runs nap time for fifteen 4-year-olds, and ever since they learned about music and instruments in class, it starts the same way: "sing a song for me." She'd love to give each child their own song, built from whatever they love that week, but she doesn't have the time, the musical training, or a tool that could do it. So @volivers and I built one.

Introducing Lolaby — our submission to the Hugging Face Build Small Hackathon 2026, hosted by Gradio and backed by OpenBMB, OpenAI, NVIDIA, Modal, Cohere, JetBrains, and Black Forest Labs.

A child draws something they love (on screen or on paper), a name is entered, and a tiny AI watches the drawing, writes a personalised lullaby, and sings it back.

Everything runs locally. No cloud LLMs. No per-song API cost. No child's drawing or name ever leaves the device.

The full pipeline:
🖼️ MiniCPM-V 4.6 (1.3B) reads the drawing.
✍️ A fine-tuned Llama 3.2 3B writes the lyrics — trained on 1,500 lullabies with strict anti-boilerplate gates.
🎵 Kokoro 82M sings the result over custom DSP instruments.

Drop a like, upvote or comment. Feedback is welcome! 🙏

reacted to YMRohit's post with 🔥 about 5 hours ago

Post

3004

A 1B model that writes GPU kernels you can trust

I fine-tuned OpenBMB's MiniCPM5-1B to write Triton GPU kernels, then let an immutable referee decide if they are real: compile, check correctness against PyTorch on adversarial inputs, time against eager, torch.compile, and torch.compile max-autotune, then block the known ways of gaming the benchmark.

The 1B setup beat torch.compile max-autotune in 12/12 independently seeded runs. The larger Qwen3.6-27B smith pushed the same referee loop further: 76 verified compiler-beating kernels on H200, with 69 surviving a 5-run stability gate and 7 kept as single-shot probes on unseen problems. On a 376-cell shape/dtype grid, the stability-gated kernels keep a 1.49x geomean, with about 10% of cells losing and reported per cell.

Honest bound: these are scheduling wins on memory-bound ops, not new algorithms or wins over cuBLAS/FlashAttention. The scarce thing is not the big model, it is the verifier it cannot fool.

Full write-up: https://huggingface.co/blog/YMRohit/ouroboros-kernel-mint
Try it: build-small-hackathon/ouroboros-kernel-mint
2-min demo: https://youtu.be/ViicZHktb-A

Built for #BuildSmallHackathon with MiniCPM, Qwen, Triton, Gradio, Codex, and Modal H200s.

1 reply

reacted to kingkw1's post with 👍 about 5 hours ago

Post

2962

I built Read-Along AI for the Hugging Face Build Small Hackathon.

It is an offline-capable reading practice app for early readers: one short sentence at a time, tap-to-hear word help, record a read-aloud attempt, then get gentle feedback.

The goal is Backyard AI in the literal sense: a tool for real home reading practice, where feedback needs to be patient, developmentally fair, and private. A child’s voice should not need to leave the app just to practice “The dog ran fast.”

What makes it small-model native:

- Exact clean readings pass immediately.
- Close or ambiguous child-speech transcripts get a second look from a fine-tuned MiniCPM phonetic evaluator.
- Meaning-changing mistakes still fail closed, e.g. “blue hat” should not pass for “red hat.”
- Off the Grid Mode runs local ASR plus the MiniCPM GGUF evaluator through llama.cpp.
- Turbo Mode uses Modal endpoints for lower-latency ASR/TTS/evaluation.
- The UI is custom Gradio with a child-facing reading canvas, clickable words, progress feedback, and celebration on success.

Targeted tracks and badges:
Backyard AI, Off-Brand, Off the Grid, Llama Champion, Well-Tuned, Tiny Titan, Sharing is Caring, Field Notes.

Space:
build-small-hackathon/read-along-ai

Demo video:
https://youtu.be/4bpbwhipLU4

Repo:
https://github.com/kingkw1/read-along-ai

Built with Codex as the lead development partner.

5 replies

reacted to danielhanchen's post with 🔥 about 5 hours ago

Post

3965

Google's new DiffusionGemma can now run at 2000+ tokens/sec! ⚡

We made local DiffusionGemma inference 1.8× faster.
Run it on 18GB RAM via Unsloth Studio.

GitHub: https://github.com/unslothai/unsloth
Guide: https://unsloth.ai/docs/models/diffusiongemma

4 replies

reacted to KingNish's post with 👀 about 5 hours ago

Post

4034

We trained an open-source Mythos like cybersecurity LLM for the Build Small Hackathon meet OpenMythos

Trained in two stages: SFT on ~1.84K filtered ArXiv cs.CR papers + real CVE data, then RLVR using paired with past vulnerabilities GitHub repos with a verifier model checking outputs against ground truth.

Trained on: H100s from Modal

The RLVR stage made the biggest difference responses got more precise and less prone to confusing similar vulnerability classes.

Everything is open:
🤖 Demo → build-small-hackathon/OpenMythos
🧠 Model → build-small-hackathon/OpenMythos
📦 CVE Dataset → build-small-hackathon/CVE_Vulnerailities_Detailed
📄 ArXiv Dataset → himanshu17HF/ArvixImport-Filtered-Final

Try it out and let us know where it breaks 🙏

reacted to daanhoekstra's post with 👍 about 5 hours ago

Post

122

Just published my first model on the Hub: daanhoekstra/sds-ner-compliance
A token classification model for extracting structured fields from GHS Safety Data Sheets, signal words, H-codes, P-codes, CAS numbers, built on SciBERT.
Comes with a full write-up on automating GHS and supply chain label compliance with open-source NLP. Feedback welcome!

reacted to ProCreations's post with 👍 about 5 hours ago

Post

112

Here is my submission for the huggingface hackathon! build-small-hackathon/tutori

Heart it if you want, it helps

reacted to dronefreak's post with 🔥 about 5 hours ago

Post

393

Excited to open-source the VisDrone Aerial Object Detection Model Zoo on Hugging Face.

The collection includes multiple YOLO variants trained and evaluated on the VisDrone benchmark for aerial object detection, with accompanying documentation and performance metrics.

If you're working on drones, aerial surveillance, robotics, or small-object detection, I hope these models save you some time.

Model Zoo: https://huggingface.co/collections/dronefreak/visdrone-detection-model-zoo

Feedback, issues, and contributions are welcome.

reacted to Anran-MLLM's post with 🔥 about 5 hours ago

Post

248

🚀 Introducing PerceptionDLM — the first multimodal diffusion LLM for parallel region perception!

Most MLLMs are autoregressive, so captioning N regions costs N sequential passes. PerceptionDLM instead describes ALL masked regions in a single denoising process. 🧩

✨ Highlights
• ⚡ Up to 3.4× faster on dense multi-region captioning, with stable per-image latency
• 🏆 PerceptionDLM-Base beats LLaDA-V on 15/16 multimodal benchmarks (new SOTA among open diffusion VLMs)
• 📊 New benchmark: ParaDLC-Bench — jointly evaluates caption quality AND inference efficiency
• 🔓 Code, models & benchmark all open-sourced

🤖 Models
MSALab/PerceptionDLM-Base
MSALab/PerceptionDLM

📊 Benchmark
MSALab/ParaDLC-Bench

📄 Paper: PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models (2606.19534)
💻 Code: https://github.com/MSALab-PKU/PerceptionDLM

Diffusion LLMs aren't just for text — they unlock efficient, parallel visual perception. 👁️✨

#multimodal #diffusion #VLM #perception

reacted to Reubencf's post with 🔥 about 5 hours ago

Post

233

Shadows of Tomorrow is finally live on Hugging Face Spaces with Gradio.

It’s a browser-playable RPG built with Godot, set in a post-nuclear future where players explore Magnus Province, collect medicinal plants, craft medicine, and help cure NPCs.

Play it here: Reubencf/Shadows_of_Tomorrow

1 reply

reacted to Hari5115's post with 🔥 about 5 hours ago

Post

Bit addictive. Fair warning !!!
Chain combos, fever mode, daily leaderboard. Free, runs in your browser.
Beat the score if you can 🫧

🎮 Hari5115/neon-pop

#SendHelp #JustOneMoreGame #NeonPop #NotAddicted

Dean Byrne PRO

AI & ML interests

Recent Activity

Organizations

Quazim0t0's activity