AI & ML interests

None defined yet.

Recent Activity

quangnd58  updated a model 4 days ago
vrfai/Qwen3-ASR-1.7B-int8
quangnd58  updated a model 4 days ago
vrfai/Qwen3-ASR-1.7B-int4
quangnd58  updated a model 4 days ago
vrfai/Qwen3-ASR-1.7B-fp8
View all activity

Organization Card

VinRobotics - Edge AI & Model Optimization

We optimize and deploy LLMs, ASR, VLM and VLA (Vision-Language-Action) models on real-world systems.

What we do

  • Optimization: quantization (INT8/INT4/FP8/NVFP4), pruning, distillation, ...
  • Deployment: VLLM, TensorRT, ONNX Runtime, edge runtimes
  • Systems: real-time pipelines (vision, audio, language, action)

Focus

  • Edge devices (Jetson, SoCs)
  • Robotics & VLA systems
  • Latency, stability, deployability

Philosophy

Optimization = model + runtime + system

datasets 0

None public yet