Kyle PRO
iky1e
AI & ML interests
None yet
Recent Activity
liked a dataset about 1 hour ago
kyutai/interactivity-alignment-samples liked a model about 1 hour ago
kyutai/personaplex-rl-seamless upvoted a paper about 1 hour ago
Multi-Faceted Interactivity Alignment in Full-Duplex Speech ModelsOrganizations
Micro-LLM
Gender Detection
Embedding Models
Audio Analysis
-
codelion/whisper-age-estimator
Automatic Speech Recognition • 72.6M • Updated • 232 • 3 -
blackhole33/uzbek-speaker-verification-v4
Updated • 66 • 1 -
alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech
Audio Classification • 0.3B • Updated • 175k • 47 -
fronx/Fast-FullSubNet
Audio-to-Audio • Updated • 5
Text to Speech
-
parler-tts/parler-tts-mini-expresso
Text-to-Speech • 0.6B • Updated • 1.37k • 117 -
parler-tts/parler-tts-large-v1
Text-to-Speech • 2B • Updated • 9.41k • 273 -
parler-tts/parler-tts-mini-v1
Text-to-Speech • 0.9B • Updated • 18.7k • 153 -
OuteAI/OuteTTS-0.2-500M
Text-to-Speech • 0.5B • Updated • 249 • 311
Music Models
Open Datasets
Language Models
Interesting LLM Models
-
mzbac/llama-3-8B-Instruct-function-calling
Text Generation • 8B • Updated • 15 • • 30 -
hjhj3168/Llama-3-8b-Orthogonalized-exl2
Text Generation • Updated • 63 • 91 -
failspy/kappa-3-phi-abliterated
Text Generation • 4B • Updated • 23 • • 46 -
failspy/kappa-3-phi-3-4k-instruct-abliterated-GGUF
4B • Updated • 55 • 12
DeepFilterNet-MLX
MLX ports of the DeepFilterNet speech enhancement models for Apple Silicon
Audio Encoder
3D
- Configuration errorAgentsFeatured4.78k
TRELLIS
🏢4.78kScalable and Versatile 3D Generation from images
- Running on ZeroAgents191
PSHuman
🏃191PHOTOREALISTIC HUMAN RECONSTRUCTION w/ CROSS-SCALE DIFF
-
microsoft/TRELLIS-image-large
Image-to-3D • Updated • 1.23M • 653 - Runtime errorAgentsFeatured90
GaussianAnything-AIGC3D
🌖90Generate 3D models from 2D images
Speech to Text
-
UsefulSensors/moonshine
Automatic Speech Recognition • Updated • 94 -
UsefulSensors/moonshine-tiny
Automatic Speech Recognition • 27.1M • Updated • 44.1k • 39 -
UsefulSensors/moonshine-base
Automatic Speech Recognition • 61.5M • Updated • 9.51k • 45 -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 5.05M • • 5.8k
Text to Image
structured information extraction
Translation
Video Models
Audio Models
Image Models
Super-Resolution
DeepFilterNet-MLX
MLX ports of the DeepFilterNet speech enhancement models for Apple Silicon
Micro-LLM
Audio Encoder
Gender Detection
3D
- Configuration errorAgentsFeatured4.78k
TRELLIS
🏢4.78kScalable and Versatile 3D Generation from images
- Running on ZeroAgents191
PSHuman
🏃191PHOTOREALISTIC HUMAN RECONSTRUCTION w/ CROSS-SCALE DIFF
-
microsoft/TRELLIS-image-large
Image-to-3D • Updated • 1.23M • 653 - Runtime errorAgentsFeatured90
GaussianAnything-AIGC3D
🌖90Generate 3D models from 2D images
Embedding Models
Speech to Text
-
UsefulSensors/moonshine
Automatic Speech Recognition • Updated • 94 -
UsefulSensors/moonshine-tiny
Automatic Speech Recognition • 27.1M • Updated • 44.1k • 39 -
UsefulSensors/moonshine-base
Automatic Speech Recognition • 61.5M • Updated • 9.51k • 45 -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 5.05M • • 5.8k
Audio Analysis
-
codelion/whisper-age-estimator
Automatic Speech Recognition • 72.6M • Updated • 232 • 3 -
blackhole33/uzbek-speaker-verification-v4
Updated • 66 • 1 -
alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech
Audio Classification • 0.3B • Updated • 175k • 47 -
fronx/Fast-FullSubNet
Audio-to-Audio • Updated • 5
Text to Image
Text to Speech
-
parler-tts/parler-tts-mini-expresso
Text-to-Speech • 0.6B • Updated • 1.37k • 117 -
parler-tts/parler-tts-large-v1
Text-to-Speech • 2B • Updated • 9.41k • 273 -
parler-tts/parler-tts-mini-v1
Text-to-Speech • 0.9B • Updated • 18.7k • 153 -
OuteAI/OuteTTS-0.2-500M
Text-to-Speech • 0.5B • Updated • 249 • 311
structured information extraction
Music Models
Translation
Open Datasets
Video Models
Language Models
Audio Models
Interesting LLM Models
-
mzbac/llama-3-8B-Instruct-function-calling
Text Generation • 8B • Updated • 15 • • 30 -
hjhj3168/Llama-3-8b-Orthogonalized-exl2
Text Generation • Updated • 63 • 91 -
failspy/kappa-3-phi-abliterated
Text Generation • 4B • Updated • 23 • • 46 -
failspy/kappa-3-phi-3-4k-instruct-abliterated-GGUF
4B • Updated • 55 • 12
Image Models