·
AI & ML interests
None yet
Organizations
None yet
januverma/mdlm-owt-from-scratch
januverma/qwen2.5-1b-tool-use-gsm8k-grpo
2B • Updated • 4
januverma/qwen2.5-1b-retool-sft
Text Generation
• 2B • Updated • 4
januverma/Qwen2.5-7B-Instruct-GSM-GRPO
Text Generation
• 8B • Updated • 3
januverma/GenRec-7B-Instruct
Text Generation
• 8B • Updated • 5
• 2
Text Generation
• 8B • Updated • 9
• 1
januverma/Qwen-RecSys-GRPO
Text Generation
• 2B • Updated • 5
januverma/Qwen2.5-1.5B-s1K
2B • Updated • 2
januverma/Qwen2.5-1.5B-simplescaling-SFT
Text Generation
• 2B • Updated • 2
januverma/Qwen2.5-1.5B-s1K-cot-SFT
Text Generation
• 2B • Updated • 3
januverma/Qwen2.5-1.5B-s1K-SFT
Text Generation
• 2B • Updated • 5
januverma/Qwen2.5-1.5B-SFT-simplescaling-20K
Text Generation
• 2B • Updated • 1
januverma/Qwen2.5-7B-s1K-SFT
Text Generation
• 8B • Updated • 7
• 1
januverma/Qwen2.5-1.5B-GRPO
Text Generation
• 2B • Updated • 2
januverma/Qwen2.5-3B-GRPO
Text Generation
• 3B • Updated • 4
januverma/MovieRecGRPO-3B
Updated
januverma/MovieRecGRPO-1.5B
Updated
januverma/MovieRecV4-1.5B
Updated
januverma/MovieRecV3-1.5B
Updated
januverma/MovieRecV2-1.5B
Updated
januverma/MovieRecQwen7B_V2
Updated
januverma/MovieRecQwen1.5B
Updated
januverma/MovieRecLlama-1B
Updated
januverma/QwenMath0.5B_GRPO
Text Generation
• 0.5B • Updated • 4
• 1
januverma/MovieRecV1-1.5B
Updated
januverma/Qwen7B_movierec
Updated