arxiv:2508.10925
Harshit Sikchi
hsikchi
AI & ML interests
None yet
Organizations
models 36
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0175-alpha-0-step-79872
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0175-alpha-0-step-59904
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0175-alpha-0-step-19968
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0375-alpha-0-step-59904
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0375-alpha-0-step-39936
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0375-alpha-0-step-79872
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0175-alpha-0-step-39936
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0175-alpha-0-LATEST
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.0375-alpha-0-step-19968
Text Generation • 7B • Updated
• 1
hsikchi/pythia-6.9b-goldrm_tldr-dpo-beta-0.025-alpha-0-step-59904
Text Generation • 7B • Updated
• 1