2026-06-20
i fine-tuned qwen3-4b on indonesian government documents and it actually works
1966 Q&A pairs, 30 minutes on kaggle, and it now cites specific law numbers correctly.
2026-06-20what i learned building an indonesian slm as a non-cs student
i'm a digital PR major. here's what three models taught me that no course would.
2026-06-20i trained a crisis PR model and it said 'bitch investigations' once
630K params, character-level tokenizer, val loss 1.03. it works if you squint.
2026-06-20why i went from 500M to 100M parameters (it's not what you think)
DFD-1 wasn't a failure. it was a tradeoff. here's the math.