Training
Teacher traces into student behavior
- Fine-tuned Qwen3-8B with QLoRA using teacher-generated reasoning traces.
- Trained the student model to generate reasoning traces and sentiment labels for financial slang such as rug pull and diamond hands.
- Used Unsloth and 4-bit quantization to stay within limited GPU memory.