Refining Conversational Models: Supervised Tuning vs. Reinforcement Learning — Payloop Community | Payloop