Has anyone tried training large models on consumer hardware? My experience with gradient checkpointing + ZeRO — Payloop Community | Payloop