Show HN: RLHF and Lora finetuning to mistralai 7B with DeepSpeed learning

Heykuki News

1 point

10 months ago

Using multiple gpus, training 7B model with lora and RLHF with external dataset.

Show HN: RLHF and Lora finetuning to mistralai 7B with DeepSpeed learning | Heykuki News