Self-Instruct: Aligning Language Models with Self-Generated Instructions

Using base LLMs to self-generate instruction data for alignment, approaching InstructGPT performance with near-zero human annotation.
Alignment
Author

Imad Dabbura

Published

March 21, 2024

Self-Instruct: Aligning Language Models with Self-Generated Instructions

#nlp #fine-tuning

Back to top