Label data and build a meme classifier? Does not have to be perfect to be useful. But yeah, data curation is probably a huge endeavor at the companies making Language model that are fit for production. Like in practically all applications of Machine Learning.
But the Reinforcement Learning from Human Feedback (RLHF) is also one of the key tools to getting useful outputs.
But the Reinforcement Learning from Human Feedback (RLHF) is also one of the key tools to getting useful outputs.