STake the quiz
Menu

Short answer

What is RLHF work?

RLHF work usually means helping improve model behavior by comparing, rating, or rewriting outputs according to a rubric. Applicants may need strong writing, domain judgment, or technical reasoning.

Context

RLHF stands for reinforcement learning from human feedback. In public listings, the label can cover several task types, so the current role page matters.

Related pages