The use of reinforcement learning from human feedback (RLHF) in AI training is not as effective as true reinforcement learning, as it relies on subjective human judgments and does not have a clear reward function. Some experts argue that RLHF is not true reinforcement learning and may not be as successful in tasks with open-ended goals.