ArCHer is a novel hierarchical reinforcement learning framework that enables large language models to excel in multi-turn decision-making tasks. By segregating decision-making into hierarchical layers, ArCHer optimizes both macro strategies and micro decisions, resulting in unprecedented precision and foresight.
