Self-consistency samples multiple reasoning paths from the model and selects the most common answer through majority voting. Instead of relying on a single chain-of-thought, this technique leverages the diversity of reasoning approaches to improve accuracy.
This method significantly improves performance on complex reasoning tasks by reducing the impact of individual reasoning errors.