Skeleton-of-Thought (SoT) is a prompting technique that first generates a skeleton outline of the answer, then expands each point in parallel. This dramatically reduces latency by enabling parallel generation of independent sections.
Unlike sequential Chain-of-Thought, SoT can achieve 2x+ speedup while maintaining answer quality for suitable tasks.