Sudoku-Bench: Unlocking Creative Reasoning in LLMs
Sudoku-Bench: Unlocking Creative Reasoning in LLMs Large language models (LLMs) often struggle with truly creative reasoning, instead relying on pattern recognition and memorization. Sudoku-Bench, a novel benchmark, aims to change…