For https://github.com/huggingface/open-r1/pull/158, to handle situations where `math_verify` throws an error, we just skip it with existing behavior. But what is the reasoning behind awarding reward 1, the same as a correct answer? I'd assume we just return `None` and _actually_ skip it.