This story draft by @escholar has not been reviewed by an editor, YET.

Critique Ability of Large Language Models: CriticBench: Statistics and Examples