OpenAI’s ChatGPT 3.5 and Anthropic’s Claude failing at absolutely basic evaluations of schedules.