Why exams intended for humans might not be good benchmarks for LLMs like GPT-4 March 30, 2023 Training data contamination and other factors mean LLMs like GPT-4 succeeding on human exams might not be a good measure of their abilities.https://bit.ly/3lUAnES Share Get link Facebook X Pinterest Email Other Apps Share Get link Facebook X Pinterest Email Other Apps