Why exams intended for humans might not be good benchmarks for LLMs like GPT-4
Training data contamination and other factors mean LLMs like GPT-4 succeeding on human exams might not be a good measure of their abilities. https://bit.ly/3lUAnES
GUEST: I remember it fondly … long road trips with my family and hours in the car, with Tetris on my brother’s Game Boy keeping us company. Clearing line after line, nailing a Tetris whenever I could, having that iconic music loop in my head for hours until the batteries died … for many of us, […] https://bit.ly/2QFq9os
Did ChatGPT see its shadow for six more weeks of AI winter? We don’t think so, because the generative AI field is super hot right now. https://bit.ly/3WZAfQU