Explore why LLMs struggle in non-English languages and how frameworks like Menlo and medical exams are reshaping global AI evaluation.