A framework and benchmark to evaluate LLMs multilingual capabilities in healthcare queries, revealing significant performance gaps across languages.
Dec 10, 2024