The latest version of Google DeepMind, Gemini Exp 1114, has achieved significant milestones on the Chatbot Arena, soaring to the top of the overall leaderboard with over 6,000 community votes and excelling in multiple domains:
First, we need to understand what LLM Arena is. LLM Arena (or Chatbot Arena) is a platform for evaluating LLMs, primarily aimed at promoting community-driven LLM performance assessments. It is one of the most prestigious evaluation platforms.
From the overall leaderboard, Google's new model Gemini (Exp 1114) saw a score increase of over 40, reaching a score of 1344, while the latest version of ChatGPT 4.0 scored 1340. This seems to be the first time a model from Google has achieved such results.
Gemini-Exp-1114 is tied for first place in the math arena, performing on par with o1:
Currently, Gemini-Exp-1114 can be experienced in conversation at Google AI Studio.
The Terminator is coming.