This website is experimental.

Provider Model Total History (4) Humanities (5) Opinion (4) Creativity (4) Coding (5) Science (5) Philosophy (5) Math (5) Reasoning (4) Engineering (5)
OpenAI o3-pro 1,168 1128 1175 1088 1019 1088 1022 1103 1030 1182 1147
Moonshot Kimi-k2.5 1,162 1081 1093 1049 1043 1058 1024 1068 1060 1042 984
Anthropic Claude-Opus-4-5 1,136 963 971 999 1019 996 1056 1126 1019 980 1006
Google Gemini-3-Pro 1,122 1013 1074 983 1033 1045 1054 952 1038 996 1008
OpenAI GPT-5.2 1,101 1028 996 1027 1131 1070 1092 1026 991 1056 1010
OpenAI GPT-OSS 1,067 1052 968 1100 978 1020 1020 1045 1102 1027 1211
DeepSeek R1 1,000 998 976 1124 1025 891 1037 1031 1011 1001 825
OpenAI GPT-5.1 997 1045 1011 1024 1027 942 1043 1014 975 991 1073
Qwen Qwen-3-thinking 971 996 976 1007 858 986 1071 1004 1026 1013 1014
Google Gemini-3-Flash 968 918 988 1020 912 1011 878 959 997 1018 1021
Z GLM-4.7 949 974 1026 928 991 1004 955 954 956 974 1001
DeepSeek V3-2-thinking 934 1007 1001 911 1046 949 941 995 965 919 911
Z GLM-5 874 1001 913 967 970 1016 880 938 945 953 966
OpenAI o4-mini 861 951 933 961 952 971 966 921 938 950 905
Moonshot Kimi-k2 851 967 955 895 1015 955 1058 931 937 913 994
xAI Grok-4-1-fast 839 876 946 917 983 995 902 933 1012 983 924