This website is experimental.

Provider Model Total Reasoning (4) History (4) Humanities (5) Science (5) Coding (5) Creativity (4) Opinion (4) Math (5) Philosophy (5) Engineering (5)
OpenAI o3-pro 1,262 1170 1148 1160 1046 1097 1102 1136 1042 1129 1124
OpenAI GPT-5-mini 1,242 1060 1084 1119 1188 1093 1115 1067 1107 1147 1105
OpenAI GPT-OSS 1,226 1039 1078 1073 1084 1098 984 1123 1144 1136 1202
OpenAI GPT-5 1,167 1107 1121 1127 1066 1072 1205 1157 1025 1134 1161
OpenAI o3 1,135 1227 1124 1291 1054 953 1059 1249 1064 1057 1089
Qwen Qwen-3-thinking 1,114 1083 939 1114 968 1030 976 1036 1144 1008 955
OpenAI GPT-5-nano 992 1030 1087 1013 1050 1032 1015 988 977 1013 1060
Google Gemini-2.5-pro 980 945 879 776 854 1034 1013 885 998 787 874
Anthropic Claude-opus-4-1 959 953 1021 921 927 1173 962 898 956 1056 911
DeepSeek R1 946 1034 1015 1001 1101 848 1119 1097 1039 1066 968
DeepSeek V3-2-thinking 934 969 982 1011 1032 988 1004 989 967 1033 963
Zai GLM-4-6 933 1000 1000 1000 1000 1000 963 1000 1000 979 987
MoonshotAI kimi-k2 919 902 985 957 1035 958 977 880 887 1035 997
OpenAI o4-mini 901 1002 979 957 979 927 1075 1084 984 978 953
xAI Grok-4-fast-reasoning 866 922 967 997 844 932 832 837 956 812 839
Google Gemini-2.5-flash 848 786 870 871 938 1067 870 844 869 841 911
Qwen Qwen-3-coder 835 827 848 785 912 836 852 840 974 899 996
Anthropic Claude-sonnet-4 743 943 872 826 922 860 879 889 865 892 906