| Provider | Model | Total | Reasoning (4) | History (4) | Humanities (5) | Science (5) | Coding (5) | Creativity (4) | Opinion (4) | Math (5) | Philosophy (5) | Engineering (5) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OpenAI | o3-pro | 1,262 | 1170 | 1148 | 1160 | 1046 | 1097 | 1102 | 1136 | 1042 | 1129 | 1124 |
| OpenAI | GPT-5-mini | 1,242 | 1060 | 1084 | 1119 | 1188 | 1093 | 1115 | 1067 | 1107 | 1147 | 1105 |
| OpenAI | GPT-OSS | 1,226 | 1039 | 1078 | 1073 | 1084 | 1098 | 984 | 1123 | 1144 | 1136 | 1202 |
| OpenAI | GPT-5 | 1,167 | 1107 | 1121 | 1127 | 1066 | 1072 | 1205 | 1157 | 1025 | 1134 | 1161 |
| OpenAI | o3 | 1,135 | 1227 | 1124 | 1291 | 1054 | 953 | 1059 | 1249 | 1064 | 1057 | 1089 |
| Qwen | Qwen-3-thinking | 1,114 | 1083 | 939 | 1114 | 968 | 1030 | 976 | 1036 | 1144 | 1008 | 955 |
| OpenAI | GPT-5-nano | 992 | 1030 | 1087 | 1013 | 1050 | 1032 | 1015 | 988 | 977 | 1013 | 1060 |
| Gemini-2.5-pro | 980 | 945 | 879 | 776 | 854 | 1034 | 1013 | 885 | 998 | 787 | 874 | |
| Anthropic | Claude-opus-4-1 | 959 | 953 | 1021 | 921 | 927 | 1173 | 962 | 898 | 956 | 1056 | 911 |
| DeepSeek | R1 | 946 | 1034 | 1015 | 1001 | 1101 | 848 | 1119 | 1097 | 1039 | 1066 | 968 |
| DeepSeek | V3-2-thinking | 934 | 969 | 982 | 1011 | 1032 | 988 | 1004 | 989 | 967 | 1033 | 963 |
| Zai | GLM-4-6 | 933 | 1000 | 1000 | 1000 | 1000 | 1000 | 963 | 1000 | 1000 | 979 | 987 |
| MoonshotAI | kimi-k2 | 919 | 902 | 985 | 957 | 1035 | 958 | 977 | 880 | 887 | 1035 | 997 |
| OpenAI | o4-mini | 901 | 1002 | 979 | 957 | 979 | 927 | 1075 | 1084 | 984 | 978 | 953 |
| xAI | Grok-4-fast-reasoning | 866 | 922 | 967 | 997 | 844 | 932 | 832 | 837 | 956 | 812 | 839 |
| Gemini-2.5-flash | 848 | 786 | 870 | 871 | 938 | 1067 | 870 | 844 | 869 | 841 | 911 | |
| Qwen | Qwen-3-coder | 835 | 827 | 848 | 785 | 912 | 836 | 852 | 840 | 974 | 899 | 996 |
| Anthropic | Claude-sonnet-4 | 743 | 943 | 872 | 826 | 922 | 860 | 879 | 889 | 865 | 892 | 906 |