| Provider | Model | Total | History (4) | Humanities (5) | Opinion (4) | Creativity (4) | Coding (5) | Science (5) | Philosophy (5) | Math (5) | Reasoning (4) | Engineering (5) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OpenAI | GPT-5.2 | 1,176 | 1039 | 1009 | 1040 | 1131 | 1081 | 1134 | 1041 | 1007 | 1056 | 1010 |
| Qwen | Qwen-3-thinking | 934 | 979 | 976 | 1007 | 858 | 986 | 1066 | 993 | 997 | 1013 | 1014 |
| Gemini-3-Pro | 1,045 | 1013 | 1020 | 983 | 1025 | 1027 | 1054 | 952 | 1063 | 996 | 1023 | |
| OpenAI | GPT-5.1 | 1,026 | 1074 | 1011 | 1011 | 1027 | 962 | 1050 | 1014 | 975 | 1005 | 1050 |
| DeepSeek | R1 | 992 | 998 | 988 | 1136 | 1039 | 891 | 1049 | 1046 | 975 | 978 | 814 |
| Moonshot | Kimi-k2 | 874 | 967 | 955 | 916 | 1015 | 944 | 1045 | 931 | 927 | 922 | 979 |
| Anthropic | Claude-Opus-4-5 | 1,085 | 963 | 971 | 979 | 1033 | 996 | 1042 | 1126 | 1000 | 993 | 1030 |
| OpenAI | o3-pro | 1,232 | 1128 | 1182 | 1111 | 1019 | 1088 | 1037 | 1114 | 1030 | 1205 | 1147 |
| OpenAI | GPT-OSS | 1,037 | 1037 | 985 | 1114 | 978 | 1020 | 1006 | 1045 | 1084 | 1017 | 1191 |
| Moonshot | Kimi-k2.5 | 1,113 | 1081 | 1088 | 1036 | 1043 | 1069 | 993 | 1048 | 1071 | 1058 | 1001 |
| Z | GLM-4.7 | 890 | 970 | 1006 | 928 | 976 | 988 | 955 | 954 | 977 | 955 | 984 |
| OpenAI | o4-mini | 904 | 951 | 920 | 961 | 972 | 960 | 954 | 922 | 946 | 950 | 928 |
| DeepSeek | V3-2-thinking | 953 | 1007 | 1002 | 900 | 1046 | 949 | 941 | 984 | 1001 | 897 | 936 |
| xAI | Grok-4-1-fast | 892 | 897 | 966 | 917 | 983 | 1012 | 916 | 949 | 995 | 983 | 902 |
| Z | GLM-5 | 826 | 987 | 901 | 944 | 956 | 1015 | 880 | 921 | 934 | 953 | 948 |
| Gemini-3-Flash | 1,020 | 907 | 1021 | 1020 | 901 | 1011 | 878 | 959 | 1018 | 1018 | 1043 |