| Provider | Model | Total | History (4) | Humanities (5) | Opinion (4) | Creativity (4) | Coding (5) | Science (5) | Philosophy (5) | Math (5) | Reasoning (4) | Engineering (5) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OpenAI | GPT-OSS | 1,011 | 1052 | 968 | 1100 | 978 | 1020 | 1006 | 1045 | 1074 | 1018 | 1191 |
| OpenAI | o3-pro | 1,224 | 1128 | 1182 | 1098 | 1019 | 1088 | 1037 | 1114 | 1030 | 1205 | 1147 |
| OpenAI | GPT-5.1 | 1,041 | 1059 | 1011 | 1024 | 1027 | 962 | 1050 | 1014 | 975 | 1005 | 1073 |
| Gemini-3-Flash | 1,001 | 907 | 1009 | 1020 | 901 | 1011 | 878 | 959 | 1018 | 1018 | 1032 | |
| Anthropic | Claude-Opus-4-5 | 1,111 | 963 | 971 | 999 | 1033 | 996 | 1042 | 1126 | 1000 | 993 | 1030 |
| Gemini-3-Pro | 1,082 | 1013 | 1038 | 983 | 1025 | 1045 | 1054 | 952 | 1049 | 996 | 1023 | |
| Qwen | Qwen-3-thinking | 953 | 979 | 976 | 1007 | 858 | 986 | 1080 | 993 | 1011 | 1013 | 1014 |
| OpenAI | GPT-5.2 | 1,158 | 1039 | 1009 | 1040 | 1131 | 1070 | 1106 | 1041 | 1007 | 1056 | 1010 |
| Moonshot | Kimi-k2.5 | 1,113 | 1081 | 1088 | 1049 | 1043 | 1058 | 993 | 1048 | 1071 | 1042 | 1001 |
| Z | GLM-4.7 | 890 | 970 | 1006 | 928 | 976 | 988 | 955 | 954 | 977 | 955 | 984 |
| Moonshot | Kimi-k2 | 850 | 967 | 955 | 895 | 1015 | 955 | 1058 | 931 | 937 | 906 | 979 |
| Z | GLM-5 | 809 | 987 | 913 | 944 | 956 | 997 | 880 | 921 | 934 | 953 | 948 |
| DeepSeek | V3-2-thinking | 963 | 1007 | 1001 | 900 | 1046 | 949 | 941 | 984 | 1001 | 913 | 936 |
| xAI | Grok-4-1-fast | 885 | 897 | 966 | 917 | 983 | 1012 | 916 | 933 | 995 | 983 | 912 |
| OpenAI | o4-mini | 904 | 951 | 920 | 961 | 972 | 971 | 954 | 938 | 946 | 950 | 905 |
| DeepSeek | R1 | 1,008 | 998 | 988 | 1136 | 1039 | 891 | 1049 | 1046 | 975 | 993 | 814 |