This website is experimental.

Provider Model Total History (4) Humanities (5) Opinion (4) Creativity (4) Coding (5) Science (5) Philosophy (5) Math (5) Reasoning (4) Engineering (5)
DeepSeek R1 992 998 988 1136 1039 891 1049 1046 975 978 814
OpenAI GPT-OSS 1,037 1037 985 1114 978 1020 1006 1045 1084 1017 1191
OpenAI o3-pro 1,232 1128 1182 1111 1019 1088 1037 1114 1030 1205 1147
OpenAI GPT-5.2 1,176 1039 1009 1040 1131 1081 1134 1041 1007 1056 1010
Moonshot Kimi-k2.5 1,113 1081 1088 1036 1043 1069 993 1048 1071 1058 1001
Google Gemini-3-Flash 1,020 907 1021 1020 901 1011 878 959 1018 1018 1043
OpenAI GPT-5.1 1,026 1074 1011 1011 1027 962 1050 1014 975 1005 1050
Qwen Qwen-3-thinking 934 979 976 1007 858 986 1066 993 997 1013 1014
Google Gemini-3-Pro 1,045 1013 1020 983 1025 1027 1054 952 1063 996 1023
Anthropic Claude-Opus-4-5 1,085 963 971 979 1033 996 1042 1126 1000 993 1030
OpenAI o4-mini 904 951 920 961 972 960 954 922 946 950 928
Z GLM-5 826 987 901 944 956 1015 880 921 934 953 948
Z GLM-4.7 890 970 1006 928 976 988 955 954 977 955 984
xAI Grok-4-1-fast 892 897 966 917 983 1012 916 949 995 983 902
Moonshot Kimi-k2 874 967 955 916 1015 944 1045 931 927 922 979
DeepSeek V3-2-thinking 953 1007 1002 900 1046 949 941 984 1001 897 936