This website is experimental.

Provider Model Total History (4) Humanities (5) Opinion (4) Creativity (4) Coding (5) Science (5) Philosophy (5) Math (5) Reasoning (4) Engineering (5)
Anthropic Claude-Opus-4-5 1,109 963 971 999 1019 996 1042 1126 1000 980 1030
OpenAI o3-pro 1,209 1128 1182 1088 1019 1088 1022 1114 1030 1205 1147
Moonshot Kimi-k2.5 1,108 1081 1072 1049 1043 1058 1009 1048 1071 1042 1001
DeepSeek R1 996 998 968 1136 1039 891 1049 1046 991 993 814
OpenAI GPT-OSS 1,011 1052 968 1100 978 1020 1006 1045 1074 1018 1191
OpenAI GPT-5.2 1,142 1039 996 1040 1131 1070 1106 1041 991 1056 1010
OpenAI GPT-5.1 1,032 1059 1011 1024 1027 962 1050 1014 975 1004 1073
Qwen Qwen-3-thinking 939 979 976 1007 858 986 1071 993 1011 1013 1014
DeepSeek V3-2-thinking 963 1007 1001 900 1046 949 941 984 1001 913 936
Google Gemini-3-Flash 1,012 907 1009 1020 912 1011 878 959 1018 1018 1032
Z GLM-4.7 907 991 1006 928 976 988 955 954 977 955 984
Google Gemini-3-Pro 1,108 1013 1074 983 1013 1045 1054 952 1049 996 1023
OpenAI o4-mini 910 951 933 961 972 971 954 938 946 950 905
xAI Grok-4-1-fast 882 876 966 917 983 1012 926 933 995 983 912
Moonshot Kimi-k2 856 967 955 895 1015 955 1058 931 937 919 979
Z GLM-5 817 987 913 954 970 997 880 921 934 953 948