This website is experimental.

Provider Model Total History (4) Humanities (5) Opinion (4) Creativity (4) Coding (5) Science (5) Philosophy (5) Math (5) Reasoning (4) Engineering (5)
OpenAI o3-pro 1,336 1105 1154 1137 1111 1106 975 1106 1019 1138 1130
OpenAI GPT-5-mini 1,224 1115 1135 1086 1100 1006 1126 1080 1073 1070 1149
OpenAI GPT-OSS 1,188 1075 1042 1138 972 1079 994 1004 1117 1049 1172
OpenAI GPT-5.2 1,148 1041 1019 1063 1088 1034 1059 1015 1000 1090 1036
OpenAI GPT-5.1 1,042 1017 1097 980 1031 1012 1019 1031 976 986 1017
Qwen Qwen-3-thinking 1,027 973 1038 974 854 1018 1085 957 1041 1073 1032
Google Gemini-3-Pro 1,022 1002 1032 943 988 1054 1017 1016 994 1070 951
Anthropic Claude-Opus-4-5 972 980 937 986 1018 1014 1024 1029 979 976 977
OpenAI o4-mini 955 1011 931 976 977 978 951 998 1006 968 902
OpenAI GPT-5-nano 930 1013 907 914 1011 1080 1023 941 1033 970 1004
DeepSeek R1 919 976 949 1107 1018 873 1070 965 991 961 814
Google Gemini-3-Flash 873 938 1026 1009 944 987 873 953 934 992 988
MoonshotAI kimi-k2 865 994 968 905 1025 970 1013 1023 906 890 954
xAI Grok-4-1-fast 852 916 973 946 982 985 941 996 1016 965 936
Qwen Qwen-3-coder 833 834 835 903 870 905 885 843 967 868 1020
DeepSeek V3-2-thinking 813 1009 957 932 1011 897 945 1042 948 933 917