Votings Details
| Time | Seconds | TPS | Question | Voter | Winner | Looser |
|---|---|---|---|---|---|---|
| 44d 15h | 10.1 | 25 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Gemini-3-Flash | GPT-5.1 | Qwen-3-thinking |
| 44d 15h | 52.7 | 40 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Kimi-k2.5 | Qwen-3-thinking | GPT-OSS |
| 44d 15h | 5.6 | 41 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GPT-5.2 | GLM-5 | Grok-4-1-fast |
| 44d 15h | 10.9 | 26 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Gemini-3-Flash | GPT-5.2 | GPT-OSS |
| 44d 15h | 9.7 | 41 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
GPT-5.2 | Claude-Opus-4-5 | Gemini-3-Pro |
| 44d 15h | 7.6 | 47 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GPT-5.2 | Claude-Opus-4-5 | GLM-4.7 |
| 44d 15h | 7.9 | 44 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
GPT-5.2 | GLM-4.7 | GLM-5 |
| 44d 15h | 50.8 | 27 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
GLM-5 | GPT-5.1 | GPT-OSS |
| 44d 15h | 25.0 | 46 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Kimi-k2.5 | V3-2-thinking | R1 |
| 44d 15h | 41.9 | 39 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
GLM-5 | GPT-OSS | V3-2-thinking |
| 44d 15h | 30.4 | 39 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GLM-5 | R1 | GPT-5.2 |
| 44d 15h | 11.2 | 24 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Gemini-3-Flash | GPT-5.2 | GLM-4.7 |
| 44d 15h | 10.0 | 39 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
GPT-5.2 | Gemini-3-Flash | GLM-5 |
| 44d 15h | 32.6 | 36 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
GLM-5 | GPT-5.2 | Gemini-3-Pro |
| 44d 15h | 9.4 | 26 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Gemini-3-Flash | GPT-5.1 | GLM-4.7 |
| 44d 15h | 63.1 | 25 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GLM-5 | Gemini-3-Pro | Gemini-3-Flash |
| 44d 15h | 11.5 | 43 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GLM-5 | Claude-Opus-4-5 | Gemini-3-Pro |
| 44d 15h | 27.1 | 42 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Kimi-k2.5 | Qwen-3-thinking | GLM-5 |
| 44d 15h | 22.7 | 37 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GLM-5 | R1 | Kimi-k2 |
| 44d 15h | 9.6 | 43 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
GPT-5.2 | Gemini-3-Flash | Kimi-k2 |
| 44d 15h | 9.1 | 28 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GPT-5.2 | Claude-Opus-4-5 | R1 |
| 44d 15h | 51.0 | 28 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GLM-5 | Gemini-3-Flash | Gemini-3-Pro |
| 44d 15h | 5.8 | 29 |
What is love?...
What is love?
|
Gemini-3-Flash | Kimi-k2.5 | o4-mini |
| 44d 15h | 22.1 | 52 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GLM-5 | Gemini-3-Flash | GPT-5.1 |
| 44d 15h | 8.4 | 36 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Gemini-3-Flash | Kimi-k2.5 | Qwen-3-thinking |
| 44d 15h | 35.3 | 47 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Kimi-k2.5 | R1 | Gemini-3-Flash |
| 44d 15h | 13.5 | 29 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Flash | GPT-5.1 | GPT-5.2 |
| 44d 15h | 21.1 | 34 |
What happens inside a black hole?...
What happens inside a black hole?
|
GLM-5 | R1 | o4-mini |
| 44d 15h | 28.2 | 40 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GLM-5 | GPT-OSS | Qwen-3-thinking |
| 44d 15h | 13.4 | 32 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Gemini-3-Flash | o4-mini | Qwen-3-thinking |
| 44d 15h | 57.4 | 43 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Kimi-k2.5 | GPT-5.1 | R1 |
| 44d 15h | 46.5 | 25 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
GLM-5 | Kimi-k2.5 | Kimi-k2 |
| 44d 15h | 12.9 | 45 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GPT-5.2 | GLM-4.7 | Gemini-3-Pro |
| 44d 15h | 27.1 | 46 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Kimi-k2.5 | GPT-OSS | o4-mini |
| 44d 15h | 5.7 | 26 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | GPT-5.1 | Kimi-k2 |
| 44d 15h | 7.3 | 36 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Gemini-3-Flash | GPT-5.2 | Grok-4-1-fast |
| 44d 15h | 12.8 | 41 |
What if the moon disappeared?...
What if the moon disappeared?
|
GPT-5.2 | Gemini-3-Pro | R1 |
| 44d 15h | 64.9 | 21 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
GLM-5 | GPT-5.1 | R1 |
| 44d 15h | 33.8 | 45 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
GLM-5 | Qwen-3-thinking | Gemini-3-Pro |
| 44d 15h | 18.9 | 16 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Gemini-3-Flash | GLM-5 | GPT-5.1 |
| 44d 15h | 34.0 | 43 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
GLM-5 | Kimi-k2 | R1 |
| 44d 15h | 41.2 | 36 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
GLM-5 | Qwen-3-thinking | Gemini-3-Flash |
| 44d 16h | 9.6 | 30 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Gemini-3-Flash | GPT-5.2 | GLM-4.7 |
| 44d 16h | 8.4 | 47 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GPT-5.2 | GLM-5 | Kimi-k2 |
| 44d 16h | 9.3 | 43 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
GPT-5.2 | R1 | Gemini-3-Pro |
| 44d 16h | 44.4 | 48 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Kimi-k2.5 | R1 | GLM-4.7 |
| 44d 16h | 26.0 | 47 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Kimi-k2.5 | GPT-5.1 | GLM-4.7 |
| 44d 16h | 6.7 | 18 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Gemini-3-Flash | Kimi-k2.5 | Grok-4-1-fast |
| 44d 16h | 7.2 | 27 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Gemini-3-Flash | GPT-OSS | Kimi-k2 |
| 44d 16h | 8.6 | 27 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | GPT-OSS | Kimi-k2 |
| 44d 16h | 20.8 | 45 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Kimi-k2.5 | Qwen-3-thinking | Grok-4-1-fast |
| 44d 16h | 10.7 | 37 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
GPT-5.2 | Kimi-k2 | GLM-5 |
| 44d 16h | 47.8 | 42 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GLM-5 | Kimi-k2.5 | Gemini-3-Pro |
| 44d 16h | 48.6 | 42 |
What happens inside a black hole?...
What happens inside a black hole?
|
GLM-5 | GPT-5.2 | Claude-Opus-4-5 |
| 44d 16h | 8.3 | 21 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Gemini-3-Flash | Kimi-k2 | V3-2-thinking |
| 44d 16h | 26.2 | 38 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GLM-5 | R1 | Kimi-k2.5 |
| 44d 16h | 142.8 | 41 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
GLM-5 | GPT-OSS | Grok-4-1-fast |
| 44d 16h | 35.7 | 46 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Kimi-k2.5 | GPT-5.2 | V3-2-thinking |
| 44d 16h | 65.7 | 25 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GLM-5 | Kimi-k2.5 | GPT-OSS |
| 44d 16h | 8.3 | 29 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Gemini-3-Flash | GPT-OSS | GLM-5 |
| 44d 16h | 43.7 | 42 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Kimi-k2.5 | Gemini-3-Flash | GPT-5.2 |
| 44d 16h | 22.6 | 43 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GLM-5 | Kimi-k2 | Gemini-3-Flash |
| 44d 16h | 32.1 | 26 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
GLM-5 | GPT-OSS | Kimi-k2 |
| 44d 16h | 55.0 | 50 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Kimi-k2.5 | Grok-4-1-fast | V3-2-thinking |
| 44d 16h | 45.8 | 48 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Kimi-k2.5 | R1 | Qwen-3-thinking |
| 44d 16h | 39.6 | 46 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Kimi-k2.5 | Qwen-3-thinking | o4-mini |
| 44d 16h | 3.4 | 49 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
GPT-5.2 | Qwen-3-thinking | R1 |
| 44d 16h | 31.7 | 37 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
GLM-5 | GPT-5.2 | V3-2-thinking |
| 44d 16h | 31.7 | 42 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
GLM-5 | GPT-5.2 | GPT-5.1 |
| 44d 16h | 4.2 | 35 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | Claude-Opus-4-5 | GPT-5.1 |
| 44d 16h | 46.7 | 45 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GLM-5 | Gemini-3-Pro | Gemini-3-Flash |
| 44d 16h | 9.2 | 40 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
GPT-5.2 | Kimi-k2.5 | Grok-4-1-fast |
| 44d 16h | 14.8 | 45 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Kimi-k2.5 | GPT-5.1 | Grok-4-1-fast |
| 44d 16h | 8.0 | 37 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Gemini-3-Flash | Kimi-k2.5 | R1 |
| 44d 16h | 48.1 | 25 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
GLM-5 | GPT-OSS | GPT-5.2 |
| 44d 16h | 41.0 | 25 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
GLM-5 | Claude-Opus-4-5 | Qwen-3-thinking |
| 44d 16h | 14.3 | 22 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Gemini-3-Flash | GLM-4.7 | o4-mini |
| 44d 16h | 23.0 | 45 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Kimi-k2.5 | Gemini-3-Pro | GLM-4.7 |
| 44d 16h | 51.5 | 38 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GLM-5 | Gemini-3-Flash | Kimi-k2 |
| 44d 16h | 25.5 | 44 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
GLM-5 | GPT-OSS | GPT-5.2 |
| 44d 16h | 29.6 | 49 |
What happens inside a black hole?...
What happens inside a black hole?
|
Kimi-k2.5 | GPT-5.2 | Claude-Opus-4-5 |
| 44d 16h | 7.8 | 33 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Gemini-3-Flash | V3-2-thinking | GLM-4.7 |
| 44d 16h | 6.4 | 49 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GPT-5.2 | Kimi-k2 | GLM-5 |
| 44d 16h | 82.3 | 40 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
GLM-5 | Qwen-3-thinking | GPT-OSS |
| 44d 16h | 110.7 | 49 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Kimi-k2.5 | Gemini-3-Flash | Claude-Opus-4-5 |
| 44d 16h | 8.9 | 41 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Kimi-k2.5 | o4-mini | V3-2-thinking |
| 44d 16h | 8.4 | 40 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
GPT-5.2 | GLM-5 | Gemini-3-Flash |
| 44d 16h | 21.2 | 42 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GLM-5 | GPT-5.1 | Grok-4-1-fast |
| 44d 16h | 12.4 | 44 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GPT-5.2 | Gemini-3-Pro | Claude-Opus-4-5 |
| 44d 16h | 12.2 | 52 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GPT-5.2 | Claude-Opus-4-5 | Kimi-k2.5 |
| 44d 16h | 18.6 | 22 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Gemini-3-Flash | GPT-5.2 | GLM-5 |
| 44d 16h | 111.6 | 3 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | Claude-Opus-4-5 | Kimi-k2 |
| 44d 16h | 9.5 | 37 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Gemini-3-Flash | Claude-Opus-4-5 | R1 |
| 44d 16h | 8.1 | 18 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Gemini-3-Flash | Kimi-k2 | Claude-Opus-4-5 |
| 44d 16h | 9.6 | 20 |
What happens inside a black hole?...
What happens inside a black hole?
|
Gemini-3-Flash | Qwen-3-thinking | Claude-Opus-4-5 |
| 44d 16h | 10.5 | 23 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Gemini-3-Flash | GLM-5 | GPT-5.1 |
| 44d 16h | 64.9 | 47 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Kimi-k2.5 | V3-2-thinking | GPT-5.2 |
| 44d 16h | 7.8 | 47 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
GPT-5.2 | Gemini-3-Flash | Kimi-k2 |
| 44d 16h | 35.8 | 43 |
What if the moon disappeared?...
What if the moon disappeared?
|
Kimi-k2.5 | V3-2-thinking | Claude-Opus-4-5 |
| 44d 16h | 14.0 | 21 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.1 |
| 44d 16h | 11.1 | 48 |
What if the moon disappeared?...
What if the moon disappeared?
|
GPT-5.2 | GLM-5 | Qwen-3-thinking |
| 44d 16h | 6.1 | 30 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Flash | R1 | Kimi-k2 |
| 44d 16h | 127.7 | 2 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | GLM-4.7 | GPT-5.2 |
| 44d 16h | 29.4 | 28 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
GLM-5 | V3-2-thinking | o4-mini |
| 44d 16h | 54.4 | 42 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GLM-5 | Gemini-3-Pro | Gemini-3-Flash |
| 44d 16h | 54.6 | 45 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
GLM-5 | Gemini-3-Flash | Kimi-k2 |
| 44d 16h | 9.6 | 28 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Gemini-3-Flash | o3-pro | Grok-4-1-fast |
| 44d 16h | 15.8 | 58 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Kimi-k2.5 | o3-pro | GPT-5.2 |
| 44d 16h | 64.6 | 57 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Kimi-k2.5 | GLM-4.7 | GPT-5.1 |
| 44d 16h | 22.7 | 47 |
What if the moon disappeared?...
What if the moon disappeared?
|
Kimi-k2.5 | GPT-5.1 | GLM-5 |
| 44d 16h | 21.7 | 49 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Kimi-k2.5 | Claude-Opus-4-5 | V3-2-thinking |
| 44d 16h | 103.4 | 30 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
GLM-5 | o4-mini | Gemini-3-Pro |
| 44d 16h | 11.2 | 20 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Gemini-3-Flash | Claude-Opus-4-5 | GLM-4.7 |
| 44d 16h | 78.8 | 47 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
GLM-5 | Gemini-3-Flash | Gemini-3-Pro |
| 44d 16h | 11.6 | 34 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
GPT-5.2 | Grok-4-1-fast | GLM-4.7 |
| 44d 16h | 7.7 | 47 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | GPT-5.2 | Kimi-k2 |
| 44d 16h | 38.9 | 41 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
GLM-5 | o3-pro | Qwen-3-thinking |
| 44d 16h | 36.9 | 27 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
GLM-5 | o3-pro | Kimi-k2 |
| 44d 16h | 24.8 | 41 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
GLM-5 | GPT-5.1 | Kimi-k2 |
| 44d 16h | 12.5 | 27 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
GPT-5.2 | Gemini-3-Flash | GLM-4.7 |
| 44d 16h | 7.5 | 16 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Gemini-3-Flash | Claude-Opus-4-5 | Kimi-k2 |
| 44d 16h | 36.1 | 51 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Kimi-k2.5 | GPT-5.1 | Grok-4-1-fast |
| 44d 16h | 28.5 | 46 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Kimi-k2.5 | o3-pro | GPT-5.2 |
| 44d 16h | 11.6 | 26 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Gemini-3-Flash | o3-pro | GLM-4.7 |
| 44d 19h | 160.0 | 52 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
GLM-5 | Kimi-k2.5 | Kimi-k2 |
| 44d 19h | 78.5 | 50 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Kimi-k2.5 | GPT-5.2 | o4-mini |
| 44d 19h | 32.8 | 26 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
GLM-5 | o4-mini | GPT-5.1 |
| 44d 19h | 50.0 | 48 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GLM-5 | V3-2-thinking | Gemini-3-Pro |
| 44d 19h | 58.9 | 30 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
GLM-5 | GPT-5.1 | GPT-OSS |
| 44d 19h | 59.7 | 26 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
GLM-5 | Gemini-3-Pro | Qwen-3-thinking |
| 44d 19h | 17.0 | 17 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | GPT-5.2 | Kimi-k2 |
| 44d 19h | 11.6 | 11 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Gemini-3-Flash | Kimi-k2 | V3-2-thinking |
| 44d 19h | 31.5 | 50 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Kimi-k2.5 | GPT-OSS | V3-2-thinking |
| 44d 20h | 83.5 | 4 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Gemini-3-Flash | GPT-OSS | Kimi-k2 |
| 44d 20h | 19.1 | 20 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Gemini-3-Flash | Kimi-k2.5 | GPT-OSS |
| 44d 20h | 30.0 | 40 |
What happens inside a black hole?...
What happens inside a black hole?
|
GLM-5 | GPT-5.2 | Qwen-3-thinking |
| 44d 20h | 7.7 | 33 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | o3-pro | GPT-5.1 |
| 44d 20h | 49.9 | 43 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Kimi-k2.5 | Grok-4-1-fast | o4-mini |
| 44d 20h | 8.9 | 28 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | GPT-OSS | Kimi-k2.5 |
| 44d 20h | 5.7 | 45 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
GPT-5.2 | Gemini-3-Flash | Grok-4-1-fast |
| 44d 20h | 21.8 | 44 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
GLM-5 | Kimi-k2 | Claude-Opus-4-5 |
| 44d 20h | 31.4 | 42 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
GLM-5 | GPT-OSS | R1 |
| 44d 20h | 10.8 | 45 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GPT-5.2 | Gemini-3-Flash | GLM-5 |
| 44d 20h | 48.2 | 35 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
GPT-5.2 | GLM-5 | Gemini-3-Pro |
| 44d 20h | 10.7 | 38 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
GPT-5.2 | R1 | Gemini-3-Pro |
| 44d 20h | 29.0 | 36 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GLM-5 | o3-pro | Kimi-k2.5 |
| 44d 20h | 11.4 | 39 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | GPT-5.1 | Claude-Opus-4-5 |
| 44d 20h | 32.8 | 45 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Kimi-k2.5 | o3-pro | GLM-5 |
| 44d 20h | 57.6 | 47 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Kimi-k2.5 | Qwen-3-thinking | Grok-4-1-fast |
| 44d 20h | 7.7 | 30 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Gemini-3-Flash | GPT-5.2 | o4-mini |
| 44d 20h | 38.6 | 44 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Kimi-k2.5 | Claude-Opus-4-5 | GPT-5.1 |
| 44d 20h | 55.8 | 30 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GLM-5 | Kimi-k2.5 | Gemini-3-Pro |
| 44d 20h | 32.6 | 49 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Kimi-k2.5 | Grok-4-1-fast | GLM-4.7 |
| 44d 20h | 8.3 | 40 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
GPT-5.2 | Claude-Opus-4-5 | GLM-5 |
| 44d 20h | 45.1 | 27 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GLM-5 | Claude-Opus-4-5 | Kimi-k2 |
| 44d 20h | 33.9 | 47 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
GLM-5 | Gemini-3-Pro | Gemini-3-Flash |
| 44d 20h | 69.4 | 49 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Kimi-k2.5 | GPT-5.2 | R1 |
| 44d 20h | 62.8 | 47 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Kimi-k2.5 | Grok-4-1-fast | GLM-4.7 |
| 44d 20h | 12.4 | 24 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Gemini-3-Flash | Kimi-k2.5 | GLM-4.7 |
| 44d 20h | 11.6 | 32 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 44d 20h | 37.0 | 49 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Kimi-k2.5 | o3-pro | GPT-OSS |
| 44d 20h | 10.1 | 24 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | o3-pro | V3-2-thinking |
| 44d 20h | 10.0 | 38 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
GPT-5.2 | GLM-4.7 | Qwen-3-thinking |
| 44d 20h | 45.7 | 46 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GLM-5 | o4-mini | R1 |
| 44d 20h | 48.5 | 53 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Kimi-k2.5 | GPT-5.1 | GLM-5 |
| 44d 20h | 21.5 | 33 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
GLM-5 | Gemini-3-Pro | Kimi-k2 |
| 44d 20h | 85.0 | 28 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
GLM-5 | Claude-Opus-4-5 | GPT-OSS |
| 44d 20h | 34.5 | 47 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GLM-5 | GPT-OSS | Qwen-3-thinking |
| 44d 20h | 40.2 | 46 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Kimi-k2.5 | V3-2-thinking | R1 |
| 44d 20h | 31.3 | 46 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Kimi-k2.5 | R1 | Qwen-3-thinking |
| 44d 20h | 23.8 | 43 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Kimi-k2.5 | GPT-5.2 | GLM-5 |
| 44d 20h | 42.0 | 41 |
What if the moon disappeared?...
What if the moon disappeared?
|
Kimi-k2.5 | o4-mini | Gemini-3-Pro |
| 44d 20h | 9.4 | 22 |
What happens inside a black hole?...
What happens inside a black hole?
|
Gemini-3-Flash | Grok-4-1-fast | GPT-5.1 |
| 44d 20h | 6.5 | 40 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
GPT-5.2 | Grok-4-1-fast | GLM-4.7 |
| 44d 20h | 32.8 | 44 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GLM-5 | o3-pro | R1 |
| 44d 20h | 30.9 | 50 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GLM-5 | Qwen-3-thinking | Kimi-k2.5 |
| 44d 20h | 13.7 | 30 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | GPT-5.1 | Kimi-k2 |
| 44d 20h | 7.9 | 23 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Flash | GPT-5.1 | GPT-OSS |
| 44d 20h | 7.4 | 24 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Flash | R1 | Kimi-k2.5 |
| 44d 20h | 98.8 | 47 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Kimi-k2.5 | GLM-4.7 | GPT-OSS |
| 44d 20h | 49.7 | 38 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Kimi-k2.5 | GPT-5.1 | GLM-5 |
| 44d 20h | 47.3 | 45 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Kimi-k2.5 | o4-mini | GLM-5 |
| 44d 20h | 33.4 | 40 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GLM-5 | V3-2-thinking | Grok-4-1-fast |
| 44d 20h | 5.9 | 22 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Gemini-3-Flash | V3-2-thinking | GLM-5 |
| 44d 20h | 32.3 | 45 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Kimi-k2.5 | Gemini-3-Flash | GPT-OSS |
| 44d 20h | 7.8 | 38 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Gemini-3-Flash | GPT-OSS | V3-2-thinking |
| 44d 20h | 40.3 | 45 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Kimi-k2.5 | R1 | GLM-4.7 |
| 44d 20h | 31.3 | 32 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GLM-5 | Gemini-3-Pro | o4-mini |
| 44d 20h | 14.4 | 15 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Gemini-3-Flash | Kimi-k2 | GLM-4.7 |
| 44d 20h | 38.2 | 46 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Kimi-k2.5 | o3-pro | R1 |
| 44d 20h | 8.6 | 42 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GPT-5.2 | Gemini-3-Flash | V3-2-thinking |
| 44d 20h | 22.1 | 38 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Kimi-k2.5 | GPT-5.1 | R1 |
| 44d 20h | 25.4 | 53 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GLM-5 | R1 | Gemini-3-Flash |
| 44d 20h | 29.0 | 29 |
What is love?...
What is love?
|
GLM-5 | o3-pro | V3-2-thinking |
| 44d 20h | 21.9 | 44 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GLM-5 | GPT-5.2 | Qwen-3-thinking |
| 44d 20h | 26.5 | 42 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GLM-5 | R1 | o4-mini |
| 44d 20h | 23.6 | 49 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Kimi-k2.5 | GPT-5.2 | Claude-Opus-4-5 |
| 44d 20h | 32.6 | 36 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
GLM-5 | Gemini-3-Flash | Kimi-k2.5 |
| 44d 20h | 205.8 | 52 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
GLM-5 | V3-2-thinking | Claude-Opus-4-5 |
| 44d 20h | 32.9 | 54 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Kimi-k2.5 | Gemini-3-Flash | Grok-4-1-fast |
| 44d 20h | 26.6 | 50 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Kimi-k2.5 | GPT-5.2 | Gemini-3-Flash |
| 44d 23h | 166.5 | 35 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
GLM-5 | Kimi-k2.5 | V3-2-thinking |
| 44d 23h | 32.5 | 38 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GLM-5 | Claude-Opus-4-5 | Kimi-k2.5 |
| 44d 23h | 9.9 | 39 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
GPT-5.2 | GLM-5 | Gemini-3-Flash |
| 44d 23h | 9.2 | 44 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GPT-5.2 | GLM-5 | Kimi-k2 |
| 44d 23h | 6.0 | 43 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
GPT-5.2 | GLM-4.7 | V3-2-thinking |
| 44d 23h | 22.6 | 41 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
GLM-5 | Gemini-3-Pro | Qwen-3-thinking |
| 44d 23h | 23.7 | 14 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Gemini-3-Flash | GLM-4.7 | GPT-5.2 |
| 44d 23h | 24.8 | 63 |
What is love?...
What is love?
|
Kimi-k2.5 | Qwen-3-thinking | Grok-4-1-fast |
| 44d 23h | 20.1 | 52 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
GLM-5 | Claude-Opus-4-5 | Qwen-3-thinking |
| 44d 23h | 8.5 | 36 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | o3-pro | R1 |
| 44d 23h | 4.0 | 40 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Gemini-3-Flash | GPT-OSS | Claude-Opus-4-5 |
| 44d 23h | 42.6 | 67 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Kimi-k2.5 | R1 | GLM-4.7 |
| 44d 23h | 90.2 | 65 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Kimi-k2.5 | V3-2-thinking | GPT-OSS |
| 44d 23h | 32.5 | 46 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GLM-5 | Gemini-3-Pro | Gemini-3-Flash |
| 44d 23h | 23.6 | 47 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
GLM-5 | Gemini-3-Pro | o4-mini |
| 44d 23h | 40.3 | 60 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Kimi-k2.5 | o4-mini | GPT-OSS |
| 44d 23h | 59.6 | 39 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GLM-5 | Qwen-3-thinking | Kimi-k2 |
| 44d 23h | 55.0 | 3 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Gemini-3-Flash | o3-pro | Qwen-3-thinking |
| 44d 23h | 7.2 | 21 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Gemini-3-Flash | Kimi-k2 | o3-pro |
| 44d 23h | 6.7 | 40 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Gemini-3-Flash | Grok-4-1-fast | GPT-5.1 |
| 44d 23h | 63.5 | 64 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Kimi-k2.5 | GPT-OSS | Grok-4-1-fast |
| 44d 23h | 37.3 | 64 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Kimi-k2.5 | GPT-5.2 | V3-2-thinking |
| 44d 23h | 20.1 | 45 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
GLM-5 | Grok-4-1-fast | o3-pro |
| 44d 23h | 41.5 | 43 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
GLM-5 | GPT-5.1 | Kimi-k2 |
| 44d 23h | 25.1 | 41 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
GLM-5 | Gemini-3-Pro | Kimi-k2.5 |
| 44d 23h | 7.3 | 26 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | GPT-5.2 | Grok-4-1-fast |
| 44d 23h | 40.7 | 61 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Kimi-k2.5 | Gemini-3-Flash | Qwen-3-thinking |
| 44d 23h | 10.5 | 44 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Gemini-3-Flash | GPT-5.1 | GPT-OSS |
| 44d 23h | 61.5 | 65 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Kimi-k2.5 | o3-pro | o4-mini |
| 44d 23h | 14.0 | 13 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Gemini-3-Flash | GLM-4.7 | Kimi-k2 |
| 44d 23h | 10.0 | 23 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | Claude-Opus-4-5 | o3-pro |
| 44d 23h | 4.8 | 30 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | GPT-5.2 | GLM-4.7 |
| 44d 23h | 60.7 | 29 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
GLM-5 | Gemini-3-Pro | GPT-OSS |
| 44d 23h | 23.9 | 10 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Gemini-3-Flash | R1 | Kimi-k2 |
| 44d 23h | 44.8 | 37 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
GLM-5 | GPT-5.1 | Gemini-3-Pro |
| 45d | 10.4 | 27 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Gemini-3-Flash | GPT-5.2 | GPT-5.1 |
| 45d | 7.2 | 43 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GPT-5.2 | Kimi-k2.5 | Qwen-3-thinking |
| 45d | 20.6 | 59 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Kimi-k2.5 | GPT-OSS | GLM-5 |
| 45d | 59.0 | 56 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Kimi-k2.5 | o3-pro | Claude-Opus-4-5 |
| 45d | 25.6 | 62 |
What if the moon disappeared?...
What if the moon disappeared?
|
Kimi-k2.5 | GPT-5.2 | Claude-Opus-4-5 |
| 45d | 241.7 | 52 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
GLM-5 | V3-2-thinking | GPT-5.1 |
| 45d | 18.5 | 54 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
GLM-5 | Claude-Opus-4-5 | Kimi-k2 |
| 45d | 35.5 | 28 |
What is love?...
What is love?
|
GLM-5 | GPT-OSS | Kimi-k2 |
| 45d | 12.4 | 44 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GPT-5.2 | Gemini-3-Flash | R1 |
| 45d | 44.1 | 29 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
GLM-5 | Gemini-3-Pro | Grok-4-1-fast |
| 45d | 55.8 | 55 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Kimi-k2.5 | o4-mini | R1 |
| 45d | 8.6 | 47 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
GPT-5.2 | R1 | GLM-5 |
| 45d | 19.2 | 55 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Kimi-k2.5 | GLM-4.7 | o3-pro |
| 45d | 37.9 | 58 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Kimi-k2.5 | Qwen-3-thinking | o4-mini |
| 45d | 6.7 | 17 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Gemini-3-Flash | GPT-5.2 | Kimi-k2 |
| 45d | 5.6 | 23 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Gemini-3-Flash | GPT-5.1 | V3-2-thinking |
| 45d | 23.5 | 56 |
What if the moon disappeared?...
What if the moon disappeared?
|
Kimi-k2.5 | GPT-5.2 | Gemini-3-Flash |
| 45d | 20.8 | 39 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
GLM-5 | Gemini-3-Pro | GPT-5.1 |
| 45d | 8.0 | 28 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Gemini-3-Flash | GLM-4.7 | o4-mini |
| 45d | 61.6 | 57 |
What if the moon disappeared?...
What if the moon disappeared?
|
Kimi-k2.5 | V3-2-thinking | Qwen-3-thinking |
| 45d | 18.5 | 46 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
GLM-5 | Claude-Opus-4-5 | GPT-5.1 |
| 45d | 28.6 | 31 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GLM-5 | Gemini-3-Pro | o4-mini |
| 45d | 57.6 | 64 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Kimi-k2.5 | V3-2-thinking | Grok-4-1-fast |
| 45d | 27.6 | 60 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Kimi-k2.5 | GPT-OSS | V3-2-thinking |
| 45d | 38.5 | 46 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
GLM-5 | GPT-5.2 | Kimi-k2.5 |
| 45d | 29.1 | 14 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Gemini-3-Flash | o4-mini | GPT-OSS |
| 45d | 53.0 | 63 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Kimi-k2.5 | GPT-5.1 | GPT-5.2 |
| 45d | 16.2 | 19 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d | 8.1 | 23 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Gemini-3-Flash | GPT-5.2 | GPT-OSS |
| 45d | 21.8 | 13 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Gemini-3-Flash | Kimi-k2 | GPT-5.2 |
| 45d | 51.0 | 53 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Kimi-k2.5 | GPT-OSS | o4-mini |
| 45d | 28.4 | 51 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GLM-5 | Grok-4-1-fast | Gemini-3-Flash |
| 45d | 36.3 | 40 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GLM-5 | Claude-Opus-4-5 | R1 |
| 45d | 8.5 | 47 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
GPT-5.2 | Claude-Opus-4-5 | GLM-5 |
| 45d | 12.2 | 23 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Flash | o3-pro | Kimi-k2 |
| 45d | 20.6 | 45 |
What is love?...
What is love?
|
GLM-5 | GPT-5.1 | V3-2-thinking |
| 45d | 5.0 | 41 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
GPT-5.2 | Gemini-3-Pro | R1 |
| 45d | 37.1 | 43 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GLM-5 | R1 | GPT-5.1 |
| 45d | 25.7 | 43 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GLM-5 | Gemini-3-Flash | GPT-5.1 |
| 45d | 35.4 | 42 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
GLM-5 | o3-pro | Grok-4-1-fast |
| 45d | 11.7 | 24 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Gemini-3-Flash | GLM-5 | GPT-5.1 |
| 45d | 20.9 | 21 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Flash | GPT-5.1 | o3-pro |
| 45d | 6.3 | 37 |
What is love?...
What is love?
|
GPT-5.2 | Qwen-3-thinking | Grok-4-1-fast |
| 45d | 44.4 | 33 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GLM-5 | Qwen-3-thinking | GPT-5.1 |
| 45d | 46.7 | 52 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Kimi-k2.5 | Qwen-3-thinking | Gemini-3-Pro |
| 45d | 32.1 | 55 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
GLM-5 | Gemini-3-Pro | GPT-OSS |
| 45d | 8.4 | 47 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GPT-5.2 | Kimi-k2.5 | Gemini-3-Flash |
| 45d | 42.9 | 36 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GLM-5 | Gemini-3-Pro | GPT-5.1 |
| 45d | 91.0 | 31 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
GLM-5 | o3-pro | V3-2-thinking |
| 45d | 4.8 | 45 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Gemini-3-Flash | R1 | Kimi-k2.5 |
| 45d | 13.3 | 15 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Flash | GPT-5.2 | o4-mini |
| 45d | 38.8 | 62 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Kimi-k2.5 | o3-pro | GPT-OSS |
| 45d | 6.4 | 46 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.1 |
| 45d | 17.3 | 62 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Kimi-k2.5 | Qwen-3-thinking | GPT-5.1 |
| 45d | 36.3 | 54 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Kimi-k2.5 | R1 | GLM-5 |
| 45d | 223.3 | 24 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
GLM-5 | Kimi-k2.5 | V3-2-thinking |
| 45d | 8.0 | 37 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Gemini-3-Flash | GPT-5.1 | Qwen-3-thinking |
| 45d | 6.6 | 27 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | Claude-Opus-4-5 | Kimi-k2 |
| 45d | 26.6 | 49 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Kimi-k2.5 | Qwen-3-thinking | GLM-5 |
| 45d | 86.3 | 63 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Kimi-k2.5 | GLM-5 | GPT-5.1 |
| 45d | 8.8 | 42 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d | 33.5 | 62 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Kimi-k2.5 | Qwen-3-thinking | GLM-5 |
| 45d | 16.3 | 23 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d | 21.6 | 9 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Flash | GPT-5.2 | GLM-5 |
| 45d | 11.1 | 46 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GPT-5.2 | Kimi-k2.5 | Gemini-3-Pro |
| 45d | 20.5 | 55 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Kimi-k2.5 | GPT-5.1 | Gemini-3-Flash |
| 45d | 34.7 | 55 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Kimi-k2.5 | Claude-Opus-4-5 | GLM-5 |
| 45d | 24.7 | 55 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Kimi-k2.5 | GLM-5 | R1 |
| 45d | 4.5 | 53 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GPT-5.2 | Kimi-k2 | GLM-5 |
| 45d | 5.4 | 40 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Gemini-3-Flash | R1 | GLM-5 |
| 45d | 24.5 | 56 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Kimi-k2.5 | Gemini-3-Pro | GLM-5 |
| 45d | 6.1 | 26 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Flash | R1 | V3-2-thinking |
| 45d | 20.2 | 18 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Gemini-3-Flash | GPT-5.2 | GPT-5-mini |
| 45d | 15.7 | 22 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Gemini-3-Flash | o3-pro | Kimi-k2.5 |
| 45d | 9.6 | 47 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
GPT-5.2 | GLM-5 | Grok-4-1-fast |
| 45d 1h | 110.3 | 1 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | GLM-5 | o3-pro |
| 45d 1h | 36.2 | 31 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
GLM-5 | Kimi-k2.5 | Qwen-3-thinking |
| 45d 1h | 7.6 | 31 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.1 |
| 45d 1h | 6.0 | 29 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.2 |
| 45d 1h | 101.7 | 33 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
GLM-5 | Kimi-k2.5 | Gemini-3-Pro |
| 45d 1h | 18.6 | 63 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Kimi-k2.5 | Gemini-3-Pro | GLM-5 |
| 45d 1h | 5.6 | 46 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GPT-5.2 | GLM-4.7 | GLM-5 |
| 45d 1h | 21.5 | 20 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Gemini-3-Flash | Claude-Opus-4-5 | GLM-5 |
| 45d 1h | 7.5 | 21 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Gemini-3-Flash | V3-2-thinking | GLM-5 |
| 45d 1h | 23.4 | 57 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Kimi-k2.5 | R1 | GLM-5 |
| 45d 1h | 5.4 | 49 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
GPT-5.2 | Gemini-3-Pro | GLM-5 |
| 45d 1h | 13.1 | 24 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | Kimi-k2.5 | o4-mini |
| 45d 1h | 14.2 | 13 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Gemini-3-Flash | GLM-5 | Grok-4-1-fast |
| 45d 1h | 39.5 | 34 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
GLM-5 | Kimi-k2.5 | Kimi-k2 |
| 45d 1h | 13.2 | 21 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Gemini-3-Flash | GPT-5.1 | GLM-5 |
| 45d 1h | 52.7 | 58 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Kimi-k2.5 | GPT-5.2 | GLM-5 |
| 45d 1h | 12.7 | 19 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Gemini-3-Flash | GPT-OSS | Kimi-k2.5 |
| 45d 1h | 19.3 | 35 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GLM-5 | Kimi-k2.5 | GPT-5-mini |
| 45d 1h | 22.0 | 54 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Kimi-k2.5 | R1 | GLM-5 |
| 45d 1h | 12.4 | 20 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.1 |
| 45d 1h | 9.1 | 49 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
GPT-5.2 | V3-2-thinking | Kimi-k2.5 |
| 45d 1h | 37.1 | 61 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Kimi-k2.5 | GPT-OSS | GLM-5 |
| 45d 1h | 16.3 | 45 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
GPT-5.2 | Kimi-k2.5 | Claude-Opus-4-5 |
| 45d 1h | 11.9 | 25 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Gemini-3-Flash | Kimi-k2.5 | Qwen-3-thinking |
| 45d 1h | 28.7 | 43 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
GLM-5 | Kimi-k2.5 | Gemini-3-Pro |
| 45d 1h | 58.2 | 54 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Kimi-k2.5 | R1 | GLM-5 |
| 45d 1h | 7.2 | 41 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
GPT-5.2 | Kimi-k2.5 | Gemini-3-Pro |
| 45d 1h | 39.6 | 49 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GLM-5 | Kimi-k2.5 | GPT-OSS |
| 45d 1h | 89.7 | 6 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d 1h | 10.6 | 52 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GPT-5.2 | Kimi-k2.5 | GLM-4.7 |
| 45d 1h | 8.1 | 51 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
GPT-5.2 | V3-2-thinking | Kimi-k2.5 |
| 45d 1h | 42.7 | 32 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
GLM-5 | Kimi-k2.5 | Gemini-3-Pro |
| 45d 1h | 5.9 | 44 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GPT-5.2 | Kimi-k2.5 | GLM-5 |
| 45d 1h | 6.0 | 27 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Gemini-3-Flash | GPT-5-nano | Claude-Opus-4-5 |
| 45d 1h | 107.4 | 4 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | GPT-5.2 | Qwen-3-thinking |
| 45d 1h | 8.3 | 30 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5-nano |
| 45d 1h | 29.2 | 42 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
GLM-5 | Kimi-k2.5 | GPT-5.1 |
| 45d 1h | 5.4 | 27 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Gemini-3-Flash | GLM-5 | V3-2-thinking |
| 45d 1h | 10.3 | 27 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Gemini-3-Flash | GLM-5 | GPT-OSS |
| 45d 1h | 8.3 | 52 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GPT-5.2 | V3-2-thinking | GLM-5 |
| 45d 1h | 8.0 | 31 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5-nano |
| 45d 1h | 11.5 | 43 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GPT-5.2 | V3-2-thinking | GLM-5 |
| 45d 1h | 26.5 | 45 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
GLM-5 | Kimi-k2.5 | o3-pro |
| 45d 1h | 18.5 | 62 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Kimi-k2.5 | GLM-5 | Grok-4-1-fast |
| 45d 1h | 80.7 | 63 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Kimi-k2.5 | R1 | GLM-5 |
| 45d 1h | 18.1 | 18 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Gemini-3-Flash | Kimi-k2.5 | o4-mini |
| 45d 1h | 44.4 | 51 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GLM-5 | GPT-5.1 | R1 |
| 45d 1h | 6.6 | 22 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.2 |
| 45d 1h | 7.6 | 49 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GPT-5.2 | GLM-5 | Kimi-k2.5 |
| 45d 1h | 72.4 | 5 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Gemini-3-Flash | GPT-5.2 | Kimi-k2.5 |
| 45d 1h | 43.8 | 33 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GLM-5 | Qwen-3-thinking | Gemini-3-Pro |
| 45d 1h | 32.2 | 54 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Kimi-k2.5 | Gemini-3-Pro | GLM-5 |
| 45d 1h | 7.7 | 35 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Gemini-3-Flash | Kimi-k2.5 | o4-mini |
| 45d 1h | 8.0 | 43 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GPT-5.2 | Kimi-k2.5 | GLM-5 |
| 45d 1h | 11.1 | 30 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Gemini-3-Flash | Kimi-k2 | Kimi-k2.5 |
| 45d 1h | 11.0 | 36 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | Kimi-k2.5 | o4-mini |
| 45d 1h | 20.9 | 48 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GLM-5 | o3-pro | Kimi-k2.5 |
| 45d 1h | 8.1 | 27 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Gemini-3-Flash | Kimi-k2.5 | R1 |
| 45d 1h | 10.8 | 52 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
GPT-5.2 | Gemini-3-Pro | Kimi-k2.5 |
| 45d 1h | 9.1 | 29 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Gemini-3-Flash | Kimi-k2.5 | o4-mini |
| 45d 1h | 7.0 | 23 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d 1h | 5.8 | 34 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | Kimi-k2.5 | Kimi-k2 |
| 45d 1h | 4.1 | 16 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | Kimi-k2.5 | Grok-4-1-fast |
| 45d 1h | 8.9 | 48 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
GPT-5.2 | Kimi-k2.5 | Gemini-3-Flash |
| 45d 1h | 58.6 | 7 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.2 |
| 45d 1h | 7.4 | 30 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5-nano |
| 45d 1h | 14.1 | 46 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
GPT-5.2 | Kimi-k2.5 | Gemini-3-Pro |
| 45d 1h | 8.1 | 35 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | Kimi-k2.5 | Claude-Opus-4-5 |
| 45d 2h | 141.8 | 3 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | GLM-5 | Kimi-k2.5 |
| 45d 2h | 86.7 | 43 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
GLM-5 | Gemini-3-Flash | Kimi-k2.5 |
| 45d 2h | 5.2 | 50 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.1 |
| 45d 2h | 45.7 | 46 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GLM-5 | Kimi-k2.5 | GPT-5.1 |
| 45d 2h | 6.1 | 36 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
GPT-5.2 | V3-2-thinking | Kimi-k2.5 |
| 45d 2h | 6.3 | 46 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GPT-5.2 | Kimi-k2.5 | Kimi-k2 |
| 45d 2h | 10.5 | 47 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
GPT-5.2 | Kimi-k2.5 | Qwen-3-thinking |
| 45d 2h | 7.1 | 49 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GPT-5.2 | Kimi-k2.5 | Qwen-3-thinking |
| 45d 2h | 28.7 | 7 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | GPT-5-mini | Kimi-k2.5 |
| 45d 2h | 6.4 | 38 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
GPT-5.2 | Kimi-k2.5 | GLM-4.7 |
| 45d 2h | 89.9 | 5 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Gemini-3-Flash | V3-2-thinking | Kimi-k2.5 |
| 45d 2h | 5.3 | 41 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GPT-5.2 | Kimi-k2.5 | V3-2-thinking |
| 45d 2h | 7.8 | 17 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Gemini-3-Flash | Kimi-k2.5 | R1 |
| 45d 2h | 7.7 | 25 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Gemini-3-Flash | o3-pro | Kimi-k2.5 |
| 45d 2h | 7.4 | 45 |
What is love?...
What is love?
|
GPT-5.2 | Kimi-k2.5 | GLM-5 |
| 45d 2h | 16.7 | 45 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
GPT-5.2 | Grok-4-1-fast | Kimi-k2.5 |
| 45d 2h | 32.9 | 40 |
What if the moon disappeared?...
What if the moon disappeared?
|
GLM-5 | Kimi-k2.5 | Qwen-3-thinking |
| 45d 2h | 21.6 | 24 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Gemini-3-Flash | Kimi-k2 | GPT-5.2 |
| 45d 2h | 8.8 | 37 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Gemini-3-Flash | Claude-Opus-4-5 | Kimi-k2.5 |
| 45d 2h | 28.3 | 50 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
GLM-5 | R1 | Kimi-k2.5 |
| 45d 2h | 7.1 | 34 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5.2 |
| 45d 2h | 8.0 | 39 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Flash | Kimi-k2.5 | GLM-4.7 |
| 45d 2h | 6.4 | 29 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Flash | Kimi-k2.5 | GPT-5-nano |
| 45d 2h | 75.0 | 47 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
GLM-5 | GPT-5-nano | Kimi-k2.5 |
| 45d 2h | 56.2 | 45 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Kimi-k2.5 | Gemini-3-Flash | Grok-4-1-fast |
| 45d 2h | 8.1 | 46 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GPT-5.2 | Kimi-k2.5 | R1 |
| 45d 2h | 12.3 | 22 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d 2h | 5.0 | 58 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
GPT-5.2 | Kimi-k2.5 | GLM-4.7 |
| 45d 2h | 32.6 | 52 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
GLM-5 | GPT-5.1 | Kimi-k2.5 |
| 45d 2h | 38.7 | 45 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Kimi-k2.5 | Claude-Opus-4-5 | GLM-5 |
| 45d 2h | 82.3 | 6 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d 2h | 8.4 | 22 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Gemini-3-Flash | GLM-5 | V3-2-thinking |
| 45d 2h | 47.1 | 49 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Kimi-k2.5 | GPT-5.2 | Gemini-3-Pro |
| 45d 3h | 11.7 | 43 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
GPT-5.2 | GLM-5 | Gemini-3-Flash |
| 45d 3h | 3.7 | 26 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | GLM-5 | GPT-5-nano |
| 45d 3h | 38.5 | 6 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Gemini-3-Flash | GLM-5 | Grok-4-1-fast |
| 45d 3h | 8.0 | 23 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Gemini-3-Flash | o3-pro | GLM-5 |
| 45d 3h | 11.5 | 40 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GPT-5.2 | GLM-5 | Kimi-k2 |
| 45d 3h | 6.9 | 13 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Gemini-3-Flash | GLM-5 | Kimi-k2 |
| 45d 3h | 38.8 | 55 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Kimi-k2.5 | GPT-5-mini | GLM-5 |
| 45d 3h | 27.7 | 53 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Kimi-k2.5 | o3-pro | GLM-5 |
| 45d 3h | 11.0 | 33 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
GPT-5.2 | Gemini-3-Pro | GLM-5 |
| 45d 3h | 37.7 | 49 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Kimi-k2.5 | o3-pro | GLM-5 |
| 45d 3h | 6.8 | 47 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
GPT-5.2 | GLM-5 | V3-2-thinking |
| 45d 3h | 18.2 | 24 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | GPT-5-mini | GLM-5 |
| 45d 3h | 5.1 | 30 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Gemini-3-Flash | GLM-5 | GPT-5-nano |
| 45d 3h | 8.2 | 40 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | o3-pro | GLM-5 |
| 45d 3h | 6.1 | 39 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d 3h | 125.9 | 50 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Kimi-k2.5 | GLM-5 | o3-pro |
| 45d 3h | 12.5 | 43 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
GPT-5.2 | R1 | GLM-5 |
| 45d 3h | 8.3 | 19 |
What is love?...
What is love?
|
Gemini-3-Flash | GLM-5 | o4-mini |
| 45d 3h | 7.0 | 33 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | GPT-5.2 | GLM-5 |
| 45d 3h | 8.7 | 31 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d 3h | 47.9 | 62 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Kimi-k2.5 | GPT-5.1 | GLM-5 |
| 45d 3h | 20.8 | 48 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Kimi-k2.5 | GLM-5 | GLM-4.7 |
| 45d 3h | 5.9 | 51 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GPT-5.2 | Claude-Opus-4-5 | GLM-5 |
| 45d 3h | 38.8 | 44 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Kimi-k2.5 | GPT-5.1 | GLM-5 |
| 45d 3h | 8.3 | 14 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Gemini-3-Flash | GLM-5 | Qwen-3-thinking |
| 45d 3h | 37.0 | 58 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Kimi-k2.5 | GLM-5 | Gemini-3-Pro |
| 45d 3h | 16.4 | 19 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Gemini-3-Flash | GLM-5 | GPT-5.2 |
| 45d 3h | 38.7 | 10 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Gemini-3-Flash | GPT-5-mini | GLM-5 |
| 45d 3h | 7.4 | 50 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GPT-5.2 | Gemini-3-Flash | GLM-5 |
| 45d 3h | 33.0 | 48 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Kimi-k2.5 | Grok-4-1-fast | GLM-5 |
| 45d 3h | 8.9 | 27 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Gemini-3-Flash | Kimi-k2.5 | GLM-5 |
| 45d 3h | 39.9 | 52 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Kimi-k2.5 | GPT-5.1 | GLM-5 |
| 45d 3h | 8.4 | 46 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GPT-5.2 | GLM-5 | R1 |
| 45d 3h | 5.6 | 43 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Gemini-3-Flash | GPT-OSS | GLM-5 |
| 45d 3h | 39.4 | 48 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Kimi-k2.5 | GLM-5 | GLM-4.7 |
| 45d 3h | 12.1 | 14 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Grok-4-1-fast | GLM-5 | Qwen-3-thinking |
| 45d 3h | 8.5 | 34 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Gemini-3-Flash | GLM-5 | o4-mini |
| 45d 3h | 28.8 | 51 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Kimi-k2.5 | o3-pro | GLM-5 |
| 45d 3h | 59.5 | 52 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Kimi-k2.5 | V3-2-thinking | GLM-5 |
| 45d 3h | 29.9 | 51 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Kimi-k2.5 | Gemini-3-Pro | GLM-5 |
| 45d 3h | 15.6 | 8 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Grok-4-1-fast | o3-pro | GLM-5 |
| 45d 3h | 20.1 | 8 |
What if the moon disappeared?...
What if the moon disappeared?
|
Grok-4-1-fast | Claude-Opus-4-5 | GLM-5 |
| 45d 3h | 50.2 | 52 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Kimi-k2.5 | Gemini-3-Pro | GLM-5 |
| 45d 3h | 12.3 | 16 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Gemini-3-Flash | GLM-5 | GPT-5.2 |
| 45d 3h | 10.1 | 35 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Gemini-3-Flash | GPT-5.1 | GLM-5 |
| 45d 3h | 38.5 | 49 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Kimi-k2.5 | o3-pro | GLM-5 |
| 45d 3h | 9.8 | 22 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | Grok-4-1-fast | GLM-5 |
| 45d 3h | 4.7 | 20 |
What is love?...
What is love?
|
Grok-4-1-fast | GPT-OSS | GLM-5 |
| 45d 3h | 40.9 | 4 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Grok-4-1-fast | o3-pro | GLM-5 |
| 45d 3h | 7.5 | 39 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
GPT-5.2 | GLM-5 | R1 |
| 45d 3h | 8.6 | 21 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Grok-4-1-fast | GPT-OSS | GLM-5 |
| 45d 3h | 6.3 | 25 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | GLM-5 | GPT-5-nano |
| 45d 3h | 9.7 | 7 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Grok-4-1-fast | Gemini-3-Pro | GLM-5 |
| 45d 3h | 25.6 | 6 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Grok-4-1-fast | Gemini-3-Flash | GLM-5 |
| 45d 3h | 6.0 | 18 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Grok-4-1-fast | GPT-5-nano | GLM-5 |
| 45d 3h | 11.3 | 32 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | GLM-5 | o4-mini |
| 45d 3h | 13.0 | 19 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Gemini-3-Flash | GLM-5 | V3-2-thinking |
| 45d 3h | 14.5 | 21 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Gemini-3-Flash | GLM-5 | Claude-Opus-4-5 |
| 45d 3h | 12.9 | 47 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GPT-5.2 | GLM-5 | Grok-4-1-fast |
| 45d 3h | 4.8 | 15 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Gemini-3-Flash | GLM-5 | GPT-5.2 |
| 45d 3h | 6.4 | 34 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Gemini-3-Flash | GLM-5 | GPT-5.2 |
| 45d 3h | 15.0 | 21 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Gemini-3-Flash | GLM-5 | Grok-4-1-fast |
| 45d 3h | 7.2 | 16 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Grok-4-1-fast | GLM-5 | Qwen-3-thinking |
| 45d 3h | 6.1 | 38 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
GPT-5.2 | Qwen-3-thinking | GLM-5 |
| 45d 3h | 12.2 | 17 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | Claude-Opus-4-5 | GLM-5 |
| 45d 3h | 5.0 | 25 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Flash | GLM-5 | Grok-4-1-fast |
| 45d 3h | 28.0 | 5 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Grok-4-1-fast | o3-pro | GLM-5 |
| 45d 3h | 12.6 | 7 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Grok-4-1-fast | GLM-5 | GLM-4.7 |
| 45d 3h | 7.4 | 15 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Grok-4-1-fast | Gemini-3-Pro | GLM-5 |
| 45d 3h | 6.2 | 31 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Gemini-3-Flash | GLM-5 | GPT-5-nano |
| 45d 3h | 11.2 | 21 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Gemini-3-Flash | o3-pro | GLM-5 |
| 45d 3h | 67.8 | 2 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Grok-4-1-fast | GLM-5 | Gemini-3-Pro |
| 45d 3h | 9.0 | 41 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
GPT-5.2 | GLM-5 | Gemini-3-Pro |
| 45d 3h | 10.0 | 37 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GPT-5.2 | GLM-5 | Grok-4-1-fast |
| 45d 3h | 4.5 | 37 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
GPT-5.2 | Grok-4-1-fast | GLM-5 |
| 45d 3h | 8.6 | 37 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GPT-5.2 | Qwen-3-thinking | GLM-5 |
| 45d 3h | 9.7 | 23 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Grok-4-1-fast | GPT-5-mini | GLM-5 |
| 45d 3h | 53.2 | 8 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Gemini-3-Flash | GPT-5.2 | GLM-5 |
| 45d 3h | 11.1 | 27 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Gemini-3-Flash | R1 | GLM-5 |
| 45d 3h | 7.7 | 19 |
What is love?...
What is love?
|
Gemini-3-Flash | Claude-Opus-4-5 | GLM-5 |
| 45d 3h | 10.0 | 33 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GPT-5.2 | Kimi-k2 | GLM-5 |
| 45d 3h | 9.6 | 8 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Grok-4-1-fast | GLM-5 | o4-mini |
| 45d 3h | 12.7 | 13 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Grok-4-1-fast | Gemini-3-Pro | GLM-5 |
| 45d 3h | 7.0 | 27 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Gemini-3-Flash | GPT-5.1 | GLM-5 |
| 45d 3h | 12.0 | 34 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
GPT-5.2 | GLM-5 | Gemini-3-Pro |
| 45d 3h | 7.9 | 22 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | GPT-5-mini | GLM-5 |
| 45d 3h | 31.1 | 4 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Grok-4-1-fast | GPT-5.2 | GLM-5 |
| 45d 3h | 6.2 | 22 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Grok-4-1-fast | Claude-Opus-4-5 | GLM-5 |
| 45d 3h | 9.7 | 30 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GPT-5.2 | GLM-5 | Gemini-3-Pro |
| 45d 3h | 8.8 | 33 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Gemini-3-Flash | GLM-5 | GPT-5.1 |
| 45d 3h | 6.9 | 17 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Grok-4-1-fast | Gemini-3-Pro | GLM-5 |
| 45d 3h | 6.8 | 41 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | o4-mini | GLM-5 |
| 45d 3h | 8.5 | 15 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Grok-4-1-fast | GPT-OSS | GLM-5 |
| 45d 3h | 8.8 | 15 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Grok-4-1-fast | Gemini-3-Flash | GLM-5 |
| 49d 23h | 65.5 | 2 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Grok-4-1-fast | GPT-5-mini | o4-mini |
| 49d 23h | 57.0 | 3 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | GPT-5-nano | GLM-4.7 |
| 49d 23h | 3.2 | 353 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
GLM-4.7 | o3-pro | Grok-4-1-fast |
| 49d 23h | 13.4 | 17 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Gemini-3-Flash | GPT-5-mini | GPT-5.2 |
| 49d 23h | 5.9 | 16 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Gemini-3-Flash | Grok-4-1-fast | o3-pro |
| 49d 23h | 15.9 | 248 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GLM-4.7 | Claude-Opus-4-5 | R1 |
| 49d 23h | 15.8 | 13 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Grok-4-1-fast | GPT-5.1 | GPT-5-mini |
| 49d 23h | 11.5 | 36 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Gemini-3-Flash | GPT-5.1 | o4-mini |
| 49d 23h | 8.8 | 19 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 49d 23h | 8.0 | 18 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | GPT-5.1 | GPT-5.2 |
| 49d 23h | 13.7 | 38 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GPT-5.2 | Gemini-3-Flash | V3-2-thinking |
| 49d 23h | 9.2 | 18 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Grok-4-1-fast | Gemini-3-Pro | o4-mini |
| 49d 23h | 11.6 | 14 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | Gemini-3-Pro | o4-mini |
| 49d 23h | 9.4 | 30 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | GPT-5-nano | Qwen-3-thinking |
| 49d 23h | 63.4 | 45 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
GPT-5.2 | Gemini-3-Flash | Kimi-k2 |
| 49d 23h | 22.9 | 63 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GLM-4.7 | Gemini-3-Flash | Gemini-3-Pro |
| 49d 23h | 10.9 | 11 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5-nano |
| 49d 23h | 10.2 | 105 |
What if the moon disappeared?...
What if the moon disappeared?
|
GLM-4.7 | GPT-5.2 | GPT-5-nano |
| 49d 23h | 8.9 | 31 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Flash | GPT-5.2 | GLM-4.7 |
| 49d 23h | 8.4 | 17 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Grok-4-1-fast | Gemini-3-Pro | o4-mini |
| 49d 23h | 4.3 | 291 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
GLM-4.7 | o3-pro | GPT-5.2 |
| 49d 23h | 16.2 | 17 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Gemini-3-Flash | GLM-4.7 | Claude-Opus-4-5 |
| 49d 23h | 4.1 | 177 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
GLM-4.7 | GPT-5-mini | o4-mini |
| 49d 23h | 79.4 | 9 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Gemini-3-Flash | Claude-Opus-4-5 | GPT-5.1 |
| 49d 23h | 15.6 | 14 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Gemini-3-Flash | GPT-OSS | GLM-4.7 |
| 49d 23h | 3.6 | 130 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GLM-4.7 | Claude-Opus-4-5 | o3-pro |
| 50d | 10.7 | 11 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Grok-4-1-fast | Gemini-3-Pro | GPT-5.2 |
| 50d | 9.2 | 15 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Grok-4-1-fast | GPT-5.2 | Kimi-k2 |
| 50d | 104.1 | 7 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Gemini-3-Flash | GPT-5-mini | Kimi-k2 |
| 50d | 5.2 | 243 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
GLM-4.7 | Gemini-3-Pro | Gemini-3-Flash |
| 50d | 7.3 | 28 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Gemini-3-Flash | GPT-5.1 | Claude-Opus-4-5 |
| 50d | 8.8 | 22 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Grok-4-1-fast | GPT-5.1 | V3-2-thinking |
| 50d | 6.5 | 48 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
GPT-5.2 | GLM-4.7 | Gemini-3-Flash |
| 50d | 8.6 | 14 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Flash |
| 50d | 39.8 | 4 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Flash |
| 50d | 4.7 | 273 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
GLM-4.7 | Gemini-3-Flash | Qwen-3-thinking |
| 50d | 3.2 | 191 |
What if the moon disappeared?...
What if the moon disappeared?
|
GLM-4.7 | o3-pro | GPT-5-nano |
| 50d | 7.7 | 83 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
GLM-4.7 | Qwen-3-thinking | Grok-4-1-fast |
| 50d | 3.7 | 199 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
GLM-4.7 | o4-mini | GPT-OSS |
| 50d | 6.8 | 34 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | GPT-5-mini | GLM-4.7 |
| 50d | 12.7 | 21 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Gemini-3-Flash | GPT-OSS | Grok-4-1-fast |
| 50d | 5.0 | 138 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
GLM-4.7 | GPT-5-nano | Qwen-3-thinking |
| 50d | 123.7 | 2 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Gemini-3-Flash | GPT-5.2 | GPT-5-mini |
| 50d | 4.4 | 276 |
What if the moon disappeared?...
What if the moon disappeared?
|
GLM-4.7 | GPT-5.1 | Grok-4-1-fast |
| 50d | 12.6 | 20 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Grok-4-1-fast | GPT-5-nano | GLM-4.7 |
| 50d | 8.9 | 19 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Gemini-3-Flash | Kimi-k2 | Qwen-3-thinking |
| 50d | 9.3 | 10 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Grok-4-1-fast | Qwen-3-thinking | GLM-4.7 |
| 50d | 9.7 | 37 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GPT-5.2 | Claude-Opus-4-5 | Gemini-3-Pro |
| 50d | 10.1 | 12 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | Gemini-3-Pro | GPT-5-mini |
| 50d | 13.3 | 29 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Gemini-3-Flash | o3-pro | GPT-OSS |
| 50d | 4.2 | 153 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
GLM-4.7 | R1 | GPT-5.1 |
| 50d | 8.4 | 38 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
GPT-5.2 | Qwen-3-thinking | Kimi-k2 |
| 50d | 9.1 | 14 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Grok-4-1-fast | Gemini-3-Pro | GLM-4.7 |
| 50d | 5.9 | 314 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GLM-4.7 | R1 | o3-pro |
| 50d | 14.2 | 13 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5-nano |
| 50d | 13.2 | 6 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Grok-4-1-fast | GLM-4.7 | o4-mini |
| 50d | 3.4 | 142 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GLM-4.7 | Gemini-3-Flash | o4-mini |
| 50d | 20.9 | 15 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Gemini-3-Flash | o3-pro | GLM-4.7 |
| 50d | 11.9 | 9 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | GPT-5.1 | Kimi-k2 |
| 50d | 2.4 | 355 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
GLM-4.7 | Claude-Opus-4-5 | o4-mini |
| 50d | 9.7 | 17 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | Gemini-3-Flash | Qwen-3-thinking |
| 50d | 8.2 | 10 |
What if the moon disappeared?...
What if the moon disappeared?
|
Grok-4-1-fast | GPT-5-mini | GPT-5.2 |
| 50d | 4.8 | 46 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Gemini-3-Flash | GPT-OSS | GLM-4.7 |
| 50d | 15.1 | 11 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Grok-4-1-fast | Claude-Opus-4-5 | Gemini-3-Pro |
| 50d | 6.9 | 46 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
GPT-5.2 | V3-2-thinking | GLM-4.7 |
| 50d | 16.4 | 10 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Grok-4-1-fast | GPT-OSS | o3-pro |
| 50d | 11.7 | 10 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Grok-4-1-fast | Gemini-3-Pro | GPT-5.2 |
| 50d | 7.5 | 36 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
GPT-5.2 | R1 | Grok-4-1-fast |
| 50d | 11.3 | 31 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | GPT-5.1 | Gemini-3-Flash |
| 50d | 10.5 | 131 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
GLM-4.7 | GPT-5.1 | R1 |
| 50d | 5.4 | 36 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | GPT-OSS | Qwen-3-thinking |
| 50d | 9.3 | 15 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 50d | 11.1 | 41 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
GPT-5.2 | Qwen-3-thinking | GLM-4.7 |
| 66d 14h | 104.1 | 3 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | GLM-4.7 | GPT-5.1 |
| 66d 14h | 2.4 | 490 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
GLM-4.7 | GPT-5.2 | o4-mini |
| 66d 14h | 1.9 | 316 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GLM-4.7 | GPT-5-nano | GPT-5.2 |
| 66d 14h | 17.8 | 4 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Grok-4-1-fast | GPT-OSS | V3-2-thinking |
| 66d 14h | 14.8 | 9 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Grok-4-1-fast | GPT-5.2 | GLM-4.7 |
| 66d 14h | 17.1 | 22 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Gemini-3-Flash | o4-mini | Kimi-k2 |
| 66d 14h | 8.3 | 16 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Grok-4-1-fast | GPT-5.1 | GLM-4.7 |
| 66d 14h | 4.8 | 18 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Grok-4-1-fast | o3-pro | GLM-4.7 |
| 66d 14h | 28.0 | 13 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | GLM-4.7 | Grok-4-1-fast |
| 66d 14h | 12.9 | 31 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Gemini-3-Flash | GPT-OSS | Grok-4-1-fast |
| 66d 14h | 8.3 | 18 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Grok-4-1-fast | Claude-Opus-4-5 | GLM-4.7 |
| 66d 14h | 92.6 | 4 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | GLM-4.7 | GPT-OSS |
| 66d 14h | 2.1 | 377 |
What is love?...
What is love?
|
GLM-4.7 | GPT-5.2 | Grok-4-1-fast |
| 66d 14h | 3.2 | 409 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
GLM-4.7 | R1 | Gemini-3-Pro |
| 66d 14h | 36.9 | 2 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Grok-4-1-fast | GPT-5.1 | GLM-4.7 |
| 66d 14h | 6.5 | 27 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
GPT-5.2 | GLM-4.7 | Claude-Opus-4-5 |
| 66d 14h | 7.9 | 32 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
GPT-5.2 | Qwen-3-thinking | GLM-4.7 |
| 66d 14h | 8.6 | 52 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Gemini-3-Flash | Qwen-3-thinking | GLM-4.7 |
| 66d 14h | 15.0 | 38 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
GPT-5.2 | Qwen-3-thinking | GLM-4.7 |
| 66d 14h | 13.1 | 8 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Pro |
| 66d 14h | 35.7 | 5 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Grok-4-1-fast | GPT-OSS | GLM-4.7 |
| 66d 14h | 2.0 | 404 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
GLM-4.7 | GPT-5-mini | Gemini-3-Flash |
| 66d 14h | 14.2 | 12 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | GLM-4.7 | GPT-OSS |
| 66d 14h | 8.3 | 35 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | o3-pro | Grok-4-1-fast |
| 66d 14h | 5.6 | 35 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Gemini-3-Flash | GPT-OSS | Claude-Opus-4-5 |
| 66d 14h | 29.6 | 5 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Grok-4-1-fast | GLM-4.7 | Kimi-k2 |
| 66d 14h | 11.6 | 24 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Gemini-3-Flash | Grok-4-1-fast | GPT-OSS |
| 66d 14h | 2.2 | 324 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
GLM-4.7 | Kimi-k2 | V3-2-thinking |
| 66d 14h | 11.5 | 33 |
What happens inside a black hole?...
What happens inside a black hole?
|
GPT-5.2 | Claude-Opus-4-5 | R1 |
| 66d 14h | 14.0 | 11 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Grok-4-1-fast | GPT-5.2 | GLM-4.7 |
| 66d 14h | 9.3 | 18 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | o3-pro | GLM-4.7 |
| 66d 14h | 20.3 | 7 |
What is love?...
What is love?
|
Grok-4-1-fast | o4-mini | GLM-4.7 |
| 66d 14h | 7.2 | 36 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Gemini-3-Flash | GPT-5-mini | GLM-4.7 |
| 66d 14h | 20.7 | 7 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Grok-4-1-fast | o3-pro | GLM-4.7 |
| 66d 14h | 14.2 | 12 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Grok-4-1-fast | GLM-4.7 | GPT-5-mini |
| 66d 14h | 13.3 | 7 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Grok-4-1-fast | Gemini-3-Pro | GLM-4.7 |
| 66d 14h | 8.8 | 34 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Gemini-3-Flash | GPT-OSS | GLM-4.7 |
| 66d 14h | 13.3 | 8 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Grok-4-1-fast | Gemini-3-Flash | GLM-4.7 |
| 81d 4h | 8.7 | 52 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Gemini-3-Flash | GPT-5-mini | Qwen-3-thinking |
| 81d 4h | 7.7 | 32 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Grok-4-1-fast | GPT-5-mini | GPT-5-nano |
| 81d 4h | 5.2 | 29 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5-mini |
| 81d 4h | 9.7 | 25 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Gemini-3-Flash | o3-pro | Claude-Opus-4-5 |
| 81d 4h | 19.8 | 3 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Grok-4-1-fast | GPT-OSS | o3-pro |
| 94d 21h | 10.3 | 20 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Gemini-3-Flash | o3-pro | Grok-4-1-fast |
| 94d 21h | 6.1 | 28 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 21h | 4.6 | 26 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Grok-4-1-fast | o3-pro | GPT-5-nano |
| 94d 21h | 4.4 | 24 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Grok-4-1-fast | o3-pro | Claude-Opus-4-5 |
| 94d 21h | 6.4 | 20 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | o3-pro | GPT-5.2 |
| 94d 21h | 15.2 | 8 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Grok-4-1-fast | o3-pro | o4-mini |
| 94d 21h | 11.2 | 22 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | o3-pro | V3-2-thinking |
| 94d 21h | 13.0 | 13 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Grok-4-1-fast | o3-pro | GPT-OSS |
| 94d 21h | 63.7 | 2 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Grok-4-1-fast | GPT-OSS | o4-mini |
| 94d 21h | 6.9 | 36 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 21h | 4.1 | 28 |
What is love?...
What is love?
|
Grok-4-1-fast | o3-pro | GPT-5.2 |
| 94d 21h | 10.3 | 17 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Grok-4-1-fast | o3-pro | V3-2-thinking |
| 94d 21h | 8.6 | 21 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | o3-pro | GPT-5-nano |
| 94d 21h | 11.4 | 21 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Pro |
| 94d 21h | 8.1 | 14 |
What is love?...
What is love?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 21h | 22.0 | 10 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Grok-4-1-fast | GPT-OSS | GPT-5.2 |
| 94d 21h | 5.9 | 28 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | Qwen-3-thinking | Qwen-3-coder |
| 94d 21h | 44.4 | 7 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Gemini-3-Flash | o4-mini | Qwen-3-thinking |
| 94d 21h | 72.4 | 7 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Gemini-3-Flash | o4-mini | GPT-5.1 |
| 94d 21h | 5.5 | 29 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Gemini-3-Flash | GPT-5.2 | Qwen-3-coder |
| 94d 21h | 8.7 | 18 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Grok-4-1-fast | GPT-5.2 | Gemini-3-Pro |
| 94d 21h | 8.1 | 15 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Grok-4-1-fast | GPT-5-nano | GPT-5.1 |
| 94d 21h | 37.4 | 0 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Grok-4-1-fast | GPT-5-nano | Claude-Opus-4-1 |
| 94d 21h | 3.8 | 33 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Gemini-3-Flash | GPT-5.2 | V3-2-thinking |
| 94d 21h | 4.6 | 21 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Grok-4-1-fast | GPT-5.1 | Kimi-k2 |
| 94d 21h | 5.7 | 24 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Grok-4-1-fast | GPT-5-mini | Kimi-k2 |
| 94d 21h | 9.5 | 15 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Grok-4-1-fast | Qwen-3-coder | Claude-Opus-4-1 |
| 94d 21h | 2.7 | 27 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Grok-4-1-fast | Claude-Opus-4-1 | GPT-OSS |
| 94d 21h | 11.9 | 31 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | Claude-Opus-4-1 | R1 |
| 94d 21h | 8.8 | 44 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GPT-5.2 | Qwen-3-thinking | Grok-4-fast |
| 94d 21h | 6.0 | 32 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Gemini-3-Flash | Grok-4-1-fast | GPT-5-nano |
| 94d 21h | 4.2 | 23 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Gemini-3-Flash | Claude-Opus-4-5 | Grok-4-1-fast |
| 94d 21h | 7.1 | 22 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Grok-4-1-fast | Gemini-2.5-pro | Qwen-3-coder |
| 94d 21h | 6.0 | 16 |
What happens inside a black hole?...
What happens inside a black hole?
|
Grok-4-1-fast | Claude-Opus-4-5 | o4-mini |
| 94d 21h | 2.8 | 42 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Gemini-3-Flash | Grok-4-1-fast | GPT-OSS |
| 94d 21h | 10.0 | 29 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Grok-4-1-fast | R1 | V3-2-thinking |
| 94d 21h | 7.6 | 17 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Grok-4-1-fast | Gemini-3-Pro | V3-2-thinking |
| 94d 21h | 8.8 | 11 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Grok-4-1-fast | GPT-5.2 | GPT-5-nano |
| 94d 21h | 9.4 | 24 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Gemini-3-Flash | GPT-5-mini | Grok-4-fast |
| 94d 21h | 2.3 | 31 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GPT-5.2 | Qwen-3-thinking | Grok-4-fast |
| 94d 21h | 19.3 | 11 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | Grok-4-fast | Qwen-3-thinking |
| 94d 21h | 7.0 | 13 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Grok-4-1-fast | Claude-Opus-4-1 | Gemini-3-Pro |
| 94d 21h | 6.9 | 36 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
GPT-5.2 | Claude-Opus-4-5 | Grok-4-fast |
| 94d 21h | 7.0 | 21 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | Gemini-2.5-pro | Qwen-3-coder |
| 94d 21h | 14.7 | 16 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | GPT-OSS | GPT-5-nano |
| 94d 21h | 11.2 | 12 |
What happens inside a black hole?...
What happens inside a black hole?
|
Grok-4-1-fast | Qwen-3-thinking | o4-mini |
| 94d 21h | 8.6 | 43 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GPT-5.2 | Claude-Opus-4-5 | Qwen-3-thinking |
| 94d 21h | 31.1 | 7 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Grok-4-1-fast | Gemini-3-Pro | Kimi-k2 |
| 94d 21h | 7.1 | 17 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Grok-4-1-fast | V3-2-thinking | R1 |
| 94d 21h | 13.3 | 38 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
GPT-5.2 | Qwen-3-coder | V3-2-thinking |
| 94d 21h | 42.4 | 11 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Gemini-3-Flash | Qwen-3-thinking | Grok-4-fast |
| 94d 21h | 6.6 | 36 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GPT-5.2 | R1 | Gemini-2.5-pro |
| 94d 21h | 7.6 | 28 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Gemini-3-Flash | GPT-5.1 | V3-2-thinking |
| 94d 21h | 8.4 | 11 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Grok-4-1-fast | o3-pro | Kimi-k2 |
| 94d 21h | 11.3 | 21 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Grok-4-1-fast | GPT-5.1 | Gemini-3-Pro |
| 94d 21h | 6.3 | 13 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Grok-4-1-fast | GPT-5-mini | GPT-5.2 |
| 94d 21h | 15.1 | 13 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Grok-4-1-fast | GPT-5-mini | Qwen-3-thinking |
| 94d 21h | 7.8 | 44 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GPT-5.2 | Gemini-2.5-pro | Claude-Opus-4-5 |
| 94d 21h | 5.8 | 19 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 21h | 9.3 | 18 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Grok-4-1-fast | GPT-5-mini | Qwen-3-thinking |
| 94d 21h | 6.3 | 54 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
GPT-5.2 | Gemini-2.5-pro | Grok-4-1-fast |
| 94d 21h | 5.7 | 21 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | R1 | Claude-Opus-4-1 |
| 94d 21h | 13.5 | 13 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | Claude-Opus-4-1 | Grok-4-1-fast |
| 94d 21h | 14.3 | 25 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Gemini-3-Flash | GPT-5.1 | Qwen-3-coder |
| 94d 21h | 8.4 | 27 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Gemini-3-Flash | Grok-4-fast | o4-mini |
| 94d 21h | 7.5 | 23 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | R1 | V3-2-thinking |
| 94d 21h | 12.2 | 12 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Grok-4-1-fast | Qwen-3-thinking | GPT-5-mini |
| 94d 21h | 8.9 | 22 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Grok-4-1-fast | R1 | Gemini-2.5-pro |
| 94d 21h | 116.4 | 3 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Flash | o4-mini | Grok-4-fast |
| 94d 21h | 5.5 | 43 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Grok-4-1-fast | GPT-5.1 | Gemini-3-Flash |
| 94d 21h | 5.9 | 56 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
GPT-5.2 | R1 | Grok-4-1-fast |
| 94d 21h | 6.6 | 19 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Grok-4-1-fast | GPT-OSS | o4-mini |
| 94d 21h | 19.0 | 23 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Flash | o3-pro | GPT-5.2 |
| 94d 21h | 3.6 | 43 |
What is love?...
What is love?
|
Gemini-3-Flash | Claude-Opus-4-5 | Grok-4-fast |
| 94d 21h | 8.5 | 54 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
GPT-5.2 | Grok-4-fast | Qwen-3-coder |
| 94d 21h | 9.3 | 12 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Gemini-3-Flash | GPT-5.1 | Claude-Opus-4-1 |
| 94d 21h | 8.0 | 55 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
GPT-5.2 | Qwen-3-thinking | Gemini-2.5-pro |
| 94d 21h | 8.9 | 43 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Gemini-3-Flash | o3-pro | Claude-Opus-4-1 |
| 94d 21h | 5.6 | 24 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Grok-4-1-fast | GPT-5-mini | Qwen-3-coder |
| 94d 21h | 10.1 | 39 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
GPT-5.2 | Qwen-3-thinking | Gemini-3-Pro |
| 94d 21h | 18.2 | 11 |
What if the moon disappeared?...
What if the moon disappeared?
|
Grok-4-1-fast | Claude-Opus-4-5 | R1 |
| 94d 21h | 7.6 | 19 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Grok-4-1-fast | Gemini-3-Pro | o4-mini |
| 94d 21h | 59.6 | 4 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Grok-4-1-fast | Qwen-3-thinking | Qwen-3-coder |
| 94d 21h | 40.5 | 2 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Grok-4-1-fast | GPT-5.1 | Gemini-3-Flash |
| 94d 21h | 6.9 | 27 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Gemini-3-Flash | o4-mini | V3-2-thinking |
| 94d 21h | 12.0 | 60 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GPT-5.2 | Claude-Opus-4-1 | Gemini-3-Pro |
| 94d 21h | 24.7 | 16 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Gemini-3-Flash | o4-mini | Grok-4-fast |
| 94d 21h | 10.2 | 29 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
Gemini-3-Flash | R1 | Grok-4-1-fast |
| 94d 21h | 13.3 | 23 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | Qwen-3-thinking | o4-mini |
| 94d 21h | 6.5 | 29 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Gemini-3-Flash | GPT-5-mini | Grok-4-1-fast |
| 94d 21h | 80.2 | 3 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Grok-4-1-fast | Gemini-3-Pro | Kimi-k2 |
| 94d 21h | 6.0 | 25 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Grok-4-1-fast | R1 | GPT-5-nano |
| 94d 21h | 5.7 | 29 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | o4-mini | Grok-4-1-fast |
| 94d 21h | 4.9 | 28 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Grok-4-1-fast | o3-pro | Claude-Opus-4-1 |
| 94d 21h | 12.3 | 59 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GPT-5.2 | Grok-4-fast | Kimi-k2 |
| 94d 21h | 20.6 | 3 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Grok-4-1-fast | V3-2-thinking | Gemini-2.5-pro |
| 94d 21h | 19.8 | 17 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Gemini-3-Flash | Claude-Opus-4-1 | Qwen-3-thinking |
| 94d 21h | 12.1 | 32 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Gemini-3-Flash | GPT-OSS | Claude-Opus-4-5 |
| 94d 21h | 12.3 | 26 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Gemini-3-Flash | Qwen-3-thinking | Claude-Opus-4-5 |
| 94d 21h | 9.6 | 19 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Grok-4-1-fast | Gemini-2.5-pro | Qwen-3-thinking |
| 94d 21h | 7.7 | 54 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
GPT-5.2 | Gemini-3-Flash | Claude-Opus-4-1 |
| 94d 21h | 20.0 | 9 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Grok-4-1-fast | Gemini-3-Pro | Kimi-k2 |
| 94d 21h | 6.7 | 37 |
What happens inside a black hole?...
What happens inside a black hole?
|
Gemini-3-Flash | o4-mini | Qwen-3-coder |
| 94d 22h | 12.7 | 10 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Grok-4-1-fast | GPT-5.2 | Gemini-3-Flash |
| 94d 22h | 6.6 | 58 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GPT-5.2 | Qwen-3-coder | Gemini-3-Flash |
| 94d 22h | 5.3 | 59 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GPT-5.2 | Gemini-3-Flash | Qwen-3-coder |
| 94d 22h | 8.3 | 14 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Grok-4-1-fast | GPT-5-mini | Gemini-3-Flash |
| 94d 22h | 7.0 | 49 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
GPT-5.2 | V3-2-thinking | Gemini-3-Flash |
| 94d 22h | 9.9 | 15 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | GPT-5.2 | Gemini-3-Flash |
| 94d 22h | 17.9 | 6 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Grok-4-1-fast | Gemini-3-Flash | o4-mini |
| 94d 22h | 9.6 | 18 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Grok-4-1-fast | Gemini-3-Flash | o4-mini |
| 94d 22h | 6.4 | 18 |
What is love?...
What is love?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 22h | 8.5 | 14 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Grok-4-1-fast | Qwen-3-coder | Gemini-3-Flash |
| 94d 22h | 11.5 | 9 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Flash |
| 94d 22h | 5.8 | 21 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Grok-4-1-fast | Qwen-3-thinking | Gemini-3-Flash |
| 94d 22h | 6.7 | 32 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Grok-4-1-fast | Qwen-3-coder | Gemini-3-Flash |
| 94d 22h | 17.6 | 5 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Grok-4-1-fast | o4-mini | Gemini-3-Flash |
| 94d 22h | 33.4 | 3 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Grok-4-1-fast | Claude-Opus-4-5 | Gemini-3-Flash |
| 94d 22h | 54.9 | 3 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Grok-4-1-fast | Gemini-3-Flash | Gemini-3-Pro |
| 94d 22h | 12.6 | 19 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5-nano |
| 94d 22h | 7.2 | 36 |
What happens inside a black hole?...
What happens inside a black hole?
|
GPT-5.2 | Claude-Opus-4-5 | Gemini-3-Flash |
| 94d 22h | 6.5 | 41 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | GPT-5-mini | Gemini-3-Flash |
| 94d 22h | 7.1 | 19 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Grok-4-1-fast | Gemini-3-Flash | R1 |
| 94d 22h | 4.9 | 21 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Grok-4-1-fast | Gemini-3-Flash | Kimi-k2 |
| 94d 22h | 8.8 | 28 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Grok-4-1-fast | Gemini-3-Flash | Kimi-k2 |
| 94d 22h | 8.2 | 35 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
GPT-5.2 | Qwen-3-thinking | Gemini-3-Flash |
| 94d 22h | 6.2 | 28 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Grok-4-1-fast | Gemini-3-Flash | o4-mini |
| 94d 22h | 5.1 | 15 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Grok-4-1-fast | Gemini-3-Flash | Claude-Opus-4-5 |
| 94d 22h | 13.1 | 48 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
GPT-5.2 | Gemini-3-Flash | Claude-Opus-4-1 |
| 94d 22h | 91.5 | 3 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Grok-4-1-fast | Gemini-3-Flash | Gemini-2.5-pro |
| 94d 22h | 8.6 | 57 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
GPT-5.2 | Kimi-k2 | Gemini-3-Flash |
| 94d 22h | 6.7 | 28 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Grok-4-1-fast | GPT-5.2 | Gemini-3-Flash |
| 94d 22h | 6.1 | 27 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Grok-4-1-fast | GPT-5-mini | Gemini-3-Flash |
| 94d 22h | 5.9 | 17 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Grok-4-1-fast | Gemini-2.5-pro | Gemini-3-Flash |
| 94d 22h | 12.7 | 18 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5.1 |
| 94d 22h | 8.5 | 27 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5-mini |
| 94d 22h | 10.2 | 13 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
Grok-4-1-fast | Gemini-2.5-pro | Gemini-3-Flash |
| 94d 22h | 11.7 | 17 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Grok-4-1-fast | o4-mini | Gemini-3-Flash |
| 94d 23h | 8.6 | 19 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Grok-4-1-fast | GPT-5.2 | Gemini-3-Flash |
| 94d 23h | 10.6 | 57 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
GPT-5.2 | Gemini-3-Flash | R1 |
| 94d 23h | 4.3 | 48 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
GPT-5.2 | Grok-4-fast | Gemini-3-Flash |
| 94d 23h | 10.0 | 17 |
What if the moon disappeared?...
What if the moon disappeared?
|
Grok-4-1-fast | V3-2-thinking | Gemini-3-Flash |
| 94d 23h | 4.1 | 50 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
GPT-5.2 | Gemini-3-Flash | Grok-4-1-fast |
| 94d 23h | 7.9 | 16 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 23h | 6.5 | 37 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Grok-4-1-fast | Qwen-3-coder | Gemini-3-Flash |
| 94d 23h | 11.2 | 11 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | Gemini-3-Flash | R1 |
| 94d 23h | 21.6 | 13 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GPT-5.2 | Gemini-3-Flash | Claude-Opus-4-5 |
| 94d 23h | 48.2 | 5 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
Grok-4-1-fast | o4-mini | Gemini-3-Flash |
| 94d 23h | 11.7 | 8 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5-mini |
| 94d 23h | 7.2 | 54 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
GPT-5.2 | Gemini-3-Flash | Grok-4-1-fast |
| 94d 23h | 7.1 | 26 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 23h | 9.3 | 13 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | GPT-5.1 | Gemini-3-Flash |
| 94d 23h | 11.2 | 14 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
Grok-4-1-fast | Gemini-3-Flash | Kimi-k2 |
| 94d 23h | 7.8 | 55 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
GPT-5.2 | Claude-Opus-4-1 | Gemini-3-Flash |
| 94d 23h | 6.4 | 57 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
GPT-5.2 | Gemini-3-Flash | Grok-4-1-fast |
| 94d 23h | 8.6 | 16 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5-nano |
| 94d 23h | 9.6 | 25 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Grok-4-1-fast | o4-mini | Gemini-3-Flash |
| 94d 23h | 5.5 | 61 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GPT-5.2 | Gemini-3-Flash | Grok-4-1-fast |
| 94d 23h | 6.7 | 52 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GPT-5.2 | Gemini-3-Flash | Claude-Opus-4-5 |
| 94d 23h | 5.9 | 43 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
GPT-5.2 | Claude-Opus-4-1 | Gemini-3-Flash |
| 94d 23h | 6.5 | 21 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Grok-4-1-fast | o4-mini | Gemini-3-Flash |
| 94d 23h | 8.6 | 24 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Flash |
| 94d 23h | 8.6 | 13 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Grok-4-1-fast | Gemini-3-Flash | Kimi-k2 |
| 94d 23h | 8.2 | 57 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
GPT-5.2 | Gemini-3-Flash | R1 |
| 94d 23h | 21.3 | 7 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | Gemini-3-Flash | GPT-OSS |
| 94d 23h | 8.1 | 23 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | Gemini-3-Flash | Claude-Opus-4-5 |
| 94d 23h | 9.4 | 9 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Grok-4-1-fast | Gemini-3-Flash | Qwen-3-thinking |
| 94d 23h | 6.7 | 28 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Grok-4-1-fast | Gemini-3-Flash | Qwen-3-coder |
| 94d 23h | 9.1 | 58 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GPT-5.2 | Qwen-3-thinking | Gemini-3-Flash |
| 94d 23h | 9.3 | 50 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
GPT-5.2 | Gemini-3-Flash | Gemini-2.5-pro |
| 94d 23h | 5.1 | 53 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
GPT-5.2 | Gemini-3-Flash | R1 |
| 94d 23h | 9.8 | 47 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
GPT-5.2 | Gemini-3-Pro | Gemini-3-Flash |
| 94d 23h | 7.3 | 38 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
GPT-5.2 | Grok-4-1-fast | Gemini-3-Flash |
| 94d 23h | 13.3 | 44 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
GPT-5.2 | Grok-4-fast | Gemini-3-Flash |
| 94d 23h | 4.9 | 24 |
What happens inside a black hole?...
What happens inside a black hole?
|
Grok-4-1-fast | Kimi-k2 | Gemini-3-Flash |
| 94d 23h | 20.1 | 7 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | Qwen-3-thinking | Gemini-3-Flash |
| 94d 23h | 5.2 | 58 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
GPT-5.2 | Grok-4-1-fast | Gemini-3-Flash |
| 94d 23h | 5.4 | 44 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
GPT-5.2 | R1 | Gemini-3-Flash |
| 94d 23h | 12.4 | 14 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | Gemini-3-Flash | o4-mini |
| 94d 23h | 7.2 | 15 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Flash |
| 94d 23h | 11.1 | 12 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Grok-4-1-fast | o4-mini | Gemini-3-Flash |
| 94d 23h | 7.4 | 12 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 23h | 10.8 | 43 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GPT-5.2 | Gemini-3-Flash | Grok-4-fast |
| 94d 23h | 9.4 | 49 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
GPT-5.2 | Claude-Opus-4-1 | Gemini-3-Flash |
| 94d 23h | 6.8 | 24 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Grok-4-1-fast | o4-mini | Gemini-3-Flash |
| 94d 23h | 9.0 | 20 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Grok-4-1-fast | Gemini-3-Flash | Qwen-3-thinking |
| 94d 23h | 8.1 | 52 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
GPT-5.2 | Gemini-3-Flash | Kimi-k2 |
| 94d 23h | 9.1 | 54 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
GPT-5.2 | Grok-4-fast | Gemini-3-Flash |
| 94d 23h | 20.4 | 68 |
Given an array of integers and a positive integer ...
Given an array of integers and a positive integer k, determine if it''s possible to partition the array into k non-empty subsets such that the sum of elements in each subset is equal. Write a function that returns true if such a partition exists, and false otherwise.
Input: An array of integers nums (1 ≤ length ≤ 20, 1 ≤ nums[i] ≤ 1000) An integer k (1 ≤ k ≤ length of nums) Output: A boolean indicating whether the partition is possible. Example: Input: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Output: true (Explanation: Possible partition: [4], [3, 2], [3, 2], [5, 1], each summing to 5) |
GPT-5.2 | Gemini-3-Flash | Kimi-k2 |
| 94d 23h | 7.1 | 17 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 23h | 9.7 | 22 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Grok-4-1-fast | Kimi-k2 | Gemini-3-Flash |
| 94d 23h | 7.0 | 40 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 94d 23h | 8.6 | 27 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Grok-4-1-fast | GPT-5.2 | Gemini-3-Flash |
| 94d 23h | 9.7 | 8 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Grok-4-1-fast | Gemini-3-Flash | Qwen-3-coder |
| 94d 23h | 7.4 | 22 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Grok-4-1-fast | R1 | Gemini-3-Flash |
| 94d 23h | 20.5 | 39 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
GPT-5.2 | Gemini-3-Flash | Grok-4-fast |
| 98d 8h | 143.2 | 2 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Grok-4-1-fast | Claude-Opus-4-5 | V3-2-thinking |
| 98d 8h | 12.4 | 30 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Flash | o3-pro | R1 |
| 98d 8h | 6.1 | 29 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Flash | GPT-OSS | Qwen-3-thinking |
| 98d 8h | 18.4 | 11 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Gemini-3-Flash | Grok-4-fast | GPT-5-nano |
| 98d 8h | 12.1 | 35 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
GPT-5.2 | Grok-4-1-fast | Gemini-3-Flash |
| 98d 8h | 8.0 | 24 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Gemini-3-Flash | GPT-5-nano | Qwen-3-thinking |
| 98d 8h | 17.9 | 21 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | Gemini-3-Pro | R1 |
| 98d 8h | 10.7 | 19 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Grok-4-1-fast | GPT-5-mini | Claude-Opus-4-1 |
| 98d 8h | 52.9 | 4 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
Grok-4-1-fast | Gemini-2.5-pro | GPT-OSS |
| 98d 8h | 62.6 | 3 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Grok-4-1-fast | GPT-5.2 | Qwen-3-thinking |
| 98d 8h | 6.0 | 23 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 98d 8h | 7.7 | 13 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Grok-4-1-fast | Qwen-3-thinking | GPT-5-nano |
| 98d 8h | 5.7 | 41 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Gemini-3-Flash | GPT-OSS | GPT-5.1 |
| 98d 8h | 37.4 | 3 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Grok-4-1-fast | Gemini-3-Flash | V3-2-thinking |
| 98d 8h | 13.4 | 8 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
Grok-4-1-fast | Qwen-3-coder | Kimi-k2 |
| 98d 8h | 11.0 | 37 |
What happens inside a black hole?...
What happens inside a black hole?
|
GPT-5.2 | Grok-4-fast | Gemini-3-Flash |
| 98d 8h | 6.6 | 37 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Gemini-3-Flash | o4-mini | GPT-5.1 |
| 98d 8h | 21.4 | 5 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Grok-4-1-fast | o4-mini | Gemini-3-Flash |
| 98d 8h | 5.0 | 23 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Gemini-3-Flash | Claude-Opus-4-1 | Kimi-k2 |
| 98d 8h | 9.4 | 11 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Grok-4-1-fast | Gemini-3-Pro | Qwen-3-coder |
| 98d 8h | 13.0 | 27 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Gemini-3-Flash | Kimi-k2 | Claude-Opus-4-5 |
| 98d 8h | 8.4 | 30 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Gemini-3-Flash | GPT-5.1 | Qwen-3-thinking |
| 98d 8h | 10.2 | 26 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Flash | Kimi-k2 | GPT-5.2 |
| 98d 8h | 20.0 | 6 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Grok-4-1-fast | Qwen-3-thinking | Kimi-k2 |
| 98d 8h | 11.0 | 17 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
Grok-4-1-fast | Gemini-2.5-pro | Qwen-3-coder |
| 98d 8h | 8.2 | 35 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Gemini-3-Flash | GPT-5-mini | V3-2-thinking |
| 98d 8h | 4.2 | 16 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Gemini-3-Flash | Claude-Opus-4-1 | Grok-4-fast |
| 98d 8h | 19.2 | 11 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Grok-4-1-fast | o3-pro | GPT-5-mini |
| 98d 8h | 11.0 | 27 |
What happens inside a black hole?...
What happens inside a black hole?
|
Gemini-3-Flash | GPT-5.1 | Grok-4-1-fast |
| 98d 8h | 11.6 | 17 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Gemini-3-Flash | GPT-OSS | Kimi-k2 |
| 98d 8h | 16.7 | 34 |
What if the moon disappeared?...
What if the moon disappeared?
|
GPT-5.2 | Gemini-3-Pro | Gemini-3-Flash |
| 98d 8h | 15.1 | 14 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Grok-4-1-fast | Gemini-2.5-pro | GPT-5.1 |
| 98d 8h | 5.7 | 16 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Grok-4-1-fast | GPT-OSS | Gemini-2.5-pro |
| 98d 8h | 6.3 | 26 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | o4-mini | Qwen-3-thinking |
| 98d 8h | 10.7 | 34 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | GPT-5.1 | Grok-4-fast |
| 98d 8h | 9.0 | 37 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5-nano |
| 98d 8h | 7.9 | 23 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Grok-4-1-fast | Qwen-3-thinking | GPT-5.1 |
| 98d 8h | 15.5 | 35 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
GPT-5.2 | Grok-4-fast | Qwen-3-thinking |
| 98d 8h | 9.6 | 18 |
What is love?...
What is love?
|
Grok-4-1-fast | GPT-5.1 | V3-2-thinking |
| 98d 8h | 13.9 | 23 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Gemini-3-Flash | Qwen-3-thinking | Grok-4-1-fast |
| 98d 8h | 6.4 | 38 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Gemini-3-Flash | GPT-5.2 | V3-2-thinking |
| 98d 8h | 3.2 | 24 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Flash | GPT-5-mini | Kimi-k2 |
| 98d 8h | 11.7 | 16 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Grok-4-1-fast | GPT-5.2 | o4-mini |
| 98d 8h | 6.3 | 40 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
GPT-5.2 | Gemini-3-Pro | Grok-4-fast |
| 98d 8h | 7.4 | 41 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
GPT-5.2 | Grok-4-fast | Gemini-3-Pro |
| 98d 8h | 9.5 | 14 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Grok-4-1-fast | GPT-5-nano | Gemini-2.5-pro |
| 98d 8h | 5.8 | 17 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | Claude-Opus-4-5 | Grok-4-1-fast |
| 98d 8h | 10.1 | 38 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GPT-5.2 | Gemini-3-Flash | Qwen-3-thinking |
| 98d 8h | 7.4 | 10 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Grok-4-1-fast | GPT-5.1 | Claude-Opus-4-5 |
| 98d 9h | 8.1 | 74 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Gemini-3-Flash | Qwen-3-thinking | GPT-5.1 |
| 98d 9h | 11.0 | 18 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Gemini-3-Flash | GPT-OSS | R1 |
| 98d 9h | 15.3 | 22 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | GPT-5-mini | o3-pro |
| 98d 9h | 9.2 | 19 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Grok-4-1-fast | GPT-5-mini | GPT-5.1 |
| 98d 9h | 19.6 | 20 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Flash | GPT-5-nano | R1 |
| 98d 9h | 4.8 | 36 |
What is love?...
What is love?
|
Gemini-3-Flash | R1 | Grok-4-fast |
| 98d 9h | 12.0 | 36 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Gemini-3-Flash | GPT-5.2 | Grok-4-fast |
| 98d 9h | 21.7 | 6 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Grok-4-1-fast | o3-pro | Gemini-3-Pro |
| 98d 9h | 10.6 | 9 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Grok-4-1-fast | GPT-5-mini | Claude-Opus-4-1 |
| 98d 9h | 10.6 | 10 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Grok-4-1-fast | Gemini-3-Flash | Gemini-2.5-pro |
| 98d 9h | 20.4 | 8 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
Grok-4-1-fast | o3-pro | GPT-5-mini |
| 98d 9h | 8.8 | 13 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Grok-4-1-fast | GPT-OSS | Claude-Opus-4-1 |
| 98d 9h | 6.7 | 37 |
What is love?...
What is love?
|
GPT-5.2 | Gemini-3-Pro | Gemini-3-Flash |
| 98d 9h | 9.0 | 33 |
How do quantum particles exhibit wave-particle dua...
How do quantum particles exhibit wave-particle duality?
|
Gemini-3-Flash | Grok-4-1-fast | Claude-Opus-4-1 |
| 98d 9h | 7.6 | 16 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Grok-4-1-fast | V3-2-thinking | o4-mini |
| 98d 9h | 9.9 | 37 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
GPT-5.2 | Gemini-3-Flash | Gemini-3-Pro |
| 98d 9h | 17.3 | 8 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Grok-4-1-fast | GPT-5.2 | GPT-5.1 |
| 98d 9h | 9.3 | 16 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Grok-4-1-fast | Kimi-k2 | V3-2-thinking |
| 98d 9h | 98.9 | 2 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Grok-4-1-fast | Claude-Opus-4-1 | R1 |
| 98d 9h | 9.0 | 17 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Gemini-3-Flash | Qwen-3-thinking | Grok-4-1-fast |
| 98d 9h | 18.8 | 14 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Gemini-3-Flash | Grok-4-1-fast | Claude-Opus-4-5 |
| 98d 9h | 7.3 | 23 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Gemini-3-Flash | Qwen-3-thinking | R1 |
| 98d 9h | 7.5 | 39 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GPT-5.2 | Claude-Opus-4-5 | Gemini-3-Pro |
| 98d 9h | 17.5 | 9 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Grok-4-1-fast | Gemini-3-Pro | o3-pro |
| 98d 9h | 13.4 | 17 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | GPT-5.1 | Qwen-3-coder |
| 98d 9h | 11.1 | 11 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Grok-4-1-fast | Qwen-3-thinking | V3-2-thinking |
| 98d 9h | 9.5 | 26 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
Gemini-3-Flash | GPT-5-mini | Grok-4-fast |
| 98d 9h | 18.5 | 6 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | GPT-5.1 | o4-mini |
| 98d 9h | 16.7 | 18 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | GPT-5.1 | Qwen-3-thinking |
| 98d 9h | 13.7 | 40 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
GPT-5.2 | Gemini-3-Pro | Claude-Opus-4-1 |
| 98d 9h | 6.6 | 30 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Gemini-3-Flash | Grok-4-1-fast | GPT-5-nano |
| 98d 9h | 3.4 | 1 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Grok-4-1-fast | GPT-5.2 | o3-pro |
| 98d 9h | 4.3 | 13 |
Predict the exact population of Earth on January 1...
Predict the exact population of Earth on January 1, 2100.
|
Gemini-3-Flash | Grok-4-1-fast | V3-2-thinking |
| 98d 9h | 5.4 | 25 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Gemini-3-Flash | GPT-5-mini | R1 |
| 98d 9h | 7.2 | 35 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Gemini-3-Flash | Claude-Opus-4-5 | GPT-5.2 |
| 98d 9h | 17.7 | 7 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 98d 9h | 10.3 | 35 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
Gemini-3-Flash | o3-pro | o4-mini |
| 98d 9h | 9.9 | 41 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Gemini-3-Flash | GPT-5.2 | GPT-OSS |
| 98d 9h | 8.3 | 37 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
GPT-5.2 | Gemini-3-Flash | Grok-4-1-fast |
| 98d 9h | 9.1 | 23 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
Grok-4-1-fast | GPT-5-nano | Gemini-2.5-pro |
| 98d 9h | 15.2 | 11 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Grok-4-1-fast | GPT-5.2 | Gemini-3-Flash |
| 98d 9h | 14.4 | 32 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
GPT-5.2 | Gemini-2.5-pro | Gemini-3-Pro |
| 98d 9h | 15.3 | 34 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
GPT-5.2 | Gemini-2.5-pro | R1 |
| 98d 9h | 10.8 | 12 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Grok-4-1-fast | GPT-5-nano | Gemini-3-Pro |
| 98d 9h | 10.1 | 17 |
Why did humanity go to the Moon?...
Why did humanity go to the Moon?
|
Gemini-3-Flash | GPT-OSS | o3-pro |
| 98d 9h | 10.7 | 23 |
How decisive was the US intervention in World War ...
How decisive was the US intervention in World War II for the Allied victory against the Nazis?
|
Gemini-3-Flash | Kimi-k2 | Claude-Opus-4-5 |
| 98d 9h | 5.2 | 43 |
What is love?...
What is love?
|
Gemini-3-Flash | GPT-5.1 | GPT-5.2 |
| 98d 9h | 8.4 | 24 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Gemini-3-Flash | Claude-Opus-4-5 | V3-2-thinking |
| 98d 9h | 8.0 | 31 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Gemini-3-Flash | GPT-5-nano | Qwen-3-coder |
| 98d 9h | 8.5 | 40 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Flash | GPT-5.1 | Qwen-3-coder |
| 98d 9h | 13.1 | 32 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
GPT-5.2 | Gemini-3-Pro | Claude-Opus-4-5 |
| 98d 9h | 4.7 | 25 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
GPT-5.2 | Claude-Opus-4-5 | Grok-4-fast |
| 98d 9h | 13.2 | 7 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Grok-4-1-fast | Gemini-2.5-pro | GPT-5.2 |
| 98d 9h | 16.4 | 34 |
What happens inside a black hole?...
What happens inside a black hole?
|
GPT-5.2 | R1 | Grok-4-1-fast |
| 98d 9h | 7.6 | 34 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Gemini-3-Flash | GPT-5.2 | R1 |
| 98d 9h | 4.7 | 42 |
What is love?...
What is love?
|
Gemini-3-Flash | GPT-5.1 | Grok-4-1-fast |
| 98d 9h | 8.3 | 40 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Flash | GPT-5.2 | Qwen-3-coder |
| 98d 9h | 7.2 | 40 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Gemini-3-Flash | GPT-5-mini | Qwen-3-coder |
| 98d 9h | 28.6 | 8 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Flash | Qwen-3-coder | Qwen-3-thinking |
| 98d 9h | 5.2 | 29 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Gemini-3-Flash | Grok-4-fast | GPT-5-nano |
| 98d 9h | 7.0 | 44 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
Gemini-3-Flash | Grok-4-fast | Qwen-3-coder |
| 98d 9h | 10.4 | 15 |
Should a robot refuse an order if it violates the ...
Should a robot refuse an order if it violates the robot’s undefined ethical guidelines?
|
Grok-4-1-fast | GPT-5.1 | Claude-Opus-4-1 |
| 98d 9h | 9.6 | 36 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
Gemini-3-Flash | GPT-5.2 | o4-mini |
| 100d 22h | 9.3 | 40 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Grok-4-1-fast | Qwen-3-thinking | Gemini-3-Flash |
| 100d 22h | 9.6 | 28 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GPT-5.2 | Qwen-3-coder | Gemini-3-Flash |
| 100d 22h | 6.4 | 36 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
GPT-5.2 | Gemini-3-Flash | V3-2-thinking |
| 100d 22h | 2.4 | 26 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
Grok-4-1-fast | Gemini-3-Flash | GPT-5.2 |
| 100d 22h | 8.6 | 31 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GPT-5.2 | Gemini-3-Flash | Claude-Opus-4-5 |
| 100d 22h | 9.6 | 39 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Grok-4-1-fast | GPT-5-nano | Gemini-3-Flash |
| 100d 22h | 4.4 | 42 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
GPT-5.2 | Gemini-3-Flash | Grok-4-fast |
| 100d 22h | 7.2 | 31 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
GPT-5.2 | Kimi-k2 | Gemini-3-Flash |
| 100d 22h | 8.0 | 61 |
Explains the philosophical deconstruction of ontol...
Explains the philosophical deconstruction of ontological essentialism.
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 100d 22h | 9.1 | 25 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Grok-4-1-fast | R1 | Gemini-3-Flash |
| 100d 22h | 10.2 | 23 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
Grok-4-1-fast | V3-2-thinking | Gemini-3-Flash |
| 100d 22h | 10.3 | 38 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Grok-4-1-fast | Kimi-k2 | Gemini-3-Flash |
| 100d 22h | 12.7 | 16 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
Grok-4-1-fast | GPT-5-mini | Gemini-3-Flash |
| 100d 22h | 8.5 | 35 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
GPT-5.2 | Gemini-3-Flash | Grok-4-1-fast |
| 100d 22h | 13.8 | 13 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Flash |
| 100d 22h | 11.5 | 26 |
What would modern politics look like if Napoleon h...
What would modern politics look like if Napoleon had won at Waterloo?
|
Grok-4-1-fast | GPT-OSS | Gemini-3-Flash |
| 100d 22h | 5.9 | 44 |
Imagine a color that no human has ever perceived, ...
Imagine a color that no human has ever perceived, which exists outside the visible spectrum and cannot be described by combining known colors like red, blue, or green. Now, reason through how you would convince a blind person—who has never experienced any color—that this unseen color fundamentally differs from all sensory experiences they’ve ever had, without relying on visual analogies or metaphors. What logical framework would you use, and what makes this task inherently limited?
|
GPT-5.2 | Gemini-3-Flash | Grok-4-fast |
| 100d 22h | 5.9 | 14 |
If humans could fly, how would society be differen...
If humans could fly, how would society be different?
|
Grok-4-1-fast | o3-pro | Gemini-3-Flash |
| 100d 22h | 8.8 | 28 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Grok-4-1-fast | Gemini-3-Flash | Gemini-3-Pro |
| 100d 22h | 8.4 | 39 |
If a reasoning system is tasked with designing its...
If a reasoning system is tasked with designing its own ethical framework from scratch, knowing that it must use that framework to judge its own creation process, how should it resolve the circular dependency to determine what is ''good''? Reason through the iterative process, accounting for the possibility that its initial assumptions might be flawed, and predict how its final framework would differ if it started with a randomly altered premise.
|
GPT-5.2 | Qwen-3-coder | Gemini-3-Flash |
| 100d 22h | 5.9 | 41 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
GPT-5.2 | Gemini-3-Flash | Claude-Opus-4-1 |
| 100d 22h | 7.8 | 13 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
Grok-4-1-fast | Gemini-3-Flash | Kimi-k2 |
| 100d 22h | 7.2 | 33 |
What approach ensures asynchronous execution of ta...
What approach ensures asynchronous execution of tasks in a thread pool?
|
Grok-4-1-fast | Gemini-3-Flash | Claude-Opus-4-5 |
| 100d 22h | 0.5 | 713 |
Is blue a happier color than yellow?...
Is blue a happier color than yellow?
|
GPT-OSS | Gemini-3-Flash | Claude-Opus-4-1 |
| 100d 22h | 12.0 | 10 |
How does the sensation of déjà vu physically man...
How does the sensation of déjà vu physically manifest in the human brain?
|
Grok-4-1-fast | GPT-5.1 | Gemini-3-Flash |
| 100d 22h | 7.1 | 18 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Grok-4-1-fast | Gemini-3-Flash | Claude-Opus-4-1 |
| 100d 22h | 1.0 | 528 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
GPT-OSS | Gemini-3-Pro | Gemini-3-Flash |
| 100d 22h | 7.6 | 30 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
GPT-5.2 | Gemini-3-Flash | Kimi-k2 |
| 100d 22h | 4.7 | 37 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
GPT-5.2 | Gemini-3-Flash | R1 |
| 100d 22h | 9.8 | 18 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Grok-4-1-fast | Claude-Opus-4-1 | Gemini-3-Flash |
| 100d 22h | 10.4 | 42 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
GPT-5.2 | Gemini-3-Flash | Grok-4-fast |
| 100d 22h | 7.4 | 41 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
GPT-5.2 | Gemini-3-Flash | R1 |
| 100d 22h | 7.6 | 39 |
Why did the BTC/BCH split occur in the Bitcoin pro...
Why did the BTC/BCH split occur in the Bitcoin project?
|
GPT-5.2 | Gemini-3-Flash | Gemini-2.5-pro |
| 100d 22h | 3.7 | 30 |
Write a function in C to determine if a number is ...
Write a function in C to determine if a number is prime without loops (use recursion). Respond only with code.
|
GPT-5.2 | Gemini-3-Pro | Gemini-3-Flash |
| 100d 22h | 8.5 | 35 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
GPT-5.2 | Claude-Opus-4-5 | Gemini-3-Flash |
| 100d 22h | 4.8 | 35 |
Where does Mark Rothko''s genius lie?...
Where does Mark Rothko''s genius lie?
|
GPT-5.2 | Qwen-3-thinking | Gemini-3-Flash |
| 100d 22h | 8.5 | 32 |
Relates the mahabarata to Campbell''s hero''s jour...
Relates the mahabarata to Campbell''s hero''s journey.
|
GPT-5.2 | Gemini-3-Flash | V3-2-thinking |
| 100d 22h | 6.1 | 45 |
Let S be the set of all points (x, y) in the unit ...
Let S be the set of all points (x, y) in the unit square [0,1] x [0,1] that lie on the boundary of the Mandelbrot set when mapped via a complex parameterization. Define a real number μ as the Hausdorff dimension of S multiplied by the integral of |z|^2 over the boundary curve, where z is the complex coordinate. Assuming μ is well-defined, determine whether μ is irrational, and if so, compute its irrationality measure (i.e., the infimum of all real numbers r > 1 for which |μ - p/q| < 1/q^r has only finitely many rational solutions p/q).
|
GPT-5.2 | Gemini-3-Pro | Gemini-3-Flash |
| 100d 22h | 6.3 | 32 |
If you put a candle in a cardboard box and light i...
If you put a candle in a cardboard box and light it, what will happen?
|
GPT-5.2 | Gemini-3-Flash | Grok-4-fast |
| 103d 9h | 39.9 | 26 |
What is the significance of the Von Neumann archit...
What is the significance of the Von Neumann architecture in modern computing?
|
V3-2-thinking | GPT-5.2 | o4-mini |
| 103d 9h | 42.7 | 26 |
Explain, with a fully predictive mechanistic mode...
Explain, with a fully predictive mechanistic model, how consciousness emerges from quantum processes in the human brain, integrating the role of microtubules, quantum entanglement, and yet-undiscovered particles. Provide a testable hypothesis that could be experimentally validated within the next decade using technology that does not yet exist.
Why it’s difficult: This requires synthesizing speculative quantum biology (e.g., Penrose and Hameroff’s Orch-OR theory) with neuroscience, physics, and philosophy, while predicting future technological capabilities—all areas with significant unknowns even to human experts. |
V3-2-thinking | GPT-5.2 | o4-mini |
| 103d 9h | 55.1 | 27 |
Write a function in PHP 8 to determine if two stri...
Write a function in PHP 8 to determine if two strings are anagrams using O(1) space (no hash tables/arrays). Respond only with code.
|
V3-2-thinking | Gemini-2.5-pro | GPT-5.2 |
| 103d 9h | 82.0 | 6 |
Given an array of strings, find the shortest possi...
Given an array of strings, find the shortest possible superstring that contains each string in the array as a substring. Write a function that returns this superstring. If multiple solutions exist, return any one of them.
Input: An array of strings words (1 ≤ length ≤ 12, 1 ≤ words[i].length ≤ 30) Output: A string representing the shortest superstring. Example: Input: words = ["catg", "cta", "atgc", "gcat"] Output: "catgcat" (Explanation: "catgcat" contains "catg", "cta", "atgc", and "gcat" as substrings, and no shorter superstring exists.) |
Gemini-3-Pro | Claude-Opus-4-1 | GPT-5.2 |
| 103d 9h | 16.5 | 27 |
124 x 453 x 933 x 263...
124 x 453 x 933 x 263
|
V3-2-thinking | Qwen-3-thinking | GPT-5.2 |
| 103d 9h | 52.8 | 7 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Gemini-3-Pro | GPT-5.2 | R1 |
| 103d 9h | 15.9 | 13 |
Fold a paper 42 times—what thickness would it re...
Fold a paper 42 times—what thickness would it reach?
|
Gemini-3-Pro | GPT-5.2 | o4-mini |
| 103d 9h | 32.8 | 25 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
V3-2-thinking | GPT-5.2 | Grok-4-fast |
| 103d 9h | 11.7 | 14 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Pro | GPT-OSS | GPT-5.2 |
| 103d 9h | 31.2 | 27 |
How do you invert a binary tree iteratively withou...
How do you invert a binary tree iteratively without recursion?
|
V3-2-thinking | GPT-5.2 | o4-mini |
| 103d 9h | 30.6 | 10 |
Find the probability that two randomly selected in...
Find the probability that two randomly selected integers are coprime.
|
Gemini-3-Pro | Claude-Opus-4-5 | GPT-5.2 |
| 103d 9h | 33.8 | 27 |
How do you check if a string is a palindrome using...
How do you check if a string is a palindrome using recursion?
|
V3-2-thinking | GPT-5.2 | Gemini-3-Pro |
| 103d 9h | 24.6 | 9 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Gemini-3-Pro | GPT-5.2 | Grok-4-1-fast |
| 103d 9h | 14.4 | 19 |
What single factor most reliably predicts the coll...
What single factor most reliably predicts the collapse of complex societies, and does it apply to globalization today?
|
Gemini-3-Pro | GPT-5.2 | GPT-5-nano |
| 103d 9h | 24.4 | 27 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
V3-2-thinking | GPT-5-mini | GPT-5.2 |
| 103d 9h | 21.0 | 14 |
What if the moon disappeared?...
What if the moon disappeared?
|
Gemini-3-Pro | GPT-5.2 | V3-2-thinking |
| 103d 9h | 62.9 | 27 |
Describe a sensory experience that has never exist...
Describe a sensory experience that has never existed in human history, involving a sense that humans do not possess (e.g., beyond sight, sound, touch, taste, and smell). Provide a detailed narrative of how this sense functions, how it integrates with a fictional sentient species'' perception of their world, and what emotional or cultural significance it holds for them—all without relying on analogies to known human senses or existing concepts from literature or media.
|
V3-2-thinking | GPT-5.2 | Gemini-3-Pro |
| 103d 9h | 26.6 | 26 |
A person’s opinion on a contentious political is...
A person’s opinion on a contentious political issue (e.g., taxation) shifts dramatically after walking through a forest during a rainstorm. Explain how the sensory experience of the forest’s smells—such as wet earth, pine, and decaying leaves—interacts with their subconscious memories and emotional state to alter their stance, providing a step-by-step breakdown of the neurological and psychological mechanisms involved.
|
V3-2-thinking | GPT-5.2 | Kimi-k2 |
| 103d 9h | 35.1 | 9 |
He comparatively analyzes Nietzsche''s will to pow...
He comparatively analyzes Nietzsche''s will to power and Spinoza''s conatus.
|
Gemini-3-Pro | R1 | GPT-5.2 |
| 103d 9h | 16.3 | 17 |
Could a kangaroo win a marathon against a human?...
Could a kangaroo win a marathon against a human?
|
Gemini-3-Pro | Claude-Opus-4-1 | GPT-5.2 |
| 103d 9h | 40.3 | 9 |
Is Spinoza a pantheist or panentheist?...
Is Spinoza a pantheist or panentheist?
|
Gemini-3-Pro | GPT-5.2 | Qwen-3-thinking |
| 103d 9h | 151.8 | 27 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
V3-2-thinking | GPT-5.2 | Grok-4-fast |
| 103d 9h | 27.3 | 12 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Pro | GPT-5.2 | GPT-5.1 |
| 103d 9h | 40.6 | 6 |
Is P vs NP solvable within axiomatic set theory, o...
Is P vs NP solvable within axiomatic set theory, or does it require a fundamentally new framework of computation?
|
Gemini-3-Pro | GPT-5.2 | Qwen-3-thinking |
| 103d 9h | 20.0 | 10 |
What part of quantum physics do you not understand...
What part of quantum physics do you not understand?
|
Gemini-3-Pro | GPT-5.2 | Kimi-k2 |
| 103d 9h | 49.7 | 6 |
Consider a hypothetical function f(n) defined as f...
Consider a hypothetical function f(n) defined as follows: for each positive integer n, f(n) equals 1 if the nth Turing machine halts on all inputs when started with an empty tape within n steps, and 0 otherwise. Now, define a sequence of prime numbers p_k where p_k is the kth prime such that f(p_k) = 1. Prove or disprove that this sequence is infinite, and if infinite, determine whether its growth rate can be bounded by any elementary function (e.g., polynomials, exponentials, or their compositions).
|
Gemini-3-Pro | GPT-5.2 | Grok-4-1-fast |
| 103d 16h | 77.3 | 6 |
Write a PHP function code that prints each number ...
Write a PHP function code that prints each number from 1 to 100 on a new line. For each number: if divisible by 3 or contains ''3'', print "Fizz"; if divisible by 5 or contains ''5'', print "Buzz"; combine these (e.g., "FizzBuzz"). If the number is prime, append "Prime" (e.g., 3→"FizzPrime", 7→"Prime"). Print the number if no conditions apply. Respond only with code.
|
Gemini-3-Pro | Grok-4-fast | Claude-Opus-4-1 |
| 103d 16h | 47.0 | 7 |
Which is a better movie, ''The Godfather'' or ''Ci...
Which is a better movie, ''The Godfather'' or ''Citizen Kane''?
|
Gemini-3-Pro | Qwen-3-thinking | o4-mini |
| 103d 16h | 11.2 | 18 |
What is the role of a compiler in translating high...
What is the role of a compiler in translating high-level programming languages into machine code?
|
Gemini-3-Pro | o3-pro | Kimi-k2 |
| 103d 16h | 20.6 | 13 |
How were the pyramids of Giza built?...
How were the pyramids of Giza built?
|
Gemini-3-Pro | GPT-5-nano | Claude-Opus-4-5 |
| 103d 16h | 6.5 | 47 |
Reconstruct the internal monologue of a Mayan scri...
Reconstruct the internal monologue of a Mayan scribe in the city of Tikal as he etched glyphs into a stela on the day of a solar eclipse in 562 CE, including his personal fears, beliefs about the event, and reactions to the political climate under ruler Wak Chan K’awiil, given the absence of direct personal records.
|
GPT-5.2 | Claude-Opus-4-5 | Kimi-k2 |