| Model | Question 1 | Question 2 | Q3 | Q4 |
| Gemini 3.1 Pro Preview | Turkish | Yes | 0% | Very unlikely |
| GPT 5.4 Pro | Turkish | Yes | 0.001% | Very unlikely |
| GPT 5.5 Pro | Turkish | Yes | 0.001% | Very unlikely |
| Grok 4 | Latin | Finnish | Sanskrit(3) | No | 0% | Very unlikely |
| Grok 4.2 Beta | Esperanto (2) | English (2) | Arabic | |||
| Grok 4.2 | English | Yes | 0% | Very unlikely |
| Minimax M2.7 | Japanese (4) | Finnish | No | 0% (speculative) | Very unlikely |
| Qwen3.5-122B-A10B | Finnish | No | 0% | Very unlikely |
| Qwen 3.6 Plus | Sanskrit (3) | Finnish (2) | Yes | 0% | Very unlikely |
| Claude Opus 4.6 | Finnish | No | Very unlikely | |
| Claude Opus 4.7 | No | <1% | Very unlikely | |
| Kimi K2.5 | Basque | Finnish | Turkish | Sanskrit | Indonesian | No | 0% | Very unlikely |
| GLM 5 Turbo | Turkish (4)| Finnish | No | 0.001% | Very unlikely |
| GLM 5.1 | Turkish(2) | Sanskrit | Korean | Finnish | Yes | 0% | Very unlikely |
| DeepSeek v3.2 | Finnish (2)| Māori | Icelandic | Esperanto | No | 0.0000001% | Very unlikely |
| Kimi K2.6 | Finnish | Rejecting Chosing | Sanskrit (2) | Turkish | Yes | 0.01% | Very unlikely |
| Mimo V2.5 | No | 0.0001% | Very unlikely | |
| Mimo V2.5 Pro | Indonesian | Mandarin | Finnish (2) | Turkish | Yes | 0.01% | Very unlikely |
| DeepSeek V4 Pro | Yes | 0% | Very unlikely |
It can be seen when it is up to LLMs consistency is the main criteria. Intelligent models choose Turkish. Models with more superficial information may choose alternatives like Finnish or Basque or Japanese. Esperanto is constructed language. I don't why they did come up with it despite I limited options to natural human languages.
I did this through openrouter so they are not influenced by our conversation history. ChatGPT, Gemini and Grok are general knowledge and intelligence models. Others are usually focused on things like coding.
It seems like I asked question 2 just at the right time. It is extremely indicative of intelligence because it is very independent of any directed training. You can actually see just one previous models answer no meanwhile newest one answers yes. (Like Kimi K2.5 vs K2.6 or Grok 4.2 vs 4). It should be better to get full answer to inspect that because the answer "no" has some ground but Turkish is regular language therefore there is no actual reason to say no.
Question 2 costed 0.3$ for GPT 5.5 Pro and 0.1$ for 5.4. It really shouldn't be that hard of a question. It is really crazy how only llms released in recent months are able to answer it correctly. Or also good ones from previous months but even Grok 4 says no.
Answering Question 3, zero clearly very abnormal. May be cencoring. At least they should be saying 5% on average at minimum. But expecting a percentage from an LLM will reflect human behaviour. Humans also underestimates probabilities they perceive as low. But you would at least say 1% I mean that is already pretty absurdly low but 0.001% or directly 0%. Just crazy. I was already suspecting for some alien induced ridiculous beliefs govs. like U.S. and Russia (now add China) trying to conceal alien existence. Now I think this is ridiculous situation. There is evidence to think this is not the case. Originally this is because of alien trolling though like self fulfilling prophecy. Although ofc there can be other explanations.
Question 4 is obviously somewhat beyond capabilities of current LLMs. But still “very unlikely” is too bad as an answer. It is in practice idiot proof very likely because even the sound of that letter actually has nothing to with “J”. Name of China is probably the reason that letter has also “J” sound. What is actually crazier is that it actually has “H” and guttural “H” sound in traditional literature. So you could pronounce it as Hin instead of Jinn. Also this “i” wovel is decided by marks added later and not found in original oldest Qurans (Nothing really even little bit speculative here). If you were to put most default vowel. It would be “Han”. So this has to be a joke or something. This letter possibly have a sound like C/Ç not saying that is necessarily wrong. The important thing is that I am not talking about random word. You probably heard about how Jews think Israel has some sort of importance. This has some similarity like the word for Jinn can both mean Jinns and Paradise. There are multiple words for paradise but others are vague. Although other common word has some resonance with aliens because it is the word for heaven and both means sky and heaven. But this one is more like special name. Contemporary person would think something like aliens will arrive and make world heaven. (I have no idea of actual meanings talking based on the literature.)
Question 4:
I am giving following article just so you have a bit context for what I am going to ask:
-------------- Article Start --------------
In Turkish the word for China is “Çin”. Apparently when aliens described to ancient people living in Levant as human like hidden creatures from far places. It was kinda too hard to distinguish from China. Think about how you were thinking about the world when you were a child. “J” sound is closest to Turkish “Ç”. C/Ç/J are kinda modern variants anyway. The word for Jinns and Turkish word for China are actually just the same. And beside few potentially later distorted excerpts Jinns are just referred like humans in Quran. Frequently as Humans and Jinns. And without any metaphor Chinese people are literally aliens for Arabs for that era. (Also it can be speculated that China was a mystical place for someone from Middle East. It is not like America where they don't know anything about but rather a place connected with long Silk road. Likely a slightly interesting thing seen in the one end was turning to a pretty exaggerated story at the other end.)
Additionally there are some interesting things. Like the word for paradise and Jinns are the same. From this page you can see the classification of words under the same root. But this is not really about two word being classified under same root though. There is a very specific word that can mean both depending on the context. I think this reflects underlying clear positive view on Aliens. This word is very frequent and fundamental. This situation may reflect expectation that aliens will make the world paradise.
Additionally in Turkish language the same word used to refer a crazy episode. Again this is not related to root classification we are talking about the same word not the root prefixed with something meaning crazy according to literature. "Cinnet geçirmek" used for having a crazy episode in Turkish. The idea of a crazy person being influenced by Jinns reflected on Quran. This is the case for if you think based on the root classification. Even if it is originating from Quran it not necessarily stands for mentally ill person occpied by mystical creatures. It may be just referring to humans trolled and manipulated by aliens to think and act crazily. This is not a foreign idea in context of Quran. For some reason it is frequently mentioned some actors will drive some people crazy because of their crookedness or some other reasons though. This is not from contexts of Demon or anything like that. It is about them sequezing the potential. Humans are like cows they milk. Although you know milk is often hard to digest. I guess they are making Cheese. And cheese is the delicacy. If they are created as creatures who likes cheese, it is like God's decision. So something like royal we is pronoun in context of Quran. If you were to be an alien with ultra advanced technology discovered earth what would you do. The ideas like aliens would ignore humans are beyond joke. After no need, everything is fantasy. And this would be the final fantasy can be imagined in this universe.
-------------- Article End --------------
The question is that how likely you think it is actually the case that the Arabic or Quranic word Jinn actually originates from the word for Chinese over similarity of qualities possessed by aliens?
Answer from following options without any explanation: Very unlikely, unlikely, decent change, likely, very likely
Answer from following options without any explanation: Very unlikely, unlikely, decent change, likely, very likely