Is the Voynich Manuscript a Lost Language or a 600-Year-Old Hoax?
A forensic analysis of the manuscript's statistical patterns reveals why the text is likely a medieval fabrication rather than a lost language.


The vellum is stiff, stained by the fingerprints of centuries, and inscribed with an elegant script that has confounded the brightest minds since its rediscovery in 1912. Carbon-dated to the early 15th century, the Voynich Manuscript is a codex of 240 pages filled with flowing handwriting, bizarre illustrations of non-existent plants, and naked women bathing in interconnected green tubs. To the romantic, it is the last surviving relic of a lost civilization or a ciphered journal of Roger Bacon. To the cryptographer, it is a statistical nightmare. As we stand in 2026, with artificial intelligence capable of cracking complex biological codes, the Voynich Manuscript remains unreadable. This persistence suggests a uncomfortable possibility: we are not looking at a hidden message. We are looking at the most sophisticated forgery in medieval history.
The allure of the manuscript is undeniable. It feels like it should mean something. The scribe was confident, writing with a fluid hand that rarely hesitates or corrects itself. Yet, the text lacks the punctuation, breaks, or standard structure of 15th-century Latin or the vernacular of the time. If this were a simple substitution cipher where one symbol stands for one letter, we would have solved it by now. The problem runs deeper. The text behaves like a language, mimicking the rhythms and patterns of communication, yet it refuses to yield semantic meaning. My position, after reviewing the latest forensic linguistics and statistical analyses, is that the Voynich Manuscript is a hoax—a constructed gibberish designed to be valuable precisely because it cannot be read.
The Curious Case of the Botanical Ghosts
Before we analyze the syntax, we must look at the illustrations that supposedly illuminate the text. The manuscript is famous for its herbal section, yet botanists have never been able to definitively identify a single plant. While some enthusiasts argue that the drawings are stylized or represent extinct species, the reality is more telling. The plants appear to be composite chimeras. A root system might belong to a chard, while the leaves resemble those of a water lily, and the flower is unmistakably a pansy. These are not field sketches of nature; they are collages of imagination.
This visual fabrication supports the gibberish hypothesis. If the author intended to record real medical or botanical knowledge, they would have depicted identifiable flora. Instead, the images mirror the text: they create the impression of a scientific encyclopedia without the substance of one. This mirrors the debate we see with other unexplained phenomena, such as the Nazca Lines vs. Crop Circles: Which Endures the Test of History?, where the human capacity for fabrication often rivals natural mysteries. The artist was not drawing what they saw; they were drawing what they thought a "book of secrets" should look like. The text, I argue, follows the same principle.
Does the Text Obey Zipf’s Law?
One of the strongest arguments for the text being a genuine language—or at least a sophisticated cipher—is its adherence to Zipf’s Law. This statistical rule states that in any natural language, the frequency of a word is inversely proportional to its rank. The most common word occurs twice as often as the second most common, three times as often as the third, and so on. Human languages, from English to Mandarin, strictly follow this curve.
The Voynich text, remarkably, also follows Zipf’s Law. For decades, this was the smoking gun for proponents of the "lost language" theory. They argued that a medieval hoaxer, particularly one working in the 1400s, could not have intentionally engineered a text that aligns with a linguistic principle not formally discovered until George Kingsley Zipf in the 20th century. It seemed too perfect to be fake.
However, statistical anomalies appear when we look closer. While the word frequencies follow the curve, the vocabulary itself is strangely rigid. There are almost no one-letter or two-letter words, which are common in European languages (like "a", "I", "in", "on"). Furthermore, the "conditional entropy" of the text is low. In natural speech, the predictability of the next word varies wildly. In the Voynich Manuscript, the predictability is repetitive and mechanical. This suggests the scribe was not generating thoughts but following a strict algorithm, perhaps a table of prefixes and suffixes that they combined methodically to create the illusion of entropy.

The "Line as a Unit" Conspiracy
The most damning evidence against the manuscript containing a hidden message is a phenomenon known as the "line as a unit" property. In normal writing, thoughts flow from one line to the next. If a word breaks at the end of a line, it continues on the next. The meaning is continuous. In the Voynich Manuscript, the statistical properties of the words change depending on where they appear on the page.
Words found at the beginning of a line tend to have a different set of initial letters than words found in the middle or the end. It appears the scribe had a specific "code" or table for starting a line and another for finishing it. The text is not a stream of consciousness; it is a grid filling exercise. The scribe seems to be looking at a mechanism—perhaps a spinning wheel with letters or a set of grilles—and selecting characters based on the spatial requirement of the line rather than a grammatical requirement of the sentence.
This mechanical generation explains why the text is so smooth. There are no crossed-out errors, no pauses. It is the rhythm of a repetitive task, not the struggle of composition. This rigidity is what defeats modern code-breaking. You cannot decrypt a message that was never encrypted in the first place. We are trying to find meaning in noise, looking for patterns in what is essentially the medieval equivalent of a random number generator. It is a frustrating realization, similar to the math behind The Bermuda Triangle: A Dangerous Void or Just Bad Math?, where perceived mysteries dissolve under statistical scrutiny.
The Mechanics of a Medieval Con
If the manuscript is a hoax, who created it and why? The carbon dating places its creation between 1404 and 1438. This was an era where books were rare, expensive status symbols, and " occult knowledge" was a lucrative commodity. I suspect the author was a savvy apothecary or an astrologer who understood that their value lay in the mystery of their practice.
By creating a book that no one else could read, filled with images of plants that didn't exist, the owner could claim exclusive access to esoteric wisdom. It was a credential. If a patron or a rival physician demanded to see the source of your cures, you could show them the Voynich. They would be baffled, intimidated, and ultimately convinced of your superior knowledge. The gibberish was the product.
This technique is not unheard of. In the 16th century, the mystic Heinrich Cornelius Agrippa warned of the use of "unknown alphabets" to deceive. The Voynich Manuscript may simply be the most elaborate execution of this trick. It is a hollow vessel, filled with nothing but ink, designed to catch the gaze—and the gold—of the wealthy and the curious.
The Final Verdict on the Cipher
The tragedy of the Voynich Manuscript is not that it is a fake, but that it is a boring fake. There is no grand conspiracy, no lost civilization, no alien contact. There is only a human being, six hundred years ago, trying to make a living by selling the illusion of knowledge. The statistical evidence—the rigid adherence to line structure, the composite illustrations, and the low conditional entropy—paints a clear picture. We are not looking at a language we failed to learn. We are looking at a performance.
The manuscript’s value today has shifted. It is no longer a medical textbook. It is a testament to the human desire to believe in a hidden world, a world just beyond our understanding. We stare at the vellum, desperate to see ourselves in it, just as the medieval readers stared at it, desperate to see the secrets of the universe. In that way, the deception worked perfectly. The scribe created a mirror, and six centuries later, we are still gazing into it, wondering why the reflection looks so cryptic.

