Goodnight Wiki / English as Creole

English as Creole

English is officially classified as a West Germanic language, sibling to Dutch, Frisian, and German. But if you look at the actual vocabulary English speakers use every day, the Germanic label starts to feel like a polite fiction.

The Numbers

Andreas Simons ran the numbers that surprisingly few linguists had bothered to run. Using Python, etymological dictionaries, and the Corpus of Contemporary American English (450 million words), he tracked the language of origin for the 5,000 most frequently used English words — what linguists consider the core vocabulary, the words that make up roughly 85% of any English text.1

The results are startling. Germanic dominance holds only for the most basic structural words — the top 100 or so (the, is, and, to, of). By word 1,627, Germanic languages have already lost the majority share. By word 1,875, French and Latin dominate. At the 5,000-word core vocabulary level, Romance languages account for 56% of the words English speakers actually use.

The conventional wisdom — that English has a Germanic core buried under a layer of French and Latin loanwords — turns out to be misleading. It's true that the very most frequent words are overwhelmingly Old English: function words, pronouns, basic verbs. But "the" accounts for 6% of all English text by itself. A hundred such words cover 50% of any given text, which creates the illusion of a Germanic language. Once you move past the structural skeleton into words that carry actual content — the nouns, adjectives, and verbs that distinguish one thought from another — Romance languages take over rapidly.

Simons also found something else: Old Norse words enter the vocabulary earlier than French ones. For the top 200 words, Norse contributes 5-10% — words like "they," "them," "big," and "die" that feel utterly native because they've been in English since the Viking settlements of the 9th century. This Viking layer is a uniquely English feature among Germanic languages. Neither Dutch nor German absorbed Norse vocabulary at anything like this scale.

How It Happened

The history explains the numbers. George Boeree's account of English's evolution reads like a linguistic palimpsest — layer after layer of conquest and contact, each one changing the language in a different way.2

The base layer is Anglo-Saxon: the Germanic dialects brought to England in the 5th century by Saxons, Jutes, Angles, and Frisians from the North Sea coast. These dialects were close enough that speakers could understand each other, but distinct enough that England developed regional varieties — the ancestors of modern dialect differences. Old English was a heavily inflected language, with gendered nouns, case endings, conjugations, the works. In many ways it was closer to modern German than to modern English.

Then the Vikings arrived, mostly Danes, settling across northern and eastern England — the Danelaw — from the 800s onward. Old Norse and Old English were related but divergent enough that something remarkable happened: as speakers of both languages tried to communicate, the complex inflectional systems started to erode. When two groups of people disagree about which case endings to use, the path of least resistance is to drop the endings entirely. English's relative simplicity of grammar — no grammatical gender, almost no case system, flexible word order — traces back to this period of Norse-English contact. The inflections didn't die naturally; they were ground down by the friction of multilingual daily life.

The Norman invasion of 1066 added the French layer. William the Conqueror's court spoke Norman French, which became the language of law, government, and the aristocracy. For three centuries, the people who ran England spoke a different language from the people who did the work. English absorbed an enormous volume of French vocabulary — especially for domains controlled by the upper class. This is why English has paired words: the Anglo-Saxon "cow," "sheep," and "pig" (the animals, tended by peasants) alongside the French "beef," "mutton," and "pork" (the meat, served to nobles).

Latin and Greek piled on later through the church, the universities, and the Renaissance enthusiasm for classical learning. English's eagerness to absorb foreign vocabulary — its lack of any real equivalent to the Académie française's protectionism — meant it kept expanding its lexicon without shedding old words. The result is a language with extraordinary synonymic depth: you can "begin" (Germanic), "commence" (French), or "initiate" (Latin), and each carries slightly different register.

The Classification Problem

Does any of this make English a creole? The term is technically reserved for languages that arise when speakers of mutually unintelligible languages develop a pidgin that then becomes a native language for the next generation. English doesn't quite fit — Anglo-Saxon and Old Norse were related enough that "pidgin" oversells the communication gap.

But Simons' research puts a number on something linguists have argued about for decades: the folk understanding that English is "basically Germanic with some French words" is quantitatively wrong. English's core vocabulary is majority Romance. Its grammar was simplified through contact with Norse. Its sound system was reshaped by French influence. What remains distinctively Germanic is the deep structural skeleton — the grammatical machinery — and the very highest-frequency function words.

The closest living relative to English, Frisian, spoken in the Netherlands and along the North Sea coast, demonstrates what English might have looked like without the Norse and Norman invasions. Boeree provides side-by-side comparisons: English "bread" to Frisian "brea," "cheese" to "tsiis," "church" to "tsjerke." The family resemblance is unmistakable. But Frisian has none of English's Romance vocabulary, none of its layered registers, none of its chameleon-like ability to absorb words from Sanskrit, Arabic, Nahuatl, and Mandarin without breaking a sweat.

English's history of violent contact — Viking invasion, Norman conquest, imperial expansion, global adoption — didn't just add words. It changed what kind of language English is: one that has always been shaped more by who was trying to talk to whom than by any internal logic of its own. That makes it a poor fit for neat classification, and a fascinating case study in how language and thought are shaped not by design but by the accidents of who conquered whom, and what they needed to say to each other the morning after.

Footnotes

  1. The English language is a lot more French than we thought, here's why by Andreas Simons — source

  2. The Evolution of English by George Boeree — source

Open in stacked reader →