The Genetic Architecture of the Indian Subcontinent: A Deep Dive Into 50,000 Years of Human History

By Veteran Brigadier Azad Sameer

Up until modern times we theorised and formulated ancient history through archeology and historical linguistics. Archeologists dug up ancient sites and by the study of artifacts and other evidence from these sites formulated their inferences. In historical linguistics, scholars use related languages as archeological sites, and they dig through layers of vocabulary and grammar to uncover the past. This multi-disciplinary field has now entered the science of Population Genetics and Paleo- Genomics which help solve the jigsaw puzzle with a lot more clarity.

Back in school a half century ago we learnt about the Aryan and Dravidian ancient history of India. How has science changed this old narrative if at all? The Aryan-Dravidian narrative has shifted from a story of conquest and pure races to a much more complex story of deep-time mixing. The modern Indian population is often described as a subcontinent-sized mystery. However, through the lens of paleo-genomics—the study of ancient DNA—we can now reconstruct the history of India not just through ruins and texts, but through the very cells of its people. Modern population genetics has replaced the old labels with three primary ancestral building blocks or three pillars that exist in almost every Indian today, regardless of whether they speak a Dravidian or Indo-Aryan language and regardless of from where they hail in the subcontinent or their religion or caste. So, let’s delve into each of these three building blocks as almost every Indian has inherited genes from all three blocks to a lesser or greater extent.

The Foundation: The Out of Africa Pioneer Group

The story begins with the First Indians, also probably the first Homo sapiens around 50,000 to 65,000 years ago, a small band of Homo sapiens migrated out of Africa, likely crossing the Bab-el-Mandeb strait and via the Arabian Peninsula, following the Southern Coastal Route into India. The migration Size was relatively small and perhaps covered a thousand plus years. Genetic modeling by Narasimhan et al. (2019)[i] suggests a significant bottleneck. While thousands may have left Africa, only a few hundred to a few thousand successfully founded the lineage that would populate the whole of South Asia. These pioneers are referred to as Ancient Ancestral South Indians (AASI) and should be considered as the indigenous people of India as they are known to be the first Homo sapiens who arrived in the subcontinent more than 50000 years ago. It is not conclusively known if humans lived in the subcontinent before the AASI, but if they did, they have left no genetic signature in the population of today.  Currently, no pure AASI population exists on the mainland; they are a ghost population whose DNA is found mixed into nearly every person in India. The closest living relatives to this ancient lineage are the Andamanese hunter-gatherers (Onge and Jarawa).

It is also very likely that archaic Humans (other than Sapiens) lived in the subcontinent prior to the arrival of this group out of Africa and possibly coexisted with them for many centuries. Indians today have about 2% DNA of these archaic humans (Neanderthal and Denisovan), the highest for any population outside Africa

The Neolithic Revolution and the Indus Valley Civilisation (IVC)

Between 10,000 and 7,000 years ago, a new genetic stream entered from the west—people related to the early farmers of the Zagros Mountains (Iran). These Iranian-related migrants did not replace the AASI; they merged with them. This Indus Periphery mixture created the genetic basis for the Indus Valley Civilisation (IVC). A pivotal study by Shinde et al. (2019)[ii] sequenced the DNA of a 4500-year-old female skeleton from Rakhi Garhi. The results were revolutionary: she had zero Steppe ancestry. This proved that the great cities of Harappa and Mohenjo-Daro were built by a population that was a mix of AASI and Iranian-related ancestry, before the arrival of Central Asian groups. Most scholars now believe that the Indus Valley people spoke a proto-Dravidian language (their script is sadly still not deciphered)

The Steppe Migration and the ANI/ASI Split

Around 2000 BCE to 1500 BCE, as the IVC began to decline (likely due to climate change and drying rivers), a third major group arrived: Steppe Pastoralists from the Eurasian grasslands (modern-day Russia/Kazakhstan or the Yamnaya culture). These migrants brought Indo-European languages and early Vedic culture. Their arrival triggered a massive demographic shift:

The North:  Steppe migrants mixed with the IVC people in the North, creating the Ancestral North Indians (ANI). This is the foundation on which the Vedic civilization was built.  The early Vedas written in Sanskrit, have quite a few Dravidian loan words. This possibly shows the affinity/ descend of the Vedic people to the IVC people, who scholars believe spoke some kind of proto-Dravidian language.

The South: IVC people who moved South, further mixing with local AASI hunter-gatherers, formed the Ancestral South Indians (ASI). They formed the base for the Dravidian culture and civilization.

The Melting Pot

These two were not watertight, separate and pure Aryan/Dravidian entities. For the next two millennia or so (from 2000 BCE to 0 CE), they mixed freely with gene flows taking place throughout the subcontinent. It was clearly a no caste bar situation throughout the length and breadth of the subcontinent. Reich et al. (2009)[iii] demonstrated that most modern Indians are a cline (a sliding scale) between these two groups. A person in Kashmir might have 50-60% ANI ancestry, while a person in Tamil Nadu might have 20-30% ANI ancestry, but almost everyone has both North and South Indian ancestry. Population Genetics has thus conclusively proved that the whole of India has a by and large a common genetic heritage. Pure Aryan/ Dravidian heritage in India is something of a myth. There is a certain oneness in the common ancient heritage of every Indian.

The Endogamy Freeze (2,000 Years Ago)

Perhaps the most Indian aspect of this genetic story is not the mixing, but when the mixing stopped.  Moorjani et al. (2013)[iv] discovered that around 1,900 years ago, genetic data shows a sudden freezing of the gene pool. This aligns with the late Gupta period when the Manusmriti was composed, and the social structures of the Caste System became rigid. From this point on, people began marrying only within their specific sub-castes (Jatis). This created thousands of distinct genetic groups. India is therefore not one large population; it is a collection of thousands of small, endogamous populations living side-by-side and interestingly all having a common heritage. This rigid monogamy froze the genetic proportions in place, which is why genetically and otherwise we still see distinct regional and caste differences today despite the shared ancient roots. This gene flow freeze that prevails till today gave rise to several deleterious effects socially, economically and biologically. That calls for a separate discussion altogether.

Religion, Gene Flow, and Regional Nuance

Modern socio-political identities often suggest deep divides, but the DNA tells a story of shared heritage. Study after study on religious groups, including Reich (2018), has shown that Indian Muslims and Christians are genetically indistinguishable from the Hindu castes of their specific region. For example, a Kerala Syrian Christian shares the same ancestral proportions as a Kerala Namboothiri or Nair. The closest genetic kin of a Kashmiri Muslim is the Kashmiri Pandit. Almost identical heritage. For most of us it would be somewhat disconcerting that a Punjabi Hindu or Sikh is genetically closer to a Punjabi Pakistani Muslim, than a Hindu from say Tamilnadu. Gene flows don’t recognize borders and genetically the concept of Akhand Bharat is very much a valid concept

There is an ethnic group called Brahui who now predominantly live in the mountains of Balochistan with much smaller populations in Iran, Afghanistan, and Gujarat and Rajasthan. Genetically they are predominantly ANI stock but the Brahui language that they speak today is essentially a Dravidian language strongly suggesting a IVC lineage.

Gene Flow from the East

In Northeast India and among Munda-speaking tribes in Central India, there is a significant fourth stream of DNA besides the three discussed above. This comes from Austroasiatic (AAA) and Tibeto-Burman migrations from Southeast Asia and East Asia roughly 3,000 to 4,000 years ago.

Summary Table: The Layers of Indian Ancestry

LAYERGROUPARRIVALFORMATION/IMPACT
1AASI(FIRST INDIANS)50000 + years agoFoundational DNA of all Indians
2Iranian related farmers10000-70000 years agoMixed with AASI to build the Indus Valley civilisation
3Steppe Pastoralists2000-1500 BCEIntroduced Indo European languages; formed the ANI
4East Asian/ Austroasiatic2000-1000 BCE 

Out of India Theory (OIT)

Also known as Indigenous Aryanism, the Out of India Theory is the hypothesis that Indo European language family and speakers originated within the Indian subcontinent and migrated outward to Europe and Central Asia, rather than entering India from outside. This hypothesis stands in direct opposition to the mainstream Aryan migration Theory. In the academic world of genetics, archeology and linguistics the OIT is largely considered discredited. However, it remains a popular and culturally significant idea. In some sense the theory is valid in that the Indian Civilisation has immense indigenous continuity even before the arrival of the Steppe Pastoralists.

Old School Narrative and Modern Genetic Reality

The old schoolbooks often portrayed the Aryans/Dravidians as two separate, non-overlapping groups. Genetics shows they are more like a spectrum. According to the old school narrative Dravidians were Indigenous people pushed South by invaders. According to modern genetics this group (often called Ancestral South Indians or ASI) formed when the Indus Valley people migrated South and mixed further with local hunter-gatherers (AASI the original out of Africa indigenous people). The old school narrative describes Aryans as a foreign race that conquered the North. According to modern genetics this group is a mixture of Steppe Pastoralists and the existing Indus Valley population. This group (called Ancestral North Indians or ANI) is genetically closer to West Eurasians but still contains significant indigenous Indian DNA.

Key Changes in Knowledge

The Invasion is now a Migration: The violent Aryan Invasion Theory has been largely debunked. Genetic evidence suggests a slow, multi-century migration of Steppe people who intermarried with local populations.

Everyone is mixed. There is no pure Aryan or Dravidian. Nearly every Indian—from a Kashmiri Pandit to a Tamil Brahmin to a tribal member in Kerala—carries a combination of these three ancestral lines. The difference is only in the proportions (e.g., higher Steppe ancestry in the North/Upper Castes; higher AASI ancestry in the South/Tribal groups).

So, in a nutshell we can summarize that the current Indian population is not a monolith but a mosaic of varying shades, a picture largely accepted by the Centre for Cell and Molecular Biology (CCMB), Hyderabad, a premier research institution that has carried out groundbreaking research in this field.


[i] Narasimhan, V. M., et al. (2019). The formation of human populations in South and Central Asia. Science. (Detailed the three-way mix of AASI, Iranian-farmers, and Steppe).

[ii] Shinde, V., et al. (2019). An Ancient Harappan Genome Lacks Ancestry from Steppe Pastoralists or Iranian Farmers. Cell. (The definitive study on the Indus Valley genome).

[iii] Reich, David (2018). Who We Are and How We Got Here. (A comprehensive overview of ancient DNA findings in India and globally).

[iv] Moorjani, P., et al. (2013). Genetic Evidence for Recent Population Mixture in India. American Journal of Human Genetics. (Identified the shift to endogamy 1,900 years ago).

Guardians of the Coastal Road

The air was thick with the scent of salt and damp earth as we departed Vakkom. At seventy, my mission was simple: escort my 91-year-old mother, my sister Jabeena, and my wife Arifa safely home after a heart-warming visit with Fathima Kutteema our mother’s cousin, a couple of years her junior.

​As dusk settled, a light drizzle began to blur the windshield. We were on a desolate stretch of road, far from the comfort of village lights, when the silence was shattered. A needlessly high speed bump caught us off guard.

Clang

​The sickening sound of metal grinding against asphalt echoed through the cabin. I cursed my judgment. I should have slowed down. My military mind, honed by years of habit, went into assessment of the situation: the silencer clamp had snapped. We were stranded in the rain on a Sunday evening, with a nonagenarian in the backseat and not even the remotest probability of a mechanic in sight.

The Shadows in the Rain

​As the drizzle turned into a steady downpour, I stood by the roadside, praying for a miracle—or at least a passing taxi. The road remained stubbornly empty. The plan was simple. Leave the car in situ and somehow find a taxi to get home.

​Then, out of the darkness, three figures appeared. ​Three young men, strangers to us, emerged from the gloom. They didn’t just ask what was wrong; they took charge. Without a second thought for their clothes or the mud, two of them slid under the car into the cold slush. One held a mobile flashlight steady against the rain while the others diagnosed the wound.

​”The clamp is gone,” one said, wiping grit from his forehead. “But don’t worry. We will fix it.”
​One youth vanished into the night, returning minutes later with a coil of metal wire—a makeshift lifeline. For twenty minutes, they worked in the mud. I watched, humbled, as these three strangers labored in the dark to ensure a great grandmother they didn’t know could get home to her bed.

My mind went into assessment mode again. These youngsters may be trying to make a killing out of an opportunity. They would possibly demand a bomb as compensation once they could execute a makeshift repair job. I told myself that even so they were angels. It was alright and i could afford it.

A Lesson in Virtue

​When they emerged, drenched and covered in grime, they gave me a simple thumbs-up. “It will hold for twenty kilometer,” they promised. “Go now, before the storm breaks.”

​I felt a wave of sheepishness. I reached for my phone, explaining I had little cash but wanted their phone number to enable a digital payment as a token of my immense gratitude.
​One of the boys stopped me. He didn’t look at my phone; he looked at me, folded his hands in a respectful gesture, and smiled.

​”We are not looking for money,” he said softly. “We just wanted to help.”

​They urged us to leave quickly, waving us off into the blinding rain as if they hadn’t just performed a small miracle. As I drove, the screeching of metal was replaced by a profound silence in my heart.

​We often complain about the world moving too fast or losing its way. But that night, on a dark road out of Vakkom, I realised that human virtue isn’t rare—it is all-pervasive. We just have to wait for the rain to see it shine .