Title: Origin%20of%20Man,%20Language%20and%20Languages
1Origin of Man, Language and Languages
- Central Asia
- A common inquiry in genetics, linguistics and
anthropology - Granted by
- European Science foundation
- CNRS
2Eurasia
3Central Asia
TURKIC - SPEAKING PEOPLE
KAZAKS
KARAKALPAKS
UZBEKS
TURKMENS
KIRGHIZES
UZBEKS
TAJIKS
KIRGHIZES
UZBEKS
TAJIKS
IRANIAN-SPEAKING PEOPLE
4Expeditions
- 2001 Karakalpakie (Karakalpak, Uzbek, Kazakh)
- 2002 Karakalpakie (On Tort Uruw, Turkmen)
- 2003 Kirghizie (North and South)
- 2004 Tajik from Uzbekistan (Ferghana and
Samarkand area) - 2005 Boukhara area Kazakh, Ouzbek and Tajik
- 2005 Tajik and Uzbek from Tajikistan (Gharm and
Penjinkent area)
5(No Transcript)
6Ethnological questionnaire
- Location, langage spoken, tribe if applicable of
individuals and 4 grand-parents - Information on married children
7Linguistic data
8(No Transcript)
9(No Transcript)
10Blood, DNA
- 5 ml for each individual
- Informed consent
- On the field blood is process ? white cells
- In the laboratory DNA extraction
11Main goals
- Trace back population history
- Describe genetic diversity in Central Asia
- Compare genetic and linguistic distances
12- History of Eurasia Past demographic expansion
- By Raphaelle Chaix (Former PhD Student)
13Age of expansion
Mismatch distribution
Pop 1
Pop 2
All 2 by 2 comparison
seq1
seq2
Fq
Estimation of ? 2Tu
0 2 4 6 8
Number of differences between sequences
14Age of expansion (?)
ADNmt N133
15ADNmt
?
r0.7 p0
longitude
16Age of expansion (?)
Chr Y N77
17ChrY
?
r0.3 p0.01
longitude
18Age of expansion (T)
ADNmt / Chr Y (KY)
17 5.6
27 7.4
30 7.2
25 6
31 6.6
24 7
mtDNA dating depends on the mutation
rate 30000yrs BP China to 17000yrs BP in Europe
(-27000 in CA) 62000yrs BP in China to 35000yrs
BP in Europe (-54000 in CA)
19Center of expansion ?
intermatch distribution (Harpending et al
2000 Excoffier et al 2004)
Pop 1
Pop 2
All 2 by 2 comparison
Fq
Fq
0 2 4 6 8
0 2 4 6 8
Number of differences between sequences
Same center of expansion for the 2 populations
20Cultural expansion
Fq
0 2 4 6 8
Europe
Extrême-Orient
Asie-Centrale
21Demic expansion
Fq
0 2 4 6 8
Europe
Extrême-Orient
Asie-Centrale
22Cultural expansion with high migration rate
Demic expansion
Fq
0 2 4 6 8
23Past expansion in Eurasia
- Mitochondrial DNA East to West in Paleolithic
(from China to Central Asia and then to Europe
from Middle East to Europe) ? no cultural
expansion - Y chromosome expansion during Neolithic. Two
centers of expansion (China and ME, Pakistan CA)
perhaps 3 (Europe). - For central Asia same timing for expansion as
Middle East, a little bit earlier but not
statistically significant. - Differences explained by lower Ne for male
24Central Asia diversity
- 463 individuals typed for two uniparental markers
- Y chromosome ? Micro satellites SNP
- Mitochondrial DNA ? HVS-I RFLP
- 400 to be typed
25Y Chromosome diversity
26Genetic distances among populations chromosome Y
r gen-geo 0,85
27- Comparison of genetic and ethnological data
28From oral tradition
Common tribes ancestor
Common clans ancestor
Common lineages ancestor
29If a recent common ancestor
If no recent common ancestor
Common ancestor
Strong genetical kinship
Low genetical kinship
Patrilinear filiation ? Y chromosome study
302 Patrilinear genetic kinship
1 Ethnological questionnaire
250 men genealogical information
- 12 microsatellites of Y chromosome
31Kinship coefficient
Mean genetic kinship coefficient for each
ethnological class of the five populations
examined in this study. KZ Kazakhs TK Turkmen
UZ Uzbeks QN Qongrat OTU On Tort Uruw.
32Kinship coefficient
33Kinship coefficient
34Kinship coefficient
35Datation
Ancestor of the clan or of the lineage
T ASD / ?
ASD mean square number of differences between
the ancestor and the individuals
T age
- mutation rate
- 2.1x 10-3 per generation
36Tribe mythical ancestor
Common clans ancestor
50 generations
App. 1500 years
Common lineages ancestor
37Conclusion
Genetical data can help decipher social
organisation Lineages and clans people share a
recent common ancestor Tribes a conglomerate
of clans who subsequently invented a mythical
ancestor to strengthen group unity
38Y chromosome
- Low diversity of some populations is explained by
social organistaion - Distances among populations related to
geographical distances
39Mitochondrial DNA
40MDS based on mt DNA 12 populations (Kimura 2P
a 0.26)
KZ
KK UZ OTU
KRG KRM KRA
TK
TJK TJR
Tja Tju
r gen-geo O
41Central Asian Populations
42MDS de lADN mito
MDS based on mt DNA 28 populations (Kimura 2P
a 0.26)
Tajiks
No gen-geo correlation
43Ouighours
Kazakhs
Kazakhs
Karakalpaks_(Qongirat)
Ouzbeks
Karakalpaks
Karakalpaks_(OTU)
Uighurs
Kirghizes
Kazakhs
Kirghizes (Comas)
Ouzbeks (Korezhm)
Turkmènes
Kirghizes A
Kirghizes GM
Turkmènes
Ouzbeks
Tajiks R
Kirghizes
Tajiks A U
Tajiks
Turkmènes
Kurds_du_Turkmenistan
Ouzbeks
Shugnan_(Pamir_Tajikistan)
44Mitochondrial DNA
Mean distance (Fst) between populations Diagonal
show intra group distances
Exogamous populations
Endogamous populations
45(No Transcript)
46Mitochondrial DNA
- Distances among populations not related to
linguistic or geographical distances - Exchange among populations differ between
Turko-Mongol (exogamous) populations and
Indo-Iranian (endogamous) populations
47Conclusion
- Past history clear movement from east to west in
paleolithic strong population growth in
neolithic. - Exchange between populations clearly different
for male and female - Linguistic distances ?
48Computational linguistic
Design of the sampling Swadesh list 2/3
speakers for each sampling location (interspeaker
variation) Analyses We are not interested in
historical linguistics.
49Central Asia about 1000 CE language groups
Turkic Oguz, Kipchak, Karluk
Kipchak
Oguz
Ossetic
Karluk ?
?
Khorasmian
Sogdian
Pamirian
Persian-Tadjik
Dardic
Indo-iranian
50- We want to statistically compare genetic and
linguistic data - More linguistic differences among Iranian
populations than among Turkish populations ?
51 we selected distance-based approaches
We have two major linguistic groups Indo-Iranian
and Turk We will focus on them separately since
they both constitute a DIALECT-CHAIN Borrowing,
if it exists is less of a problem since it
reflects CONTACT (migrations), a kind of
information that is embedded in genetic data.
More than historical linguistics we look for a
POPULATION LINGUISTICS
52Dialectometrical Computation of
distances (Kondrak 2004, Heeringa 2004)
- Phonetic alignment
- An alignement algorithm (string mapping)
- A metric for measuring distances between phonetic
segments - Distance Matrices
- Correlate linguistic and geographic distances
- Correlate linguistic and genetic distances (mt
DNA)
53From Ph Mennecier
54What remains to be done in genetic analysis
- Phylogeography of Y and mtDNA ? geographic
patterns of genetic variation may reveal
migrations synchronic to linguistic phenomena
(replacement, borrowing,..) - Autosomal markers
- Samples from Tajikistan
- Thanks all the people who participated to this
study
55- In France
- Dr. François Jacquesson, linguist, CNRS
- Pr. Evelyne Heyer, geneticist, MNHN, CNRS
- Dr. Lluis Quintana, geneticist, CNRS, Inst.
Pasteur - Dr. Philippe Mennecier, linguist, MNHN
- Dr. Frederic Austerlitz, geneticist, CNRS
- Dr. Svetlana Jacquesson, anthropologist, IFEAC
- Dr. Franz Manni, geneticist, MNHN
- Dr. R Chaix (former PhD student, in Oxford)
- Dr. P Balaresque (former PhD student, in
Leicester) - In Central Asia
- Dr. Tatiana Hegai, geneticist, Tashkent
- Pr. Ruslan Ruzibakiev, geneticist, Tashkent
- Dr. Aldashev, geneticist , Bishkek
- Pr. Vadim Yagodin, archaeologist, Nukus
- Dr. Bakyt Amanbaeva, archaeologist, Bishkek
56(No Transcript)
57Alignement
I N D U S T R Y
0
I 0
N 0
T 1
E 2
R 3 4 5
E 6
S 6
T 6 7 8
industry Subst i/i 0
industry Subst. n/n 0
intdustry Insert t 1
intedustry Insert e 1
interdustry Insert r 1
interustry Delete d 1
Interstry Delete u 1
interestry Insert e 1
Interestry Subst s/s 0
Interestry Subst t/t 0
Interesty Delete r 1
Interest Delete y 1
Total cost 8
The indels are weighted as 1 instead of 2 in a
newer version of the algorithm
58Y Chromosome diversity
59Genetic distances among populations chromosome
Y