A program to test the impact of language-family boundaries on population differentiation and to evaluate the homogeneity of the genetic processes along these boundaries
| Introduction | Input files | Running | Output files | References | Download: authorized only |
This program permits to assess the impact of language-family boundaries on population differentiation and to evaluate the homogeneity of the genetic processes along these boundaries as described in Dupanloup de Ceuninck et al. (2000). The first estimator (delta-a) of the impact of the boundary is based on an isolation by distance (IBD) model and measures the added genetic distance between populations located on different sides of the boundary. This estimator is compared to another estimator of group differentiation (FCT) computed under an analysis of variance framework that does not assume any particular spatial structure of the populations (Excoffier et al., 1992).
Though, this method was originally developped to evaluate the genetic processes at work along and accross linguistic boundaries, it can be used to assess the impact on population differentiation of any other cultural boundaries or ecological barriers, at any geographical scale.
Boundaries takes two input files. The first one (*.geo) must contain the geographic coordinates of the sampling localities of your populations. The second one (*.arp) is in fact an Arlequin input file (called Arlequin project file) containing the genetic data in your populations. The Arlequin file must have the same name as the geographical file with the extension (*.arp). The order of the populations in the two input files MUST BE THE SAME !!!
The file containing the geographic coordinates of the sampling localities of your populations
must have the *.geo extension. Important notice: Boundaries does not work if two
sampling localities have the same geographical coordinates.
The geographical input file must be structured the following way.
Each line corresponds to a population.
Each line must contain five fields separated by a tab character:
In the Arlequin project file, the order of the populations, which means the order in which the genetic data in your samples is defined, must be the same than in the file containing the geographic coordinates of your sampling points. For more informations on Arlequin project files, you can download Arlequin program (Schneider et al., 2000) and Arlequin help file through Arlequin web site.
Example of Arlequin project file (for the same populations listed in
the geographical input file) inputdata.arp :
#AMOVA analysis
[Profile]
Title="A New Sample File Designed To Compute AMOVA"
NbSamples=4
LocusListing=TABLE
GenotypicData=0
LocusSeparator=WHITESPACE
DataType=FREQUENCY
Frequency=REL
[Data]
[[Samples]]
SampleName="Egyptiens"
SampleSize=250
SampleData= {
1 0.64
2 0.36
}
SampleName="Tunisiens"
SampleSize=474
SampleData= {
1 0.80
2 0.20
}
SampleName="Albanais"
SampleSize=117
SampleData= {
1 0.86
2 0.14
}
SampleName="Lithuaniens"
SampleSize=202
SampleData= {
1 0.45
2 0.55
}
When Boundaries is launched, it asks you successively:
The most important output file of Boundaries is the file called results.txt.
It contains the results of the computations done by boundaries (test of IBD in the linguistic groups,
values and p-values of the delta-a and FCT statistics).
A set of additional output files are created by Boundaries and listed above in
alphabetical order:
Isabelle Dupanloup, Dipartimento di Biologia, University of Ferrara