Sib-pair Command: dis


ClassAnalysis and data manipulation command
Namedis
Arguments [<marker locus 1> [<marker locus 2> [...<marker locus N> [<trait locus>]]]]
(r2|dpr)

Estimates frequencies of haplotypes based on individuals with two informative parents or no informative parents. For pairs of multiallelic markers (6 or more alleles), only the former group is used. In the informative parent-offspring triads, the loci are assumed to be tightly linked, so that four parental haplotypes may be inferred, but this can be turned off (set hrec).

If no markers are specified, the sequence of pairs of markers in the list of loci is produced (ie marker1 with marker2, marker2 with marker3...). If one marker is specified, then the pairing of all other markers with this index marker is analysed.

For pairs of markers with few alleles, information from both phased and unphased genotypes is combined using an EM-fitted log-linear model, which can also deal with X-linked data.

If three or more markers are specified, or two or more markers along with a trait, the genotypes are all treated as if unphased (and X-linked data is not handled). The same log-linear modelling framework is used.

When a trait is specified, the haplotype frequencies within each trait category (two if a binary trait, as many levels as observed for a quantitative trait) are calculated, along with a chi-square testing equality of marker haplotype frequencies across categories.

If exactly two numbers are given, then these are assumed to be the number of alleles at two loci, and genotype counts may be entered at the command line.

Two likelihood ratio tests are provided for the two locus case. The first tests the adequacy of the intragametic association model ie absence of population stratification type effects. The other tests if there is significant intragametic association (ie D or r ≠ 0).

If the r2 is given, the pairwise r2 matrix for all pairs of active markers is printed. The dpr gives the matrix of Hedrick weighted D' for all pairs of active markers.

Example:

>> dis

---------------------------------------------------
Inter-marker allelic association analysis
---------------------------------------------------

Marker 1   Marker 2      N mean D'    r2 Chi-sq  df  asy P
---------- ---------- ---- ------- ----- ------ --- ------
E366K      rs1122629  3564   0.872 0.016   85.8   1 0.0000 LD  ***
rs1122629  rs1303     3540   0.252 0.023   95.0   1 0.0000 LD  ***

>> set ple 2
>> dis E366K rs1122629

---------------------------------------------------
Inter-marker allelic association analysis
---------------------------------------------------

Assoc for locus "E366K     " c. locus "rs1122629 "
---------------------------------------------------

Modelling  10 phased genotypes (N=  628) and   9 unphased genotypes (N= 2936).

Unphased Genotypes  Observed  Expected  Deviance
  1/1     1/1             0.      0.0     -0.0
  1/1     1/2             0.      0.2     -0.3
  1/1     2/2             2.      1.1      0.8
  1/2     1/1             6.      4.0      1.0
  1/2     1/2            58.     61.2     -0.4
  1/2     2/2            59.     54.2      0.7
  2/2     1/1           753.    745.4      0.3
  2/2     1/2          1397.   1406.5     -0.2
  2/2     2/2           661.    663.5     -0.1
Phased Genotypes    Observed  Expected  Deviance
  1  1;  1  1             0.      0.0     -0.0
  1  1;  2  1             0.      0.0     -0.1
  1  1;  2  2             0.      0.2     -0.4
  2  1;  1  1             0.      0.8     -1.1
  2  1;  1  2            11.     12.3     -0.3
  2  2;  1  1           152.    159.4     -0.6
  2  1;  2  1             0.      0.8     -1.0
  2  1;  2  2            10.     11.6     -0.4
  2  2;  2  1           313.    300.8      0.7
  2  2;  2  2           142.    141.9      0.0

  Haplotype  Prop      95% CL           D        D'
  ----------------------------------------------------
 A   C     0.0013   0.0007--0.0025   -0.0092   -0.8724
 A   T     0.0194   0.0165--0.0229    0.0092    0.8724
 G   C     0.5039   0.4893--0.5188    0.0092    0.8724
 G   T     0.4754   0.4612--0.4900   -0.0092   -0.8724

  Number of genotypes used =      3564
    LD Model LR Chi-square =        7.57 (df=  14, P=0.9103)
       LR Chi-square (D=0) =       85.79 (df=   1, P=0.0000)
  Hedrick weighted mean D' =        0.8724
                 r-squared =        0.0165


>> dis 2 2
  9 genotype counts> 91 147 85 32  78 75 5  17  7
Modelling   9 unphased genotypes (N=  537).

  Haplotype  Prop      95% CL           D        D'
  ----------------------------------------------------
   1   1   0.3822   0.3503--0.4169    0.0234    0.2231
   1   2   0.3916   0.3594--0.4266   -0.0234   -0.2231
   2   1   0.0815   0.0664--0.1001   -0.0234   -0.2231
   2   2   0.1448   0.1244--0.1684    0.0234    0.2231

  Number of genotypes used =       537
    LD Model LR Chi-square =       12.81 (df=   5, P=0.0252)
       LR Chi-square (D=0) =        8.01 (df=   1, P=0.0046)
  Hedrick weighted mean D' =        0.2231
                 r-squared =        0.0126


See also:

set hreAssume zero recombination for phased LD model
write haploviewwrite Haploview pedigree file
write locus haploviewwrite Haploview info file
haplotypes haplotypes for pairs of SNPs


<< (mds)Up to index>> (neff)