Sib-pair Command: summary


ClassGlobal
Namesummary
Arguments[<N highest results> | plot [qq] [<EPS file>] | table] | dump [<output file>]| get <variable_name>].

List the N (defaulting to 5) most significant P-values from the last multilocus analysis.

If the plot keyword is issued, a Postscript plot of the -log10 P-values versus map position of the active marker loci tested for association or linkage in the last analysis, unless the qq modifier keyword is present. In that case, a quantile-quantile plot of the -log10 P-values is produced. This is saved to a file (default name "sib-pair.eps").

For the table modifier keyword, a table of binned P-values or test statistics is produced; usual bin sizes are powers of ten.

The dump modifier prints all the P-values to a file.

The get <varname> modifier extracts numerical values from locus annotations taking the form varname=value, or from the i'th column. Reading VCF files usually transfers INFO variables to the annotations as such a pair. As a side effect, the variable is now available to select loci using "keep/drop where test".

Example:

>> include cavanaughex.in
>> apm ibd
>> summary

Test: "apm ibd".

Total number of tests =      8

Locus          Position  P-value   -log10(P)
-------------- --------- -------  ----------
D16S411            62.23  5e-12       11.286  48294236 (chr 16)
D16S409            61.00  0.0027       2.571  47333369 (chr 16)
D16S753            60.12  0.0027       2.562  31181005 (chr 16)
D16S3145           55.01  0.0036       2.446  26784604 (chr 16)
D16S419            67.69  0.0082       2.087  51510863 (chr 16)

>> summary 10

Locus          Position   P-value   -log10(P)
-------------- ---------- -------  ----------
D16S411             62.23  5e-12       11.286  48294236 (chr 16)
D16S409             61.00  0.0027       2.571  47333369 (chr 16)
D16S753             60.12  0.0027       2.562  31181005 (chr 16)
D16S3145            55.01  0.0036       2.446  26784604 (chr 16)
D16S419             67.69  0.0082       2.087  51510863 (chr 16)
D16S415             69.15  0.0483       1.316  52228236 (chr 16)
D16S771             70.85  0.1120       0.951  53066505 (chr 16)
D16S769             53.71  0.1297       0.887  26066263 (chr 16)


>> summary tab

Bin     Count  Percent
------- -----  -------
10^-11      1    12.5
10^-8       0     0.0
10^-7       0     0.0
10^-6       0     0.0
10^-5       0     0.0
10^-4       0     0.0
10^-3       0     0.0
10^-2       4    50.0
10^-1       1    12.5
10^0        2    25.0


>> summary plot
>> $ gv sib-pair.eps &

Resulting plot:

>> read loc vcf chr22.dose.vcf.gz
[...]

VCF file           = chr22.dose.vcf.gz
Number of subjects = 1370 (with genotype data)
Number of markers  = 365643
Number passing QC  = 365531 (1.000)
Number indels      = 18541
Number SVs         = 0
Total genotypes    = 500930910
Chromosome         = 22   
Map range (bp)     = 16050408 -- 51242613

Declaring 365643 loci.

>> summary get R2
>> sum tab

Test: "R2".

 Intvl Midpt    Count   Histogram
 ---------------------------------------------------------
       0.0500  252665 + **************************************************
       0.1501   37201 | **********
       0.2501   21433   *****
       0.3501   14206   ***
       0.4502   10285   **
       0.5502    7858   **
       0.6502    6571   *
       0.7503    5289   *
       0.8503    4365   *
       0.9504    5352   *
Median (IQR)         =        0.0224 (       0.0024 --        0.1484)
Symmetry test J(.02) =        2.7801 (P=0.000)

>> keep where test >= 0.8
Selecting markers where test statistic >=   0.80000000000000004     
Keeping 9736 active loci.


<< (mztwin)Up to index>> (homoz)