Tutorials
Tutorials
These tutorials explain steps that are used to analyze real-world sequencing data. We will try to explain the steps in detail but command output are usually ignored for brevity and confidentiality reasons.
1. Quick Start Guide
This is a 10-minute quick start guide using data from the 1000 genomes project. We encourage you to walk through this minimal real-world example to get a feel for the software and assess its helpfulness.
2. Presentations
3. Introductory tutorials
- A recent presentation about variant tools, with output of commands
- A tutorial session in the ACM BCB 2014 meeting
4. Variants screening
- Analyzing 44 whole genome cases and 200 exome controls, with detailed performance measures
- Analyzing 5 whole-genome samples from Illumina
- Compare variants from the same samples called by Complete Genomics and Illumina
- Select variants belonging to specified genes
- Detailed analysis of one variant
5. Import and export
- Import all genotype data from the 1000 genomes project
- Generate and import annovar annotations for variants already in vtools
- Import variants from a list of variants in an unsupported format
- Handling genotypes imported from multiple files (e.g. genotypes of samples are imported chromosome by chromosome)
6. Reference genomes and annotation databases
- Working with mouse and other genomes
7. Annotation
- Annotating variants using multiple annotation databases
8. Association analysis
- Quality control using sequencing data
- Association analysis using sequencing data
9. Run association analysis on cluster
- Submit a pbs script to run vtools association command on cluster using PBS
- Submit a pbs script to run vtools association command on cluster using LSF
Parallelization
- Use subprojects to manage large projects. This strategy can be used to parallelize variant processing using multiple processors or a cluster