Gene Coverage

Gene Coverage refers to how well a gene was covered by high throughput sequencing reads. This is useful to know how confident you can be about a lack of variant calls in a region.

Having gene coverage associated with a VCF sample allows you to be warned in an analysis when a gene in a gene list is below a threshold (default: 20x) and you may be missing some variants. The node will flash yellow, and the “genes” tab will be highlighted yellow so you can view which genes have low coverage.

Where gene coverage has been uploaded (eg on diagnostic systems where QC is automatically uploaded) box-plots of sample coverage for a gene will be shown on the gene symbol page

Canonical Transcripts

Many genes have multiple transcripts, but people want only one value for each gene.

This is achieved by choosing a single (representative or canonical) transcript, and use that transcripts value for the gene.

A CanonicalTranscriptCollection is a list of gene:transcript mappings imported into the system. The administrator can import different collections, linking them to EnrichmentKits and setting a system default.

Sample QC metrics

You can upload gene coverage files (.txt files) which use the system default canonical transcripts. You can then associate them with a sample from a VCF

Sample QC coverage loaded via sequencing features - and automatically choose transcripts based on EnrichmentKit

GeneGrid EnrichmentKit coverage

The per-gene QC metrics for an EnrichmentKit on the GeneGrid page are from Gold Standard Runs, using the canonical transcripts for that EnrichmentKit.