Skip to main content

Advanced Filtering of Variants

To enter the Advanced Variant Filtering mode, click on the button at the top of the table. You will see the Query Builder:

note

If you are on the page of SNV Viewer with somatic mutations, then the somatic mutation filtering condition ("Origin in Somatic") will be added by default:

In such a case, remove this condition or add a new one, as described here.
info

On "Profile Settings" page, you can adjust which filtering mode you will see when you first open sample SNV Viewer page: basic or advanced.

Filtering Columns#

The columns are divided into groups:

  1. Common
  2. Per-sample data (if there is data on sequencing samples)
  3. Gene (if you have access to OMIM database data inside Genomenal, which is provided if the user has a license to access the database)
  4. gnomAD 3
  5. gnomAD 4
  6. ClinVar
  7. Conservation
  8. ExAC
  9. Protein function effect
  10. Protein function effect (additional)
  11. Other frequencies
  12. Other
  13. Custom annotation sources (if at least one custom annotation was added to the system before the variants in the sample were annotated)

You can interact with column groups by using and to expand and collapse the column list of the group, respectively.

Use the search box to find the filtering column. The search is case-insensitive. Some columns can be found by search synonyms. For example, the following columns can be found by the word "change":

Common#

  1. Sample Status (basic filter "Sample Status") - filter variants by run sample interpretation status (completed or uncompleted). Single value filter. Only available in run SNV Viewer.
  2. Interpretation - filter variants by interpretation (comment) added by the user on the variant details page or panel. Any value filter.
  3. HPO Phenotype - filter variants by terms from Human Phenotype Ontology (HPO). To open the phenotype search window, click on the filter value field . How to find and select the phenotypes in this form is described here.
  4. Gene panel (basic filter "Gene panel") - filter variants by localization in certain genes from the panel. Single value filter. You can add a gene panel on on "Gene Panels" page or import a panel from the Library of the most common clinical panels (such option is available for basic filter "Gene panel" in basic filtering mode. If you don't have any gene panels added, the list of filter values will be empty.
  5. Region (basic filter "Region") - filter variants by position on the genome (chromosome, start and end positions). To filter by region, click on the input window and enter the position of the needed genomic interval, i.e. chromosome, start and end positions. Possible options of writing the interval position:
  • chr1:161,190,824-161,193,921
  • chr1:161190824-161193921
  • chr1:161190824_161193921
  • 1:161190824-161193921

You can also filter by more than one interval at once. For example, by chr1:161190824-161193921, chrX:151110769-151113866.
To complete adding the condition, press Enter or click outside edit value container.
The filtering result will be variants whose positions intersect with the entered interval.

  1. Gene (column "Locus", basic filter "Gene") - filter variants by localization in gene transcript. The filter values depend on the selected filter operator:
  • any value if the operators "starts with", "contains", "not contains" are selected. For example, this is useful if you want to filter variants by genes, the name of which contains a certain letter sequence (e.g., "HLA-").
  • multiselect if the operators โ€œinโ€ and โ€œnot inโ€ are selected. Used to filter variants by localization in certain genes. In the value options, the gene transcript is given in brackets next to the official gene name.
  • no value if the operators "not empty", "empty" are selected.
  1. Impact - filter by the predicted effect of the variant on the protein: High, Moderate, Low, Modifier. Column values are described here. Multiselect filter.
  2. Genome position - filter by variant position in the genome: in exon (Exonic), in intron (Intronic), in the intergenic region (Intergenic). Multiselect filter.
  3. Consequence (basic filter "Consequence") - filter by the effect of the variant on genes. Column values are described here. Multiselect filter.
  4. Pathogenicity (column "Pathogenicity", basic filter "Pathogenicity") - multiselect filter by variant pathogenicity, which have been determined either manually, by ACMG criteria, or according to the pathogenicity base.
  5. Label (column "Label", basic filter "Label") - multiselect filter by variant significance defined by the user: High significance, Moderate significance, Low significance, Sequencing error, Undefined.
  6. Phasing group (column "PG") - multiselect filter by phasing group status:
  • Phasing group - merging of homozygous variants located within a specific genomic interval;
  • Part of group consequence equal - part of phasing group with concordant amino acid substitutions;
  • Part of group consequence unequal - part of phasing group with discordant amino acid substitutions.
  1. Compound heterozygote (Genotype) - filter variants by whether they are compound heterozygotes (i.e. heterozygosity of two or more variants of the same gene; all cases are described in detail here). Unlike the compound heterozygote filtering mode, this filter is applied simultaneously with other filtering conditions. As a result, the variants included in the compound heterozygote will be found, but the remaining variants from this compound heterozygote may be filtered out by other conditions.

In family analysis SNV Viewer and in SNV Viewer with one sequencing sample, to filter variants by compound heterozygote, select the filter in the list of filtering columns:

In germline cohort analysis SNV Viewer, in SNV Viewer with several sequencing samples and in run SNV Viewer (if there is data on sequencing samples), to filter variants by compound heterozygote, select a sample in which all cases will be found where at least two heterozygotes (0/1, 1/0, 0|1, etc.) are located in the same gene:

  1. Trio GT (Genotype) - multiselect filter by variant inheritance: De novo variants, Newly formed homozygotes (homozygotes in proband that are heterozygous in parents) or Single parent-inherited. Only available in family group analysis for Trio case (Proband, Father and Mother).
  2. Chromosome (column "Position")- filter variants by localization in chromosomes: chr1-chr22, chrX, chrY, chrM, chromosomes like chrN_random (contigs located in the N chromosome but could not be reliably ordered within the sequence, or sequences for alternative haplotypes), chromosomes like chrUn_ (contigs that cannot be confidently placed on a specific chromosome), chrEBV. Multiselect filter.
  3. rsId (column "External links") - filter variants by ID from dbSNP database. Filter with ID input. You can use as a value either the full ID (e.g. rs1126497) or the ID digits only (e.g. 1126497).
  4. ClinVar ID (column "External links") - filter variants by Variation ID from ClinVar database (e.g. 183700). Filter with ID input.
  5. Origin (basic filter "Origin") - filter variants by mutation type: Somatic, Germline. Multiselect filter.
  6. Start (column "Position") - filter by the start position of the variant on the chromosome. Numeric filter (natural number).
  7. End - filter by the end position of the variant on the chromosome. Numeric filter (natural number).
  8. Reference allele (column "Alteration") - filter by reference allele of the variant. Any value filter.
  9. Alternative allele (column "Alteration") - filter by alternative allele of the variant. Any value filter.
  10. HGVSc - filter variants by nucleotide substitution using the HGVS notation: โ€œc.โ€ (coding; for a substitution in the coding sequence) or โ€œn.โ€ (non-coding; for a substitution in the non-coding sequence) prefix + genomic position of the substituted nucleotide + reference allele > alternative allele. Any value filter.
  11. HGVSp (short) (column "HGVSp") - filter variants by amino acid substitution using the HGVS notation in the single-letter amino acid code (โ€œp.โ€ prefix (protein) + reference amino acid + amino acid position in protein + new amino acid resulting from the substitution). Any value filter. e.g.: p.P72R.
  12. HGVSp (long) (column "HGVSp") - filter variants by amino acid substitution using the HGVS notation in the three-letter amino acid code. Written with transcript: Ensembl transcript ID + โ€œp.โ€ prefix (protein) + reference amino acid + amino acid position in protein + new amino acid resulting from the substitution. Any value filter. e.g.: ENSP00000269305.4:p.Pro72Arg.
  13. Exon - filter by exon number in the transcript that the variant affected. Numeric filter (natural number).
  14. Onco Relevance - filter by the variant significance for any oncological disease: Relevant, Irrelevant. Single value filter.
  15. Caller Filters - multiselect filter by variant quality (GATK filters).
  16. Occurrence in Run - filter variants by occurrence in run samples. Numeric filter (integer number). Only available in run SNV Viewer and run sample SNV Viewer.
  17. RefSeq Transcript - filter variants by transcript ID in RefSeq database (e.g. NM_002354.3). Any value filter.
  18. OMIM ID - filter variants by gene ID in OMIM database (e.g. 185535). Filter with ID input.
  19. NCBI ID - filter variants by gene ID in NCBI Gene database (e.g. 4072). Filter with ID input.
  20. Swiss-Prot ID - filter variants by gene ID in UniProtKB Swiss-Prot database (e.g. P16422). Filter with ID input.
  21. TrEMBL ID - filter variants by gene ID in UniProtKB TrEMBL database (e.g. B5MCA4). Filter with ID input.
  22. Ensembl ID - filter variants by gene ID in Ensembl database (e.g. ENSG00000119888). Filter with ID input.
  23. Cosmic ID - filter by variant ID in COSMIC database (e.g. COSV52665487). Filter with ID input. Only available if COSMIC database has been uploaded as a custom annotation before the sample variant annotation is complete.
  24. RUSeq AF - filter variants by alternative allele frequency from RUSeq database. Numeric filter (any number). Only available if RUSeq database has been uploaded as a custom annotation before the sample variant annotation is complete.

Per-sample data#

  • The filtering column group is only present if there is sequencing sample data.
  1. Genotype (column "GT") - filter variants by genotype, i.e. reference and alternative allele values for the sample. Genotypes are filtered ignoring phasing information. To add a condition for filtering variants by genotype, do the following:
    1. Select a sample to filter:
  • "Some": any sample has the required genotype;
  • "Every": each sample has the required genotype;
  • Sample name.

  1. Select a filter operator:

  1. Select the required filter value by clicking on the value row:
  • "REF": only reference allele at the site, (e.g., 0/0, 0|0, 0/0/0);
  • "ALT": there is at least one alternative allele at the site (e.g., 1/1, 1|1, 0/1, 1/0, 0|1, 1|0, 1/., 1|., ./1, .|1, 1, 0/0/1, 0/2, 1/2);
  • "HOM": homozygous site, can be as in reference or alternative (e.g., 1|1, 1/1, 1/1/1, 0/0, 0|0);
  • "HET": heterozygous site, containing at least two different alleles (e.g., 0/1, 1/0, 0|1, 1|0, 2/0, 0/1/2);
  • "HOM_ALT": homozygous site in alternative (e.g., 1/1, 1|1|1);
  • "MIS": information is missing for the site (e.g., ./.).
  1. Read depth (column "Depth (Alt/Ref)") - filter variants by sequencing depth, i.e. the total number of reads of the sequence overlapping the variant position for this sample.
  2. Ref count - filter variants by the number of times the reference nucleotide was read in the sequence for this sample.
  3. Alt count - filter variants by the number of times the alternative nucleotide was read in the sequence for this sample.
  4. AF - filter variants by alternative allele frequency for this sample.

To add a condition for filtering variants by depth, ref, alt or AF, do the following:

  1. Select a filter function:
  • "Max": maximum value;
  • "Min": minimum value;
  • "Avg": mean;
  • "Sum": total value.

  1. Select a filter operator:

  1. If a mathematical operator was selected as the filter operator, click on input box and enter the filter threshold (natural number for filters by depth, ref and alt count, and positive rational number for AF). To complete adding the condition, press Enter or click outside edit value container.

Gene#

gnomAD 3#

Filter columns with the following data from gnomAD v3 database:

  1. AF (column "gnomAD 3 AF") - filter variants by the total allele frequency (AF). Numeric filter (positive rational number).
  2. XY AF - filter variants by allele frequency (AF) in the XY subpopulation. Numeric filter (positive rational number).
  3. XX AF - filter variants by allele frequency (AF) in the XX subpopulation. Numeric filter (positive rational number).
  4. Number of Homozygotes - filter variants by the number of homozygotes. Numeric filter (natural number).
  5. [Population] AF - filter variants by total allele frequency (AF) in different populations, where [Population] is Ashkenazi Jewish, Amish, African and Afr. Am., Finnish, South Asian, Latino, Non-finnish european, East Asian, Middle Eastern, Other. Numeric filter (positive rational number).
  6. [Population] XY/XX samples AF - filter variants by allele frequencies (AF) in XY/XX subpopulations in different populations, where [Population] is Ashkenazi Jewish, Amish, African and African American, Finnish, South Asian, Latino, Non-finnish european, East Asian, Middle Eastern, Other. Numeric filter (positive rational number).
  7. [Population] Number of Homozygotes - filter variants by the number of homozygotes in different populations, where [Population] is Ashkenazi Jewish, Amish, African and African American, Finnish, South Asian, Latino, Non-finnish european, East Asian, Middle Eastern, Other. Numeric filter (positive rational number).
  8. Coverage - filter variants by average coverage depth by bases: <10, 10-100, โฉพ100. Any value filter.

gnomAD 4#

Filter columns with the following data from gnomAD v4 database:

  1. AF - filter variants by the total alternative allele frequency (AF) in high quality genotypes in exomes and genomes. Numeric filter (positive rational number).
  2. AC - filter variants by the total alternative allele count in exomes and genomes. Numeric filter (natural number).
  3. AN - filter variants by the total number of called high quality genotypes in exomes and genomes. Numeric filter (natural number).
  4. Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in exomes and genomes. Numeric filter (natural number).
  5. XY Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in the XY subpopulation in exomes and genomes. Numeric filter (natural number).
  6. XX Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in the XX subpopulation in exomes and genomes. Numeric filter (natural number).
  7. XY AF - filter variants by the total alternative allele frequency in the XY subpopulation in exomes and genomes. Numeric filter (positive rational number).
  8. XY AC - filter variants by the total alternative allele count in high quality genotypes in the XY subpopulation in exomes and genomes. Numeric filter (natural number).
  9. XY AN - filter variants by the total number of called high quality genotypes in the XY subpopulation in exomes and genomes. Numeric filter (natural number).
  10. XX AF - filter variants by the total alternative allele frequency in the XX subpopulation in exomes and genomes. Numeric filter (positive rational number).
  11. XX AC - filter variants by the total alternative allele count in high quality genotypes in the XX subpopulation in exomes and genomes. Numeric filter (natural number).
  12. XX AN - filter variants by the total number of called high quality genotypes in the XX subpopulation in exomes and genomes. Numeric filter (natural number).
  13. Exomes AF - filter variants by the total alternative allele frequency (AF) in high quality genotypes in exomes. Numeric filter (positive rational number).
  14. Exomes AC - filter variants by the total alternative allele count in exomes. Numeric filter (natural number).
  15. Exomes AN - filter variants by the total number of called high quality genotypes in exomes. Numeric filter (natural number).
  16. Exomes Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in exomes. Numeric filter (natural number).
  17. Exomes XY Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in the XY subpopulation in exomes. Numeric filter (natural number).
  18. Exomes XX Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in the XX subpopulation in exomes. Numeric filter (natural number).
  19. Exomes XY AF - filter variants by the total alternative allele frequency in the XY subpopulation in exomes. Numeric filter (positive rational number).
  20. Exomes XY AC - filter variants by the total alternative allele count in high quality genotypes in the XY subpopulation in exomes. Numeric filter (natural number).
  21. Exomes XY AN - filter variants by the total number of called high quality genotypes in the XY subpopulation in exomes. Numeric filter (natural number).
  22. Exomes XX AF - filter variants by the total alternative allele frequency in the XX subpopulation in exomes. Numeric filter (positive rational number).
  23. Exomes XX AC - filter variants by the total alternative allele count in high quality genotypes in the XX subpopulation in exomes. Numeric filter (natural number).
  24. Exomes XX AN - filter variants by the total number of called high quality genotypes in the XX subpopulation in exomes. Numeric filter (natural number).
  25. Genomes AF - filter variants by the total alternative allele frequency (AF) in high quality genotypes in genomes. Numeric filter (positive rational number).
  26. Genomes AC - filter variants by the total alternative allele count in genomes. Numeric filter (natural number).
  27. Genomes AN - filter variants by the total number of called high quality genotypes in genomes. Numeric filter (natural number).
  28. Genomes Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in genomes. Numeric filter (natural number).
  29. Genomes XY Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in the XY subpopulation in genomes. Numeric filter (natural number).
  30. Genomes XX Number of Homozygotes - filter variants by the total number of individuals homozygous for alternative allele in the XX subpopulation in genomes. Numeric filter (natural number).
  31. Genomes XY AF - filter variants by the total alternative allele count in high quality genotypes in the XY subpopulation in genomes. Numeric filter (natural number).
  32. Genomes XY AC - filter variants by the total alternative allele count in high quality genotypes in the XY subpopulation in genomes. Numeric filter (natural number).
  33. Genomes XY AN - filter variants by the total number of called high quality genotypes in the XY subpopulation in genomes. Numeric filter (natural number).
  34. Genomes XX AF - filter variants by the total alternative allele count in high quality genotypes in the XX subpopulation in genomes. Numeric filter (natural number).
  35. Genomes XX AC - filter variants by the total alternative allele count in high quality genotypes in the XX subpopulation in genomes. Numeric filter (natural number).
  36. Genomes XX AN - filter variants by the total number of called high quality genotypes in the XX subpopulation in genomes. Numeric filter (natural number).

ClinVar#

Filter columns with the following data from ClinVar database:

  1. Clinical significance (column "ClinVar", basic filter "ClinVar") - filter variants by the clinical significance (you can find the description of values here). Multiselect filter.
  2. Allele ID - filter variants by allelic substitution ID (e.g. 405185). Filter with ID input.
  3. Phenotypes - filter variants by the phenotypic significance. Any value filter.
  4. Phenotypes cross references - filter variants by phenotype references in various databases (Orphanet, MedGen, MONDO, OMIM, etc.) Any value filter.
  5. Haplotype phenotypes - filter by the phenotypic significance of the variant that is part of the haplotype. Any value filter.
  6. Haplotype phenotypes cross references - filter by references to the phenotype of a variant that is part of the haplotype in various databases (Orphanet, MedGen, MONDO, OMIM, etc.) Any value filter.
  7. Review status - filter variants by the summary score of the data source, in which the clinical significance of the phenotype is declared. Any value filter. Possible value options and their descriptions can be found here.
  8. Cross references - filter variants by references in various databases (ClinGen, UniProtKB, OMIM, GTR, etc.) Any value filter.
  9. Allele origin - filter variants by allele origin. Multiselect filter.
  10. Variant Haplotype ID - filter by ID of the variant (Variation ID) that is part of the haplotype. Filter with ID input.

Conservation#

  1. Ancestral allele - filter variants by ancestral allele predicted by EPO. Any value filter.
  2. Altai Neanderthal - filter variants by Altai Neanderthal genotype. Any value filter.
  3. Denisova - filter variants by Denisova genotype. Any value filter.
  4. Vindija Neanderthal - filter variants by Vindija Neanderthal genotype. Any value filter.
  5. [Program] [Score] - filter variants by conservation scores obtained using prediction by various programs, where [Program] is a prediction program: FitCons, GERP++, PhyloP, PhastCons, SiPhy, BStatistic, and [Score] is a conservation score: rankscore, score, neutral rate. Numeric filter (positive rational number).
  6. [Score] confidence value - filter variants by confidence value of various conservation scores: Highly significant; Significant; Informative; Other, where [Score] is a conservation score: Integrated, GM12878, H1-hESC, HUVEC. Multiselect filter.
  7. [Distribution] - filter variants by stationary allele distribution in the site, estimated using SiPhy algorithm, where [Distribution] is a distribution of alleles A, G, C or T: pA, pG, pC, pT. Numeric filter (positive rational number).

ExAC#

Filter columns with the following data from ExAC database:

  1. AF - filter variants by the total allele frequency (AF). Numeric filter (positive rational number).
  2. Allele count - filter variants by alternative allele count. Numeric filter (natural number).
  3. Adjusted AF - filter variants by allele frequency (AF) in a population with genotype quality โ‰ฅ20 and depth โ‰ฅ10. Numeric filter (positive rational number).
  4. Adjusted allele count - filter variants by alternative allele count in a population with genotype quality โ‰ฅ20 and depth โ‰ฅ10. Numeric filter (natural number).
  5. [Population] AF - filter variants by total allele frequency (AF) in different populations, where [Population] is African & African American, American, East Asian, Finnish, Non-Finnish European, South Asian. Numeric filter (positive rational number).
  6. [Population] allele count - filter variants by alternative allele count in different populations, where [Population] is African & African American, American, East Asian, Finnish, Non-Finnish European, South Asian. Numeric filter (positive rational number).
  7. [Cohort] AF - filter variants by total allele frequency (AF) in different cohorts, where [Cohort] is Non-TCGA, Non-Psych. Numeric filter (positive rational number).
  8. [Cohort] Allele count - filter variants by alternative allele count in different cohorts, where [Cohort] is Non-TCGA, Non-Psych. Numeric filter (natural number).
  9. [Cohort] adjusted AF - filter variants by allele frequency (AF) in a population with genotype quality โ‰ฅ20 and depth โ‰ฅ10 in different cohorts in ExAC database, where [Cohort] is Non-TCGA, Non-Psych. Numeric filter (positive rational number).
  10. [Cohort] adjusted allele count - filter variants by alternative allele count in a population with genotype quality โ‰ฅ20 and depth โ‰ฅ10 in different cohorts, where [Cohort] is Non-TCGA, Non-Psych. Numeric filter (natural number).
  11. [Cohort] [Population] AF - filter variants by total allele frequency (AF) in different populations in different cohorts, where [Cohort] is Non-TCGA, Non-Psych, and [Population] is African & African American, American, East Asian, Finnish, Non-Finnish European, South Asian. Numeric filter (positive rational number).
  12. [Cohort] [Population] allele count - filter variants by alternative allele count in different populations in different cohorts,where [Cohort] is Non-TCGA, Non-Psych, and [Population] is African & African American, American, East Asian, Finnish, Non-Finnish European, South Asian. Numeric filter (natural number).

Protein function effect#

  1. [Algorithm] [Score] - filter variants by prediction scores of the effect of amino acid substitution on protein function, obtained using various algorithms and programs, where [Algorithm] is a prediction algorithm: SIFT, Polyphen, SIFT4G, LRT, FAtHMM, Mutation Accessor, and [Score] is a prediction score: rankscore, score, omega, p-value. Numeric filter (positive rational number).
  2. [Algorithm] prediction - filter variants by the results of predicting the effect of amino acid substitution on protein function, obtained using various algorithms and programs, where [Algorithm] is a prediction algorithm: Polyphen, SIFT4G, LRT, FAtHMM, Mutation Accessor. Multiselect filter.
  3. SpliceAI Delta score - filter by the probability of the variant being splice-altering at any position within a given interval (+/-50 bp between the variant and gained/lost splice site by default), predicted SpliceAI. Numeric filter (any number).
  4. SpliceAI Delta score (acceptor gain) - filter by the probability of the variant being splice acceptor at any position within a given interval (+/-50 bp between the variant and gained/lost splice site by default), predicted SpliceAI. Numeric filter (any number).
  5. SpliceAI Delta score (acceptor loss) - filter by the probability of the variant not to be splice acceptor at any position within a given interval (+/-50 bp between the variant and gained/lost splice site by default), predicted SpliceAI. Numeric filter (any number).
  6. SpliceAI Delta score (donor gain) - filter by the probability of the variant being splice donor at any position within a given interval (+/-50 bp between the variant and gained/lost splice site by default), predicted SpliceAI. Numeric filter (any number).
  7. SpliceAI Delta score (donor loss) - filter by the probability of the variant not to be splice donor at any position within a given interval (+/-50 bp between the variant and gained/lost splice site by default), predicted SpliceAI. Numeric filter (any number).
  8. SpliceAI Delta position - filter by the location (in bp) where splicing changes relative to the variant position (positive values are downstream of the variant, negative values are upstream). Numeric filter (integer number).
  9. SpliceAI Delta position (acceptor gain) - filter by the location (in bp) of splice acceptor relative to the variant position (positive values are downstream of the variant, negative values are upstream). Numeric filter (integer number).
  10. SpliceAI Delta position (acceptor loss) - filter by the location (in bp) which is not a splice acceptor relative to the variant position (positive values are downstream of the variant, negative values are upstream). Numeric filter (integer number).
  11. SpliceAI Delta position (donor gain) - filter by the location (in bp) of splice donor relative to the variant position (positive values are downstream of the variant, negative values are upstream). Numeric filter (integer number).
  12. SpliceAI Delta position (donor loss) - filter by the location (in bp) which is not a splice donor relative to the variant position (positive values are downstream of the variant, negative values are upstream). Numeric filter (integer number).
  13. CADD raw - filter by the extent to which the annotation profile for a variant suggests that the variant is likely to be "observed" (negative values) vs "simulated" (positive values), calculated by CADD. Numeric filter (any number).
  14. CADD phred - filter by Phred-scaled score of the deleteriousness of a variant, calculated by CADD. Numeric filter (any number).
  15. MaxEntScan Alt score - filter by score for alternative allele predicting the loss of a native splice site for a variant, calculated by MaxEntScan. Numeric filter (any number).
  16. MaxEntScan Ref score - filter by score for reference allele predicting the loss of a native splice site for a variant, calculated by MaxEntScan. Numeric filter (any number).
  17. MaxEntScan diff - filter by the difference between the scores for alternative and reference alleles predicting the loss of a native splice site for a variant, calculated by MaxEntScan. Numeric filter (any number).

Protein function effect (additional)#

  1. [Algorithm] [Score] - filter variants by prediction scores of the effect of amino acid substitution on protein function, obtained using various algorithms and programs, where [Algorithm] is a prediction algorithm: BayesDel addAF, BayesDel noAF, Provean, Meta SVM, Meta LR, M-CAP, MutPred, MVP, MPC, Primate AI, DEOGEN2, DANN, FAtHMM-MKL, FAtHMM-XF, Eigen, and [Score] is a prediction score: rankscore, score, p-value. Numeric filter (positive rational number).
  2. [Algorithm] prediction - filter variants by the results of predicting the effect of amino acid substitution on protein function, obtained using various algorithms and programs, where [Algorithm] is a prediction algorithm: BayesDel addAF, BayesDel noAF, Provean, Meta SVM, Meta LR, M-CAP, Primate AI, FAtHMM-MK, FAtHMM-XF. Multiselect filter.
  3. Transcript ID: filter variants by transcript ID in UniProtKB database (e.g. P16422) for which the effect of amino acid substitution on protein function has been predicted by MutPred. Any value filter.
  4. Amino acid change - filter variants by amino acid substitution whose effect on protein function was predicted by MutPred. Any value filter. e.g.: G35r.
  5. Top 5 features - filter variants by the top five substitution effects on protein function predicted by MutPred. Any value filter.

Other frequencies#

  1. 1000G Allele frequency - filter variants by total allele frequency in the 1000 Genomes Project. Numeric filter (positive rational number).
  2. 1000G [Population] AF - filter variants by total allele frequency (AF) in different populations in the 1000 Genomes Project, where [Population] is East Asian, European, African, American, South Asian, African American, European American. Numeric filter (positive rational number).
  3. UK10K allele frequency - filter variants by total allele frequency in combined genotypes in the UK10K cohort from the UK10K project. Numeric filter (positive rational number).
  4. UK10K allele count - filter variants by alternative allele count in combined genotypes in the UK10K cohort from the UK10K project. Numeric filter (natural number).
  5. UK10K [Cohort] AF - filter variants by total allele frequency (AF) in combined genotypes in different cohorts from the UK10K project, where [Cohort] is TwinsUK, ALSPAC. Numeric filter (positive rational number).
  6. UK10K [Cohort] allele count - filter variants by alternative allele count in combined genotypes in different cohorts from the UK10K project, where [Cohort] is TwinsUK, ALSPAC. Numeric filter (natural number).
  7. ESP [Population] AF - filter variants by total allele frequency (AF) in different populations in NHLBI GO Exome Sequencing Project (ESP), where [Population] is African American, European American. Numeric filter (positive rational number).
  8. [Population] allele count - filter variants by alternative allele count in different populations from NHLBI GO Exome Sequencing Project (ESP), where [Population] is African American, European American. Numeric filter (natural number).

Other#

  1. Coding sequence position - filter variants by coding sequence position. Numeric filter (natural number).
  2. Protein position - filter variants by protein position. Numeric filter (natural number).
  3. Amino acids - filter variants by reference amino acid. Any value filter.
  4. Codons - filter variants by nucleotide substitution written as codons. Any value filter. e.g.: GAA/GAG.
  5. Amino acid position - filter variants by amino acid position. Numeric filter (natural number).
  6. APPRIS annotation - filter variants by human splicing isoforms (APPRIS annotation): Principal 1-5; Alternative 1-2. Multiselect filter.
  7. Geuvadis EQTL target gene - filter variants by gene from eQTL data analysis in GEUVADIS. Any value filter.
  8. ENIGMA clinical significance - filter by clinical significance of the variant located in the known or suspected breast and/or ovarian predisposition genes (such as BRCA1, BRCA2) determined by ENIGMA consortium. Multiselect filter: Pathogenic, Likely pathogenic, Benign, Likely benign, Not provided.
  9. ENIGMA clinical significance comment - filter by comment on clinical significance of the variant located in the known or suspected breast and/or ovarian predisposition genes (such as BRCA1, BRCA2) determined by ENIGMA consortium. Any value filter.

Custom annotation sources#

A filtering column group is present if at least one custom annotation was added to the system before the sample variants were annotated. Filtering column names look like this: [custom annotation name]: [annotation column name]. To add a condition for filtering variants by column from a custom annotation, do the following:

  1. Select a filter operator. If the annotation column type is text, then the operators will be the following:

If the annotation column type is numeric, then the operators will be the following:

  1. If operators other than "not empty" and "empty" have been selected, click on the input box and enter the filter column value. To complete adding the condition, press Enter or click outside edit value container.

Compound Heterozygote Filtering Mode#

A compound heterozygote is a genetic condition in which two or more alleles of the same gene carry different mutations in a patient. This condition is observed if the patient inherits a mutant allele from each parent or inherits one mutation from one of the parents, and the second occurs de novo. Compound heterozygotes are common in autosomal recessive diseases.

If there is data about sequencing samples, a button for filtering by compound heterozygote appears in the query builder:

Filtering by compound heterozygote depends on SNV Viewer type:

SNV Viewer of family trio analysis (Proband, Father, Mother): the proband has at least two heterozygotes in one gene (0/1, 1/0, 0|1, etc.) with the following inheritance options:

  • at least one of the heterozygotes is inherited from the father, and at least one is inherited from the mother:

  • at least one of the heterozygotes occurred de novo (i.e the father and mother have homozygous reference 0/0), and the second is inherited from the father or from the mother:

To apply a filter by compound heterozygote, click the button . As a result, there will be found all cases where at least two heterozygotes are located in the same gene in the proband, and in the parents they are either heterozygous, or homozygous reference, or there is no information about the site (./.). To turn off the filter in the query builder, click the button .

SNV Viewer of family duo analysis (Father and Mother): cases when in one gene at least one heterozygote (0/1, 1/0, 0|1, etc.) is in the mother and at least one is in the father:

To apply a filter by compound heterozygote, click the button , and to turn off the filter in the query builder, click the button .

SNV Viewer of family duo analysis (Proband, Parent (Father or Mother)): the proband has at least two heterozygotes in one gene (0/1, 1/0, 0|1, etc.) with the following inheritance options:

  • at least one heterozygote is inherited, and the second is not inherited from the parent (i.e. the parent has homozygous reference 0/0):

  • both are not inherited from the parent (i.e. the parent has homozygous reference 0/0):

To apply a filter by compound heterozygote, click the button , and to turn off the filter in the query builder, click the button .

SNV Viewer with one sequencing sample (e.g. for a single sample of tumor or healthy tissue): there will be found all cases where at least two heterozygotes (0/1, 1/0, 0|1, etc.) are located in the same gene. To apply a filter by compound heterozygote, click the button , and to turn off the filter in the query builder, click the button .

SNV Viewer of germinal cohort analysis or SNV Viewer with several sequencing samples (e.g. for a tumor/control sample pair), or run SNV Viewer (if there is data on sequencing samples): by clicking on the compound heterozygote filter button, you will see a drop-down sample list:

When one of the samples is selected, the variants will be filtered in such a way that in this sample there will be at least two heterozygotes in one gene (0/1, 1/0, 0|1, etc.) To turn off such a filter in the query builder, you need to click on the compound heterozygote filter button and select "Turn off" option.

Applying of the compound heterozygote filtering mode with other filtering conditions:

When applying the compound heterozygote filtering mode together with other filtering conditions, the condition for compound heterozygote is already applied to those variants that fit the other conditions:

This fundamentally differs the filtering mode from "Compound heterozygote (Genotype)" filter, which is applied simultaneously with other filtering conditions. As a result of applying the filter, there will be found variants that are included in the compound heterozygote, but the remaining variants from this compound heterozygote can be filtered out by other conditions:

Add the First Filter Condition#

To add a new condition, click on the highlighted window:

You will see a list of annotation columns available for filtering variants:

The way of adding a filter condition depends on the filter type:

1. Multiselect Filters#

  • Ability to select multiple column values at once.
  • The way of adding a filter condition:
  1. Select a filter column from the list of all columns;
  2. Select a filter operator:
    • "in";
    • "not in";
    • "not empty": the selected column has a non-empty value in the annotation for the searched variants (there is no such option for the "Chromosome" and "Origin" columns);
    • "empty": the selected column has no value in the annotation for the searched variants (there is no such option for the "Chromosome" and "Origin" columns).

  1. If the "in" or "not in" operators have been selected, select the required filter column values by clicking on the value rows:

  1. To complete adding the condition, click outside edit value container.

2. Single Value Filters#

  • Ability to select only one filter value.
  • The way of adding a filter condition:
  1. Select filter from the list of all filters. The filter operator "is" is automatically selected for filters of this type;
  2. Select the required filter value by clicking on the value row:

3. Filters with ID input#

  • Ability to enter the required column value.
  • The way of adding a filter condition:
  1. Select a filter column from the list of all columns;
  2. Select a filter operator:

  1. If the operators "is" or "is not" were selected, click on the input box and enter filter column value:

You can enter several IDs at once in a list separated by a comma (e.g. rs1234, rs2799066, rs4648562), a semicolon or a space (rs1234 rs2799066 rs4648562).

  1. To complete adding the condition, press Enter or click outside edit value container.

4. Numeric Filters#

  • Ability to enter the required column value.
  • The way of adding a filter condition:
  1. Select a filter column from the list of all columns;
  2. Select a filter operator:
    • ">";
    • "<";
    • "โ‰ฅ";
    • "โ‰ค";
    • "=";
    • "โ‰ ";
    • "not empty": the selected column has a non-empty value in the annotation for the searched variants (there is no such option for the "Start" and "End" columns);
    • "empty": the selected column has no value in the annotation for the searched variants (there is no such option for the "Start" and "End" columns).

  1. If a mathematical operator was selected as the filter operator, click on the input box and enter filter column value:

  1. To complete adding the condition, press Enter or click outside edit value container.

5. Any Value Filters#

  • Ability to enter the required column value.
  • The way of adding a filter condition:
  1. Select a filter column from the list of all columns;
  2. Select a filter operator:
    • "starts with" (there is no such option for the "HGVSc" column);
    • "contains";
    • "not contains" (there is no such option for the "HGVSc" column);
    • "not empty": the selected column has a non-empty value in the annotation for the searched variants (there is no such option for the "Reference allele" column);
    • "empty": the selected column has no value in the annotation for the searched variants (there is no such option for the "Reference allele" column).

  1. If the operators "starts with", "contains" or "not contains" were selected, click on the input
    box and enter filter column value:

  1. To complete adding the condition, press Enter or click outside edit value container.

Edit a Condition#

You can change the operator and/or value in an already created condition. To do this, you need to click on the corresponding field:

and choose another option:

Turn off a Condition#

Turning off a condition is useful if you donโ€™t want to lose the added condition, but at the moment you donโ€™t need to filter by it. To turn off a condition, hover over it and click on :

Remove a Condition#

To remove a condition, hover over it and click on :

tip

A condition that has not yet been filled in can also be removed (if it's not the very first condition in the Query Builder):

Add the Following Condition#

If you want to add another condition to the query, first decide whether you want the variant you are looking for to strictly match both conditions (the one already added and the new one), or whether the variant can match one of the conditions.

The searched variant matches both conditions#

  1. After adding the first condition, click on .
  2. Complete the condition as described in the corresponding section. To get started, click on the highlighted window:

The searched variant matches one of the conditions#

  1. After adding the first condition, hover over it and click on .
  2. Complete the condition as described in the corresponding section. To get started, click on the highlighted window:

Apply a Query#

After you have completed all the conditions for the sought variants, click on . The Query Builder closes and the query is applied to the variants. ะฃou can see which query has been applied at the top of the table:

You can turn the condition off by clicking on , or you can delete it by clicking on :

To remove all conditions from the query, click on .

To change or save your query or create a new one, open the Query Builder by clicking on .

To return to the SNV Viewer main page, click on .

Sometimes there are a lot of conditions for the searched variants and the query can be very long:

In this case, you can collapse the query conditions by clicking on the button to the right of the Remove all conditions button. Then only the opening conditions of the query will be shown:

To expand the query conditions back, click on .

info

On Profile Settings page, you can adjust whether the query conditions are expanded or collapsed immediately after applying a filter query.

Save a Query#

If you want to reuse the created query in the future, save it. To do this, open the Query Builder by clicking on , and then click on . Enter the query name:

and click on .

The saved query will appear in the Saved queries list, on the left in the Query Builder. To apply a saved query, click on its line:

and then click on .

In addition, the saved query will appear in the options of the basic filter "Saved queries".

If you want to delete a saved query, click on the query line and then on .