Reference Gene Catalogue and Nomenclature Recommendations
The scope of this section is 1) to offer a unified repository of functionally characterized genes and described gene families with reference to gene names, synonyms and publications, and 2) remind the grapevine community of the nomenclature guidelines recommended to name genes and assign symbols.
Grapevine Gene Nomenclature
The recommendations are published in: Grimplet et al, 2014.
Erratum: In the manuscript, the link to the VIV catalog for species abbreviations is incorrect. Follow this one instead.
Highlights:
1- Check if your gene has been previously assigned a Full Name or Symbol in a previous work describing the whole gene family to which it belongs. You can still give it a name according to the phenotype or process in which it is involved and that you have demonstrated experimentally, but it’s recommended to search for these Synonyms.
2- The definition of a proper nomenclature needs to consider the level of confidence of the function as assigned to the full name (e.g. experimental validation in same or other species, role proposed by phylogenetic analysis, hypothetical function based in similarity to other species’ sequences, etc). See Figure 2 in Grimplet et al, 2014.
3- Use the convention for functional names and symbols proposed by the sNCGGA. See Figure 3 in Grimplet et al, 2014. Briefly:
- Gene name based on complementation assays in other plant mutant species is allowed, however, please bear in mind that this doesn’t necessarily mean the role is going to be exactly the same in grapevine (maybe there are more roles or additional ‘evolved’ functions).
- Gene name based on overexpression phenotypes in other plant species. Same recommendations as in the previous point.
- Gene name based in overexpression, knock-out or silencing phenotype in grape: OK
- Gene name based in QTL/mapped trait: OK
- Gene naming based on phylogenetic trees: we recommend performing phylogenetic analyses using the complete gene family identified in grape and Arabidopsis (Maximum Likelihood is preferred over Neighbour Joining). A grape gene can be named as the Arabidopsis homologue if this relation is the closest compared to any other homologue/paralogue (orthologs one2one should be considered: the vitis gene is the closest among all the vitis genes for an Arabidopsis gene and vice-versa). Please consider methods (search for best fit model, consider only bootstrap values >70 or Bayesian probabilities >0.8). Letters can be added to the name (name1a, name1b…) if several grape genes show closest homology to one Arabidopsis gene.
- Ordering according to the chromosome position should be avoided. It presents the disadvantage of being invalidated each time changes occur at the level of the genome assembly or when new members of the family are discovered.
- Prefix Vvi should be used for Vitis vinifera (Vv prefix was originally created for the bacteria Vibrio vulnificus). For other grape species use this list.
Locus ID (V1) | Locus ID (V.Cost) | Full name | Curation | Prefix | Symbol | Synonyms | |
---|---|---|---|---|---|---|---|
Example 1 | VIT_04s0008g05210 | Vitvi04g00464 | (Vitis vinifera) ELONGATED HYPOCOTYL5 | validated (Loyola et al., 2016) | Vvi | HY5 | bZIP10 (Liu et al., 2014) |
Example 2 | VIT_03s0180g00200 | Vitvi03g00557 | (Vitis vinifera) Resveratrol Glycosyltransferase, putative | validated in V. labrusca (Hall & Luca, 2007) | Vvi | RsGT1 | GT9 (Bönisch et al. 2014) |
Description | Genome localization in V1 annotation | Genome localization in V3 annotation | Relative descriptive function | Level of curation | Vvi | Concise (3-10 characters), descriptive of function when possible | Any known synonyms |
Grapevine Reference Gene Catalogue
The INTEGRAPE Cost Action has unified several independent efforts for building a catalogue of characterized genes and surveyed gene families. Their gene symbols were included in the latest 12X.2 assembly V.Cost gff annotation file (to download VCost.v3_INTEGRAPEv2.gff3) and will also be present in the latest PN40024 40X assembly annotation (under construction).
Note on Catalogue’s Structure. Each row in the catalogue corresponds to a unique V.Cost.v3 Id. In cases where there is recent evidence of a V.Cost.v3 splitting in two genes, a composite gene symbol is provided.
Access to the full catalogue
(version2.3, Last update: 29th November 2021; version 3 will be released in February 2022):
(Excel)⇒ catalogue.INTEGRAPEv2.xlsx
Gene Cards. This App serves as a hub for gene functional associations, represented as information cards with references and all the information from the most recent version of the catalogue. It also includes an interactive organ expression viewer thorough all SRA runs available.
Developed by David Navarro-Payá & José Tomás Matus (Coordinators of the Grape Gene Reference Catalogue Initiative).
Validation Levels
Validation | Level of experimental validation |
---|---|
Hypothetical: only based on similarity to other proteins (e.g. BLAST, Hidden Markov Models /PFAM search). | 1 |
Putative: complete family identification (using phylogenetic trees and including other species) and/or expression data/validation (RNA-seq, qPCR, others). | 2 |
Proposed:correlation experiments such as gene co-expression networks, correlation to metabolite profiles. | 3 |
Candidate: QTL mapping. | 4 |
Validated_other_sp.: If transcription factor: overexpression or negative dominant (in transient, stable experiments) in another species. | 5 |
Validated: knock-out/loss-of-function, silencing, overexpression or negative dominant (transient, stable) in the same species. If enzyme: in vitro/recombinant enzyme characterization (e.g. E. coli/yeast) or overexpression of enzyme (transient, stable) in another species. | 6 |
New Gene Submission Form
MPORTANT NOTICE: If you have recently characterized a gene or described a gene family (already published or accepted for publication) you can incorporate it in our catalogue.
Please fill out the following form.
Make sure to consider the following levels of evidence/validation for gene functions
Validation | Level of experimental validation |
---|---|
Hypothetical: only based on similarity to other proteins (e.g. BLAST, Hidden Markov Models /PFAM search). | 1 |
Putative: complete family identification (using phylogenetic trees and including other species) and/or expression data/validation (RNA-seq, qPCR, others). | 2 |
Proposed:correlation experiments such as gene co-expression networks, correlation to metabolite profiles. | 3 |
Candidate: QTL mapping. | 4 |
Validated_other_sp.: If transcription factor: overexpression or negative dominant (in transient, stable experiments) in another species. | 5 |
Validated: knock-out/loss-of-function, silencing, overexpression or negative dominant (transient, stable) in the same species. If enzyme: in vitro/recombinant enzyme characterization (e.g. E. coli/yeast) or overexpression of enzyme (transient, stable) in another species. | 6 |
Provider
José Tomás Matus / David Navarro-Payá
Primary contact | tomas.matus (at) uv.es / davidnp7 (at) gmail.com.