Team:Valencia UPV/Software

DATA BASE


Often, gene related information is available on the Internet, but people don’t always know where to look or how to search efficiently, and almost every time it is so difficult to interpret because it is not clear the function of these genes. Editing them usually leads to non desired modification. Plant proteins usually are involved in many metabolic processes simultaneously, so modifications in a particular protein could lead in a non viable plant. In other cases, those modifications lead to a fruitless variety, which many times has not an agricultural benefit. So that, many times is so difficult to know which gene a plant breeder must edit despite he knows what phenotypic trait he wants to obtain.

In HYPE-IT, we have worked hard looking for genes which knockout leads to an interesting phenotypic trait. Scientific articles web pages are full of papers where gene functions are identified, but not always are identified by knocking them out. We have only selected those articles where they have done the gene silencing, either by RNAi mediated silencing or by CRISPR/Cas9 editing, and they obtained a viable fruited plants. This allow us to avoid transgenesis because we don’t need to insert exogenous genes to obtain an enhanced plant variety.
Our database gather more than 20 different genes, which knockout leads to many different improved characteristics. All these genes are well referenced, so has been demonstrated plant viability in all the cases. However, there are many proteins that are homologous between them, so we can admit that they have the same function in other plant species. After doing Blastp exams, we finally obtained more than 200 targeting genes. This is a huge Database with a high interest for plant breeders and seedbeds.

Some gene examples of our Database are:

#
Ga20 oxidase, that leads to a smaller phenotype in maize or rice. All the energy that the modified plants would have used growing up, they would harness it increasing their grain properties and production.
TFL (terminal flower), is a key regulator of delaying flowering and regulating plant growth. Its knockdown leads to more flowering varieties.
ACS4 is a protein involved in the route of ethylene synthesis. It catalyzes the synthesis of 1­-aminocyclopropane­-1-­carboxylic acid (ACC) from S­-Adenosyl methionine. Its knockdown leads to andromonoecy varieties.

HYPE-IT Database also include important gene related information, such as the name of the gene and the protein targeted, the paper we have based on or the NCBI accession number. It allows us to organise information in a structured way, in addition to have it interconnected with other kind of databases. Mixing our Database and Software, we are able to obtain the optimal gRNA to a specific HYPE-IT Database gene using the scoring system of our Software. The objective is to reduce steps that plant breeders should do if they wanted to enhance a plant variety giving them all sequences they should order to synthesize.

We are very glad to the result of our Database. If you want to access to it, you should sign in our external webpage, located inside HYPE-IT sofware application.

http://hypeit.cloudno.de/


We also include here our database in .xls for anyone who wants to check it:


HYPE-IT Database.xlsx


As a sample of how our Database is:

Database example

Common nameSpeciesPhenotypic traitGene NameNCBI CDS Accession numberProtein
AppleMalus domesticadelayed ripeningMdACS3AB2430601-aminocyclopropane-1-carboxylate synthase
TomatoSolanum lycopersicumdelayed ripeningSolanum lycopersicum 1-aminocyclopropane-1-carboxylate synthase (ACS6), mRNANM_001247235.21-aminocyclopropane-1-carboxylate synthase
StrawberryFragaria × ananassaFlavonoid biosynthesisFragaria chiloensis transcription factor (MYB1) mRNA, complete cds GQ867222.1A R2R3 MYB transcription factor
Tomato Solanum lycopersicum Increase of carotenoid and flavonoid levelsSolanum lycopersicum deetiolated1 homolog (Det1), mRNA NM_001247219.2light-mediated development protein DET1
Orange treeCitrus sinensisinduced floweringCitrus sinensis terminal flower (TFL), mRNANM_001288919.1terminal flower (TFL)
MaizeZea mayssemi-dwarf; more grain yieldZea mays (LOC107521947), mRNANM_001321686.1GA3 oxidase
RiceOryza sativaDrought TolerancePREDICTED: Oryza sativa Japonica Group E3 ubiquitin-protein ligase SINAT5 (LOC4344172), mRNA. XM_015789296E3 ubiquitin-protein ligase SINAT5
coffeeCoffea arabicadecaffeinated plantsCoffea arabica CaMXMT1 mRNA for 7-methylxanthine N-methyltransferase, complete cds.AB048794RecName: Full=Monomethylxanthine methyltransferase 1; Short=CaMXMT1; AltName: Full=Theobromine synthase 1
cottonGossypium hirsutum increased stearic acid contentG.hinsutum mRNA for stearoyl-acyl-carrier protein desaturase X95988delta 9 stearoyl-(acyl-carrier protein) desaturase (Gossypium hirsutum)
cottonGossypium hirsutumincreased oleic acid contentGossypium hirsutum delta(12)-fatty-acid desaturase FAD2-like (LOC107934594), mRNA.NM_001327381delta(12)-fatty-acid desaturase FAD2-like (Gossypium hirsutum)
cornZea mayshigher levels of amyloseZea mays amylose extender 1 (ae1), mRNA.NM_0011118461,4-α-glucan-branching enzyme 2, chloroplastic/amyloplastic precursor (Zea mays)
onion Allium royleireduced levels of tear-inducing lachrymatory factorAllium roylei lachrymatory factor synthase (LFS) gene, partial cds. HQ738919 lachrymatory factor synthase, partial (Allium roylei)
tomato Solanum lycopersicumParthenocarpicSolanum lycopersicum chalcone synthase (CHS2) mRNA, complete cds. HQ008773chalcone synthase (Solanum lycopersicum)


Sponsors