TY - GEN AB - Designing synthetic genes for heterologous expression is a keystone of synthetic biology. In protein sequences - as there are 61 sense codons but only 20 standard amino acids - most amino acids are encoded by more than one codon. Although such synonymous codons do not alter the encoded amino acid sequence, they are not redundant. By using certain codons over others, gene expression can be improved by up to 1000 times. Industry-standard codon optimization techniques based on biological indexes replace synonymous codons with the most abundant codon found in the host organism's genome. However, this technique may result in an imbalanced tRNA pool, metabolic stress, and translational error which lead to greater cell toxicity and reduced protein expression. In this research, recurrent neural networks are used to accurately capture sequential and contextual patterns. By predicting synonymous codons based on the sequential information of the host organism, protein expression can be increased while preventing translational error and plasmid toxicity. AD - Westview High School AU - Jain, Rishab DA - 2021 DO - 10.6083/fq977v625 DO - DOI ID - 9217 KW - Gene Expression KW - Codon Usage KW - Deep Learning KW - Synthetic Biology KW - Codon KW - Machine Learning KW - Artificial Intelligence KW - codon optimization KW - protein expression KW - heterologous expression KW - neutral network L1 - https://digitalcollections.ohsu.edu/record/9217/files/Jain-Rishab-OHSU-ResearchWeek-2021.pdf L2 - https://digitalcollections.ohsu.edu/record/9217/files/Jain-Rishab-OHSU-ResearchWeek-2021.pdf L4 - https://digitalcollections.ohsu.edu/record/9217/files/Jain-Rishab-OHSU-ResearchWeek-2021.pdf LA - eng LK - https://digitalcollections.ohsu.edu/record/9217/files/Jain-Rishab-OHSU-ResearchWeek-2021.pdf N2 - Designing synthetic genes for heterologous expression is a keystone of synthetic biology. In protein sequences - as there are 61 sense codons but only 20 standard amino acids - most amino acids are encoded by more than one codon. Although such synonymous codons do not alter the encoded amino acid sequence, they are not redundant. By using certain codons over others, gene expression can be improved by up to 1000 times. Industry-standard codon optimization techniques based on biological indexes replace synonymous codons with the most abundant codon found in the host organism's genome. However, this technique may result in an imbalanced tRNA pool, metabolic stress, and translational error which lead to greater cell toxicity and reduced protein expression. In this research, recurrent neural networks are used to accurately capture sequential and contextual patterns. By predicting synonymous codons based on the sequential information of the host organism, protein expression can be increased while preventing translational error and plasmid toxicity. PB - Oregon Health and Science University PY - 2021 T1 - Codonify: a recurrent-neural-network based codon optimization tool to improve protein expression TI - Codonify: a recurrent-neural-network based codon optimization tool to improve protein expression UR - https://digitalcollections.ohsu.edu/record/9217/files/Jain-Rishab-OHSU-ResearchWeek-2021.pdf Y1 - 2021 ER -