GenoCAD

GenoCAD
Initial release 30 August 2007 (2007-08-30)
Stable release 2.3.1 / 11 January 2014 (2014-01-11)
Development status Active
Written in PHP JavaScript C++ MySQL
Type Computer-Aided Design Bioinformatics
License Apache v2.0
Website genocad.com

GenoCAD is one of the earliest computer assisted design tools for synthetic biology.[1] The software is a bioinformatics tool developed and maintained by GenoFAB. GenoCAD facilitates the design of protein expression vectors, artificial gene networks and other genetic constructs for genetic engineering and is based on the theory of formal languages.[2] GenoCAD can be used online at www.genocad.com.

History

GenoCAD originated as an offshoot of an attempt to formalize functional constraints of genetic constructs using the theory of formal languages. In 2007, the website genocad.org (now retired) was set up as a proof of concept by researchers at Virginia Bioinformatics Institute, Virginia Tech. Using the website, users could design genes by repeatedly replacing high-level genetic constructs with lower level genetic constructs, and eventually with actual DNA sequences.[2]

On August 31, 2009, the National Science Foundation granted a three-year $1,421,725 grant to Dr. Jean Peccoud, an associate professor at the Virginia Bioinformatics Institute at Virginia Tech, for the development of GenoCAD.[3] GenoCAD was and continues to be developed by GenoFAB, a company founded by Peccoud (currently CSO and acting CEO), who was also one of the authors of the originating study.[2]

Source code for GenoCAD was originally released on Sourceforge in December 2009.[4]

GenoCAD version 2.0 was released in November 2011 and included the ability to simulate the behavior of the designed genetic code. This feature was a result of a collaboration with the team behind COPASI.[5]

In April, 2015, Peccoud and colleagues published a library of biological parts, called GenoLIB, that can be incorporated into the GenoCAD platform.[6]

Goals

The four aims of the project are to develop a:[7]

  1. computer language to represent the structure of synthetic DNA molecules used in E.coli, yeast, mice, and Arabidopsis thaliana cells
  2. compiler capable of translating DNA sequences into mathematical models in order to predict the encoded phenotype
  3. collaborative workflow environment which allow to share parts, designs, fabrication resource
  4. means to forward the results to the user community through an external advisory board, an annual user conference, and outreach to industry

Features

The main features of GenoCAD can be organized into three main categories. [8]

Workflow of GenoCAD

Theoretical foundation

GenoCAD is rooted in the theory of formal languages; in particular, the design rules describing how to combine different kinds of parts and form context-free grammars. [2]

A context free grammar can be defined by its terminals, variables, start variable and substitution rules.[10] In GenoCAD, the terminals of the grammar are sequences of DNA that perform a particular biological purpose (e.g. a promoter). The variables are less homogeneous: they can represent longer sequences that have multiple functions or can represent a section of DNA that can contain one of multiple different sequences of DNA but perform the same function (e.g. a variable represents the set of promoters). GenoCAD includes built in substitution rules to ensure that the DNA sequence is biologically viable. Users can also define their own sets of rules for other purposes.

Designing a sequence of DNA in GenoCAD is much like creating a derivation in a context free grammar. The user starts with the start variable and repeatedly selects a variable and a substitution for it until only terminals are left.[2]

Alternatives

The most common alternatives to GenoCAD are Proto, GEC and EuGene[11]

Tool Advantages Disadvantages
GEC
  • Designer only needs to know basic part types and determine constraints [11]
EuGene
  • Interfacing with other simulation and assembly tools[11]
Proto
  • Choice of molecules and sequences can be made by other programs[11]
  • Integration capability with some other languages[11]
  • Relatively hard to learn [11]
  • Results are less efficient [1]

References

  1. 1 2 Beal, Jacob; Phillips, Andrew; Densmore, Douglas; Cai, Yizhi (2011). "High-Level Programming Languages for Biomolecular Systems". In Koeppl, Heinz; Densmore, Douglas; Setti, Gianluca; di Bernardo, Mario. Design and Analysis of Biomolecular Circuits. New York Dordrecht Heidelberg London: Springer. p. 241. doi:10.1007/978-1-4419-6766-4. ISBN 978-1-4419-6765-7.
  2. 1 2 3 4 5 Cai Y, Hartnett B, Gustafsson C, Peccoud J. (2007). "A syntactic model to design and verify synthetic genetic constructs derived from standard biological parts.". Bioinformatics 23 (20): 2760–7. doi:10.1093/bioinformatics/btm446. PMID 17804435.
  3. Jodi Lewis (September 14, 2009). "National Science Foundation awards $1.4 million for GenoCAD development". Retrieved October 7, 2013.
  4. "GenoCAD Code". Sourceforge. Retrieved 8 October 2013.
  5. Wilson, Mandy. "GenoCAD Release Notes". Peccoud Lab. Retrieved 8 October 2013.
  6. Adames N, Wilson M, Fang G, Lux M, Glick B, Peccoud J (2015). "GenoLIB: a database of biological parts derived from a library of common plasmid features.". Nucleic Acids Research 43: 4823–32. doi:10.1093/nar/gkv272. PMID 25925571.
  7. Jean Peccoud (June 21, 2013). "GenoCAD: Computer Assisted Design of Synthetic DNA". Retrieved October 7, 2013.
  8. Wilson ML, Hertzberg R, Adam L, Peccoud J. (2011). "A step-by-step introduction to rule-based design of synthetic genetic constructs using GenoCAD.". Methods Enzymol. 498: 173–88. doi:10.1016/B978-0-12-385120-8.00008-5. PMID 21601678.
  9. Cai, Y.; Lux, M. W.; Adam, L.; Peccoud, J. (2009). Sauro, Herbert M, ed. "Modeling Structure-Function Relationships in Synthetic DNA Sequences using Attribute Grammars". PLoS Computational Biology 5 (10): e1000529. doi:10.1371/journal.pcbi.1000529. PMC 2748682. PMID 19816554.
  10. Sipser, Michael (2013). Introduction to the Theory of Computation, Third edition. Boston, MA, USA: Cengage Learning. p. 104. ISBN 978-1-133-18779-0.
  11. 1 2 3 4 5 6 7 8 Habibi, N., Mohd Hashim, S. Z., Rodriguez, C. A., & Samian, M. R. (2013). A Review of CADs, Languages and Data Models for Synthetic Biology. Jurnal Teknologi, 63(1).
  12. Pedersen, M. (2010). Modular languages for systems and synthetic biology.

External links

This article is issued from Wikipedia - version of the Tuesday, February 16, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.