PhagesDB

From Wikipedia, the free encyclopedia

The Actinobacteriophage database, more commonly known as PhagesDB, is a database-backed website that gathers and shares information related to the discovery, characterization and genomics of viruses that prefer to infect Actinobacterial hosts. It is a bioinformatics tool that is used worldwide to compare multiple phages and their genomic annotations. Up to recent dates, there have been more than 8,000 bacteriophages, including over 1,600 with already sequenced genomes, have been entered into the database.[1] It is an addition to the wide range of priorly existing bioinformatic tools, like NCBI. It provides results of already sequenced phage genomes and aims to allow access to drafted phage genomes to provide a larger spectrum of information.

Background[]

PhagesDB provides the Actinobacteriophage research community with an outlet to post their findings and share it with the members of their community who can then further analyze the data and use it to annotate newly discovered phage genomes through comparison. It was designed to keep up with the speed of discovery so new genes can be uploaded in real time. It is supposed to help avoid the "time-lag between sequencing and availability of annotated genomes in GenBank.[2] It links students from all over the world who are performing authentic research via the SEA-PHAGES program so they can share their results with the rest of the research community.[3]" There are more than 6400 registered PhagesDB users with xxx.edu email addresses, reflecting usage by student researchers.[2]

In 1993, the sequencing of L5 led to the start of the first decade of Actinobacteriophage genomics,[4] and concluded in the publication of an analysis that compared 14 different mycobacteriophage genomes in 2003.[5] Using local spreadsheets and GenBank, it was possible to manage the resulting data for approximately a year, provided that it was at the pace of approximately one genome every year. However, the following two developments made this approach indefensible. First of all, the creation of the Phage Hunters Integrating Research and Education (PHIRE) program [6][7] created a path for beginner high school and college scientists to purify, isolate, and characterize their own new phages, leading to a sharp increase in the amount of phage isolates that can be accessed for sequencing.[8] Secondly, the revelation of Next-Generation Sequencing technologies caused a faster and cheaper sequencing of those genomes of phages. As a result the number of phage genomes sequenced in the next decade exponentially increased.[8] The foundation of the Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) program in 2008 [9] increased the pipeline more for isolation and sequencing of Actinobacteriophage, and coping with the data created by these programs brought a challenge.

PhagesDB was created to be a single, concentrated archive of phage information where anyone interested in or involved with phage study could access and enter data. A web-accessible database sample allowed the storage and retrieval of data in a methodical and pliable way, as well as giving easy access to anyone with an connection to internet. In April 2010, PhagesDB launched, and was initially only for Mycobacteriophages (phages of mycobacterial hosts). In 2015, it became the Actinobacteriophage Database to contain all phages infecting hosts in the Phylum Actinobacteria.[2]

Design and Features[]

The creation of PhagesDB was carried out using Django[10] and was hosted on a WebFaction server. Django is a Python-based web-development framework, and it was chosen specifically for its high versatility, accessibility to non-professional programmers, clarity of documentation, and several other out-of-the-box features including a fully functional administrative site. It was important for it to have accessibility to non-professional programmers as that allows for a more diverse range of results. Rather than host PhagesDB locally, WebFaction was chosen for its easy integration with Django, its high-level data security and its low downtime. The database website opens up with a mostly green and black lobby page and on the top left, a search bar is present. Phage names can be, sequenced and/or draft, typed in the search bar and results immediately pop up.[2]

PhagesDB has an individual phage page for every single phage out of the more than 8000 phages that have been entered into the database. These pages contain detailed information regarding the phages. This information includes discovery details (GPS coordinates, year found, isolation temperature, host bacterium, etc.), sequencing details (genome length, G + C content, type of genome termini, etc.), characterization details (morphotype, cluster/subcluster, gene list, etc.) and further useful files (fasta sequence file, plaque picture, restriction digest picture, micrograph, etc.). If applicable, there are links to the GenBank entry for the phage, as well as the paper it was published in. Along with all these, there is a separate GeneMark page for each phage which allows one to cross reference the position of genomes within the draft phages to ensure that there is indeed a genome present at a certain spot. PhagesDB can be used on its own but is found to be more accurate when used in collaboration with another bioinformatics website like NCBI Blast.[2] The figure below[11] indicates the different types and numbers of phages sequenced:

Phage Types Sequenced Number Sequenced
Actinoplanes 1
Arthrobacter 240
Brevibacterium 2
Corynebacterium 12
Gordonia 296
Kocuria 4
Microbacterium 98
Mycobacterium 1590
Propionibacterium 55
Rhodococcus 53
Rothia 1
Streptomyces 167
Tetrasphaera 1
Tsukamurella 2

There are many different ways in PhagesDB where the user is able to view and contact with groups of phages. Phage lists can be generated and classified by host (genus, species, or strain), cluster, subcluster, institution, year found, genome length, G + C content and a few other criteria. The filter page[12] lets for a combination of criteria to aim at a group of phages with particular characteristics. Each phage cluster and subcluster has its own page with a catalog of member phages. Along with this each cluster and subcluster has days present about itself as well, for example number of members in the clusters, their average genome sizes, and the hosts their members infect or prefer to infect. There is an interactive map that shows all phages/clusters that are sequenced with known GPS coordinates. This gives information about the geographical spread of phage locations of isolation.[2] The compare page[13] lets users view all plaque pictures, limitation digest pictures, or micrographs for a given phage group. PhagesDB has amino-acid level details about its phage genomes that are sequenced by integration with Phamerator[14][15]

Access and Rights to Data[]

PhagesDB data can be viewed by anyone freely and anyone can register for the site (through Google, Facebook, Twitter, or PhagesDB itself), giving the ability to add new phages that have been found, and modify the phage data while learning more about the characteristics of phages.

Additionally, the site gives multiple ways of bringing back the fundamental data. The download page[16] of the data consists of links for downloading all sequences of phage genome, texts about each phage with a broad information, and photos with all plaque images, restriction digest gel pictures, or micrographs for any given cluster. Each Pham page provides a link to download the amino acid sequences of all members of that Pham for comparative proteomic motives.

An Application Programming Interface (API) was recently added to let users approach a lot of underlying data in a way more compatible for computers.[17] The PhagesDB API receives appeals for all phages, the ones by host, by cluster or subcluster, sequenced, etc., and reverts json objects with the wanted information. PhagesDB keeps some unpublished data that is not present in any medium, including genome sequences that have been done recently. The PhagesDB ‘Terms of Use[18] contains instructions on how and which data shall be used by third parties. For instance, users willing to take advantage of unpublished data for their private purposes need permission from the owners of data.[2]

References[]

  1. ^ Russell DA, Hatfull GF, [1] "PhagesDB: the actinobacteriophage database"
  2. ^ a b c d e f g Russell, Daniel A.; Hatfull, Graham F. (2017-03-01). "PhagesDB: the actinobacteriophage database". Bioinformatics. 33 (5): 784–786. doi:10.1093/bioinformatics/btw711. ISSN 1367-4803. PMC 5860397. PMID 28365761.
  3. ^ Hatfull, Graham [2]"Genomic Databases:DNA Annotation"
  4. ^ Hatfull G.F., Sarkis G.J. (1993). "DNA sequence, structure and gene expression of mycobacteriophage L5: a phage system for mycobacterial genetics". Mol. Microbiol. 7 (3): 395–405. doi:10.1111/j.1365-2958.1993.tb01131.x. PMID 8459766. S2CID 10188307.
  5. ^ Pedulla, M.L. (2003). "Origins of highly mosaic mycobacteriophage genomes". Cell. 113 (2): 171–182. doi:10.1016/S0092-8674(03)00233-2. PMID 12705866. S2CID 14055875.
  6. ^ Hanauer, D.I. (2006). "Inquiry learning. Teaching scientific inquiry". Science. 314 (5807): 1880–1881. doi:10.1126/science.1136796. PMID 17185586. S2CID 142760427.
  7. ^ Hatfull, G.F. (2006). "Exploring the mycobacteriophage metaproteome: phage genomics as an educational platform". PLOS Genet. 2 (6): e92. doi:10.1371/journal.pgen.0020092. PMC 1475703. PMID 16789831.
  8. ^ a b Hatfull, G.F. (2015). "Innovations in undergraduate science education: going viral". J. Virol. 89 (16): 8111–8113. doi:10.1128/JVI.03003-14. PMC 4524241. PMID 26018168.
  9. ^ Jordan, T.C. (2014). "A broadly implementable research course in phage discovery and genomics for first-year undergraduate students". mBio. 5 (1): e01051–e01013. doi:10.1128/mbio.01051-13. PMC 3950523. PMID 24496795.
  10. ^ "The Web framework for perfectionists with deadlines | Django". www.djangoproject.com. Retrieved 2018-04-11.
  11. ^ "The Actinobacteriophage Database | Home". phagesdb.org. Retrieved 2018-04-18.
  12. ^ "The Actinobacteriophage Database". phagesdb.org. Retrieved 2018-04-11.
  13. ^ "The Actinobacteriophage Database | Compare Phages". phagesdb.org. Retrieved 2018-04-11.
  14. ^ "Phamerator". phamerator.org. Retrieved 2018-04-18.
  15. ^ Cresawn, S.G. (2011). "Phamerator: a bioinformatic tool for comparative bacteriophage genomics". BMC Bioinform. 12: 395. doi:10.1186/1471-2105-12-395. PMC 3233612. PMID 21991981.
  16. ^ "The Actinobacteriophage Database | Home". phagesdb.org. Retrieved 2018-04-16.
  17. ^ "Swagger UI". phagesdb.org. Retrieved 2018-04-16.
  18. ^ "The Actinobacteriophage Database | Terms of Use". phagesdb.org. Retrieved 2018-04-16.

External links[]

Retrieved from ""