CrocoPhylogeny
Web-based phylogenetics of whole genomes and complex sequences in just three clicks
Phylogenetic relationships are usually inferred based on a few very specific and highly conserved genes termed phylogenetic markers, such as the 16S ribosomal RNA. However, the evolutionary history of any single gene may differ from that of other genes or of the whole organism, especially in species with ample horizontal gene transfer. To detect and quantify traces of common ancestrality, it is often useful to examine the relationships between longer and apparently unrelated stretches of the genome. CrocoPhylogeny is a web server for phylogenetic analyses of whole genomes and complex nucleotide sequences such as clusters of non-ortholog genes with severe rearrangement (full or partial inversions) and variable length. CrocoPhylogeny conducts BLASTN-like local alignment between all pairs of nucleotide sequences broken into pieces of a certain length and obtains an average sequence similarity score. CrocoPhylogeny runs as a CPU-supervised queue and file management system, while the BLASTN-like alignment core is implemented in CUDA C and runs on a high-performance GPU. Whereas its primary result is a distance matrix quantifying the similarity between all pairs of sequences, the results page of CrocoPhylogeny also provides pre-built phylogenetic trees, and links to immediatly visualize them on iTOL, a dedicated phylogenetic tree visualization server.
Full documentation about CrocoPhylogeny is available at our Wiki. Don't forget to check out the case studies.
If you found CrocoPhylogeny helpful, please cite this website.
Manuscript currently in preparation, please check again in a few weeks.
If you have any questions or comments, do not hesitate to contact us at 455306@mail.muni.cz.
Licence conditions in accordance with § 11 of Act No. 130/2002 Coll. The owner of the software is Masaryk University, a public university, ID: 00216224. Masaryk University allows other companies and individuals to use this software free of charge and without territorial restrictions in usual way, that does not depreciate its value. This permission is granted for the duration of property rights. This software is not subject to special information treatment according to Act No. 412/2005 Coll., as amended. In case that a person who will use the software under this licence offer violates the licence terms, the permission to use the software terminates.
Here we provide a few sample input files for the purpose of giving the users an idea of how the input files should look like. Basically, the input files should be in FASTA format, containing only non-ambiguous nucleotides (only A, G, T, and C). Each run should include at least 3 input files.
Campylobacter_jejuni_whole_genome
Helicobacter_pylori_whole_genome
Acinetobacter_baumannii_whole_genome
Pseudomonas_aeruginosa_whole_genome
Escherichia_coli_whole_genome
Salmonella_enterica_whole_genome
If you want more samples, don't forget to check out our case studies. They contain more sample files together with their CrocoPhylogeny results.
If you have any questions or comments, do not hesitate to contact us at 455306@mail.muni.cz.
To help in prioritizing the research and development of new and effective antibiotics treatments, the World Health Organization recently issued a global priority list of antibiotic-resistant bacteria considered to have the most substantial impact on global health at the current time. Whereas the phylogeny of prokaryotes is often judged based only on the 16S ribosomal RNA gene... [show more]
Plastids, commonly referred to as chloroplasts, are organelles that perform photosynthesis in algal and plant cells. Plastids retain their own DNA, distinct from that of the cell itself, because they are the descendants of cyanobacteria that evolved within the cytoplasm of heterotrophic single-celled eukaryotes. Plastid genomes vary strongly in length and composition across different species of plants and algae... [show more]
The Y chromosome is primarily responsible for sex determination but also carries other genes. However, the Y chromosome is an enigmatic part of the genome. For example, in humans, the Y chromosome is approximately 60 million bp in length but carries less than 100 protein-coding genes. For comparison, the X chromosome is only 2.5 times bigger (approximately 150 million bp in length) but... [show more]