NGPhylogeny.fr

Documentation

Overview

NGPhylogeny.fr is a webservice dedicated to phylogenetic analysis. It provides a complete set of phylogenetic tools and workflows adapted to various contexts and various levels of user expertise. It is built around the main steps of most phylogenetic analyses:

NGPhylogeny.fr integrates several tools for each steps of the workflow:

Different ways of using NGPhylogeny.fr are offered, depending on the user needs or expertise:

Oneclick workflows are already preconfigured with default options that should work on the majority of usecases. The only required input is the sequence data file in Fasta format. Input data type (dna or protein) is detected automatically;
Advanced workflows have basically the same structure as oneclick workflows, but can be parametrized. It means that the user should customize the options of each step of the workflows: alignment, curation, tree inference.
Workflow maker allows the user to choose the combination of tools that suits best his/her needs, and to customize the parameters.
Individual tools may be run if specific taks are required.

Moreover, NGPhylogeny.fr provides a user-friendly visualization layer specific to the different kinds of data usually manipulated in phylogenetics (i.e. alignments, trees).

Finally, Blast-Search module is placed upstream phylogenetic workflows and aims at searching for sequences that are similar to a given user input sequence. Blast-Search then analyses Blast results and builds a quick (and inaccurate) tree in which users can remove unwanted sequences. Remaining sequences may then be used as input of any ngphylogeny.Fr workflows.

Branch supports

In addition to their respective bootstraps, almost all tree inference tools are proposed with the following branch support computations:

Felsenstein Bootstrap Proportions (FBP);
Transfer Bootstrap Expectation (TBE).

For example, it is possible to compute FBP and TBE supports with FastTree.

Bootstrap options are accessible via "Advanced workflows" and "Workflow maker".

Computations

NGPhylogeny.fr works together with Institut Pasteur Galaxy instance to:

Manage tools and workflows;
Run tools and workflows on the underlying computing cluster;
Keep track of run histories.

Oneclick workflows

One click workflows are accessible via the "Phylogeny Analysis/One click workflow" link on the tool bar:

One click workflows

The 4 oneclick workflows implemented in NGPhylogeny.fr differ by the tree inference tool:

PhyML+SMS: This workflow uses PhyML+SMS to select the best evolutionary model and to infer the trees. However, it may not handle very large datasets, as the tree inference may take a very long time. SH-like aLRT branch supports are computed by default;
PhyML: This workflow uses PhyML to infer trees. Default options depend on data type (dna, protein). Like PhyML+SMS, large datasets may not be analyzed with this workflow;
FastME: This workflow infer trees using FastME. FastME provides distance algorithms to infer phylogenies and can work with large datasets;
FastTree: This workflow runs FastTree to infer trees. "FastTree infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences" and works for very large datasets. Local branch supports (SH) are computed by default by FastTree.

Sections below describe these oneclick workflows and all the steps.

PhyML+SMS

PhyML+SMS workflow

Workflow outputs:

MAFFT:
- Alignment (FASTA)
- Guide Tree (TXT)
- Output logs (TXT
BMGE:
- Cleaned sequences Html (HTML)
- Cleaned sequences Nexus (NEXUS)
- Cleaned sequences Fasta (FASTA)
- Cleaned sequences Phylip (PHYLIP)
PhyML+SMS:
- Output logs (TXT)
- Output tree (NEWICK)
- SMS model comparison (TXT)
- SMS best model (TXT)
Newick Display
- Tree image (SVG)

PhyML

One click workflows

Workflow outputs:

MAFFT:
- Alignment (FASTA)
- Guide Tree (TXT)
- Output logs (TXT
BMGE:
- Cleaned sequences Html (HTML)
- Cleaned sequences Nexus (NEXUS)
- Cleaned sequences Fasta (FASTA)
- Cleaned sequences Phylip (PHYLIP)
PhyML:
- Output logs (TXT)
- PhyML statistics (TXT)
- Output tree (NEWICK)
Newick Display
- Tree image (SVG)

FastME

One click workflows

Workflow outputs:

MAFFT:
- Alignment (FASTA)
- Guide Tree (TXT)
- Output logs (TXT
BMGE:
- Cleaned sequences Html (HTML)
- Cleaned sequences Nexus (NEXUS)
- Cleaned sequences Fasta (FASTA)
- Cleaned sequences Phylip (PHYLIP)
FastME:
- Output logs (TXT)
- Distance Matrix (TXT)
- Output tree (NEWICK)
Newick Display
- Tree image (SVG)

FastTree

One click workflows

Workflow outputs:

MAFFT:
- Alignment (FASTA)
- Guide Tree (TXT)
- Output logs (TXT
BMGE:
- Cleaned sequences Html (HTML)
- Cleaned sequences Nexus (NEXUS)
- Cleaned sequences Fasta (FASTA)
- Cleaned sequences Phylip (PHYLIP)
FastTree:
- Output logs (TXT)
- Output tree (NEWICK)
Newick Display
- Tree image (SVG)