Fork me on GitHub

NGPhylogeny.fr

Overview

NGPhylogeny.fr is a webservice dedicated to phylogenetic analysis. It provides a complete set of phylogenetic tools and workflows adapted to various contexts and various levels of user expertise. It is built around the main steps of all phylogenetic analyses:

  1. Multiple alignment
  2. Alignment curation
  3. Tree Inference
  4. Tree rendering

Integrated tools:

Different ways of using NGPhylogeny.fr are offered, depending on the user needs or expertise:

  1. Oneclick workflows are already preconfigured with default options that should work on the majority of usecases. The only required input is the sequence data file in Fasta format, and the data type (dna or protein) is detected automatically;
  2. Advanced workflows have basically the same structure as oneclick workflows, but can be parametrized. It means that the user can customize the options of each step of the workflows: alignment, curation, tree inference.
  3. Workflow maker allows the user to choose the combination of tools that suits best his/her needs, and to customize the parameters.
  4. Individual tool may be run if specific taks are required.

Finally, NGPhylogeny.fr provides a user-friendly visualization layer specific to the different kinds of data usually manipulated in phylogenetics (i.e. alignments, trees).

Branch supports

In addition to their respective bootstraps, almost all tree inference tools are proposed with the following branch support computations:

  1. Felsenstein Bootstrap Proportions (FBP);
  2. Transfer Bootstrap Expectation (TBE).

For example, it is possible to compute FBP and TBE supports with FastTree.

Bootstrap options are accessible at "Advanced workflows".

Computations

NGPhylogeny.fr works together with Institut Pasteur Galaxy instance to:

  1. Manage tools and workflows;
  2. Run tools and workflows on the underlying computing cluster;
  3. Keep track of run histories.

Oneclick workflows

One click workflows are accessible via the "Phylogeny Analysis/One click workflow" link on the tool bar:

One click workflows

The 4 oneclick workflows implemented in NGPhylogeny.fr are:

  1. PhyML-SMS: This workflow uses PhyML-SMS to select the best evolutionary model and to infer the trees. However, it may not handle very large datasets, as the tree inference may take a very long time;
  2. PhyML: This workflow uses PhyML to infer trees. Default options depend on data type (dna, protein). Like PhyML-SMS, large datasets may not be analyzed with this workflow;
  3. FastME: This workflow infer trees using FastME. FastME provides distance algorithms to infer phylogenies and can work with large datasets;
  4. FastTree: This workflow runs FastTree to infer trees. "FastTree infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences" and works very very large datasets.

Sections below describe these oneclick workflows and all the steps.

PhyML-SMS

PhyML-SMS workflow

Workflow outputs:

PhyML

One click workflows

Workflow outputs:

FastME

One click workflows

Workflow outputs:

FastTree

One click workflows

Workflow outputs:

Default options

MAFFT

Gblocks

PhyML-SMS

PhyML

FastME

FastTree

Advanced workflows

Advanced workflows are accessible via the "Phylogeny Analysis/Advanced workflows" link on the tool bar:

Advanced workflows

Advanced workflows have the same structure as oneclick workflows, but some options can be customized.

In particular, it is possible to perform specific bootstrap analyses:

Build custom workflows

The workflow maker is accessible via the "Phylogeny Analysis/Workflow maker" link on the tool bar:

Workflow Maker

In this mode, users can choose the tool they want to run at each step of the workflow:

  1. Multiple alignment
    • Clustal omega
    • MAFFT
    • MUSCLE
  2. Alignment curation
    • BMGE
    • Gblocks
    • Noisy
  3. Phylogeny
    • FastME
    • FastTree
    • MrBayes
    • PhyML-SMS
    • PhyML
    • TNT
  4. Tree rendering
    • Newick Display

In addition, parameters of each step can be specified.

Launch individual tools

Tools that make up the workflows can be configured and run independently. They are available via the "tools" link on the toolbar:

Workflow Maker

Typical analysis

Once the workflow is configured and launched, user is redirected to a waiting page giving informations about the run:

Workflow Maker

All workflows start by uploading input data to Institut Pasteur Galaxy server. Each step of the workflow is then put in pending status, waiting for available resources on the Galaxy server.

Workflow Maker

Once a step executed, corresponding result files are downloadable or viewable depending on the format. Images, trees, and alignments are viewable through specific viewers. In addition, trees may be uploaded to iTOL for further investigations.