+ - 0:00:00
Notes for current slide

Presenter notes contain extra information which might be useful if you intend to use these slides for teaching.

Press P again to switch presenter notes off

Press C to create a new window where the same presentation will be displayed. This window is linked to the main window. Changing slides on one will cause the slide to change on the other.

Useful when presenting.

Notes for next slide



Refining Genome Annotations with Apollo



last_modification Updated:   purlPURL: gxy.io/GTN:S00066

Tip: press P to view the presenter notes | arrow-keys Use arrow keys to move between slides
1 / 15

Presenter notes contain extra information which might be useful if you intend to use these slides for teaching.

Press P again to switch presenter notes off

Press C to create a new window where the same presentation will be displayed. This window is linked to the main window. Changing slides on one will cause the slide to change on the other.

Useful when presenting.

Requirements

Before diving into this slide deck, we recommend you to have a look at:

2 / 15

question Questions

  • How to visualize your genome after automated annotations have been performed?

  • How to manually annotate genome after automated annotations have been performed?

  • How to evaluate and visualize annotated genomic features?

  • How do I collaborate when doing genome annotation?

3 / 15

objectives Objectives

  • Load a genome into Galaxy

  • View annotations in JBrowse

  • Learn how to load JBrowse data into Apollo

  • Learn how to manually refine genome annotations within Apollo

  • Export refined genome annotations

4 / 15

Genome annotation

  • Classifies and describes parts of the genome sequence

  • Focusing on genes: their positions, names and functions

  • Automatic methods to annotate a newly sequenced and assembled genome

    • Prokaryotes: Prokka, Augustus
    • Eukaryotes: Maker, Braker2, Augustus

JBrowse screenshot showing several genes aligned along a genome

5 / 15
  • Annotating a genome means positioning features along the sequence of a genome.
  • Here we focus on genes.
  • We are interested in their positions, and functional annotation like their name of their function.
  • Many software tools and pipelines exist to automatically perform the annotation of a new genome sequence.
  • For example Prokka or Maker are widely used, and tutorials are available on the GTN to learn how to use them.

Automatic annotations are not perfect

  • Assembly errors can cause fragmented annotations

  • Limited and/or noisy evidence coverage makes precise identification difficult

Apollo screenshot showing how RNASeq reads align mostly within some exons limits, but not perfectly

6 / 15
  • Unfortunately, automatic annotations are rarely perfect.
  • Assembly errors can cause fragmented annotations.
  • Annotation tools have difficulties annotating genes when evidences like RNASeq are lacking or too noisy.

Manual curation refines genome annotations

  • Make use of the researcher's expertise

  • Include additional analysis results

  • Integrate all underlying evidence

  • A Human finds problems algorithms can't

Schema showing how automated annotation, experimental evidences (cDNAs, HMM domain searches, RNASeq, similarity with other species), and human analysis are used by Apollo to manually curate an annotation

Apollo screenshot showing how RNASeq reads align mostly within some exons limits, but not perfectly

7 / 15
  • Performing a manual curation step after automatic annotation allows to improve annotation quality.
  • It is a way to make use of a researcher's expertise, and to integrate more knowledge to improve gene structures and functional annotation.

Apollo

"Google Docs for Genome Annotation"

Screenshot of Apollo in Galaxy, apollo takes up most of the space in the center iframe with the history shown on the right.

Collaborative web application to edit annotation

Both structural and functional annotation

Based on JBrowse: JBrowse is for visualising only, Apollo allows to edit

8 / 15
  • Apollo is a web application specifically designed for the manual curation of annotations.
  • You can see Apollo as the "Google Docs" of annotation.
  • It is a multi-user, real time, collaborative environment.
  • Users can easily change the structure of genes, and add functional annotation like gene names of Gene Ontology terms.
  • Is is based on JBrowse.
  • JBrowse is for visualising genomes, while Apollo adds the editing capabilities.

Apollo

  • Modify gene structures (based on evidence tracks)

  • Add functional annotation

Animation showing how the limit of an exon can be changed within Apollo by a simple drag-and-drop Screenshot of the Apollo form allowing to edit functional annotation

9 / 15
  • Within Apollo, you can easily change the structure of genes, based on evidence tracks.
  • Apollo offers an interactive interface to help you add functional annotation.

Apollo

Multi user environment: users can login, and make/see changes in realtime

Full history of changes, reverting is possible

Screenshot of the history panel, with a list of actions and possibility to revert

10 / 15
  • Apollo is a multi-user environment, which means multiple users can login at the same time, make changes, and see changes of other users in real time.
  • Apollo keep track of every action performed on each gene in a history. You can revert any change at any time.

Apollo in Galaxy

Integrate Apollo in your whole analysis project in Galaxy

  1. Run assembly
  2. Align evidences along the genome (RNASeq, CDS, proteins)
  3. Run automatic annotation pipeline
  4. Create a JBrowse instance
  5. Load the JBrowse instance into Apollo
  6. Use Apollo to manually curate annotation with collaborators
  7. Export back the curated annotation into Galaxy
  8. Continue analysing in Galaxy

Available on https://usegalaxy.eu

11 / 15
  • Apollo is now tightly integrated with Galaxy, which allow to integrate the manual curation process in a complete Galaxy-based analysis workflow.
  • Within Galaxy, you can start by assembling a new genome, then aligning as many evidence tracks as possible (like RNASeq data).
  • You can then continue with an automatic annotation step, with Prokka or Maker for example.
  • Then you can generate a JBrowse instance representing all the data you have generated, and load it into Apollo.
  • Apollo let's you perform the manual curation, in collaboration with any other colleague you invite to help you.
  • Finally when you are satisfied with your changes, you can export them to your Galaxy history, and continue other analyses.
  • At any time later you can come back to Apollo to continue improving the annotation, based on new experiment results.

Apollo in Galaxy: collaborating

Invite any other Galaxy user to your curation project

Screenshot of the dialog allowing to share the access to other users

12 / 15
  • By default you are the only one having access to genomes you load into Apollo.
  • A specific module of the Apollo-Galaxy bridge allows you to open the access to a genome to any other Galaxy user.

Hands on!

Time to follow the Apollo tutorial!

13 / 15
  • You can now follow the Apollo tutorial to learn how to use it in real life.

keypoints Key points

  • Apollo is the Google Docs of the genome annotation world, real-time collaborative genome annotation.

  • Apollo allows a group to view and manually refine predicted genome annotations

  • Use Apollo to edit annotations within your group.

  • Export manual annotations as GFF3.

14 / 15

Thank You!

This material is the result of a collaborative work. Thanks to the Galaxy Training Network and all the contributors!

page logo
Logo

Tutorial Content is licensed under Creative Commons Attribution 4.0 International License.

15 / 15

Requirements

Before diving into this slide deck, we recommend you to have a look at:

2 / 15
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow