Galaxy Screencasts

If you're new: Six steps to get you started

1. Finding promoters containing TAF1 binding sites identified from a CHiP-seq experiment.



Suppose you performed a CHiP-seq experiment and identified a series of genomic regions that bind TAF1-protein. You now need to identify a list of genes that contain such sites. This can be easily done with Galaxy in just a few steps. To run this example yourself you will need a file containing genomic coordinates for TAF1- binding sites from the CHiP-seq experiment (an example file can be downloaded from here.).
Finding promoters containing TAF1 binding sites identified from a CHiP-seq experiment

2. Finding Exons with the highest number of nucleotide polymorphisms.



In this protocol we will work with two sets of genomic features: exon coordinates and single nucleotide polymorphism (SNP) coordinates. The objective will be to identify exons that contain the highest number of SNPs. Even if you are not particularly interested in exons and SNPs, we encourage you to follow along as the real goal of this protocol is to introduce you to joining, grouping, sorting, and filtering in Galaxy.
Finding Exons with the highest number of nucleotide polymorphisms

3. Saving your results and sharing data with others.



How can you ensure that the analyses you have just conducted are safely stored and that you are able to go back to them at anytime? You will need to create a free account within Galaxy. This is the only requirement to save your analyses. The protocol below explains this and also introduces sharing analyses with colleagues.
Saving your results and sharing data with others

4. Generating a workflow from a history.



Screencasts 1 and 2 within this series demonstrate interactive analysis in Galaxy, the result is a history which documents each step of an analysis. Galaxy also allows you to construct reusable multi-step analysis workflows. In this protocol, we will demonstrate the creation of a workflow from an existing analysis history.
Generating a workflow from a history

5. Generating workflows from scratch.



In addition to creating workflows from existing histories, Galaxy allows us to create a workflow from scratch. In this protocol we will construct a simple workflow that finds the 50 longest intervals from a dataset in a 6 column BED file.
Generating workflows from scratch

6. Extracting Sequences and alignments: A SNPs in exons example.



Here we demonstrate how Galaxy is used to extract genomic sequences and multiple species alignments corresponding to regions of interest. In this protocol, we start with the data that we generated in Screencast 2, where we found human coding exons with high SNP counts. We will extract genomic sequences for our regions in FASTA format as well as obtain corresponding alignment blocks from multiple species whole genome alignments in their native format and also create a pseudo-global alignment in FASTA format, which is suitable for use in existing analysis software.
Extracting Sequences and alignments: A SNPs in exons example

Metagenomics with Galaxy: From 454 reads to phylogeny

1. Processing and analysis of 454 data in metagenomic studies.



Here we show the utility of Galaxy in metagenomic applications. We start with a series of 454 reads, assess their quality, select high quality segments, conduct blast searches, and process taxonomic data. All in a single screencast.
Processing and analysis of 454 data in metagenomic studies

2. Building a complete metagenomic workflow.



This example converts the analysis we covered in Screencast 1 into a workflow that can be applied to any 454 dataset or modified to other types of next-gen platforms.
Building a complete metagenomic workflow

Introduction and Examples


A picture is worth a thousand words, and by the same token a screencast is worth a million. This page lists screencasts highlighting fundamental aspects of Galaxy's functionality. For even older screencasts click here.


Galaxy is developed by Galaxy Team and funded by NSF, Penn State and the Huck Institutes of the Life Sciences. It is free for all.