Mikel Egaña Aranguren
http://mikeleganaaranguren.com / mikel.egana.aranguren@gmail.com
Biological Informatics Group (http://wilkinsonlab.info), CBGP, UPM, Madrid
GitHub repo (slides, data, documentation, ...): https://github.com/mikel-egana-aranguren/Galaxy_tutorial
GitHub page (this slides): http://mikel-egana-aranguren.github.io/Galaxy_tutorial/galaxy.html
A web server that offers the usual bioinformatics tools in a central space with ...
... data storage
... history
... workflows
Very complex computational analyses
... easily
... with provenance
Reproducible science!
Rule 1: For Every Result, Keep Track of How It Was Produced
Rule 5: Record All Intermediate Results, When Possible in Standardized Formats
Rule 9: Connect Textual Statements to Underlying Results
Rule 10: Provide Public Access to Scripts, Runs, and Results
Galaxy main server (Limitations on data size, computational resources): https://usegalaxy.org/
Our own server at CBGP: http://biordf.org:8983/
Galaxy documentation: http://wiki.galaxyproject.org
http://usegalaxy.org/u/aun1/p/galaxy101
What genes of Arabidopsis thaliana are annotated against the "cell cycle" (GO:0007049) subtree of the Gene Ontology (GO)?
get data; upload file
data/gene_ontology.1_2.obo
(GO)data/gene_association.tair
(GAF)OBO ontology manipulation; Get the descendent terms of a given OBO term
to obtain the cell cycle subtree from GOUse the Workflow (import and run) or the history
You can make workflows and histories public with a URL, share with other users, and import/export them
You can also create a Galaxy page
Usually a good idea to install Galaxy locally, even just for yourself:
... provenance when you sit down and write the paper
... sensitive data
... install/develop other Galaxy tools
Some UNIX flavour (GNU/Linux, MAC OS X*)
Python
Mercurial (hg
)
hg clone https://bitbucket.org/galaxy/galaxy-dist/
cd galaxy-dist
hg update stable
Run:
./run.sh
nohup ./run.sh &
Open http://127.0.0.1:8080/ and do your thing
Stop (history, workflows etc. will be stored but execution will interrupt):
CTRL-C
kill -9 PID
hg incoming
hg pull -u
Add yourself as admin (add admin_users = your_email
to universe_wsgi.ini
)
Restart, log in and install through admin; tool sheds
or install manually* (Very basic XML and UNIX skills needed)
Feel free to use http://biordf.org:8983/ (No security/performance warranty though ;-)
If you need a tool and you don't find it in the tool shed, let us know and perhaps we can develop it (specially if there is a CLI version of the tool and/or the tool is interesting).
This presentation was produced using Reveal.js