Skip to content

Ansible playbooks for deploying BCF-managed Galaxy instances on UoM VMs

Notifications You must be signed in to change notification settings

pjbriggs/ansible-palfinder-galaxy

Repository files navigation

ansible-palfinder-galaxy

ansible playbook and roles for deploying BCF Galaxy instances on virtual machines at the University of Manchester:

  • pal_finder: a public instance for running Pal_finder
  • centaurus: a local instance for researchers

The roles are set up to target Galaxy version 22.05.

Roles

The following roles are defined:

  • galaxy-user: creates the Galaxy user and group
  • python3: builds and installs Python 3 from source
  • nginx: installs Nginx
  • postgresql: installs and configures PostgreSQL
  • postfix-null-client: installs and configures Postfix as a 'null client'
  • lets-encrypt-client: installs the Let's Encrypt client cert-bot
  • jsedrop: installs a local JSE-Drop service
  • galaxy: install and configure a Galaxy instance:
    • Install Galaxy dependencies
    • Install Galaxy-specific Python
    • Set up database
    • Clone and configure specified Galaxy version
    • Uploads welcome, terms and citation pages, (plus any additional static content)
    • Set up cron jobs to purge histories and datasets
    • Set up log rotation
    • Set up Nginx proxy
    • Installs customised tool_conf.xml
    • (Optionally) set up the JSE-drop job runner plugin
    • (Optionally) set up custom colour scheme via SCSS
    • (Optionally) sets up automatic SSL certificate renewal
  • galaxy-utils: installs utility scripts for Galaxy user creation, tool installation etc
  • galaxy-create-users: creates user accounts in the Galaxy instance, as specified in the galaxy_users variable
  • galaxy-install-tools: installs tools from the main toolshed, specified in the galaxy_tools variable
  • galaxy-add-library-data: uploads files to a data library (creating the library first if necessary), as specified in the galaxy_library_datasets variable
  • galaxy-set-default-quota: set the default quota for the Galaxy instance
  • galaxy-auto-delete-datasets: configures automatic deletion of old datasets
  • galaxy-audit-report: sets up weekly emailing of audit reports
  • export-galaxy-for-cluster: installs Galaxy into a Python virtualenv which is then exported for use when submitting jobs to the local cluster system

Variables

Key variables:

  • galaxy_name: name for the Galaxy instance (NB this is also used as the name for any instance-specific configuration files, and for naming processes etc)
  • galaxy_version: version of Galaxy to install
  • galaxy_install_dir: top-level directory to use; by default Galaxy will be installed under ${galaxy_install_dir}/${galaxy_name}

Webserver and proxying:

  • galaxy_server_name: URL for the Galaxy web service

  • galaxy_http_port: port to communicate with Galaxy via (default: 8080)

  • enable_https: if yes then serve Galaxy via HTTPS; this also requires: - ssl_certificate: points to the fullchain.pem certificate

    file, and

    • ssl_certificate_key: points to the privkey.pem file

Admin user:

  • galaxy_admin_user: admin account email (default: admin@galaxy.org)
  • galaxy_admin_passwd: password for admin account (default: galaxyadmin)

Database passwords:

  • galaxy_db_password: password for Postgresql database (default: same name as the database user)

Gunicorn settings:

  • galaxy_gunicorn_workers: (default: 4)
  • galaxy_gunicorn_socket: socket for Galaxy to use to communicate with Gunicorn (default: 4001)

Job runner configuration:

  • default_job_runner: the default job runner to use (default: local)
  • enable_jse_drop: if true then enables the use of the JSE-drop job runner mechanism, and creates a runner definition jse_drop in job_conf.xml (default: not enabled; see separate section for more details of using JSE-Drop)
  • galaxy_job_destinations: a list where each item should be a dictionary defining a job destination to be added to the destinations section of job_conf.xml (default: no job destinations are defined)
  • galaxy_tool_destinations: a list where each item should be a dictionary defining a tool destination to be added to the tools section of job_conf.xml (default: no tool destinations are defined)

Dependency resolvers:

  • galaxy_dependency_resolvers: a list where each item should be a dictionary defining a dependency resolver to to be added to dependency_resolvers.xml (default: no resolvers are defined)

Custom colour scheme:

Static status page:

  • galaxy_generate_status_page: if true then sets up a cron job to run the gx_monitor.py utility to generate a status.html file in Galaxy's static directory and update it every minute. This page then can be accessed to give a basic overview of jobs and disk usage (default: status page is not enabled).

Other configuration settings:

  • default_quota_gb: quota in Gb for registered users (default: 25Gb)
  • email_audit_reports_to: list of space-separated email to send weekly audit reports to (default: don't send reports to anyone)
  • galaxy_clean_up_cron_interval: sets the time interval (in days) before files, links and directories are removed from the job working directory (and JSE-Drop directory, if in use) (default: 28 days)

Tools:

  • galaxy_tools: list of tools to install from the main Galaxy tool shed, with each tool defined as a dictionary with the keys tool, owner and section (specifies the tool panel section to add the tool to; if this is an empty string then the tool will appear outside any sections) (default: don't install any tools from the tool shed)
  • local_galaxy_tools: list of tools to be added locally, with each tool defined as a dictionary with the keys name and tool_files (a list of files).

Tool data tables:

  • galaxy_tool_data_tables: list of entries to append to the standard tool_data_tables_conf.xml file, with each entry defined as a dictionary with the keys description, name, columns and file_path (default: don't append any entries to tool_data_tables_conf.xml)

Reference data (.loc file contents):

  • galaxy_loc_file_data: lines of reference data to add to .loc files; for each .loc file the entries are defined as a dictionary with the keys loc_file (target .loc file) and data (list of lines of data to be inserted into the file) (default: don't add any reference data entries to .loc files)

Variables for handling special cases:

  • galaxy_python_dir: location to install Galaxy-specific version of Python (this is required for example if the default installation of Python isn't accessible across compute cluster nodes) (default: install Galaxy-specific Python in a python/VERSION directory parallel to the Galaxy code cloned from GitHub)

Playbooks

  • palfinder.yml: playbook for setting up the Palfinder Galaxy instance
  • centaurus.yml: playbook for setting up the Centaurus Galaxy 'production' and 'devel' instance

Nb the playbooks include the passwords for the various accounts in the palfinder_passwds.yml file, which have been encrypted using ansible-vault - use:

ansible-vault edit palfinder_passwds.yml

to edit (use the view command just to see the contents).

Use the --ask-vault option to prompt for the encryption password when running the playbook.

In addition there is a playbook export_galaxy_for_cluster.yml which is used to install Galaxy into virtualenvs which can then be installed on the local cluster system for running Galaxy jobs in the production environment (see "Building Galaxy virtualenvs for the cluster system" below).

Inventory files

Inventory files for various deployment environments are included under the inventories subdirectory, for each of the Galaxy instances defined in this repository:

  • inventories/palfinder/: contains inventory files for the Palfinder service
  • inventories/centaurus/: contains inventory files for the Centaurus service

For Palfinder, each subdirectory has two inventory files:

  • production.yml: inventory for the production instance of the service
  • vagrant.yml: inventory for local testing of the service with Vagrant

For Centaurus, there are four inventory files:

  • production.yml: main production instance
  • devel.yml: test instance
  • vagrant-production: local Vagrant version of the production instance
  • vagrant-devel: local Vagrant version of the test instance

These inventories are intended to be used as an alternative to the central inventory file (typically /etc/ansible/hosts).

To explicitly specify which inventory to target for a playbook run, use the -i option e.g.:

ansible-playbook palfinder.yml -i inventories/palfinder/production.yml

will target the production Palfinder service instance.

Running the playbooks

You must pass in the hosts that the playbooks will be run on via the ansible-playbook command line, for example:

ansible-playbook palfinder.yml [ -b ] [ -u USER ] [ --ask-vault ] [ -i INVENTORY ]

Testing using Vagrant

The repo includes a Vagrantfile which can be used to create virtual machines for testing the deployment.

The following servers are defined in the Vagrantfile:

An additional VM is used to build Galaxy virtual environment for deployment on the compute cluster:

  • csf: CentOS 7.8 (http://192.168.60.8) - see below ("Building Galaxy virtualenvs for the cluster system")

To create and log into a Vagrant VM instance for testing Palfinder do e.g.:

vagrant up palfinder
vagrant ssh palfinder

Use the Vagrant-specific inventory file to test locally (note that these are not as fully-featured as the production versions), e.g.:

ansible-playbook palfinder.yml -i inventories/palfinder/vagrant.yml

Point your browser at the appropriate address to access the local test instance once it has been deployed.

Note

For centaurus the Vagrant VM is aliased as

centaurus.hosszu.lan

and this can be added to the /etc/hosts file on the host machine, so that the browser can be pointed to this address (instead of 192.168.60.3) for testing.

(See e.g. https://www.tecmint.com/setup-local-dns-using-etc-hosts-file-in-linux/ for details of how to modify /etc/hosts.)

Building Galaxy virtualenvs for the cluster system

For some production instances where jobs are submitted to the cluster system, there can be issues when the Galaxy VM OS is substantially different to that of the cluster.

In these cases a workaround is to build a Galaxy virtualenv that is installed on the cluster and which is used by the jobs submitted to it; the export_galaxy_for_cluster.yml playbook can be used to build Galaxy virtualenvs on a CentOS 7 Vagrant box for this purpose.

The inventory files in inventories/csf/ target specific production Galaxy instances; to generate a Galaxy virtualenv for the centaurus instance do e.g.:

ansible-playbook export_galaxy_for_cluster.yml -b -i inventories/csf/centaurus.yml

This will generate a .tgz archive in the assets directory, which will contain the Galaxy virtualenv to be unpacked and used on the target VM.

Note

If using the JSE-drop job submission mechanism then the galaxy_jse_drop_virtual_env also needs to be set in the playbooks to point to the unpacked virtual environment to be used.

Migrating Galaxy server to a new VM

These notes are for migrating a Galaxy server where the Galaxy source code and the database, shed tools and tool dependency directories, are all on shared drives on the old VM which can be remounted on the new VM with the same paths.

In this case the gx_dump_database.py utility can be used to get an SQL dump of the Postgres Galaxy database on the old VM, e.g.:

gx_dump_database.py -c /PATH/TO/galaxy.yml -o galaxy_db.sql

When the playbook for the server is executed for the first time targetting the new VM, then the Postgres Galaxy database can be initialised with the SQL dump from the old one by specifying the path to the .sql file via the galaxy_new_db_sql parameter.

Note

The SQL file should be on the remote machine (where Galaxy is installed), not the local one (where the playbooks are being run from).

conda can also be reinstalled while preserving any existing environments that were installed on the old VM, by setting the galaxy_reinstall_conda parameter to true.

If the new VM is a different OS to the old one then it's also recommended to force reinstallation of the Galaxy-specific Python and the Galaxy virtual environment, by specifiying:

galaxy_force_reinstall_python: yes
galaxy_force_reinstall_venv: yes

Finally, it may also be a good idea to refresh the compiled Mako templates (especially if upgrading to a new Galaxy release or Python version) - this can be done automatically by specifying:

galaxy_remove_mako_templates: yes

JSE-Drop job submission configuration

Deployments can make use of a novel job submission system called "JSE-drop", which has been developed and implemented at Manchester by the Research IT team.

JSE-Drop provides file-based communication with the SGE compute cluster and is intended to separate the Galaxy VMs (which are accessible via the web) from the cluster. Scripts are placed in a 'drop directory' and the JSE-Drop service then submits these to the cluster, monitors the resulting jobs, and writes back files with status and completion information.

To enable the plugin for JSE-Drop:

  • Set the enable_jsedrop parameter to yes
  • The 'drop directory' that JSE-drop will use is set via the galaxy_jse_drop_dir parameter.

In addition the following options can be set:

  • By default jobs will use the same Python virtual environment as the Galaxy installation; this can be changed by specifying the galaxy_jse_drop_virtual_env parameter (this is necessary at Manchester as the cluster uses a different OS to the Galaxy VMs)
  • An optional identifier can be inserted into job names by setting the galaxy_jse_drop_galaxy_id parameter.

For each JSE-drop job destination there are additional parameters:

  • Set the number of slots (i.e. cores) used for running by specifying the jse_drop_slots parameter (defaults to 1 slot if not specified).
  • Options to use with qsub when submitting jobs can be specified via the jse_drop_qsub_options parameter.

A reference implementation of a local JSE-drop service can be installed using the jsedrop role. This intended for testing purposes only and should not be deployed on a production server.

Using mamba instead of conda for dependency resolution

mamba is a drop-in replacement for conda (see https://mamba.readthedocs.io/en/latest/index.html). In the past mambas has been recommended an alternatiev as in some cases was able to resolve dependencies that conda failed on.

From Galaxy 22.05 the galaxy role has been updated to (re)install conda using Miniforge3; this includes mamba by default, and also both share the some resolver. So there seems to be less obvious benefits to using mamba.

However: to specify mamba for dependency resolution, set the galaxy_conda_use_mamba parameter to yes.

Notes on the deployment

  • Python is installed under /usr/local by default, this can be changed via the python_install_dir parameter. This Python installation is used by other system software.

    By default this is also the Python installation used by Galaxy, however it is possible to specify a separate Python installation for Galaxy via the galaxy_python_dir parameter (for example if this needs to be accessible from other systems such as a compute cluster).

  • To remove the Galaxy database and user from PostgreSQL, become the postgres user, start the psql console application and do:

    DROP DATABASE galaxy_palfinder;
    DROP ROLE galaxy;
    
  • The following ports need to be open for various services:

    • 80: HTTP access
    • 443: HTTPS access
    • 25: outgoing email
  • To enable TLS/SSL access (i.e. use HTTPS rather than HTTP) set the enable_https variable.

    Note that you will also need SSL certificate files. You can create a dummy certificate using /etc/ssl/certs/make-dummy-cert; if this is named after the server in the /etc/ssl/certs/ directory then it will used by default; set the ssl_certificate and ssl_certificate_key variables to specify the location of the certificate files explicitly.

Vagrant Boxes

The following Vagrant VirtualBox images are recommended for use with the playbooks:

To install a VirtualBox image for use with Vagrant, do:

vagrant box add --name NAME URL

For example:

::
vagrant box add --name centos/7 https://app.vagrantup.com/centos/boxes/7/versions/2004.01/providers/virtualbox.box

Known Issues

  • Tool installation can timeout or fail in which case it will need to be completed manually.

  • SSH keys can change when recreating a Vagrant VM for testing, in which case you should use e.g. ssh-keygen -R "192.168.60.5" (or the IP address of the appropriate instance, see above) to remove the old keys before running the playbooks.

  • Vagrant/VirtualBox may complain about the VM name being too long (see e.g. hashicorp/vagrant#9524), in this case uncomment the line:

    ::

    v.name = "galaxyvm"

    in the Vagrantfile.

About

Ansible playbooks for deploying BCF-managed Galaxy instances on UoM VMs

Resources

Stars

Watchers

Forks

Packages

No packages published