Software engineer and sysadmin at the OME since 2012
A consortium of universities, research labs, industry and developers producing open-source software and standards for microscopy data.
Don't build new databases from scratch, instead re-use and build upon existing work.
Data should be
Findable
Accessible
Interoperable
Reusable
No it's not
Yes it is!
Used by 100s of institutions around the world
↳ At the time University of Dundee server held around 12 TB data
Everything done manually
$ ssh idr.server
# yum install java-1.8.0-openjdk
# yum install python-{pip,devel,virtualenv,yaml,jinja2,tables}
# ...
It works... for a bit
Make sure all layers of your stack are reliable
OpenStack at EMBL-EBI with Ansible:
One command can provision new servers, configure networking and storage, and install the IDR, reproducibly: IDR/deployment
It works... for a bit
We have a scaling problem, OMERO just wasn't designed for the amount of data and frequency of access. These problems only occur on a big system like the IDR.
Curation: A critical factor in the success of the IDR (remember: FAIR)
Former
members