Software engineer and sysadmin at the OME since 2012
A consortium of universities, research labs, industry and developers producing open-source software and standards for microscopy data.
Don't build new databases from scratch, instead re-use and build upon existing work.
Data should be
Findable
Accessible
Interoperable
Reusable
No it's not
Yes it is!
Used by 100s of institutions around the world
↳ At the time University of Dundee server held around 12 TB data
Everything done manually
$ ssh idr.server
# yum install java-1.8.0-openjdk
# yum install python-{pip,devel,virtualenv,yaml,jinja2,tables}
# ...
It works... for a bit
Make sure all layers of your stack are reliable
OpenStack at EMBL-EBI with Ansible:
One command can provision new servers, configure networking and storage, and install the IDR, reproducibly: IDR/deployment
It works... for a bit
We have a scaling problem, OMERO just wasn't designed for the amount of data and frequency of access. These problems only occur on a big system like the IDR.
Sebastien Besson
Jean-Marie Burel
Mark Carroll
David Gault
Riad Gozim
Simon Li
Dominik Lindner
Melissa Linkert
Josh Moore
Will Moore
Petr Walczysko
Frances Wong
QA/Tester
Curation: A critical factor in the success of the IDR (remember: FAIR)
Jason Swedlow
Sebastien Besson
Jean-Marie Burel
Mark Carroll
David Gault
Riad Gozim
Simon Li
Dominik Lindner
Melissa Linkert
June Matthew
Josh Moore
Will Moore
Petr Walczysko
Frances Wong
Rafael Carazo-salas
Alvis Brazma
Ugis Sarkans
Simon Jupp
Tony Burdett
Aleksandra Tarkowska
Anatole Chessel
Richard Ferguson
Helen Flynn
Kenny Gillen
Roger Leigh
Simone Leo
Gabriella Rustici
Eleanor Williams
Former
members