We note with deep sadness the passing of Andreas Förster of DECTRIS who was instrumental in the organization of the HDRMX effort and the Gold Standard workshops in particular. He will be missed.
IUCr XXV for Prague, CZ included what had been planned as a hybrid Workshop on MX raw image data formats, metadata and validation on this date, but circumstances have resulted in our converting it to a pure virtual e-meeting via zoom on Saturday, 14 August 2021, from 9 am until 3 pm, Prague Time (GMT+2) which is 1 hour later than London time (8 am until 2 pm), 6 hours later than New York City time (3 am until 9 am). This is the agenda.
The most important background information for this workshop is the Gold Standard paper, [Herbert J. Bernstein, Andreas Förster, Asmit Bhowmick, Aaron S. Brewster, Sandor Brockhauser, Luca Gelisio, David R. Hall, Filip Leonarski, Valeria Mariani, Gianluca Santoni, Clemens Vonrhein, Graeme Winter, "Gold Standard for macromolecular crystallography diffraction data," IUCrJ 7, no. 5 (2020)] https://doi.org/10.1107/S2052252520008672
The registered participants were:
name | affilitation | |
---|---|---|
Oskar Aurelius | MAX IV Laboratory | oskar dot aurelius at maxiv dot lu dot se |
Frances C. Bernstein | Bernstein + Sons | fcb at bernstein-plus-sons dot com |
Herbert J. Bernstein | Ronin Institute for Independent Scholarship,c/o NSLS-II BNL | yayahjb at gmail dot com |
Aaron Brewster | LBL | asbrewster at lbl dot gov |
Max Burian | DECTRIS | max dot burian at dectrisdot com |
Tom Caradoc-Davies | Australian Synchrotron -- ANSTO | thomasc at ansto dot gov dot au |
Daniel Eriksson | Australian Synchrotron -- ANSTO | daniel dot eriksson at ansto dot gov dot au |
Diego Gaemperle | DECTRIS | diego dot gaemperle at dectris dot com |
Richard Gildea | Diamond Light Source | richard dot gildea at diamond dot ac dot uk |
David Hall | Diamond Light Source | david dot hall at diamond dot ac dot uk |
John Helliwell | School of Chemistry, The University of Manchester, UK | john dot helliwell at manchester dot ac dot uk |
Mohamed Ibrahim | Humboldt University | mohamed dot ibrahim at hu-berlin dot de |
Natalie Johnson | CCDC (Cambridge Crystallographic Data Centre) | njohnson at ccdc dot cam dot ac dot uk |
Maria Ksiażek | University of Silesia in Katowice (Poland) | maria dot ksiazek at us dot edu dot pl |
Marco Klepoch | Institute of Biotechnology, AS CR, Centre of Molecular Structure | marco dot Klepoch at ibt dot cas dot cz |
Edwin Lazo | NSLS-II, BNL | elazo at bnl dot gov |
Filip Leonarski | PSI | filip dot leonarski at psi dot ch |
Brian McMahon | IUCr | bm at iucr dot org |
Jie Nan | MAX IV Laboratory | jie dot nan at maxiv dot lu dot se |
Ezequiel (Zac) Panepucci | Swiss Light Source | ezequiel dot panepucci at psi dot ch |
Martin Savko | Synchrotron SOLEIL | savko at synchrotron-soleil dot fr |
Jan Stransky | Institute of Biotechnology, AS CR, Centre of Molecular Structure | jan dot stransky at ibt dot cas dot cz |
Clemens Vonrhein | Global Phasing Ltd., Cambridge, UK | vonrhein at globalphasing dot com |
Nadia Zatsepin | La Trobe University (La Trobe Institute of Molecular Sciences) | n dot zatsepin at latrobe dot edu dot au |
Workshop on MX raw image data formats, metadata and validationOrganized by IUCr Committee on Data
Sponsored in part by Dectris Ltd Saturday August 14 2021
Prague, Czech Republic
"Macromolecular crystallography (MX) is the dominant means of determining the three-dimensional structures of biological macromolecules. Over the last few decades, most MX data have been collected at synchrotron beamlines using a large number of different detectors produced by various manufacturers and taking advantage of various protocols and goniometry. These data came in their own formats, sometimes proprietary, sometimes open. The associated metadata rarely reached the degree of completeness required for data management according to Findability, Accessibility, Interoperability and Reusability (FAIR) principles. Efforts to reuse old data by other investigators or even by the original investigators some time later were often frustrated. In the culmination of an effort dating back more than two decades, a large portion of the research community concerned with High Data-Rate Macromolecular Crystallography (HDRMX) has now agreed to an updated specification of data and metadata for diffraction images produced at synchrotron light sources and X-ray free electron lasers (XFELs). This Gold Standard will facilitate processing of datasets independent of the facility at which they were collected and enable data archiving according to FAIR principles, with a particular focus on interoperability and reusability. This agreed standard builds on the NeXus/HDF5 NXmx application definition and the International Union of Crystallography (IUCr) imgCIF/CBF dictionary and is compatible with major data processing programs and pipelines. Just as with the IUCr CBF/imgCIF standard from which it arose and to which it is tied, the NeXus/HDF5 NXmx Gold Standard application definition is intended to be applicable to all detectors used for crystallography, and all hardware and software developers in the field are encouraged to adopt and contribute to the standard." [A. Frster, H. J. Bernstein, A. Bhoemick, A. S. Brewster, S. Brockhauser, L. Geliso, D. R. Hall, F. Leonarski, V. Mariani, G. Santoni, C. Vonrhein, G. Winter, "A Gold Standard for Macromolecular Diffraction Data", in preparation]
This is a tutorial workshop presented by the developers of the Gold Standard to introduce the community to this important upgrade to the interoperability and reusability of macromolecular crystallographic data for both synchrotrons and XFELS,and to give the participants an opportunity to work with and comment on the Gold Standard. To widely share best practices for metadata recording, we encourage the participation of neutron and chemical crystallographers.
The workshop will function as a purely virtual meeting via Zoom from 9 am to 3 pm Prague time (3 am to 9 pm New York time, 8 am to 2 pm London time) on 14 August 2021 with two breaks. In order to participate and receive the Zoom meeting room URL and agenda, contact Herbert J. Bernstein, hbernstein@bnl.gov, giving your name, institutional affiliation and email address by no later than 6 pm Prague time (noon New York time, 5 pm London time) on Friday 6 August 2021. Registration is limited to 100 persons. There will be no additional fee for the workshop virtual meeting registration.
Reminder: The CommDat user forum is at https://forums.iucr.org
Saturday, 14 August 2021 | ||||
---|---|---|---|---|
Prague LondonNew York | topic | min | conf | |
9:00 -- 9:158:00 -- 8:15 3:00 -- 3:15 |
Welcome, Introductions and Set up for zoom, test connections | 15 | ||
9:15 -- 9:45 8:15 -- 8:45 3:15 -- 3:45 |
Aaron Brewster (LBL) | Using the Gold Standard for data archival at kilohertz speeds pdf m4a audio clip | 30 | yes |
9:45 -- 10:15 8:45 -- 9:15 3:45 -- 4:15 | Herbert J
Bernstein (Ronin Institute) |
MX raw data formats and the Gold Standard pdf m4a audio clip | 30 | yes |
10:15 -- 10:45 9:15 -- 9:45 4:15 -- 4:45 |
Max Burian and Diego Gaemperle (DECTRIS) |
Stream2 and FileWriter2 pdf m4a audio clip | 30 | yes |
10:45 -- 11:15 9:45 -- 10:15 4:45 -- 5:15 |
Coffee break (bring your own coffee, tea or other refreshments) | 30 | ||
11:15 -- 11:45 10:15 -- 10:45 5:15 -- 5:45 |
Filip Leonarski (PSI) |
Jungfraujoch: A Data Acquisition and On-the-fly Analysis System for High Data-Rate Macromolecular Crystallography pdf m4a audio clip | 30 | yes |
11:45-- 12:15 10:45 -- 11:15 5:45 -- 6:15 |
Natalie Johnson (CCDC) |
Synchrotron Data in the CSD pdf m4a audio clip | 30 | yes |
12:15 -- 12:30 11:15 -- 11:30 6:15 -- 6:30 |
Daniel Eriksson (Australian Synchrotron) |
Facility Report Australian Synchrotron pdf m4a audio clip | 15 | yes |
12:30 -- 12:45 11:30 -- 11:45 6:30 -- 6:45 |
HJB for Dale Kreitler (NSLS-II) |
Facility Report NSLS-II pdf m4a audio clip | 15 | yes |
12:45 -- 13:15 11:45 -- 12:15 6:45 -- 7:15 |
Lunch break (bring your own lunch or breakfast ..., as appropriate to your time zone) | 30 | ||
13:15 -- 14:30 12:15 -- 13:30 7:15 -- 8:30 |
Facility Reports and Open discussion Future Directions What metadata are lacking at the moment? What problems are we envisioning? mp4 video clip m4a audio clip | 75 |
Despite the drastic changes in arrangements necessitated by the impact of the pandemic, including a decision by the United States to severely discourage physical travel to Prague, and difficulty for UK scientists and scientists from other countries in arranging travel to Prague, the move from hybrid to pure virtual was successful.
The workshop had twenty-four participants total with approximately eighteen of them active at most times. Aaron Brewster of LBL presented truly impressive progress in use of the Gold Standard NeXus/HDF5 data format in high-data-rate processing for XFEL experiments worldwide in his talk on "Using the Gold Standard for data archival at kilohertz speeds". Herbert J. Bernstein presented a brief tutorial on the Gold Standard in "MX raw data formats and the Gold Standard". Max Burian and Diego Gaemperle spoke of Dectris' efforts to adopt the Gold Standard and raised the wonderful possibility of going open source on their software in their presentation, "Stream2 and FileWriter2". Filip Leonarski presented an impressive talk on "Jungfraujoch: A Data Acquisition and On-the-fly Analysis System for High Data Rate Macromolecular Crystalography" and raised the question of needing to revisit the decision to use LZ4 and suggested consideration of Zstandard (https://github.com/facebook/zstd), the LZW-family compression supported by facebook. Natalie Johnson reprised her talk from last year on "Synchrotron Data in the CSD". There were three facilities reports: one from NSLS-II from Dale Kreitler of BNL given by Herbert J. Bernstein, one from the Australian Synchrotron given by Daniel Eriksson, and one on Max IV given by Oskar Aurelius.
There was then vigourous discussion for over an hour. The major points raised were: