Hi @macromeer, welcome to the forum, it’s a good question.
You hit the nail on the head with:
While not “impossible” in the strictest definition of the word, it’s extremely hard (like, even harder than it might appear) to do a rigorous, fair comparison between two different microscopes. Can’t overemphasize that. Moreover, there are multiple things that one might wish to standardize when making the comparison (SNR, speed, irradiance and/or bleaching), and that can affect interpretations. Also, even measuring the metrics of performance (resolution, contrast, SNR, bleaching, toxicity) is not trivial.
I like the concept of a benchmark or competition, but honestly my first thought is that doing this all correctly is so much more than simply setting some parameters in the software, that I’m not sure it would be practically all that useful to compare measurements made by different researchers at different institutions on different (physical) samples, etc… There’s just way too many ways that could lead to erroneous conclusions.
This leaves us in a place where the most useful thing that you can generally do is compare the practical utility of two or more specific microscopes (and operators) you make have direct access to (and resist the temptation to broadcast and generalize those findings as representative of an entire class or “type” of microscope technology or product). The paper you link can be seen as something like that. Those authors are happier with their (specific) LSFM than their (specific) spinning disc for the purpose of imaging whole cleared mouse brain. That’s great! But I would caution anyone from taking much more away from that paper. Note that they used different objective lenses, different cameras, different pixel sizes & sampling rates, different exposure times, different laser power, etc… It’s all fine if what one wants to say is “if I sit down at this exact microscope vs that exact microscope and work with what I’m given without changing too many parameters, which image generally looks qualitatively better?” but I’m not sure I’d go much farther than that.
As a specific example of things that require more investigation take their resolution measurements:
scope |
objective |
theoretical resolution @ 525nm |
actual measured FWHM |
light sheet |
2.5x/0.12 NA |
~2.7µm |
6.04 µm |
spinning disc |
4x/0.2 NA |
~1.6µm |
7.43 µm |
My first question would be: why was the measured resolution for the spinning disc so much worse than theory? (it was just a bead on a coverslip). It makes me suspect some sort of error. (note also that they used a 4µm bead that was well above the diffraction limit for both objective lenses to make these measurements)
In any case, making truly careful comparisons that are actually generalizeable is just incredibly hard