April 24, 2023
This is a minimally-researched briefer that I put in a Google Doc for an audience of biosecurity-focused policymakers. I added a few extra links when giving this a public link in 2026, but it hasn’t received a major update since 2023.
Flexible laboratory resources, available on demand. The “cloud” references cloud computing; cloud labs similarly allow users to perform work using hardware in a physically distant location, which is maintained by a different company or team. Regulation of cloud labs should address:
I’d distinguish companies with a small number of strong client relationships from cloud labs, even if they offer access to automation (e.g. Ginkgo Bioworks, Kebotix, some CROs). I’m not totally sure what’s going on with self-driving labs in biology.
Well... they mostly don’t! Very few academics or companies currently make use of cloud labs or biofoundries. Automated laboratory equipment is expensive to buy, program, and maintain; as a result, few scientists have tested their protocols on this kind of hardware, making it risky to use even equipment maintained by others. We’re also still missing basic infrastructure for laboratory automation, like versions of simple protocols that don’t ask for manual actions.
If we had better infrastructure, cloud labs would be appealing for the following reasons:
Given current trends in lab automation, I expect people will start using cloud labs more often; see more on this point under What will cloud labs look like in the future?
Cloud labs, at present, do not allow researchers to do bioengineering without knowledge of biological protocols. They do:
Example of dozens of samples being moved around a cloud lab by robot arms (source).
Cloud labs are very early in the transition between “small number of client relationships” to “high-throughput service providers”. Getting started with them still looks more like “establish a client relationship” than “open an account and make an order” (contrast IDT, Twist, AddGene). That said, they’re more like high-throughput service providers than they were 3 years ago, and we should expect this trend to continue and accelerate (see Bioautomation Challenge).
ECL has a full list of their instrumentation online. The old Strateos virtual lab tour might be of interest, but as of April 2023 they have announced a “strategic shift to focus on customer demand for on-site cloud labs” (i.e. building private labs with workcells that can be controlled over the internet).
I want to be clear that we may not have large commercial cloud labs in the future; it’s not clear that the model works from a business or science perspective.
I broadly agree with the takes in Automation is coming to life science [2026 ETA: and Heuristics for Lab Robotics] — automation enables new kinds of experiments, owning and maintaining your own robots sucks, future workflows will look like more and more parts of the design-build-test-analyze cycle in the life sciences going the way of DNA synthesis and sequencing.
Diagram from Automation is coming to life science by Erika Alden DeBenedictis.
My extremely seat-of-the-pants predictions for the next few years are something like:
I want to comment briefly on why we don’t already have highly-automated cloud labs. My main theory is that, until quite recently, no one had functional workcells that integrated multiple instruments. This was partly because device manufacturers were not designing for automation, and many instruments could only be controlled via GUIs on dedicated PCs. Some players in making integration more possible: Artificial Suite, BioSero GreenButtonGo, Strateos Lodestar, Automata LINQ, HighRes Biosolutions, The Morse Group.
If you’re worried about biology being misused to do harm, it’s unclear whether increased use of cloud labs is a positive or negative thing. A few considerations:
I think it’s pretty unlikely that we’ll have cloud lab facilities that are certified to handle pandemic pathogens in the next 5 years. We should be thinking about risk in terms of dual-use knowledge and data generation, not “turnkey lab” scenarios for viral engineering. If we turn from biology to chemistry, cloud labs are a more immediate concern. LLMs are already approaching the ability to convert “synthesize this toxin” to “here are the cloud lab API calls for that reaction series”.
One obvious thing for regulation to address is customer screening (e.g. to ensure that export-controlled equipment and software is not being used by people who aren’t authorized to use it), though my understanding is that, at least in the DNA synthesis world, customer screening is an expensive mess.
It’s possible that certain kinds of cloud lab protocols should also face some kind of security screening (similar to sequence-based screening in DNA synthesis). The most appropriate form of security screening will range from “none, please let people do science” to “require a justification for specific experiments of this kind”. It’s the “specific experiments” in the last sentence that gets tricky; you can’t really point to a general-purpose protocol and say that it’s inherently dangerous.
(blog posts or short papers, semi-prioritized list)
1 Though even if you don’t need to know which buttons to press on the mass spec, you need to understand appropriate experimental settings — should you set the mass spec’s IonSource to ESI or MALDI? I sure don’t know! ↑