Omic.ly Weekly 78

June 9, 2025

Hey There!

Thanks for spending part of your week with Omic.ly!


This Week's Headlines

1) Real-Time monitoring of hospital pathogens pays dividends

2) Structural variant based MRD testing is coming to an oncologist near you

3) A Pile of Pennies

4) Weekly Reading List


Hospital genomic surveillance of pathogens stops outbreaks, prevents deaths and saves a bunch of money.

Hospital acquired infections (HAI) are a big problem in healthcare.

Not only are they expensive to treat, but they often lead to unnecessary deaths.

The most common pathogens involved in HAI's are Clostridioides difficile, Methicillin-resistant Staphylococcus aureus (MRSA), Vancomycin-resistant Enterococci (VRE), and multidrug-resistant Gram-negatives like Klebsiella pneumoniae and Pseudomonas aeruginosa.

The challenge in detecting these deadly bugs is that HAI monitoring has traditionally been a reactive exercise where we're always playing catch up when it comes to an infection or an outbreak.

Most hospitals rely on passive surveillance to detect a rise in infections above baseline levels which then leads to an investigation after an outbreak is already in full swing.

But the tools in our molecular testing toolkits have become cheaper and higher throughput now and could allow us to be more proactive about pathogen and outbreak surveillance.

One of the options here is to perform routine environmental monitoring of a hospital and its patients using whole genome sequencing.

This allows for early detection of outbreaks and enables quicker intervention.

However, widespread adoption of WGS for these activities in the hospital setting has been hindered by cost, lack of infrastructure, and limited institutional incentives.

In response to these challenges, researchers developed the Enhanced Detection System for Healthcare-Associated Transmission (EDS-HAT).

This system integrates real-time WGS surveillance with algorithm-driven outbreak detection.

After a promising retrospective analysis of a Klebsiella outbreak in their endoscopy unit, the researchers then used the system in a two-year study that provided powerful evidence of its effectiveness:

1) They were able to detect outbreaks with as few as two positive cases

2) In 95.6% of outbreaks, interventions guided by WGS surveillance successfully halted transmission

3) Real-time genomic surveillance had a net savings of $695,706, with a potential upper-bound savings reaching $1.77 million and 126 prevented infections (See figure above)

But most importantly, they estimate that 62 infections and 4.8 deaths were prevented during the study period.

This study offers powerful evidence that real-time genomic surveillance is feasible and highly effective in identifying and mitigating outbreaks.

So it goes without saying that integration of WGS-based surveillance seems like a no-brainer when it comes to hospital infection prevention and control practices.

###

Sunderman AJ, et al. 2025. Real-Time Genomic Surveillance for Enhanced Healthcare Outbreak Detection and Control: Clinical and Economic Impact. DOI: 10.1093/cid/ciaf216


Structural Variants: What are they good for? MRD testing, actually.

Minimal Residual Disease (the “M” can also stand for “Molecular” or “Measurable”) is a term used to describe the trace amount of cancer left in a patient’s body after treatment.

This is typically detected by tracking cancer specific biomarkers in blood over time to determine if or when a cancer recurs.

These tests are usually referred to as liquid biopsies!

But, back in the day, MRD tests were based on markers known to be common to specific cancer types.

Unfortunately, these early tests had limited value because cancer is cancer and not all of them behave predictably.

They’re all a little different!

They all contain mutations that are unique to them, and so if you’re not developing a test that’s specific to them, there’s a chance you might miss them if they do start to come back!

Thankfully, there are a couple ways around this problem.

One is to do very deep sequencing of blood every couple of months to find variants that appear to be related to a cancer, but doing a lot of deep whole genome sequencing can get pretty expensive!

That’s why a good number of MRD tests use a “tumor informed” approach where they sequence the tumor to find all of the variants in a sample and then pick a handful of those to create a “fingerprint” to track over time.

Creating a fingerprint of the tumor reduces cost and complexity while increasing sensitivity because now you actually know what you’re looking for!

But cancers can have a lot of different types of variants so it’s important to pick ones that you think happened early in development (because they’re probably really important for cancer survival!) and pick variants you can detect with high confidence.

For the longest time, the variants that were used in MRD were single nucleotide variants (SNVs).

Mostly because high throughput sequencers are really good at detecting these, but we know that other types of variants, like structural variants (SVs - where big chunks of the genome have rearranged themselves), are just as important in cancer!

In the figure above, you can see that all of the major cancer types contain SVs and the nice thing about SVs is that there’s evidence that they’re some of the earliest mutations to develop during oncogenesis (when normal cells turn into cancer cells).

This means that they’re also probably going to stick around and not get lost as a tumor evolves.

But SVs also offer another important advantage:

They’re REALLY easy to detect using PCR!

And using PCR for MRD detection has distinct advantages over sequencing because it’s a much faster process but also, since it has a lower background, PCR can be 2-3 log more sensitive than sequencing.

What that means for patients is that we can catch recurrences sooner, reduce false negatives, and intervene before symptoms return or cancer spreads again!

###

Figure: Elliot MJ, et al. 2025. Clin Can Res. DOI: 10.1158/1078-0432.CCR-24-3472

Disclosure: I am the VP of Technology Development at SAGA Diagnostics - a company that uses SVs in MRD testing


The first 3D structure of DNA was published in 1938. It was generated by Florence Bell, a scientist you need to know.

One thing that we lose in the coverage of major scientific discoveries is all of the work that those advances were built on.

A perfect example of this is Watson and Crick's solving of the structure of DNA in 1953.

This wasn't just a synthesis of data they obtained from Maurice Wilkins via Rosalind Franklin (and others), but it also wouldn't have been possible without the early work of William Astbury and Florence Bell.

This is that story.

In the 1930's, William Astbury spent his days analyzing biological fibers using a technique called x-ray diffraction.

He was particularly interested in coiled fibers like those found in wool (keratin) and other textiles and was the first to characterize the structural changes in proteins as they are stretched.

He referred to these as the α-form and β-form, and laid the groundwork for Linus Pauling's discovery of the α-helix and β-strand in 1951!

Astbury quickly got a reputation for his expertise in x-ray diffraction, and was sent purified biological material to characterize from all over Europe.

In 1937, apparently elbows deep in things to characterize, he sent a letter to another renown crystallographer, Lawrence Bragg, asking if he knew of any rockstar crystallographers that were available to help with his studies.

Bragg recommended Florence Bell, a Cambridge grad who learned how to do crystallography from yet another x-ray luminary, JD Bernall.

A short correspondence later and Bell left her job working for Bragg to start her graduate work under the tutelage of Astbury.

While her research was focused on lots of different biological fibers, Bell's most important work was with DNA.

She showed that the key to obtaining the most informative images was the use of high molecular weight DNA.

The best source for this was calf thymus, and the diffractions she obtained from these fibers indicated that DNA had an ordered structure.

This finding was published in 1938 and was also presented at Cold Spring Harbor later that year by Astbury.

They described their structure of DNA as a 'pile of pennies' and a depiction of it can be seen in the figure above.

Their calculations also showed that the DNA nucleotides were 3.3 Å apart and they were stacked one on top of another.

While this might not seem like a historic achievement, it set the foundation for everything that would follow.

With a few tweaks and the discovery that DNA can exist in two forms depending on the humidity during drying, Rosalind Franklin and Raymond Gosling built on Bell's early work to generate photo 51 - the key diffraction of DNA used to solve the structure of DNA in 1953.

###

Astbury WT, Bell FO. 1938. Some recent developments in the x-ray study of proteins and related structures. CSHL Symp Quant Biol. DOI: 10.1101/SQB.1938.006.01.013


Weekly Reading List

Scientists worry about possible publishing ban
HHS secretary Kennedy says the US might launch alternatives to leading medical journals
Efficient mRNA delivery to resting T cells to reverse HIV latency - Nature Communications
Resting T cells are difficult to manipulate, and are a reservoir for latent HIV. Here, the authors develop a lipid nanoparticle formulation with the ability to transfect resting primary human T cells, enabling delivery of mRNAs that result in reactivation of latent HIV. This could help development of HIV cure strategies.
The pandemic generation: How Covid-19 lockdowns left a long-lasting mark on children
The stress and isolation of the pandemic have left social and emotional scars that are already being seen in children, but scientists also predict there could be huge economic costs.
Skyrocketing mpox outbreak in Sierra Leone raises fears of wider spread
Boom in infections in the West African nation is driven by the same viral strain that caused a global outbreak in 2022.
Race, ethnicity don’t match genetic ancestry, according to a large U.S. study
Data from the All of Us program confirm what many geneticists have long promoted
23andMe Sets New Auction With $305 Million New Bid From Ex-CEO
Bankrupt genetic analysis company 23andMe will hold a second auction for its cache of DNA data with an opening bid of $305 million from a group led by the company’s former chief executive officer, Anne Wojcicki.
How We Pioneered Next Generation DNA Sequencing At Solexa – VII
Innovation is Aerobic
How We Pioneered Next Generation DNA Sequencing At Solexa – IX
Postscript
Concurrent loss of the Y chromosome in cancer and T cells impacts outcome - Nature
Comprehensive pan-cancer analysis of loss of the Y chromosome (LOY) in benign and malignant cells establishes a new model linking LOY in circulating and tumour-infiltrating immune cells to LOY in malignant cells.
FDA’s AI tool, Elsa, is here. ‘The stupidest big fuss they ever made’
This is the web edition of STAT’s AI Prognosis, our subscriber-exclusive newsletter.
NIH indirect cost cuts will affect the economy and employment - Nature Human Behaviour
Nature Human Behaviour - NIH indirect cost cuts will affect the economy and employment
A National Lobotomy
Cutting federal support for NIH and NSF is a national lobotomy that will negatively affect our nation’s economy, education, and productivity.
Some signs of AI model collapse begin to reveal themselves
Opinion: Prediction: General-purpose AI could start getting worse
Genomic map of the functionally extinct northern white rhinoceros (Ceratotherium simum cottoni)
The northern white rhinoceros (NWR; Ceratotherium simum cottoni) is functionally extinct, with only two nonreproductive females remaining alive. Extraordinary measures are underway to rescue this species, including using a collection of NWR induced pluripotent stem cells (iPSCs) to generate gametes for assisted reproduction technologies. Because of the critical importance of genomic integrity in germ cells used for reproduction, these approaches require extensive genomic analyses to exclude aberrations that are acquired during culture of iPSCs.
Anti-ageing effects of popular supplement taurine challenged
Massive study finds limited connection between ageing and taurine levels in people, monkeys and mice.
At Columbia, dismay among chemical scientists
Some have already lost grants, while others fear for students and future funding
Alan Alda's Experiment: Helping Scientists Learn To Talk To The Rest Of Us
Alan Alda's father wanted him to become a doctor, but it wasn't meant to be. "I failed chemistry really disastrously ... " Alda says. "I really didn't want to be a doctor; I wanted to be a writer and an actor."
FDA Abandons Its Defense of the LDT Rule, But is It Signaling an Increase in RUO Scrutiny?
At midnight on Friday, May 30, 2025, the government’s deadline to notice an appeal from the U.S. District Court for the Eastern District of Texas’s decision vacating the LDT Rule lapsed without the…
A visual–omics foundation model to bridge histopathology with spatial transcriptomics - Nature Methods
OmiCLIP is a visual–omics foundation model that integrates histology and spatial transcriptomics. The associated Loki platform offers accurate and robust tools for alignment, annotation, cell-type decomposition and spatial gene expression prediction.
DNA Sequencers/Flowcells (On market)
Genomics of host–microbiome interactions in humans - Nature Reviews Genetics
In this Review, Ferretti et al. discuss advances in our understanding of interactions between the human genome and the microbiome, including the effects of the microbiome on host gene regulation.
Complete reference genome and pangenome improve genome-wide detection and interpretation of DNA methylation using sequencing and array data
The complete telomere-to-telomere human genome assembly (T2T-CHM13) and the draft human pangenome reference provide unique opportunities to refine DNA methylation (DNAm) studies. Here, we find that T2T-CHM13 calls 7.4% more CpGs genome wide compared to GRCh38 across four widely used short-read DNAm profiling methods and improves the evaluation of probe cross-reactivity and mismatch for Illumina DNAm arrays, yielding new and more reproducible sets of unambiguous probes. The pangenome reference further expands CpG calling by 4.5% in short-read sequencing data and identifies cross-population and population-specific unambiguous probes in DNAm arrays, owing to its improved representation of genetic diversity.
Do you remember the Human Genome Project? I’m not sure the Trump administration wants you to
The NIH archives of the Human Genome Project could fall victim to Trump administration cuts, writes a former archivist.
GitHub - Shashi-Sekar/GeneChat: Multi-Modal LLM for Gene Function Prediction
Multi-Modal LLM for Gene Function Prediction. Contribute to Shashi-Sekar/GeneChat development by creating an account on GitHub.
Performance of deep-learning-based approaches to improve polygenic scores - Nature Communications
Polygenic scores aim to capture genetic risk but may miss nonlinear genetic and environmental interactions. Here, the authors show that neural networks detect limited nonlinearity and do not outperform linear models, highlighting constraints in current deep-learning approaches.
Focus on single gene effects limits discovery and interpretation of complex trait-associated variants
Standard QTL mapping approaches consider variant effects on a single gene at a time, despite abundant evidence for allelic pleiotropy, where a single variant can affect multiple genes simultaneously. While allelic pleiotropy describes variant effects on both local and distal genes or a mixture of molecular effects on a single gene, here we specifically investigate allelic expression “proxitropy”: where a single variant influences the expression of multiple, neighboring genes. We introduce a multi-gene eQTL mapping framework - cis-principal component expression QTL (cis-pc eQTL or pcQTL) - to identify variants associated with shared axes of expression variation across a cluster of neighboring genes. We perform pcQTL mapping in 13 GTEx human tissues and discover novel loci undetected by single-gene approaches. In total, we identify an average of 1396 pcQTLs/tissue, 27% of which were not discovered by single-gene methods. These novel pcQTL colocalized with an additional 142 GWAS trait-associated variants and increased the number of colocalizations by 34% over single-gene QTL mapping. These findings highlight that moving beyond single-gene-at-a-time approaches toward multi-gene methods can offer a more comprehensive view of gene regulation and complex trait-associated variation. ### Competing Interest Statement S.B.M is on the scientific advisory board of MyOme, PhiTech and Valinor Therapeutics. NIH, R01MH12524, U01AG072573, U01HG012069, T32HG000044
SpotSweeper: spatially aware quality control for spatial transcriptomics - Nature Methods
SpotSweeper is a spatially aware method for quality control of spatially resolved transcriptomics data that corrects for spatial confounding missed by existing methods, including both local and regional artifacts, across diverse technologies.

Were you forwarded this newsletter?

LOVE IT.

If you liked what you read, consider signing up for your own subscription here:

Subscribe to Omic.ly