șÚÁÏÍű±ŹłÔčÏ

Subscribe to the OSS Weekly Newsletter!

Our Bodies Are Full of Dark Proteins. Can We Use Them Against Disease?

New molecules, encoded in our DNA, open the door to future diagnostic tests and treatments—as well as insights into evolution.

These proteins were not supposed to exist. Yet, there they were. As we started to learn more about them, researchers wondered if we hadn’t stumbled upon new targets for treating diseases like cancer.

If wellness fads abided by their own horoscope, we could say 2025 was the Year of the Protein. The macronutrient was shoved everywhere: inside of cereals, milk cartons, even pasta. Everyone wanted more protein.

But proteins are simply the building blocks of the human body. They make up much of the infrastructure of our cells’ cityscape. They make life possible.

Proteins are encoded in stretches of DNA we call genes, much like how an architectural plan dictates how to build a house. We have scanned our genome—and the genome of many, many other species—these past couple of decades, and you would think we had identified all of our protein-coding genes, roughlyÌę. We certainly knew what to look for.

Now, though? We’re discovering what looks like proteins coming out of DNA stretches that shouldn’t be producing them. They have been here all along yet were invisible to our tech-enhanced eyes—much like the imperceptible dark matter that seems to make up most of the weight in our universe.

They are one of the hottest things in molecular biology right now, the type of discovery one researcher tells me that comes maybe once or twice in a scientific career.

They are called dark proteins, and clinical trials are already underway to squeeze some good out of them.

Unveiling the dark proteome

ÌęProteins, which are chains of amino acids, tend to be long. But in the 2000s, researchers stumbled uponÌęÌęthat were unnaturally short. In fact,Ìę, found in the fruit fly, held a lot of power: when disrupted, significant defects in the fly’s anatomy would be observed. That a protein this small could have such a biological impact was the first major awakening that leads us to dark proteins.

The second one has to do with where tiny proteins can come from.

Imagine walking onto a construction site, sure of what they’re building, and finding out that the crew is also erecting walls that were not on the plans you saw. When asked, they would show you new paper plans, which you had never seen before. Worst of all, they would tell you this is routine. It had always been this way.

In 2009, Nicholas Ingolia, working in Jonathan Weissmann’s lab, managed to captureÌę. Ribosomes are the construction crew of our cells: tiny but mighty factories that read transcripts from our genes and translate them into proteins. That messenger RNA used in many of the COVID-19 vaccines? That’s a transcript, and it’s the ribosome that translates it into a protein.

And when he actually looked at what was coming out of ribosomes, he and his team saw molecules that looked like short proteins
 but that they had never encountered before.Ìę

Major publications in the last few years put this discovery firmly on the map, and the phrase that caught the eye of the scientific press was “dark proteins.” They’re not dark because they’re evil or because they necessarily cause diseases; the “dark” is analogous to dark matter which exists but can’t be seen. When we think of all of the dark proteins contained in the human body, we call it the dark proteome. (If you’re a structural biologist, yes, I know, the phrase carriesÌęÌęfor you. Sorry.)

“What I think the misunderstanding is is that this is something that a small number of people are doing without a larger buy-in. That’s not the case at all. I think this is super important.” I interviewed Dr. John Prensner for this, and this is what he wanted to make clear. Prensner is a physician-scientist and pediatric hematologist-oncologist currently at the University of Michigan and one of the leading researchers in the expanding field of dark proteins.

The buy-in he’s referring to is the roughly 25 academic sites all over the world working on dark proteins through theÌę, on the executive board of which Prensner sits. The consortium earlier this month published aÌęÌęinÌęNatureÌęoutlining the steps to studying these dark proteins, even giving them a proper scientific name: peptideins (roughly pronounced PEP-tuh-deenz). It’s a contraction of “peptide” and “protein.” (They originally considered “peptein” before learning it was the name of aÌęÌęin Thailand.)

“Peptidein” is a linguistic waystation, a temporary assignment to a scientific limbo. When they are first detected, these dark proteins are called peptideins because we don’t know what their role is. If they can be convincingly shown to behave like a traditional protein, then they will be called as such—or a “microprotein,” since these peptideins tend to be quite short. When a protein is made up of fewer than 100 amino acids, it isÌę. So, a dark protein is scientifically called a peptidein when its function is unknown, but if it can be proven to behave like a protein, it will be reclassified as a microprotein.

Figure 1: Microproteins and proteins are different based on how many amino acids they contain.

(Here’s an aside for the linguistically minded who are wondering why I am using the word “microprotein” and not “peptide.” Feel free to skip. Peptides are indeed short chains of amino acids, basically short proteins. But peptides don’t have a clear cut-off point in terms of their length. Many of us will say they have to be shorter than 100 amino acids to qualify, but there is no scientific consensus on this length. More importantly, peptides are typically born ofÌęcleavage.ÌęIf you think of a protein as a bit of twine and you cut it in two, the halves would be called peptides. Peptides are made from longer starting material that needs to be processed; microproteins, by comparison, are encoded as such, no cleavage needed. That’s the big difference.)Ìę

All of this discussion on what to call them obscures an important question: how were these dark proteins invisible to us for so long?

It boils down to rules.

We thought we knew what the rules were for a stretch of DNA to code for a protein. There was a particular sequence of letters that said “start here.” Well, it turns out that you don’t necessarily need this sequence to produce a protein. The instructions to make dark proteins are in parts of the DNAÌęthat shouldn’t code for proteins
 at least according to what we thought we knew.

You may remember the term “junk DNA,” which was popular in the media during the Human Genome Project. Our DNA has genes, which code for proteins, and much of the rest was, at the time, hit with the dismissive label of “junk DNA.” Some sort of evolutionary holdover, perhaps, or the equivalent of packing material. Some of this junk DNA codes for dark proteins, it turns out.

But other dark proteins are encoded very near actual genes, in regions that are involved in the regulation of these genes. It’s like buying one of those steel model kits—where you twist off tiny pieces of metal and painstakingly assemble them into the Eiffel Tower or the Millennium Falcon—and finding out that the instructions can also be folded into a tiny 3D model.

So, dark proteins exist—with over a hundred properly documented in humans and thousands more being explored. But what exactly do they do?

“A fundamentally different biology”

I spoke to Dr. Marie Brunet, who works at the genetics service of the Department of Pediatrics of the University of Sherbrooke and who carries research into dark proteins. “We’re moving from ‘what exists’ to ‘what does it do’,” she said, summarizing where the field had been and where it was headed.

Some peptideins have enough evidence for us to say that they are simply short proteins that were previously missed because of their size and because of how they are encoded in our DNA.

But a large swath of peptideins may turn out to be protein-likeÌęin structure only. They may, as has been hypothesized, only exist to gum up the ribosome. If you send phony architectural plans to a construction crew and tell them to make them a reality pronto, they will waste time doing this, and house construction will be delayed. For a cell, this might be desirable. Cells are not producing every protein all the time. Protein production is carefully regulated through a dozen different means: peptideins might be one more way of doing it, by clogging up the ribosome temporarily. In these cases, the peptidein itself is worthless and has no function; it’s the DNA coding for it that’s important.

I asked Prensner about this, and his hunch—and it is simply his own personal hypothesis—is that, indeed, a minority of peptideins will be shown to be traditional proteins, while theÌęmajorityÌęof peptideins will be shown to have “a fundamentally different biology.”

Brunet was keen to remind me that different peptideins will have different reasons for existing—much like, I would add, how different guests staying at the same hotel have their own reason for being there: work, vacation, an unexpected holdup from a delayed flight. One peptidein will be shown to behave like a protein. The next one will turn out to be a temporary artefact from a cancer cell whose biology is out of whack. The next one still will be an artefact of normal biology, where a bit of DNA is translated by the ribosome and then immediately degraded. Yet others will carry unknown functions for a while as scientists puzzle over them. It’s a spectrum.

Figure 2: What a newly discovered peptidein (also known as a dark protein) may be doing in the body.

But not knowing what they do will not stop scientists from trying to do something with them now.

What dark proteins mean for you

Reading the papers of the dark protein literature, I couldn’t help but notice how many authors are involved, in one way or another, with biotech companies focused on developing new ways to treat diseases. Both industry and academia are interested in quickly translating this knowledge of the dark proteome into the clinic. Two applications are already being considered.

First, diagnostics. If a dark protein can be shown to be tied to a specific cancer, for instance, and nothing else, then detecting it—as part of a robust diagnostic test with few false positives and false negatives—would mean you likely have this cancer.

Second, therapeutics. It may sound premature: after all, there is little we know about dark proteins’ roles in the body. Shouldn’t we study this more before talking about treatment potential? But if a dark protein is present, like a flag, at the surface of a cancer cell and is not present at the surface of a healthy cell, an immunological therapy can in theory be devised against it. We can train immune cells to recognize this flag as the enemy and to attack the cells bearing it inside the body. This is something that is already being done with traditional proteins. Our realization of the existence of dark proteins simply expands the repertoire of potential targets, and at leastÌęÌęis already testing this hypothesis.

Hope is important, but adequately calibrated hope is perhaps more valuable. Yes, dark proteins are promising, but there are significant challenges ahead, andÌęÌęout of Columbia University and published last year zeroes in on them. Because of their small size, they are difficult to detect by mass spectrometry. The vast majority of them appear to get degraded inside the cell very quickly after their synthesis, which reduces the chances the peptidein itself plays a role. Very few of them, so far, are supported by more than one study: there is a lack of substantial overlap, somethingÌęÌęin the field of microRNAs, which also held promise for diagnostics and therapeutics. And when we add a tag to a dark protein in order to see where it goes in the cell, the tag itself may alter its localization because of how small the dark protein is, thus giving us false answers.

There is another hurdle: many of the rigorous papers I read on dark proteins were for studies that had received NIH funding. The National Institutes of Health are the main health research body of the U.S. government. They have been gutted under the Trump regime.

Dark proteins represent yet another example of a basic-research finding that appears to be trivial. Why should we invest money into studying “noncanonical open reading frames,” areas in the DNA thatÌęshouldn’tÌęproduce proteins? Why should we pay to account for everything that comes out of our ribosomes? It can seem like nerdy knowledge gathering with no real value.

But nerdy knowledge gatheringÌęhasÌęvalue, in and of itself, and on top of that, you never know which part of it may end up transforming cancer care in the near future.

Evolution in action

I will end with a hypothesis Brunet slipped in toward the end of our interview and which gave me goosebumps. There is something uncanny about these dark proteins. They look like proteins but are very short; many of them get degraded before they have a real chance of doing something; and they come out of places in our genome that should not be coding for proteins. It’s like Bizarro World for molecular biology, some sort of weird laboratory our cells have set up inside of themselves. We could dismiss all this as artefactual nonsense—and many scientistsÌędidÌębefore enough evidence had accumulated that we had missed something genuine.

But Brunet asks a stunning question: “Are we witnessing evolution in real time?”

It’s known asÌęde novoÌęgene birth, a sort of immaculate conception for genes, an exotic and mostly theoretical idea that genes can be born from scratch. Those weird dark proteins we are now witnessing—or at leastÌęsomeÌęof them—might be one way in which novel genes are born, especially since many human dark proteins are not seen in other species. Dark proteins are “noncanonical,” meaning they are not recognized as being part of the canon of existing proteins. They currently exist outside of the consensus. “Some of these peptideins,” Brunet told me, “may be not-fully-finished copies of proteins. They may still be evolving. They may well be the canonical proteins of the future.”

Let’s keep an eye on them, now that we know they exist.

Take-home message:
- Dark proteins, recently renamed peptideins, are molecules present in our body that look like short proteins but that are made from stretches of DNA that were not thought to code for proteins.
- Researchers are still figuring out the roles that these peptideins play in the body: some have turned out to be genuine proteins, while others may have no role at all or may play roles in our biology that we don’t even know about yet.
- Some of these dark proteins represent new targets that could potentially help doctors diagnose people with certain diseases or even treat them by training immune cells to recognize a dark protein associated with a specific disease.


Back to top