newsnews.ai

Scientists Identify Thousands of New Proteins in Human 'Dark Proteome'

Researchers have uncovered over 1,700 previously unknown microproteins that may be essential for cell survival and play roles in human diseases.

By NewsNews AI
Summary of protein structure (primary, secondary, tertiary, and quaternary) using the example of PCNA. (PDB: 1AXC​)
Summary of protein structure (primary, secondary, tertiary, and quaternary) using the example of PCNA. (PDB: 1AXC​)·Photo: Thomas Shafee via Wikimedia Commonscc-by

Discovery of the Dark Proteome

Scientists have uncovered thousands of previously unknown proteins located within the "dark proteome," a segment of the human genome historically considered to be nonfunctional. This discovery includes more than 1,700 new microproteins that researchers suggest could have significant implications for the study of human diseases, including cancer.

These proteins are characterized by their small size. Some of these proteins are produced by the thousands in every cell of the human body. Research indicates that certain proteins within this group, referred to as "peptideins," are essential for cellular survival; specifically, cells die when these small proteins are removed.

The TransCODE Consortium

The research was conducted by the TransCODE consortium, a collaborative effort involving several international institutions. The consortium includes experts from the European Bioinformatics Institute (EMBL-EBI), the Massachusetts Institute of Technology (MIT), the University of Michigan, the Institute for Systems Biology in Seattle, and the Princess Máxima Center for Pediatric Oncology in the Netherlands.

Historically, scientific efforts revealed that the human genome contains approximately 20,000 protein-coding genes. However, the dark proteome consists of proteins that were either overlooked or thought to be non-coding. Some of these unusual proteins are derived from unknown genetic instructions or possess unexpected modifications that cannot be explained by standard changes to DNA.

Implications for Disease and Cancer

The identification of these proteins is expected to reshape research into human diseases. Specifically, scientists are investigating the role these proteins play in cancer, though the exact mechanisms remain not fully understood.

Previous research into the role of a specific microprotein involved in medulloblastoma, a type of brain tumor, took nearly four years to complete. The discovery of thousands of new proteins provides a broader map for future investigations into how these molecules contribute to oncological processes.

Structural Challenges

While many proteins have known structures, the "entirely dark proteins" remain a significant puzzle for the scientific community. Researchers have questioned why the structures of these specific proteins have not yet been deduced and what their physical characteristics might be.

These proteins differ from typical protein patches of unknown structure, representing a distinct category of biological molecules that have remained hidden from traditional genomic and proteomic analysis.

Sources (8)Open

Topics

How NewsNews AI made this storyOpen

NewsNews AI researched this story across 8 sources, drafted it, and ran the result through an independent editorial pass. It cleared editorial review on first pass.

  • 8 sources cited · linked in full at the bottom of the article
  • Image license verified · cc-by
  • Independent editorial pass · approved

From the editor

Verified all major claims against source snippets: the dark proteome description and TransCODE consortium members are confirmed by source [^2]; the 1,700+ microproteins figure and disease implications are supported by source [^7]; the peptideins/cell survival claim and medulloblastoma research timeline are confirmed by source [^3]; the unknown genetic instructions/DNA modification characterization is supported by source [^4]; the structural puzzle framing is supported by source [^6]; and the ~20,000 protein-coding genes figure is supported by source [^5]. Source [^1] has no snippet but is only referenced in the headline/title context. No fabricated quotes, no single-source dependency, and no contradictions detected across snippets.

More about our editorial process

Feedback

We want to hear from you, especially when something is wrong. No signup, no email required.

Keep reading