Mechanism of Evolution Shared by Gene and Language

12/28/2020
by   Li-Min Wang, et al.
6

We propose a general mechanism for evolution to explain the diversity of gene and language. To quantify their common features and reveal the hidden structures, several statistical properties and patterns are examined based on a new method called the rank-rank analysis. We find that the classical correspondence, "domain plays the role of word in gene language", is not rigorous, and propose to replace domain by protein. In addition, we devise a new evolution unit, syllgram, to include the characteristics of spoken and written language. Based on the correspondence between (protein, domain) and (word, syllgram), we discover that both gene and language shared a common scaling structure and scale-free network. Like the Rosetta stone, this work may help decipher the secret behind non-coding DNA and unknown languages.

READ FULL TEXT

page 3

page 5

page 16

page 17

page 19

research
07/19/2023

ProtiGeno: a prokaryotic short gene finder using protein language models

Prokaryotic gene prediction plays an important role in understanding the...
research
01/07/2016

Large Collection of Diverse Gene Set Search Queries Recapitulate Known Protein-Protein Interactions and Gene-Gene Functional Associations

Popular online enrichment analysis tools from the field of molecular sys...
research
04/22/2022

Global Mapping of Gene/Protein Interactions in PubMed Abstracts: A Framework and an Experiment with P53 Interactions

Gene/protein interactions provide critical information for a thorough un...
research
09/18/2023

DeepHEN: quantitative prediction essential lncRNA genes and rethinking essentialities of lncRNA genes

Gene essentiality refers to the degree to which a gene is necessary for ...
research
04/12/2012

Detecting lateral genetic material transfer

The bioinformatical methods to detect lateral gene transfer events are m...
research
04/05/2022

SemanticCAP: Chromatin Accessibility Prediction Enhanced by Features Learning from a Language Model

A large number of inorganic and organic compounds are able to bind DNA a...
research
05/29/2020

CLARITY – Comparing heterogeneous data using dissimiLARITY

Integrating datasets from different disciplines is hard because the data...

Please sign up or login with your details

Forgot password? Click here to reset