絞り込み

16640

広告

Training-free measures based on algorithmic probability identify high nucleosome occupancy in DNA sequences.

著者 Zenil H , Minary P
Nucleic Acids Res.2019 Sep 12 ; ():.
この記事をPubMed上で見るPubMedで表示
この記事をGoogle翻訳上で見る Google翻訳で開く

スターを付ける スターを付ける     (1view , 0users)

Full Text Sources

We introduce and study a set of training-free methods of an information-theoretic and algorithmic complexity nature that we apply to DNA sequences to identify their potential to identify nucleosomal binding sites. We test the measures on well-studied genomic sequences of different sizes drawn from different sources. The measures reveal the known in vivo versus in vitro predictive discrepancies and uncover their potential to pinpoint high and low nucleosome occupancy. We explore different possible signals within and beyond the nucleosome length and find that the complexity indices are informative of nucleosome occupancy. We found that, while it is clear that the gold standard Kaplan model is driven by GC content (by design) and by k-mer training; for high occupancy, entropy and complexity-based scores are also informative and can complement the Kaplan model.
PMID: 31511887 [PubMed - as supplied by publisher]
印刷用ページを開く Endnote用テキストダウンロード