{"id":222,"date":"2018-04-18T15:50:44","date_gmt":"2018-04-18T15:50:44","guid":{"rendered":"http:\/\/serranolab.crg.eu\/?p=222"},"modified":"2020-06-03T21:12:41","modified_gmt":"2020-06-03T21:12:41","slug":"example-news-our-last-paper-about-protein-dna-binding","status":"publish","type":"post","link":"http:\/\/serranolab.crg.eu\/index.php\/2018\/04\/18\/example-news-our-last-paper-about-protein-dna-binding\/","title":{"rendered":"PADA1 predicts the DNA-binding regions of resolved protein structures"},"content":{"rendered":"<p>Abstract:<\/p>\n<blockquote><p>The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein\u2013DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures. PADA1 relies on a library of protein and double-stranded DNA fragment pairs obtained from a training set of 2103 DNA\u2013protein complexes. It includes a fast statistical force field computed from atom-atom distances, to evaluate and filter the 3D docking models. Using published benchmark validation sets and 212 DNA\u2013protein structures published after 2016 we predicted the DNA-binding regions with an RMSD of &lt;1.8 \u00c5 per residue in &gt;95% of the cases. We show that the quality of the docked templates is compatible with FoldX protein design tool suite to identify the crystallized DNA molecule sequence as the most energetically favorable in 80% of the cases. We highlighted the biological potential of PADA1 by reconstituting DNA and protein conformational changes upon protein mutagenesis of a meganuclease and its variants, and by predicting DNA-binding regions and nucleotide sequences in proteins crystallized without DNA. These results opens up new perspectives for the engineering of DNA\u2013protein interfaces.<\/p><\/blockquote>\n<p>Read the <a href=\"https:\/\/academic.oup.com\/nar\/advance-article\/doi\/10.1093\/nar\/gky228\/4955761\">full text<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Abstract: The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein\u2013DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures. PADA1 relies on a library of protein and &hellip;<\/p>\n","protected":false},"author":3,"featured_media":223,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_mi_skip_tracking":false},"categories":[15],"tags":[],"_links":{"self":[{"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/posts\/222"}],"collection":[{"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/comments?post=222"}],"version-history":[{"count":3,"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/posts\/222\/revisions"}],"predecessor-version":[{"id":485,"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/posts\/222\/revisions\/485"}],"wp:featuredmedia":[{"embeddable":true,"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/media\/223"}],"wp:attachment":[{"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/media?parent=222"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/categories?post=222"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/serranolab.crg.eu\/index.php\/wp-json\/wp\/v2\/tags?post=222"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}