Data from: Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis
dc.contributor.affiliation | Vanderbilt University - Skwark, Marcin J. | |
dc.contributor.affiliation | Imperial College London - Croucher, Nicholas J. | |
dc.contributor.affiliation | Aalto University - Puranen, Santeri | |
dc.contributor.affiliation | University of Cambridge - Chewapreecha, Claire | |
dc.contributor.affiliation | Aalto University - Pesonen, Maiju | |
dc.contributor.affiliation | Aalto University - Xu, Ying Ying | |
dc.contributor.affiliation | University of Oxford - Turner, Paul | |
dc.contributor.affiliation | Wellcome Trust - Harris, Simon R. | |
dc.contributor.affiliation | Houston Methodist - Beres, Stephen B. | |
dc.contributor.affiliation | Houston Methodist - Musser, James M. | |
dc.contributor.affiliation | Wellcome Trust - Parkhill, Julian | |
dc.contributor.affiliation | Wellcome Trust - Bentley, Stephen D. | |
dc.contributor.affiliation | Aalto University - Aurell, Erik | |
dc.contributor.affiliation | University of Helsinki - Corander, Jukka | |
dc.contributor.author | Skwark, Marcin J. | |
dc.contributor.author | Croucher, Nicholas J. | |
dc.contributor.author | Puranen, Santeri | |
dc.contributor.author | Chewapreecha, Claire | |
dc.contributor.author | Pesonen, Maiju | |
dc.contributor.author | Xu, Ying Ying | |
dc.contributor.author | Turner, Paul | |
dc.contributor.author | Harris, Simon R. | |
dc.contributor.author | Beres, Stephen B. | |
dc.contributor.author | Musser, James M. | |
dc.contributor.author | Parkhill, Julian | |
dc.contributor.author | Bentley, Stephen D. | |
dc.contributor.author | Aurell, Erik | |
dc.contributor.author | Corander, Jukka | |
dc.date.accessioned | 2025-03-24T15:21:16Z | |
dc.date.issued | 2017-11-29 | |
dc.date.issued | 2017-11-29 | |
dc.description | Recent advances in the scale and diversity of population genomic datasets for bacteria now provide the potential for genome-wide patterns of co-evolution to be studied at the resolution of individual bases. Here we describe a new statistical method, genomeDCA, which uses recent advances in computational structural biology to identify the polymorphic loci under the strongest co-evolutionary pressures. We apply genomeDCA to two large population data sets representing the major human pathogens Streptococcus pneumoniae (pneumococcus) and Streptococcus pyogenes (group A Streptococcus). For pneumococcus we identified 5,199 putative epistatic interactions between 1,936 sites. Over three-quarters of the links were between sites within the pbp2x, pbp1a and pbp2b genes, the sequences of which are critical in determining non-susceptibility to beta-lactam antibiotics. A network-based analysis found these genes were also coupled to that encoding dihydrofolate reductase, changes to which underlie trimethoprim resistance. Distinct from these antibiotic resistance genes, a large network component of 384 protein coding sequences encompassed many genes critical in basic cellular functions, while another distinct component included genes associated with virulence. The group A Streptococcus (GAS) data set population represents a clonal population with relatively little genetic variation and a high level of linkage disequilibrium across the genome. Despite this, we were able to pinpoint two RNA pseudouridine synthases, which were each strongly linked to a separate set of loci across the chromosome, representing biologically plausible targets of co-selection. The population genomic analysis method applied here identifies statistically significantly co-evolving locus pairs, potentially arising from fitness selection interdependence reflecting underlying protein-protein interactions, or genes whose product activities contribute to the same phenotype. This discovery approach greatly enhances the future potential of epistasis analysis for systems biology, and can complement genome-wide association studies as a means of formulating hypotheses for targeted experimental work. | |
dc.identifier | https://doi.org/10.5061/dryad.gd14g | |
dc.identifier.uri | https://hydatakatalogi-test-24.it.helsinki.fi/handle/123456789/10808 | |
dc.rights | Open | |
dc.rights.license | cc-zero | |
dc.subject | epistasis | |
dc.subject | bacterial evolution | |
dc.title | Data from: Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis | |
dc.type | dataset | |
dc.type | dataset |