Machine learning models, and training, validation and test datasets for: "Sequence determinants of human gene regulatory elements"

dc.contributor.affiliationApplied Tumor Genomics Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland. Medicum, Faculty of Medicine, University of Helsinki, Helsinki, Finland. - Sahu, Biswajyoti
dc.contributor.affiliationApplied Tumor Genomics Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland - Hartonen, Tuomo
dc.contributor.affiliationApplied Tumor Genomics Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland - Pihlajamaa, Päivi
dc.contributor.affiliationDepartment of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden - Wei, Bei
dc.contributor.affiliationDepartment of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden - Dave, Kashyap
dc.contributor.affiliationDepartment of Biochemistry, University of Cambridge, Cambridge, United Kingdom - Zhu, Fangjie
dc.contributor.affiliationApplied Tumor Genomics Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland. Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden. - Kaasinen, Eevi
dc.contributor.affiliationDepartment of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany. Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden - Lidschreiber, Katja
dc.contributor.affiliationDepartment of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany. Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden - Lidschreiber, Michael
dc.contributor.affiliationDepartment of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden. Science for Life Laboratory, Stockholm, Sweden - Daub, Carsten O
dc.contributor.affiliationDepartment of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany. Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden - Cramer, Patrick
dc.contributor.affiliationApplied Tumor Genomics Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland - Kivioja, Teemu
dc.contributor.affiliationApplied Tumor Genomics Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland. Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden. Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom - Taipale, Jussi
dc.contributor.authorSahu, Biswajyoti
dc.contributor.authorHartonen, Tuomo
dc.contributor.authorPihlajamaa, Päivi
dc.contributor.authorWei, Bei
dc.contributor.authorDave, Kashyap
dc.contributor.authorZhu, Fangjie
dc.contributor.authorKaasinen, Eevi
dc.contributor.authorLidschreiber, Katja
dc.contributor.authorLidschreiber, Michael
dc.contributor.authorDaub, Carsten O
dc.contributor.authorCramer, Patrick
dc.contributor.authorKivioja, Teemu
dc.contributor.authorTaipale, Jussi
dc.date.accessioned2025-03-24T15:20:58Z
dc.date.issued2021-07-16
dc.date.issued2021-07-16
dc.descriptionThis record contains the training, test and validation datasets used to train and evaluate the machine learning models in manuscript: Sahu, Biswajyoti, et al. "Sequence determinants of human gene regulatory elements." (2021). This record contains also the final hyperparameter-optimized models for each training dataset/task combination described in the manuscript. The README-files provided with the record describe the datasets and models in more detail. The datasets deposited here are derived from the original raw data (GEO accession: GSE180158) as described in the Methods of the manuscript.
dc.identifierhttps://doi.org/10.5281/zenodo.5101420
dc.identifier.urihttps://hydatakatalogi-test-24.it.helsinki.fi/handle/123456789/10614
dc.rightsOpen
dc.rights.licensecc-by-4.0
dc.subjectGene regulation
dc.subjectSTARR-seq
dc.subjectDeep learning
dc.subjectConvolutional Neural Networks
dc.subjectMachine learning
dc.titleMachine learning models, and training, validation and test datasets for: "Sequence determinants of human gene regulatory elements"
dc.typedataset
dc.typedataset