Please use this identifier to cite or link to this item:
http://hdl.handle.net/123456789/1634
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Omprakash, Arpit. | - |
dc.contributor.author | Sandhu, K.S. | - |
dc.date.accessioned | 2021-12-09T19:50:32Z | - |
dc.date.available | 2021-12-09T19:50:32Z | - |
dc.date.issued | 2021-07-28 | - |
dc.identifier.uri | http://hdl.handle.net/123456789/1634 | - |
dc.description.abstract | GWAS (Genome Wide Association Studies) have been crucial to identifying genetic loci associated with diseased phenotype. The hypothesis-free nature of GWAS studies have been a success at predicting specific cancer markers. However, this hypothesis-free nature has also led to one of its main issues, i.e., the large number of distant SNPs discovered with no biological link to the known genetic pathways of the diseased phenotype. Recent advancements in chromatin interaction mapping techniques have identified long-ranged promoter-promoter interactions that regulate gene expression pathways in eukaryotes. The presence of regulatory enhancer-like activity in some promoters and differences in the epigenetic features associated with the promoters and enhancer-like promoters (ELPs) have also been described. It led us to hypothesize that studying such long-range promoter-promoter contacts using ELPs may provide insights into biological links between distant SNPs and genetic pathways of disease in a population. Here we explore possible histone markers and transcription factor bindings (epigenetic factors) that can distinguish between promoters and ELPs. We also build machine learning models that can predict the magnitude of enhancer-like activity (enhancer potential) of a promoter given its epigenetic factors. Regression models to predict the enhancer potential values were made, but the models’ accuracy was not up to the mark. Improvements have been suggested for the models, including better feature extraction methods using machine learning classifiers. In the case of HeLa cancer cells, biologically significant epigenetic factors are identified via the classifiers that distinguish between promoters and ELPs. However, the models did not exhibit sufficient accuracy to get relevant features in K562 cancer cells. In the later part of the thesis, spatial interactions between distant promoters have been characterised using Hi-C data. A mathematical framework incorporating the enhancer potentials and spatial interactions between promoters has been proposed to study the propagation of gene regulation in promoter-promoter networks. Initial results from the framework indicate that it can be used to identify distant upstream interacting promoters of a given promoter of interest and model time-course gene expression data to identify novel pathways of gene regulation. Keywords – GWAS, Hi-C, enhancer-like promoters (ELPs), gene regulation, machine learning | en_US |
dc.language.iso | en_US | en_US |
dc.publisher | IISERM | en_US |
dc.subject | Framework | en_US |
dc.subject | Gene | en_US |
dc.subject | Regulation | en_US |
dc.title | Developing a framework to study the effects of enhancer-like promoters on gene regulation | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | MS-16 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
MS16124.docx | 12.69 kB | Microsoft Word XML | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.