The goal of research in my lab has been to study interactions between cellular proteins and conserved G-rich sequences (GRSs) that can influence human gene expression. Altered gene expression can consequently lead to a myriad of health problems ranging from cancer, genetic disorders, developmental disorders to blood and heart diseases. Studying regulation of gene expression, therefore, is important for a better understanding of mechanisms underlying these health defects. We have previously shown that a conserved G rich sequence found in the polyadenylation regions of human genes can mediate efficient 3end processing of mammalian pre-mRNAs (Bagga et. al., 1995, 1998). We have also cloned a human gene the product of which, DSEF1/hnRNP H, binds to GRS in order to influence gene expression at this level. Our preliminary bioinformatics analysis suggests that GRS is present in the polyadenylation region of a significant number of human genes (Bagga et. al., 1995, Arhin et. al., 2002).
G-quadruplex Structure and RNA Processing:
More recent bioinformatics studies performed in my lab at Ramapo College with the help of undergraduate students have revealed the presence of GRS sequences near RNA processing sites of a significant number of human and mouse genes (conference presentations by students). According to these studies, it appears that a special type of GRS, that is capable of forming a highly stable G-quadruplex structure, is involved in differential RNA processing by helping the cell decide which site should be used for processing in a tissue and time specific manner. However, detailed bioinformatics and molecular biology investigations are required to test this hypothesis.
G-quadruplexes in Untranslated Regions:
Undergraduate research in my lab has also indicated the presence of potential G-qudruplexes in the untranslated regions (UTRs) of eukaryotic mRNAs. We believe that G-quadruplex motifs may help regulate gene expression at post-transcriptional levels in the cytoplasm. Specific interactions between RNA binding proteins and cis- acting elements in 5- and 3- UTRs are known to be responsible for regulating essential biological activities, such as mRNA localization, mRNA turnover, and translation efficiency. Detailed bioinformatics and molecular biology investigations are required to test our hypotheses.
What is a G-quadruplex?
A G-quadruplexThe quadruplex structures formed by guanine rich nucleic acid sequences have received significant attention recently because of increasing evidence for their role in important biological processes and as therapeutic targets.The G-quadruplex structure is formed by repeated folding of either the single polynucleotide molecule or by association of two or four molecules. The structure consists of stacked G-tetrads, which are square co-planar arrays of four guanine bases each. G-quadruplex is stabilized with cyclic Hoogsteen hydrogen bonding between the four guanines within each tetrad. G-quadruplex sequence motifs have been reported in telomeric, promoter and other regions of mammalian genomes. G-quadruplex DNA has been suggested to regulate DNA replication and may control cellular proliferation. G- rich sequences capable of forming G-quadruplexes in the RNA have been implicated in a variety of important biological activities, such as mRNA turnover , Fragile X Mental Retardation Protein (FMRP) binding, translation initiation as well as repression.
Computational Tools for Studying G-quadruplexes:
Defining the role of Quadruplex forming G Rich Sequences (QGRS) in differential RNA processing and identifying the proteins that can interact with them would aid in understanding the mechanism of regulation of gene expression, especially differential gene expression. A detailed investigation into the distribution of G-quadruplex sequences near RNA processing sites can provide valuable insights. Similarly, identifying cis- regulatory G-quadruplex motifs in 5- and 3-UTRs of mRNAs can also help explore the role of G-quadruplex structures in the regulation of gene expression at a post-transcriptional level. However, these efforts would require a systematic large-scale analysis of mammalian genes.My lab has adopted a computational approach to achieve our goals. We have developed computational methods (D'Antonio and Bagga, 2004) and computational tools for mapping putative QGRS (QGRS Mapper: Kikin et. al., 2006) in mammalian genes. We are also developing databases (GRSDB: Kostadinov et. al., 2006; GRSDB2 & GRS_UTRdb: Kikin et. al., 2007) for curation and further computation of the data. The computational suite is being used to conduct detailed bioinformatics studies on the distribution patterns of QGRS near biologically important sites of thousands of mammalian genes. In particular, we are investigating whether there is a correlation between the distribution pattern of QGRS and alternative processing. We are also studying the conservation of G-quadruplex motifs in untranslated regions and their association with human diseases. This project involves active participation by undergraduate students of the college, especially during the winter and summer breaks.
| Top | G-Q Resource | QGRS Mapper | GRSDB | GRS_UTRdb | Home |
The objective of these research studies has been to construct a structure-function map of DSEF-1, an auxiliary polyadenylation factor that we have cloned. Computer based structure predictions have identified three putative RNA Recognition Motifs (RRMs) and two glycine rich auxiliary domains in this protein. However, functional significance of these predicted structures in the DSEF-1 protein is still unknown. Aligning DSEF-1 protein sequence with those of closely related proteins has helped to identify other regions which could be functionally significant as well (Bagga et. al., 1998). DSEF-1 mediates efficient 3end processing of all the mammalian pre-mRNAs and is indicated to play a key role in regulating the gene expression at this step.
| Back to the Top | Go Back Home |
The aim of this research program has been to study RNA-protein interactions involved in the control of eukaryotic gene expression. For these studies we have chosen the interactions between a conserved sequence present in precursor as well as mature mRNAs and a human protein which specifically recognizes this downstream auxiliary sequence element.
Important findings:
- Identified a conserved 14 base G-Rich Sequence (GRS) in cellular RNAs following computer analysis of the GenBank database. (Bagga et. al., 1995, Chen et. al., 1996)
- Successfully demonstrated the ability of GRS to mediate efficient 3' end pre-mRNA processing through an auxiliary trans-acting factor, DSEF-1. (Bagga et. al., 1995).
- FPLC-Purified from human cells a putative auxiliary polyadenylation factor, DSEF-1, which interacts with GRS. (Bagga et. al., 1997, 1998).
- Isolated and characterized a cDNA clone with a PCR probe generated by using degenerate primers designed on the basis of partial peptide sequence of DSEF-1. (Bagga et. al., 1998).
- The cloned DSEF-1, expressed in E. coli, was found to stimulate polyadenylation of mammalian pre-mRNAs thus indicationg its capability to regulate 3 end processing of pre-mRNAs. (Bagga et. al., 1998, Arhin et. al., 2002).
- An investigation into mechanism of action of DSEF-1 indicated that this regulatory protein affects 3 end processing by stabilizing the association of core polyadenylation factors with the RNA substrate. (Bagga et. al., 1997, 1998).
| Back to the Top | Go Back Home |
The aim of this research project was to study the DNA repair mechanisms that are operative in human cells. These studies involve DNA repair deficient cell lines originating from patients with heritable conditions such as Xeroderma Pigmentosum (XP) (prone to skin cancer) and Cockayne Syndrome (CS) exhibiting growth retardation and neurological degeneration. Some important accomplishments are:
- Succeeded in recloning the bacteriophage T4 DNA repair gene, denV, into a retroviral vector and stably introducing it into human cells. (Bagga et. al., 1991, Bagga and Athwal, 1992)
- Demonstrated the ability of the denV gene to correct the excision repair defect in XP and CS cells. (Bagga and Athwal, 1992, Francis/Bagga et. al., 1997, 2001).
- Developed a PCR based assay to measure DNA repair in localized regions of the genome. (Bagga et. al., 1991, Bagga and Athwal, 1992)
- Successfully transferred single intact human chromosomes (from mouse/human monochromosomal hybrids) into XP and CS cells in order to identify the human chromosomes carrying respective DNA repair genes.
- Transformed and immortalized human cells using retroviral based vectors and recombinant adenovirus carrying SV40 T-antigen sequences.
| Back to the Top | Go Back Home |
The research work performed towards my doctoral dissertation involved the analysis of production, regulation and purification of cellulase enzyme complex in Aspergillus nidulans.
Important findings:
- The analysis for the localization of the cellulolytic enzymes and mechanism of their release into the medium revealed substrate and growth dependent patterns. (Bagga and Sandhu 1987).
- The cellulase enzymes were found to be regulated by proteases which in turn were found to be under the control of cAMP (Bagga et.al., 1989a, 1991).
- Mutants defective in the developmental pathway were isolated and analyzed. Studies on the development related changes in the expression and regulation of cellulases showed Endo III, a form of cellulase, to be specifically expressed only during fruiting body formation. (Bagga et.al., 1989b).
- The cellulase enzyme complex produced by Aspergillus nidulans was fractionated into individual components which were purified and analyzed for physico-chemical properties (Bagga et.al.,1990).
- The purified cellulase components were used to study the mechanism of cellulose fermentation.
| Back to the Top | Go Back Home |