Cheminformatics-Based Drug Design Approach for Identification of Inhibitors Targeting the Characteristic Residues of MMP-13 Hemopexin Domain

Written by Scott Christley et al. on August 31, 2010 – 7:00 am -

Background

MMP-13, a zinc dependent protease which catalyses the cleavage of type II collagen, is expressed in osteoarthritis (OA) and rheumatoid arthritis (RA) patients, but not in normal adult tissues. Therefore, the protease has been intensively studied as a target for the inhibition of progression of OA and RA. Recent reports suggest that selective inhibition of MMP-13 may be achieved by targeting the hemopexin (Hpx) domain of the protease, which is critical for substrate specificity. In this study, we applied a cheminformatics-based drug design approach for the identification and characterization of inhibitors targeting the amino acid residues characteristic to Hpx domain of MMP-13; these inhibitors may potentially be employed in the treatment of OA and RA.

Methodology/Principal Findings

Sequence-based mutual information analysis revealed five characteristic (completely conserved and unique), putative functional residues of the Hpx domain of MMP-13 (these residues hereafter are referred to as HCR-13pf). Binding of a ligand to as many of the HCR-13pf is postulated to result in an increased selective inhibition of the Hpx domain of MMP-13. Through the in silico structure-based high-throughput virtual screening (HTVS) method of Glide, against a large public library of 16908 molecules from Maybridge, PubChem and Binding, we identified 25 ligands that interact with at least one of the HCR-13pf. Assessment of cross-reactivity of the 25 ligands with MMP-1 and MMP-8, members of the collagenase family as MMP-13, returned seven lead molecules that did not bind to any one of the putative functional residues of Hpx domain of MMP-1 and any of the catalytic active site residues of MMP-1 and -8, suggesting that the ligands are not likely to interact with the functional or catalytic residues of other MMPs. Further, in silico analysis of physicochemical and pharmacokinetic parameters based on Lipinski's rule of five and ADMET (absorption, distribution, metabolism, excretion and toxicity) respectively, suggested potential utility of the compounds as drug leads.

Conclusions/Significance

We have identified seven distinct drug-like molecules binding to the HCR-13pf of MMP-13 with no observable cross-reactivity to MMP-1 and MMP-8. These molecules are potential selective inhibitors of MMP-13 that can be experimentally validated and their backbone structural scaffold could serve as building blocks in designing drug-like molecules for OA, RA and other inflammatory disorders. The systematic cheminformatics-based drug design approach applied herein can be used for rational search of other public/commercial combinatorial libraries for more potent molecules, capable of selectively inhibiting the collagenolytic activity of MMP-13.


Tags: , ,
Posted in Computatioanl biology | Comments Off

A Novel Preprocessing Method Using Hilbert Huang Transform for MALDI-TOF and SELDI-TOF Mass Spectrometry Data

Written by Scott Christley et al. on August 31, 2010 – 7:00 am -

Motivation

Mass spectrometry is a high throughput, fast, and accurate method of protein analysis. Using the peaks detected in spectra, we can compare a normal group with a disease group. However, the spectrum is complicated by scale shifting and is also full of noise. Such shifting makes the spectra non-stationary and need to align before comparison. Consequently, the preprocessing of the mass data plays an important role during the analysis process. Noises in mass spectrometry data come in lots of different aspects and frequencies. A powerful data preprocessing method is needed for removing large amount of noises in mass spectrometry data.

Results

Hilbert-Huang Transformation is a non-stationary transformation used in signal processing. We provide a novel algorithm for preprocessing that can deal with MALDI and SELDI spectra. We use the Hilbert-Huang Transformation to decompose the spectrum and filter-out the very high frequencies and very low frequencies signal. We think the noise in mass spectrometry comes from many sources and some of the noises can be removed by analysis of signal frequence domain. Since the protein in the spectrum is expected to be a unique peak, its frequence domain should be in the middle part of frequence domain and will not be removed. The results show that HHT, when used for preprocessing, is generally better than other preprocessing methods. The approach not only is able to detect peaks successfully, but HHT has the advantage of denoising spectra efficiently, especially when the data is complex. The drawback of HHT is that this approach takes much longer for the processing than the wavlet and traditional methods. However, the processing time is still manageable and is worth the wait to obtain high quality data.


Tags: , ,
Posted in Computatioanl biology | Comments Off

Identification of cis-Regulatory Elements in the Mammalian Genome: The cREMaG Database

Written by Scott Christley et al. on August 31, 2010 – 7:00 am -

Background

A growing number of gene expression-profiling datasets provides a reliable source of information about gene co-expression. In silico analyses of the properties shared among the promoters of co-expressed genes facilitates the identification of transcription factors (TFs) involved in the co-regulation of those genes. Our previous experience with microarray data led to the development of a database suitable for the examination of regulatory motifs in the promoters of co-expressed genes.

Methodology

We introduce the cREMaG (cis-Regulatory Elements in the Mammalian Genome) system designed for in silico studies of the promoter properties of co-regulated mammalian genes. The cREMaG system offers an analysis of data obtained from human, mouse, rat, bovine and canine gene expression-profiling studies. More than eight analysis parameters can be utilized in user-defined combinations. The selection of alternative transcription start sites and information about CpG islands are also available.

Conclusions

Using the cREMaG system, we successfully identified TFs mediating transcriptional responses in reference gene sets. The cREMaG system facilitates in silico studies of mammalian transcriptional gene regulation. The resource is freely available at http://www.cremag.org.


Tags: , ,
Posted in Computatioanl biology | Comments Off

GOPred: GO Molecular Function Prediction by Combined Classifiers

Written by Scott Christley et al. on August 31, 2010 – 7:00 am -

Functional protein annotation is an important matter for in vivo and in silico biology. Several computational methods have been proposed that make use of a wide range of features such as motifs, domains, homology, structure and physicochemical properties. There is no single method that performs best in all functional classification problems because information obtained using any of these features depends on the function to be assigned to the protein. In this study, we portray a novel approach that combines different methods to better represent protein function. First, we formulated the function annotation problem as a classification problem defined on 300 different Gene Ontology (GO) terms from molecular function aspect. We presented a method to form positive and negative training examples while taking into account the directed acyclic graph (DAG) structure and evidence codes of GO. We applied three different methods and their combinations. Results show that combining different methods improves prediction accuracy in most cases. The proposed method, GOPred, is available as an online computational annotation tool (http://kinaz.fen.bilkent.edu.tr/gopred).


Tags: , ,
Posted in Computatioanl biology | Comments Off

A Mechanistic Model of PCR for Accurate Quantification of Quantitative PCR Data

Written by Scott Christley et al. on August 30, 2010 – 7:00 am -

Background

Quantitative PCR (qPCR) is a workhorse laboratory technique for measuring the concentration of a target DNA sequence with high accuracy over a wide dynamic range. The gold standard method for estimating DNA concentrations via qPCR is quantification cycle () standard curve quantification, which requires the time- and labor-intensive construction of a standard curve. In theory, the shape of a qPCR data curve can be used to directly quantify DNA concentration by fitting a model to data; however, current empirical model-based quantification methods are not as reliable as standard curve quantification.

Principal Findings

We have developed a two-parameter mass action kinetic model of PCR (MAK2) that can be fitted to qPCR data in order to quantify target concentration from a single qPCR assay. To compare the accuracy of MAK2-fitting to other qPCR quantification methods, we have applied quantification methods to qPCR dilution series data generated in three independent laboratories using different target sequences. Quantification accuracy was assessed by analyzing the reliability of concentration predictions for targets at known concentrations. Our results indicate that quantification by MAK2-fitting is as reliable as standard curve quantification for a variety of DNA targets and a wide range of concentrations.

Significance

We anticipate that MAK2 quantification will have a profound effect on the way qPCR experiments are designed and analyzed. In particular, MAK2 enables accurate quantification of portable qPCR assays with limited sample throughput, where construction of a standard curve is impractical.


Tags: , ,
Posted in Computatioanl biology | Comments Off

PeakRegressor Identifies Composite Sequence Motifs Responsible for STAT1 Binding Sites and Their Potential rSNPs

Written by Scott Christley et al. on August 27, 2010 – 7:00 am -

How to identify true transcription factor binding sites on the basis of sequence motif information (e.g., motif pattern, location, combination, etc.) is an important question in bioinformatics. We present “PeakRegressor,” a system that identifies binding motifs by combining DNA-sequence data and ChIP-Seq data. PeakRegressor uses L1-norm log linear regression in order to predict peak values from binding motif candidates. Our approach successfully predicts the peak values of STAT1 and RNA Polymerase II with correlation coefficients as high as 0.65 and 0.66, respectively. Using PeakRegressor, we could identify composite motifs for STAT1, as well as potential regulatory SNPs (rSNPs) involved in the regulation of transcription levels of neighboring genes. In addition, we show that among five regression methods, L1-norm log linear regression achieves the best performance with respect to binding motif identification, biological interpretability and computational efficiency.


Tags: , ,
Posted in Computatioanl biology | Comments Off

Sensory Adaptation and Short Term Plasticity as Bayesian Correction for a Changing Brain

Written by Scott Christley et al. on August 26, 2010 – 7:00 am -

Neurons in the sensory system exhibit changes in excitability that unfold over many time scales. These fluctuations produce noise and could potentially lead to perceptual errors. However, to prevent such errors, postsynaptic neurons and synapses can adapt and counteract changes in the excitability of presynaptic neurons. Here we ask how neurons could optimally adapt to minimize the influence of changing presynaptic neural properties on their outputs. The resulting model, based on Bayesian inference, explains a range of physiological results from experiments which have measured the overall properties and detailed time-course of sensory tuning curve adaptation in the early visual cortex. We show how several experimentally measured short term plasticity phenomena can be understood as near-optimal solutions to this adaptation problem. This framework provides a link between high level computational problems, the properties of cortical neurons, and synaptic physiology.


Tags: , ,
Posted in Computatioanl biology | Comments Off

A Phenomenological Model for Predicting Melting Temperatures of DNA Sequences

Written by Scott Christley et al. on August 26, 2010 – 7:00 am -

We report here a novel method for predicting melting temperatures of DNA sequences based on a molecular-level hypothesis on the phenomena underlying the thermal denaturation of DNA. The model presented here attempts to quantify the energetic components stabilizing the structure of DNA such as base pairing, stacking, and ionic environment which are partially disrupted during the process of thermal denaturation. The model gives a Pearson product-moment correlation coefficient (r) of ~0.98 between experimental and predicted melting temperatures for over 300 sequences of varying lengths ranging from 15-mers to genomic level and at different salt concentrations. The approach is implemented as a web tool (www.scfbio-iitd.res.in/chemgenome/Tm_predictor.jsp) for the prediction of melting temperatures of DNA sequences.


Tags: , ,
Posted in Computatioanl biology | Comments Off

Coordinated Progression through Two Subtranscriptomes Underlies the Tachyzoite Cycle of Toxoplasma gondii

Written by Scott Christley et al. on August 26, 2010 – 7:00 am -

Background

Apicomplexan parasites replicate by varied and unusual processes where the typically eukaryotic expansion of cellular components and chromosome cycle are coordinated with the biosynthesis of parasite-specific structures essential for transmission.

Methodology/Principal Findings

Here we describe the global cell cycle transcriptome of the tachyzoite stage of Toxoplasma gondii. In dividing tachyzoites, more than a third of the mRNAs exhibit significant cyclical profiles whose timing correlates with biosynthetic events that unfold during daughter parasite formation. These 2,833 mRNAs have a bimodal organization with peak expression occurring in one of two transcriptional waves that are bounded by the transition into S phase and cell cycle exit following cytokinesis. The G1-subtranscriptome is enriched for genes required for basal biosynthetic and metabolic functions, similar to most eukaryotes, while the S/M-subtranscriptome is characterized by the uniquely apicomplexan requirements of parasite maturation, development of specialized organelles, and egress of infectious daughter cells. Two dozen AP2 transcription factors form a series through the tachyzoite cycle with successive sharp peaks of protein expression in the same timeframes as their mRNA patterns, indicating that the mechanisms responsible for the timing of protein delivery might be mediated by AP2 domains with different promoter recognition specificities.

Conclusion/Significance

Underlying each of the major events in apicomplexan cell cycles, and many more subordinate actions, are dynamic changes in parasite gene expression. The mechanisms responsible for cyclical gene expression timing are likely crucial to the efficiency of parasite replication and may provide new avenues for interfering with parasite growth.


Tags: , ,
Posted in Computatioanl biology | Comments Off

Trade-offs and Noise Tolerance in Signal Detection by Genetic Circuits

Written by Scott Christley et al. on August 26, 2010 – 7:00 am -

Genetic circuits can implement elaborated tasks of amplitude or frequency signal detection. What type of constraints could circuits experience in the performance of these tasks, and how are they affected by molecular noise? Here, we consider a simple detection process–a signal acting on a two-component module–to analyze these issues. We show that the presence of a feedback interaction in the detection module imposes a trade-off on amplitude and frequency detection, whose intensity depends on feedback strength. A direct interaction between the signal and the output species, in a type of feed-forward loop architecture, greatly modifies these trade-offs. Indeed, we observe that coherent feed-forward loops can act simultaneously as good frequency and amplitude noise-tolerant detectors. Alternatively, incoherent feed-forward loop structures can work as high-pass filters improving high frequency detection, and reaching noise tolerance by means of noise filtering. Analysis of experimental data from several specific coherent and incoherent feed-forward loops shows that these properties can be realized in a natural context. Overall, our results emphasize the limits imposed by circuit structure on its characteristic stimulus response, the functional plasticity of coherent feed-forward loops, and the seemingly paradoxical advantage of improving signal detection with noisy circuit components.


Tags: , ,
Posted in Computatioanl biology | Comments Off
RSS