Global transcriptional control by glucose and carbon regulator CcpA in Clostridium difficile.;Antunes A, Camiade E, Monot M, Courtois E, Barbut F, Sernova NV, Rodionov DA, Martin-Verstraete I, Dupuy B;Nucleic acids research 2012 Nov;
40(21):10701-18
[22989714]
DNA microarray analysis of the wild-type and ccpA mutant strains identified gene differentially expressed in the presence of glucose. Microarray results were confirmed by performing qRT-PCR with RNA of 15 representative genes. ChIP-on-chip experiments identified the direct target sites of CcpA in vivo. CcpA consensus sequence in the ChIP-chip peaks with enrichment factor > 5 was identified by MEME. The identified CcpA motif was then used to search for for similar CcpA binding sites for all genes identified by the ChIP-on-Chip experiment. EMSAs with 15 DNA fragments containing a putative CcpA binding sites validated the targets identified in by ChIP-chip experiments. Furthermore, the binding was abolished in DNA fragments in which the CcpA binding sites were deleted.
ChIP assay conditions
The chromatin immunoprecipitations (ChIP) were performed using three independent cultures for each strain. The C. difficile JIR8094 and its isogenic ccpA mutant (negative control) strains were grown in TYG medium until an optic density (OD)600 of 0.6. The cells were treated with formaldehyde (final concentration: 1%) and incubated for 10 min at 37°C. Cross-linking was quenched by addition of glycine (final concentration: 400 mM). Cells were then collected by centrifugation, washed twice with Tris-buffered saline (20 mM Tris–HCl pH 7.5, 150 mM NaCl) and stored at −80°C.
ChIP notes
Pellets were resuspended in 10 ml of buffer A [50 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES)–KOH pH 7.5, 150 mM NaCl, 1 mM ethylenediaminetetraacetic acid (EDTA), 1% Triton X-100, 0.1% sodium dodecyl sulphate (SDS), 0.1% sodium deoxycholate and Complete antiprotease] and sonicated in a water bath sonicator (Bioruptor, Diagenode) to shear DNA to an average size of 400–600 bp. After centrifugation, the supernatant was incubated for 3 h at 4°C in the presence of 40 µl of Dynabeads® Pan mouse IgG (Life Technologies™) on a rotating wheel. Dynabeads were then removed (non-specific immunoprecipitation), and supernatant was incubated at 4°C overnight with polyclonal anti-CcpA antibodies (1/500 dilution). The mixture was then incubated with 50 µl of Dynabeads® Pan mouse IgG (Life Technologies™) for 90 min at 4°C and washed twice with buffer A, once with buffer A plus 500 mM NaCl, once with buffer B (10 mM Tris–HCl pH 8.0, 1 mM EDTA, 0.1% Nonidet-P40,0.5% sodium deoxycholate) and once with 10 mM Tris–HCl pH 7.5, 1 mM EDTA. Antibodies bound to the magnetic beads were then eluted for 20 min at 65°C with 100 µl of elution buffer (50 mM Tris–HCl pH 7.5, 10 mM EDTA, 1% SDS). Samples were then treated with proteinase K (50 µg) for 2 h at 56°C and 6 h at 65°C in a 0.5× elution buffer. Immunoprecipitated DNA samples were finally purified using the QIAquick polymerase chain reaction (PCR) purification kit (QIAGEN). DNA obtained from immunoprecipitation was then labelled with Cy5 or Cy3 using the Bioprime plus array CGH indirect genomic labelling system (Invitrogen) as recommended by the manufacturer. Labelled DNA hybridization to microarrays and array scanning were done according to Saujet et al. The complete experimental data set were deposited in the GEO database with the accession number GSE34971.
Regulated genes for each binding site are displayed below. Gene regulation diagrams
show binding sites, positively-regulated genes,
negatively-regulated genes,
both positively and negatively regulated
genes, genes with unspecified type of regulation.
For each indvidual site, experimental techniques used to determine the site are also given.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
Quantitative Reverse-Transcription PCR is a modification of PCR in which RNA is first reverse transcribed into cDNA and this is amplified measuring the product (qPCR) in real time. It therefore allows one to analyze transcription by directly measuring the product (RNA) of a gene's transcription. If the gene is transcribed more, the starting product for PCR is larger and the corresponding volume of amplification is also larger.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
Quantitative Reverse-Transcription PCR is a modification of PCR in which RNA is first reverse transcribed into cDNA and this is amplified measuring the product (qPCR) in real time. It therefore allows one to analyze transcription by directly measuring the product (RNA) of a gene's transcription. If the gene is transcribed more, the starting product for PCR is larger and the corresponding volume of amplification is also larger.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
Quantitative Reverse-Transcription PCR is a modification of PCR in which RNA is first reverse transcribed into cDNA and this is amplified measuring the product (qPCR) in real time. It therefore allows one to analyze transcription by directly measuring the product (RNA) of a gene's transcription. If the gene is transcribed more, the starting product for PCR is larger and the corresponding volume of amplification is also larger.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
Quantitative Reverse-Transcription PCR is a modification of PCR in which RNA is first reverse transcribed into cDNA and this is amplified measuring the product (qPCR) in real time. It therefore allows one to analyze transcription by directly measuring the product (RNA) of a gene's transcription. If the gene is transcribed more, the starting product for PCR is larger and the corresponding volume of amplification is also larger.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
Quantitative Reverse-Transcription PCR is a modification of PCR in which RNA is first reverse transcribed into cDNA and this is amplified measuring the product (qPCR) in real time. It therefore allows one to analyze transcription by directly measuring the product (RNA) of a gene's transcription. If the gene is transcribed more, the starting product for PCR is larger and the corresponding volume of amplification is also larger.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
Quantitative Reverse-Transcription PCR is a modification of PCR in which RNA is first reverse transcribed into cDNA and this is amplified measuring the product (qPCR) in real time. It therefore allows one to analyze transcription by directly measuring the product (RNA) of a gene's transcription. If the gene is transcribed more, the starting product for PCR is larger and the corresponding volume of amplification is also larger.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
Quantitative Reverse-Transcription PCR is a modification of PCR in which RNA is first reverse transcribed into cDNA and this is amplified measuring the product (qPCR) in real time. It therefore allows one to analyze transcription by directly measuring the product (RNA) of a gene's transcription. If the gene is transcribed more, the starting product for PCR is larger and the corresponding volume of amplification is also larger.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.
The principle of ChIP-chip is simple. The first step is to cross-link the protein-DNA complex. This is done using a fixating agent, such as formaldehyde. The cross-linking can later be reversed with heat. Cross-linking kills the cell, giving a snapshot of the bound TF at a given time. The cell is then lysed, the DNA sheared by sonication and the chromatin[2] (TF-DNA complexes) is pulled down using an antibody (i.e. immunoprecipitated). If an antibody for the TF is available, then it is used; otherwise, the TF is tagged with an epitope targeted by commercially available antibodies (the latter option is cheaper, but runs the risk of altering the TF's functionality). Cross-linking is then reversed to free the bound DNA, which is then amplified, labeled with a fluorophore and dumped onto a DNA-array. The scanned array reveals the genomic regions bound by the TF. The resolution is around ~500 bp as a result of the sonication step.
DNA-arrays (or DNA-chips or microarrays) are flat slabs of glass, silicon or plastic onto which thousands of multiple short single-stranded (ss) DNA sequences (corresponding to small regions of a genome) have been attached. After performing a mRNA extraction in induced and non-induced cells, the mRNA is again reverse transcribed, but here the reaction is tweaked, so that the emerging cDNA contains nucleotides marked with different fluorophores for controls and experiment. Targets will hybridize by base-pairing with those probes that resemble them the most. The array can then be stimulated by a laser and scanned for fluorescence at two different wavelengths (control and induced). The ratio or log-ratio between the two fluorescence intensities corresponds to the induction level.
Electro-mobility shift-assays (or gel retardation assays) are a standard way of assessing TF-binding. A fragment of DNA of interest is amplified and labeled with a fluorophore. The fragment is left to incubate in a solution containing abundant TF and non-specific DNA (e.g. randomly cleaved DNA from salmon sperm, of all things) and then a gel is run with the incubated sample and a control (sample that has not been in contact with the TF). If the TF has bound the sample, the complex will migrate more slowly than unbound DNA through the gel, and this retarded band can be used as evidence of binding. The unspecific DNA ensures that the binding is specific to the fragment of interest and that any non-specific DNA-binding proteins left-over in the TF purification will bind there, instead of on the fragment of interest. EMSAs are typically carried out in a bunch of fragments, shown as multiple double (control+experiment) lanes in a wide picture. Certain additional controls are run in at least one of the fragments to ascertain specificity. In the most basic of these, specific competitor (the fragment of interest or a known positive control, unlabelled) is added to the reaction. This should sequester the TF and hence make the retardation band disappear, proving that the binding is indeed specific
In motif discovery, we are given a set of sequences that we suspect harbor binding sites for a given transcription factor. A typical scenario is data coming from expression experiments, in which we wish to analyze the promoter region of a bunch of genes that are up- or down-regulated under some condition. The goal of motif discovery is to detect the transcription factor binding motif (i.e. the sequence “pattern” bound by the TF), by assuming that it will be overrepresented in our sample of sequences. There are different strategies to accomplish this, but the standard approach uses expectation maximization (EM) and in particular Gibbs sampling or greedy search. Popular algorithms for motif discovery are MEME, Gibbs Motif Sampler or CONSENSUS. More recently, motif discovery algorithms that make use of phylogenetic foot-printing (the idea that TF-binding site will be conserved in the promoter sequences for the same gene in different species) have become available. These are not usually applied to complement experimental work, but can be used to provide a starting point for it. Popular algorithms include FootPrinter and PhyloGibbs.
Once the binding motif for a TF is known, this motif (which essentially defines a pattern) can be used to scan sequences in order to search for putative TF-binding site. This is useful, for instance, when trying to identify TF-binding site in ChIP-chip data. Searching for TF-binding site can be done in numerous ways. The most basic method is consensus search, in sequences are scored according to how many mismatches they have with the consensus sequence for the motif. A more elaborate way of searching involves using regular expressions, which allow to search for more loosely defined motifs [e.g. C(C/G)AT]. Common algorithms for this type of search include Pattern Locator and the DNA Pattern Find method of the SMS2 suite, but also some word processors. Finally, the mainstream way of conducting TF-binding site search is through the use of position-specific scoring matrices, which basically count the occurrences of each base at each position of the motif and use the inferred frequencies to score candidate sites. Algorithms in this last category include TFSEARCH, FITOM, CONSITE, TESS and MatInspector.