About CollecTF

CollecTF, a play on words using the French collectif [collective] and the acronym for transcription factor [TF], is a database of transcription factor binding sites (TFBS) in the Bacteria domain.

CollecTF aims at becoming a reference, highly-accessed database by relying on its ability to customize navigation and data extraction, its relevance to the community, the quality and detail of the stored data and the up-to-date nature of the stored information.


  • CollecTF exclusively stores data on experimentally-validated TFBS and places special emphasis on providing a transparent curation process that captures the experimental support for sites as reported by authors in peer-reviewed publications.
  • The CollecTF curation model combines direct author submissions and in-house curation. If you have data to submit, please do contribute it!
  • Reported binding sites are mapped to NCBI RefSeq complete genome records to provide genomic context and the experimental process is detailed during curation. Curation records are open for inspection. Please, let us know if you spot an error.
  • CollecTF focuses on providing fully customizable access to the stored data. The database can be browsed by transcription factor families, NCBI taxonomy or experimental support, or through customized searches integrating these three elements that allow users to specify the desired level of experimental support.
  • CollecTF reports export to common FASTA and flat-file formats, and periodic flat-file releases of the database are available for download.
  • CollecTF data is periodically submitted to NCBI, where it is integrated into RefSeq complete genome records through db_xref links.
  • CollecTF generates specific records for all UniProtKB identifiers in the database, and these records are cross-linked in the corresponding UniProtKB entry through a db_xref field.
  • CollecTF generates Gene Ontology (GO) annotations to formalize the relationship between a transcription factor and the chromosomal DNA it binds and the genes upon which such binding has a transcriptionalregulatory effect. The suporting experimental evidence is mapped to standardized Evidence Ontology (ECO) terms.