C2C2-GATA Genomic sequences

Minimum number of tobacco C2C2-GATA genes: 28

Count of tobacco C2C2-GATA sequences: 31

Pfam accession: GATA

SHOULD possess GATA domain and COULD   possess CCT FAR1 Zim domains

This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the domain.

A number of transcription factors (including erythroid-specific transcription factor and nitrogen regulatory proteins), specifically bind the DNA sequence (A/T)GATA(A/G) in the regulatory regions of genes. They are consequently termed GATA-binding transcription factors. The interactions occur via highly-conserved zinc finger domains in which the zinc ion is coordinated by 4 cysteine residues. NMR studies have shown the core of the zinc finger to comprise 2 irregular anti-parallel beta-sheets and an alpha-helix, followed by a long loop to the C-terminal end of the finger. The N-terminal part, which includes the helix, is similar in structure, but not sequence, to the N-terminal zinc module of the glucocorticoid receptor DNA-binding domain. The helix and the loop connecting the 2 beta-sheets interact with the major groove of the DNA, while the C-terminal tail wraps around into the minor groove. It is this tail that is the essential determinant of specific binding. Interactions between the zinc finger and DNA are mainly hydrophobic, explaining the preponderance of thymines in the binding site; a large number of interactions with the phosphate backbone have also been observed.Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins that only contain a single copy of the domain.

GATA factors were first identified as proteins that interact with conserved WGATAR (W = T or A; R = G or A) motifs involved in erythroid-specific gene expression in vertebrates.

GATA factors are characterised by the presence of conserved, type-IV zinc-finger motifs Animal factors typically contain two C-x2-Cx17-C-x2-C zinc-finger domains.The majority of known fungal GATA factors contain a single C-x2-C-x17-C-x2-C finger with greatest similarity to the carboxyl (C) terminal finger of animal GATA factors. Several examples of fungal GATA factors containing a variant C-x2-C-x18-C-x2-C DNA-binding domain are also known.

Examples of both C-x2-C-x17-Cx2-C (Type IVa) and C-x2-C-x18-C-x2-C (Type IVb) GATA factors are found within fungi; animals onlycontain the former configuration, and plants only the latter. Plant GATA factors typically contain a single zinc finger. The Arabidopsis type-IV zinc-finger proteins may represent the previously defined family of nuclear GATA-binding proteins implicated in light-responsive transcription.



Papers

Sugimoto,K., Takeda,S. and Hirochika,H. Transcriptional activation mediated by binding of a plant GATA-type zinc finger protein AGP1 to the AG-motif (AGATCCAA)of the wound-inducible Myb gene NtMyb2 PLANT JOURNAL 2003 36 (4): 550-564 PMID: 14617085

Teakle GR, Kay SA The GATA-binding protein CGF-1 is closely related to GT-1 PLANT MOLECULAR BIOLOGY 1995 29 (6): 1253-1266 PMID: 8616222

Lowry, JA; Atchley, WR. Molecular evolution of the GATA family of transcription factors: conservation within the DNA-binding domain. J. Mol. Evol. 2000. 50(2):103-15 PMID: 10684344

Omichinski, JG; Clore, GM; Schaad, O; Felsenfeld, G; Trainor, C; Appella, E; Stahl, SJ; Gronenborn, AM. NMR structure of a specific DNA complex of Zn-containing DNA binding domain of GATA-1. Science 1993. 261(5120):438-46 PMID: 8332909

Takatsuji, H. Zinc-finger transcription factors in plants. Cell. Mol. Life Sci. 1998. 54(6):582-96 PMID: 9676577



Number of contigs: 31

Number of singlets: 5

Number of N terminal – 8

Number of c terminal – 3

Number of full – 20

Total minimum number – 28




Search sequences and info


Transcription factor sequences
Locus
Description
NCBI
C2C2-GATA_1 [comment=full] ET045435
ET051057
C2C2-GATA_2 [comment=full] ET046927
ET042576
C2C2-GATA_4 [comment=full] ET045167
ET043287
C2C2-GATA_7 [comment=full] ET043853
ET050614
C2C2-GATA_8 [comment=N terminal] ET048460
ET047453
C2C2-GATA_9 [comment=N terminal] ET047443
ET041901
ET050416
C2C2-GATA_10 [comment=N terminal] ET045594
ET042653
ET044898
C2C2-GATA_11 [comment=N terminal] ET044072
ET050514
ET050515
C2C2-GATA_12 [comment=C terminal] ET042511
ET049434
ET044332
C2C2-GATA_13 [comment=full Nicotiana tabacum AGP3 for AG-motif binding protein-3] ET044619
ET044279
ET047544
ET048766
C2C2-GATA_14 [comment=full] ET043188
ET045212
ET047723
ET041878
C2C2-GATA_15 [comment=full] ET047313
ET047314
ET045632
ET045631
C2C2-GATA_16 [comment=full Nicotiana tabacum AGP1 for AG-motif binding protein-1] ET044002
ET051857
ET043056
ET045630
C2C2-GATA_17 [comment=full] ET046128
ET045711
ET051830
ET049481
C2C2-GATA_18 [comment=full] ET049620
ET045894
ET049611
ET047244
C2C2-GATA_19 [comment=full Nicotiana tabacum AGP4 for AG-motif binding protein-4] ET045964
ET047589
ET050965
ET043894
ET051334
C2C2-GATA_20 [comment=full] ET043851
ET044345
ET043935
ET043700
ET043832
C2C2-GATA_21 [comment=full] ET045862
ET044357
ET044358
ET042003
ET051451
C2C2-GATA_22 [comment=full] ET048665
ET049272
ET045939
ET045967
ET046187
ET049542
C2C2-GATA_23 [comment=N terminal] ET048379
ET049089
ET048429
ET048223
ET048378
ET044126
C2C2-GATA_24 [comment=C terminal] ET047442
ET048461
ET041668
ET049228
ET047534
ET051420
C2C2-GATA_25 [comment=full] ET048288
ET048997
ET049910
ET041759
ET041760
ET048287
C2C2-GATA_26 [comment=full] ET047321
ET041864
ET045785
ET050771
ET047107
ET047467
ET047257
C2C2-GATA_27 [comment=C terminal] ET046810
ET051487
ET046812
ET047294
ET048120
ET046809
ET044821
C2C2-GATA_28 [comment=full] ET045033
ET045262
ET041841
ET045263
ET042327
ET042325
ET041840
C2C2-GATA_29 [comment=full] ET050551
ET049084
ET045892
ET045290
ET045879
ET045966
ET047454
C2C2-GATA_30 [comment=full] ET045687
ET050896
ET043288
ET050544
ET050545
ET045434
ET042516
ET047190
C2C2-GATA_31 [comment=full] ET041728
ET042559
ET042558
ET045926
ET042262
ET043516
ET041867
ET045971
ET044248
C2C2-GATA_32 [comment=N terminal] ET049433
C2C2-GATA_33 [comment=N terminal] ET044331
C2C2-GATA_34 [comment=N terminal] ET043635


Tobacco published genes related to transcription factor family C2C2-GATA
Family Genbank ID Name
C2C2-GATA AB107693 AGP5
C2C2-GATA AB107692 AGP4
C2C2-GATA AB107691 AGP3
C2C2-GATA AB107690 AGP2
C2C2-GATA AB107689 AGP1
C2C2-GATA X73111 GATA-1
Authors of this site:

Paul J Rushton
Marta T. Bokowiec
Xianfeng (Jeff) Chen
Thomas (Tom) W Laudeman
Jennifer F. Brannock
Michael P. Timko

Contact:

pr8y@virginia.edu