Searched with the B3 domain from VP1 (from Zea mays) and ABI3. B3 domains are also found in RAV (AP2 family) factors but only 6 of 147 Arabidopsis ERF genes are of this type.
>ABI3 B3 tffamily=ABI3-VP1 drrqgwkpeknlrfllqkvlkqsdvgnlgrivlpkkeaethlpeleardg islamedigtsrvwnmryrfwpnnksrmyllentgdfvktnglqegdfiv iysdvkcgkylirgvkvrq >VP1 B3 tffamily=ABI3-VP1 dkrqgakadknlrfllqkvlkqsdvgslgrivlpkkeaevhlpelktrdg isipmedigtsrvwnmryrfwpnnksrmyllentgefvrsnelqegdfiv iysdvksgkylirgvkv
These proteins are related to the PHD finger proteins and a subgroup may not be merited, especially as the seven Arabidopsis proteins are described as PHD finger family proteins.
Searched with the Arabidopsis genes
>AT1G14510.1 Arabidopsis Alfin family transcription factor, protein sequence 252AA tffamily=Alfin MEGIQHPIPRTVEEVFSDFRGRRAGLIKALSTDVQKFYHQCDPEKENLCLYGLPNETWEVNLPVEEVPPELPEPALGINF ARDGMQEKDWISLVAVHSDSWLISVAFYFGARFGFGKNERKRLFQMINDLPTIFEVVTGNAKQSKDQSANHNSSRSKSSG GKPRHSESHTKASKMSPPPRKEDESGDEDEDDEQGAVCGACGDNYGGDEFWICCDACEKWFHGKCVKITPAKAEHIKHYK CPSCTTSKKMKA >AT2G02470.1 Arabidopsis Alfin family transcription factor, protein sequence 256AA tffamily=Alfin MEGITHPIPRTVEEVFSDFRGRRAGLIKALTNDMVKFYQTCDPEKENLCLYGLPNETWEVNLPVEEVPPELPEPALGINF ARDGMQEKDWVSLVAVHSDSWLLSVAFYFGARFGFGKNERKRLFQMINELPTIFEVVSGNAKQSKDLSVNNNNSKSKPSG VKSRQSESLSKVAKMSSPPPKEEEEEEDESEDESEDDEQGAVCGACGDNYGTDEFWICCDACEKWFHGKCVKITPAKAEH IKHYKCPTCSNKRARP >AT3G11200.1 Arabidopsis Alfin family transcription factor, protein sequence 246AA tffamily=Alfin MAAAAVSSNPRTVEEIFKDYSARRAALLRALTKDVDDFYSQCDPEKENLCLYGHPNESWEVNLPAEEVPPELPEPALGIN FARDGMQRKDWLSLVAVHSDCWLLSVSFYFGARLNRNERKRLFSLINDLPTLFDVVTGRKAMKDNKPSSDSGSKSRNGTK RSIDGQTKSSTPKLMEESYEEEEEEDEHGDTLCGSCGGHYTNEEFWICCDVCERWYHGKCVKITPAKAESIKQYKCPPCC AKKGRQ >AT3G11200.2 Arabidopsis Alfin family transcription factor, protein sequence 233AA tffamily=Alfin MRSGYERFRLLDTLLCVLLRFDFNFWVFVVIEKENLCLYGHPNESWEVNLPAEEVPPELPEPALGINFARDGMQRKDWLS LVAVHSDCWLLSVSFYFGARLNRNERKRLFSLINDLPTLFDVVTGRKAMKDNKPSSDSGSKSRNGTKRSIDGQTKSSTPK LMEESYEEEEEEDEHGDTLCGSCGGHYTNEEFWICCDVCERWYHGKCVKITPAKAESIKQYKCPPCCAKKGRQ >AT3G42790.1 Arabidopsis Alfin family transcription factor, protein sequence 250AA tffamily=Alfin MEGGAALYNPRTVEEVFKDFKGRRTAIVKALTTDVQEFYQQCDPEKENLCLYGLPNEEWEVNLPAEEVPPELPEPALGIN FARDGLSEKEWLSLVAIHSDAWLLSVSFYFGSRFSFHKEERKRLFNMINDVPTIFEVVTGMAKAKDKSSAANQNGNKSKS NSKVRTSEGKSSKTKQPKEEDEEIDEDDEDDHGETLCGACGDSDGADEFWICCDLCEKWFHGKCVKITPARAEHIKQYKC PSCSNKRARA >AT5G05610.1 Arabidopsis Alfin family transcription factor, protein sequence 241AA tffamily=Alfin MAAESSNPRTVEEIFKDFSGRRSGFLRALSVDVDKFYSLCDPEMENLCLYGHPNGTWEVNLPAEEVPPELPEPALGINFA RDGMQRKDWLSLVAVHSDCWLLSVSSYFGARLNRNERKRLFSLINDLPTLFEVVTGRKPIKDGKPSMDLGSKSRNGVKRS IEGQTKSTPKLMEESYEDEDDEHGDTLCGSCGGNYTNDEFWICCDVCERWYHGKCVKITPAKAESIKQYKCPSCCTKKGR Q >AT5G05610.2 Arabidopsis Alfin family transcription factor, protein sequence 241AA tffamily=Alfin MAAESSNPRTVEEIFKDFSGRRSGFLRALSVDVDKFYSLCDPEMENLCLYGHPNGTWEVNLPAEEVPPELPEPALGINFA RDGMQRKDWLSLVAVHSDCWLLSVSSYFGARLNRNERKRLFSLINDLPTLFEVVTGRKPIKDGKPSMDLGSKSRNGVKRS IEGQTKSTPKLMEESYEDEDDEHGDTLCGSCGGNYTNDEFWICCDVCERWYHGKCVKITPAKAESIKQYKCPSCCTKKGR Q >AT5G20510.1 Arabidopsis Alfin family transcription factor, protein sequence 260AA tffamily=Alfin MEGGTAHYSPRTVEEVFRDFKGRRAGIIQALTTDVEDFFQQCDPEKQNLCLYGFPNEVWEVNLPAEEVPPELPEPALGIN FARDGMQERNWLSLVAVHSDAWLLSVSFYFGSRFGFDRADRKRLFSMINEVPTVYEVVTGNAEKQTKEMPSSANQNGNRS KSNSKMRGLESKSSKTIHAKDEEEGLELEEGEEEEDEDEDEHGETLCGACGDNYASDEFWICCDMCEKWFHGECVKITPA RAEHIKHYKCPTCSNKRARP >AT5G26210.1 Arabidopsis Alfin family transcription factor, protein sequence 255AA tffamily=Alfin MEAGGAYNPRTVEEVFRDFKGRRAGMIKALTTDVQEFFRLCDPEKENLCLYGHPNEHWEVNLPAEEVPPELPEPVLGINF ARDGMAEKDWLSLVAVHSDAWLLAVAFFFGARFGFDKADRKRLFNMVNDLPTIFEVVAGTAKKQGKDKSSVSNNSSNRSK SSSKRGSESRAKFSKPEPKDDEEEEEEGVEEEDEDEQGETQCGACGESYAADEFWICCDLCEMWFHGKCVKITPARAEHI KQYKCPSCSNKRARS
AP2 domain
This contains the AP2-like domains but excludes the ERFs. Typically, they have two AP2 domains and lack the conserved WLG and the AEIRD parts of ERFs. Instead they have a conserved YLG.
Searches
R1 repeat contains WESHI or YEAH at the 5 prime end and LAALKY or YDRAA at the 3 prime end.
R2 repeat contains WEAR or WQAR at the 5 prime end and YDIAAI or NAVT at the 3 prime end.
>AP2 search_1 tffamily=AP2 SSVHRGVTRHRWTGRYEAHLWDKNSWNETQTKKGRQVYLGAYDEEDAAARAYDLAALKYWGRDTILNFP >AP2 search_2 tffamily=AP2 TSIYRGVTRHRWTGRYEAHLWDNSCRREGQSRKGRQVYLGGYDKEEKAARAYDLAALKYRGLNAVTNFE >AP2 search_3 tffamily=AP2 VSKYRGVAKHHHNGRWEARIGRVFGNKYLYLGTYATQEEAAIAYDIAAIEYRGLNAVTNFDISRYL >AP2 search_4 tffamily=AP2 ASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFGTQEEAAEAYDVAAIKFRGTNAVTNFDITRYD >AP2 search_5 tffamily=AP2 SSQYRGVTFYRRTGRWESHIWDCGKQVYLGGFDTAHAAARAYDRAAIKFRGVDADINFDIEDYL
Auxin response factors (ARFs) contain an amino-terminal DNA-binding domain, which has some sequence similarity to the B3 domain found in maize VP1. They also contain a carboxyl-terminal domain related to motifs III and IV found in the CTDs of Aux/IAA proteins. However, they also have a conserved central region.
Searched with a conserved region of auxin-responsive transcription factors, pfam06507, found in the middle of the protein.
>AT5G20730.1 tffamily=ARF AAHANANNSPFTIFYNPRWAAPAEFVVPLAKYTKAMYAQVSLGMRFRMIFETEECGVRRYMGTVTGISDLDPVRWKNSQWRNLQ >AT1G34390.1 tffamily=ARF AKHAFDNQCMFIVVYKPRSSQFIVSYDKFLDAVNNKFNVGSRFTMRFEGDDFSERRYFGTIIGVSDFSPHWKCSEWRNLE >AT1G34170.1 tffamily=ARF VVNAFKTKCMFNVVYKPSSSQFVISYDKFVDAMNNNYIVGSRFRMQFEGKDFSEKRYDGTIIGVNDMSPHWKDSEWRSLK >AT1G59750.1 tffamily=ARF AAHAITTGTIFSVFYKPRTSRSEFIVSVNRYLEAKTQKLSVGMRFKMRFEGEEAPEKRFSGTIVGVQENKSSVWHDSEWRSLK >AT2G46530.1 tffamily=ARF ASHAVTTTTIFVVFYKPRISQFIISVNKYMMAMKNGFSLGMRYRMRFEGEESPERIFTGTIIGSGDLSSQWPASKWRSLQ >AT2G33860.1 tffamily=ARF VAHAISTHSVFSISYNPKASWSNFIIPAPKFLKVVDYPFCIGMRFKARVESEDASERRSPGIISGISDLDPIRWPGSKWRCLL >AT1G77850.1 tffamily=ARF AINRASQGLPFEVVFYPAAGWSEFVVRAEDVESSMSMYWTPGTRVKMAMETEDSSRITWFQGIVSSTYQETGPWRGSPWKQLQ >AT4G30080.1 tffamily=ARF AATLAISGRPFEVVYYPRASTSEFCVKALDARAAMRIPWCSGMRFKMAFETEDSSRISWFMGTVSAVNVSDPIRWPNSPWRLLQ
Searched with the Arabidopsis genes
>AT1G76510.1 tffamily=ARID AGAPQDQEAFIKEVEAFNKENFLEFKAPKFYGQPLNCLKLWRAVIKLGGYDVVTTSKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEKHLRQNGELNLPGSASL >AT1G20910.1 tffamily=ARID EAGTPVEQVAFLREVEAFYKESFLEFKPPKFYGQPLNILKLWRAVVNLGGYEVVTTNKLWRQVGESFNPPKTCTTVSYTFRNFYEKALLEYEKCLRNNGELNLPGS >AT2G17410.1 tffamily=ARID SGTEEDQSAFMKELDSFFRERNMDFKPPKFYGEGLNCLKLWRAVTRLGGYDKVTGSKLWRQVGESFRPPKTCTTVSWTFRGFYEKALLEYERHKVSEGELQIPLPLE >AT1G04880.1 tffamily=ARID EAVVADPRLFMTSLERLHSLLGTKFMVPIIGGRDLDLHKLFVEVTSRGGINKILNERRWKEVTATFVFPPTATNASYVLRKYYFSLLNNYEQIYFFRSNGQIPPDSMQ >AT1G55650.1 tffamily=ARID QDIVRNPELFWEMLRDFHESSDKKFKIPIVGGKSLDLHRLFNEVTSRGGLEKVIKDRRCKEVIDAFNFKTTITNSAFVLRKSYLKMLFEFEHLYYFQAPLSTFWEKEK >AT2G46040.1 tffamily=ARID ELISLFRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQESGLESYDSASAKLIYVKYLDAFGRWLNRVVAGDTDVSSVE >AT4G11400.1 tffamily=ARID ECEERLRRLFDQALLVFLEEEGSIKPLPAVIGDGKNVDLFKLFVLVREREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAVEESRIVNWDNKDSEK
Searched with 15 full length Arabidopsis proteins.
>AT1G36000.1 Arabidopsis AS2 family transcription factor, protein sequence 122AA tffamily=AS2 MEPLGNRRPCSVCITKNRNCPRFCEYAEYFPYELQSQYESANELFGTPNIITMMQHAPEEKKQMLATSIIMEGNAWTEDP ISGGFGMIQKLMWKIMLHKAYLRELQEKIKEEKEKKPASSLY >AT2G19510.1 Arabidopsis AS2 family transcription factor, protein sequence 120AA tffamily=AS2 MEPLGDRRPCCVCITKNRNCPRFCEYAEYFPYELRSHYESTNELFGTPKIIKMMRHAPEEKKQMLATSIIMEGNAWTNDP VSGGFGMVQKIMWKIMLHKAYLHELEEKIKEEKEKIELHL >AT1G06280.1 Arabidopsis AS2 family transcription factor, protein sequence 206AA tffamily=AS2 MMQRNSNNTSITSNISNNSSSHQACASCKHQRKKCNNECILSPYFPARKTKEFQAVHKVFGVSNVQKMVRTVREEDRTKL SDSLTWEALWRQKDPVLGSYGEYRRICEELKLYKSLVHNQPLIGWDNNQRVFNNNSNNKNGLAMTNSSGSGGFSVNNNGV GVNREIVNGGYASRNVQGGWENLKHDQRQQCYAVINNGFKQHYLPL >AT1G72980.1 Arabidopsis AS2 family transcription factor, protein sequence 214AA tffamily=AS2 MSLSTFSGGSTTACAACKHQRKKCKKNCILARYFPQDGTNKFLNAHKLFGVSNITKMLKRIEESQRDIAMENLIYHANAR ALDPVGGVYRTICDLKCKIEFVQTELNLTRQQIDMCRSLAQEQHRQRQNLPYRCNSFESLLQQDGDEYVNVDGLDHQNMQ QQQEMQQQQQNPSNYDMFLEMPEQTSKVKLEEEKISDQRKNNLMRQILMSSAII >AT1G16530.1 Arabidopsis AS2 family transcription factor, protein sequence 165AA tffamily=AS2 MRQKGHRHGRTVSPCAGCKLLRRKCVKDSCVFAPYFPAKEPYKFAIVHKIFGASNVNKMLQELSENHRSDAVDSMVYEAN ARIQDPVYGCVGTISSLHRQLETLQTQLAFAQAELIHIRTLHRIHTKPPPYTASTVTFPSNKDFYSDIDMAVAYTDDAGD FLWSC >AT1G07900.1 Arabidopsis AS2 family transcription factor, protein sequence 190AA tffamily=AS2 MESKSDASVATTPIISSSSSPPPSLSPRVVLSPCAACKILRRRCAERCVLAPYFPPTDPAKFTIAHRVFGASNIIKFLQE LPESQRTDAVNSMVYEAEARIRDPVYGCAGAIYHLQRQVSELQAQLAKAQVEMVNMQFQRSNLLELIYNMDQQQKQEQDN MSFESNDLGFLEDKSNTNSSMLWWDPLWTC >AT5G63090.1 Arabidopsis AS2 family transcription factor, protein sequence 186AA tffamily=AS2 MASSSNSYNSPCAACKFLRRKCMPGCIFAPYFPPEEPHKFANVHKIFGASNVTKLLNELLPHQREDAVNSLAYEAEARVR DPVYGCVGAISYLQRQVHRLQKELDAANADLAHYGLSTSAAGAPGNVVDLVFQPQPLPSQQLPPLNPVYRLSGASPVMNQ MPRGTGGSYGTFLPWNNGHDQQGGNM >AT3G11090.1 Arabidopsis AS2 family transcription factor, protein sequence 165AA tffamily=AS2 MRGHEPRSSSSCAACKLLKRRCTPTCIFAPYFRSSDLITFAKVHKVFGASNVSKLLGEVPEEQRQETVNSLAYEAEVRLK DPVYGCIGAIASLQKKMLELQHDLAVARTRLLAHSGVNNSQVSPLDDSPELAAFLDLVPYSDLMLLDGSTNLDAYLYDLG QPPFV >AT4G00220.1 Arabidopsis AS2 family transcription factor, protein sequence 228AA tffamily=AS2 MSSSGNPSSSSGGGGGPCGACKFLRRKCVAGCIFAPYFDSEQGAAHFAAVHKVFGASNVSKLLHHVPEHKRPDAVVSICF EAQARLRDPIYGCVSHIVSLQQQVVSLQTELSYLQAHLATLELPQPQPPQVPVSSSGSLQALSITDLPTISPSVYDLSSI FEPVMSSTWAMQQQPRPSDHLFGVPSSSNMGGGGELQALAREFIHGGQMPAQPSPGTSGSASSVIKRE >AT5G06080.1 Arabidopsis AS2 family transcription factor, protein sequence 177AA tffamily=AS2 MASHGSSCGACKFLRRKCNRDCVFSPYFSYEQASSHFAAVHKVFGASNVSKHLLHLPQHQRNIAAITISYEALSRMRDPV YGCVAHIFALHQQVVTLQEEIEFLGSQMKNFSYSNQNGSQLNNIPEFVNQMTMATTNFVDESVLNNADGRNCYDGFFTNS EEMLVNHQWLQNMDYYY >AT4G00210.1 Arabidopsis AS2 family transcription factor, protein sequence 215AA tffamily=AS2 MSGSTTGCGGPCGACKFLRRKCVADCVFAPYFDSVEGTSHFTAVHKVFGASNASKLLMMIPASRRLDAVVTLTYEALARL RDPVYGCVGHIFALQHQAELAYVQTQLSTLQGLPPPNSQNNSRTEAASSSNVPLISSVDSKDNMSSSSSHIPCMSQQQEQ EQPKEAIEVPTESVDLSTFFGLENPVDEDGDLNALAREFFTKYLTGGKYRPSSLI >AT3G50510.1 Arabidopsis AS2 family transcription factor, protein sequence 198AA tffamily=AS2 MMFHQMDKISTPCAACKHLRRKCTEDCVFAPYFPSTKLDNYEAVHKVFGASHVATLINSLHPCQREFAMDTLAWEAQVQA NDPVNGCLGIIYNLLSQIKDLEEQLAIVKNELASYCIVPTFVPPPSMTNLEMHNNPMMIPEHTPNNGGCLTGQQLYNEAQ RFASTQSAQMQETQMQHDEESYRDKSSYQKFGPCFNLH >AT1G67100.1 Arabidopsis AS2 family transcription factor, protein sequence 233AA tffamily=AS2 MRMSCNGCRVLRKGCSENCSIRPCLQWIKSAESQANATVFLAKFYGRAGLMNLLNTGPDHLRPAIFRSLLYEACGRIVNP IYGSVGLLWSGNWHLCQAAVEAVMRGSPVTPIACDAAVTGQAPPFNNKLCDIRHVSSRDENVKRRSRGACKEERNVRSLS HESSLSHESPVSSEETTTEEPKTWIGLELTLGLEPLARGNHVVVPMKKRKLERCGTSEDEDTCKIELGLVCSE >AT3G49940.1 Arabidopsis AS2 family transcription factor, protein sequence 247AA tffamily=AS2 MSCNGCRVLRKGCSENCILRPCIQWIESPEAQGHATVFVAKFFGRAGLMSFISAVPESQCPALFQSLLYEACGRTVNPVN GAVGLLWTGNWNVCQAAVETVLRGGSLKPIPELLNGGGFAGFPSPTSDEASEICTEMLNLRKADDSGDRNIYHHCRFSSS RSRSRSTASPPKRKRLSSEQQPSSELDLSLIPIYPIKTLPFKEDTPSMYSEESVTTVSFQNNNAGDRYVRCGGGGGGATT KLLNLFA >AT3G27940.1 Arabidopsis AS2 family transcription factor, protein sequence 153AA tffamily=AS2 MNANPCEVCRFQNKQCVNNCMFALLFPSSDLEKFDVVNRIFGLETLTFYLKDLSPMERIDTTRTLYYEAKPCFLNPPKNP SKFLEALLNYPYQKAEEVSKTKKLLASYSRPCVVLALPAPKYTQSKSKPSVLRKRKRKTKSSDESAIRVVEDS
Searched with the Arabidopsis genes
>AT1G15050.1 tffamily=AUX-IAA QTTEFGGVIDLGLSLRTIQHEIYHSSGQRYCSNEGYRRKWGYVKVTMDGLVVGRKVCVLDHGSYSTLAHQLEDMFGMQSVSGLRLFQMESEFCLVYRDEEGLWRNAGDVPWNEFIESVERLRITRRNDAVLP >AT1G04100.1 tffamily=AUX-IAA ADSSPAAASNATRQVAVGWPPLRTYRINSLVNQAKSLATEGGLSETTKSVVVAAKNDDACFIKSSRTSMLVKVTMDGVIIGRKVDLNALDSYAALEKTLDLMFFQIPSPVTRSNTQGYKTIKETCTSKLLDGSSEYIITYQDKDGDWMLVGDVPWQMFLGSVTRLRIMKTSIGAGVGK >AT4G28640.1 tffamily=AUX-IAA ADSMAATSGQVVGWPPIRTYRMNSMVNQAKASATEDPNLEISQAVNKNRSDSTKMRNSMFVKVTMDGIPIGRKIDLNAHKCYESLSNTLEEMFLKPKLGSRTLETDGHMETPVKILPDGSSGLVLTYEDKEGDWMLVGDVPWGMFIGSVRRLRIMKTSEATGKAQ >AT1G04550.1 tffamily=AUX-IAA AESSSHQGASPPRSSQVVGWPPIGLHRMNSLVNNQAMASATEDPNLEISQAVNKNRSDSTKMRNSMFVKVTMDGIPIGRKIDLNAHKSYENLAQTLEEMFFGMTGLYCFQ >AT2G46990.1 tffamily=AUX-IAA TDLRLGLSFGTSSGTQYFNGGYGYSVAAPAVEDAEYVAAVEEEEENECNSVGSFYVKVNMEGVPIGRKIDLMSLNGYRDLIRTLD FMFNASILWAEEEDMCNEKSHVLTYADKEGDWMMVGDVPWEMFLSTVRRLKISRA >AT3G17600.1 tffamily=AUX-IAA PSESSVNLSLSLTFPSTSPQREARQDWPPIKSRLRDTLKGRRLLRRGDDTSLFVKVYMEGVPIGRKLDLCVFSGYESLLENLSHMFDTSIICGNRDRKHHVLTYEDKDGDWMMVGDIPWDMFLETVRRLKITRP >AT3G23030.1 tffamily=AUX-IAA EETRDEEESTPPTKTQIVGWPPVRSSRKNNNSVSYVKVSMDGAPYLRKIDLKTYKNYPELLKALENMFKVMIGEYCEREGYKGSGFVPTYEDKDGDWMLVGDVPWDMFSSSCKRLRIMKG >AT3G23050.1 tffamily=AUX-IAA EKTTLKDPSKPPAKAQVVGWPPVRNYRKNMMTQQKTSSGAEEASSEKAGNFGGGAAGAGLVKVSMDGAPYLRKVDLKMYKSYQDLSDALAKMFSSFTMGNYGAQGMIDFMNESKLMNLLNSSEYVPSYEDKDGDWMLVGDVPWEMFVESCKRLRIMKG >AT3G04730.1 tffamily=AUX-IAA ENMKEKVVKPPAKAQVVGWPPVRSFRKNVMSGQKPTTGDATEGNDKTSGSSGATSSASACATVAYVKVSMDGAPYLRKIDLKLYKTYQDLSNALSKMFSSFTIGNYGPQGMKDFMNESKLIDLLNGSDYVPTYEDKDGDWMLVGDVPWEMFVDSCKRIRIMKG >AT1G80390.1 tffamily=AUX-IAA ENNYISSMVTNDQLVGWPPVATARKTVRRKYVKVALDGAAYLRKVDLGMYDCYGQLFTALENMFQGIITICRVTELERKGEFVATY EDKDGDLMLVGDVPWMMFVESCKRMRLMKT >AT5G65670.1 tffamily=AUX-IAA KGQSSTTNNSSSPPAAKAQIVGWPPVRSYRKNTLATTCKNSDEVDGRPGSGALFVKVSMDGAPYLRKVDLRSYTNYGELSSALEKMFTTFTLGQCGSNGAAGKDMLSETKLKDLLNGKDYVLTYEDKDGDWMLVGDVPWEMFIDVCKKLKIMKG >AT3G15540.1 tffamily=AUX-IAA GGDAEKVNDSPAAKSQVVGWPPVCSYRKKNSCKEASTTKVGLGYVKVSMDGVPYLRKMDLGSSQGYDDLAFALDKLFGFRGIGVALKDGDNCEYVTIYEDKDGDWMLAGDVPWGMFLESCKRLRIMKR >AT3G16500.1 tffamily=AUX-IAA SNKTTSVPHISQKRTAPGPVVGWPPVRSFRKNLASTSSSKLGNESSHGGQINKSDDGEKQVETKKEGMFVKINMDGVPIGRKVDLNAYNSYEQLSFVVDKLFRGLLAAQRDISDGQGEEKPIIGLLDGKGEFTLTYEDNEGDKMLVGDVPWQMFVSSVKRLRVIK >AT5G57420.1 tffamily=AUX-IAA ASKNHNNSNSSSGAAGRSFQGFGLNVEDDLVSSVVPPVTVVLEGRSICQRISLDKHGSYQSLASALRQMFVDGADSTDDLDLSNAIPGHLIAYEDMENDLLLAGDLTWKDFVRVAKRIRILPV
Searched with the 3 prime half of four Arabidopsis genes. The DNA-binding domain is located in the C-terminal half and is the most highly conserved portion.
>AT2G35550.1 tffamily=BBR-BPC KRSVSNKSKKTPSIPETKREKKNLDINIDISSFDTSGVPPPVCSCTGVSRVCYKWGMGGWQSSCCTISISTYPLPMST TRPGARLAGRKMSNGAYVKLLARLADEGYDLSHPLDLKNHWARHGTNKFVTIK >AT2G01930.1 tffamily=BBR-BPC RKPKEERDVTNNNVQQQQQRVKPVKKSVDLVINGVSMDISGLPVPVCTCTGTPQQCYRWGCGGWQSACCTTNISVYPLPMSTKRRGARISGRKMSQGAFKKVLEKLSEGYSFGNAIDLKSHWARHGTNKFVTIR >AT2G21240.1 tffamily=BBR-BPC KVKKVGEDLNRRVPAPGKKSRTDWDSQDVGLNLVTFDETTMPVPMCSCTGSTRQCYKWGNGGWQSSCCTTTLSQYPLPQMPNKRHSRMGGRKMSGNVFSRLLSRLSAEGYDLSCPVDLKDYWARHGTNRYITIK >AT5G42520.1 tffamily=BBR-BPC NQRKVKKESEDDLNKIMFVKTTHSKSDWKSQEMVGLNQVVYDETTMPPPVCSCTGVLRQCYKWGNGGWQSSCCTTTLSMYPLPALPNKRHARVGGRKMSGSAFNKLLSRLAAEGHHDLSNPVDLKDHWAKHGTNRYITIK
Searched with the Arabidopsis genes
>AT1G78700.1 tffamily=BES TSGTRMPTWRERENNKRRERRRRAIAAKIFTGLRMYGNYELPKHCDNNEVLKALCNEAGWIVEPDGTTYRKGCSRPVER >AT1G75080.1 tffamily=BES AAARRKPSWRERENNRRRERRRRAVAAKIYTGLRAQGDYNLPKHCDNNEVLKALCVEAGWVVEEDGTTYRKGCKPLPGE >AT1G19350.1 tffamily=BES MATRRKPSWRERENNRRRERRRRAVAAKIYTGLRAQGNYNLPKHCDNNEVLKALCSEAGWVVEEDGTTYRKGHKPLPGD >AT3G50750.1 tffamily=BES AATGRMPTWKERENNKKRERRRRAIAAKIFTGLRSQGNYKLPKHCDNNEVLKALCLEAGWIVHEDGTTYRKGSRP >AT5G45300.1 tffamily=BES GGKGKREREKEKERTKLRERHRRAITSRMLAGLRQYGNFPLPARADMNDVIAALAREAGWSVEADGTTYRQSQQPNHVV >AT2G45880.1 tffamily=BES GGSRRSRPLEEKERTKLRERHRRAITARILGGLRRHGNYNLRVRADINDVIAALAREAGWVVLPDGTTFPSKSQGTKPT
Searched with 25 bHLH domains from different groups to ensure isolation of all possible genes.
>AT1G01260.1 IIId tffamily=bHLH EALNHVEAERQRREKLNQRFYALRSVVPNISKMDKASLLGDAVSYINEL >AT1G32640.1 IIIe tffamily=bHLH EPLNHVEAERQRREKLNQRFYALRAVVPNVSKMDKASLLGDAIAYINEL >AT5G54680.1 IVc tffamily=bHLH ATSSKACREKQRRDRLNDKFMELGAILEPGNPPKTDKAAILVDAVRMVTQL >AT2G22770.1 IVa tffamily=bHLH LLKEHVLAERKRRQKLNERLIALSALLPGLKKTDKATVLEDAIKHLKQL >AT5G56960.1 IVd tffamily=bHLH TQLQHMISERKRREKLNESFQALRSLLPPGTKKDKASVLSIAREQLSSL >AT2G31220.1 II tffamily=bHLH RKSRTSPTERERRVHFNDRFFDLKNLIPNPTKIDRASIVGEAIDYIKEL >AT2G31210.1 II tffamily=bHLH RKNKPFTTERERRCHLNERYEALKLLIPSPSKGDRASILQDGIDYINEL >AT1G10610.1 IIIc tffamily=bHLH FKSKNLHSERKRRERINQAMYGLRAVVPKITKLNKIGIFSDAVDYINEL >AT5G65640.1 IIIb tffamily=bHLH QPSKNLMAERRRRKRLNDRLSMLRSIVPKISKMDRTSILGDAIDYMKEL >AT4G21330.1FKSPNLEAER IIIa tffamily=bHLH RRREKLHCRLMALRSHVPIVTNMTKASIVEDAITYIGEL >AT1G68810.1 Vb tffamily=bHLH ASKSHSEAERRRRERINNHLAKLRSILPNTTKTDKASLLAEVIQHVKEL >AT5G08130.1 Va tffamily=bHLH PRSKHSATEQRRRSKINDRFQMLRQLIPNSDQKRDKASFLLEVIEYIQFL >AT2G43010.2 VIIa tffamily=bHLH AAEVHNLSERRRRDRINERMKALQELIPHCSKTDKASILDEAIDYLKSL >AT5G67110.1 VIIb tffamily=bHLH DAQFHNLSEKKRRSKINEKMKALQKLIPNSNKTDKASMLDEAIEYLKQL >AT2G24260.1 XI tffamily=bHLH ATDPHSIAERLRRERIAERMKALQELVPNGNKTDKASMLDEIIDYVKFL >AT1G18400.1 XII tffamily=bHLH ATDSHSLAERVRRGKINERLRCLQDMVPGCYKAMGMATMLDEIINYVQSL >AT2G42280.1 IX tffamily=bHLH HPRSIAERVRRTRISERMRKLQELVPNMDKQTNTSDMLDLAVDYIKD >AT1G27740.1 VIIIc tffamily=bHLH DPQSLYARKRREKINERLKTLQNLVPNGTKVDISTMLEEAVHYVKFL >AT3G21330.1 VIIIb tffamily=bHLH DPQTVAARQRRERISEKIRVLQTLVPGGTKMDTASMLDEAANYLKFL >AT1G30670.1 VIIIa tffamily=bHLH ELSAQSIAARKRRRRITEKTQELGKLIPGSQKHNTAEMFNAAAKYVKFL >AT1G68240.1 VI tffamily=bHLH YRMMMEKKRRKEIKDKVDILQGLMPNHCTKPDLASKLENIIEYIKSL >AT4G01460.1 Ia tffamily=bHLH QRMTHIAVERNRRRQMNEHLNSLRSLMPPSFLQRGDQASIVGGAIDFIKEL >AT5G46690.1 Ia tffamily=bHLH QRMTHIAVERNRRRQMNQHLSVLRSLMPQPFAHKGDQASIVGGAIDFIKEL >AT5G04150.1 Ib tffamily=bHLH KKLNHNASERDRRRKLNALYSSLRALLPKLSIPMTVARVVKYIPEQ >AT1G12540.1 Ib tffamily=bHLH KRAKHKELERQRRQENTSLFKILRYLLPGKRSSADHVLEAVNYIKDL
Searched with the Arabidopsis genes
>AT1G03970.1 tffamily=bZIP KAAAQRQKRMIKNRESAARSRERKQAYQVELETLAAKLEEENEQLLKEIEESTKERY >AT1G49720.1 tffamily=bZIP KVVERRQKRMIKNRESAARSRARKQAYTLELEAEIESLKLVNQDLQKKQAEIMKTHN >AT3G54620.1 tffamily=bZIP PTDVKRARRMLSNRESARRSRRRKQEQMNEFDTQVGQLRAEHSTLINRLSDMNHKYD >AT3G54620.1 tffamily=bZIP PTDVKRARRMLSNRESARRSRRRKQEQMNEFDTQVGQLRAEHSTLINRLSDMNHKYD >AT4G34590.1 tffamily=bZIP LMEQRKRKRMLSNRESARRSRMKKQKLLDDLTAQVNHLKKENTEIVTSVSITTQHYL >AT2G18160.1 tffamily=bZIP TVDERKRKRMLSNRESARRSRMRKQKHVDDLTAQINQLSNDNRQILNSLTVTSQLYM >AT4G36730.1 tffamily=bZIP ERELKRQKRKQSNRESARRSRLRKQAECEQLQQRVESLSNENQSLRDELQRLSSECD >AT4G36730.2 tffamily=bZIP ERELKRQKRKQSNRESARRSRLRKQAECEQLQQRVESLSNENQSLRDELQRLSSECD >AT4G01120.1 tffamily=bZIP EKEVKREKRKQSNRESARRSRLRKQAETEQLSVKVDALVAENMSLRSKLGQLNNESE >AT1G19490.1 tffamily=bZIP EREERRIRRILANRESARQTIRRRQAMCEELSKKAADLTYENENLRREKDWALKEFQ >AT1G22070.1 tffamily=bZIP RINDKMKRRLAQNREAARKSRLRKKAHVQQLEESRLKLSQLEQELVRARQQGLCVRN >AT2G16770.1 tffamily=bZIP TSESSGKKRPLGNREAVRKYREKKKAKAASLEDEVMRLKAVNNQLLKRLQGQAALEA >AT4G35040.1 tffamily=bZIP SCGKKGEKRPLGNREAVRKYREKKKAKAASLEDEVARLRAVNQQLVKRLQNQATLEA >AT1G43700.1 tffamily=bZIP DPKRAKRILANRQSAARSKERKIRYTGELERKVQTLQNEATTLSAQVTMLQRGTS >AT2G42380.2 tffamily=bZIP DPKRVKRILANRQSAQRSRVRKLQYISELERSVTSLQAEVSVLSPRVAFLDHQRL >AT3G58120.1 tffamily=bZIP IHDPKRVKRILANRQSAQRSRVRKLQYISELERSVTSLQTEVSVLSPRVAFLDHQRL >AT2G40950.1 tffamily=bZIP EEDEKKRARLMRNRESAQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENA >AT3G10800.1 tffamily=bZIP DDDKRKLIRQIRNRESAQLSRLRKKQQTEELERKVKSMNATIAELNGKIAYVMAENV >AT1G42990.1 tffamily=bZIP DAVAKKRRRRVRNRDAAVRSRERKKEYVQDLEKKSKYLERECLRLGRMLECFVAENQ >AT5G11260.1 tffamily=bZIP EKENKRLKRLLRNRVSAQQARERKKAYLSELENRVKDLENKNSELEERLSTLQNENQ
Searched with the Arabidopsis genes
>AT2G18380.1 tffamily=bZIP CASCDTTSTPLWRNGPKGPKSLCNACGIRFKKEERR >AT3G20750.1 tffamily=bZIP CTNMNCNALNTPMWRRGPLGPKSLCNACGIKFRKEEER >AT1G08000.1 tffamily=bZIP CTHCETITTPQWRQGPSGPKTLCNACGVRFKSGRLV >AT3G45170.1 tffamily=bZIP CSHCGTRKTPLWREGPRGAGTLCNACGMRYRTGRLL >AT4G17570.1 tffamily=bZIP CYHCGVTNTPLWRNGPPEKPVLCNACGSRWRTKGTL
The C2H2 zinc finger family are a large superfamily of largely unrelated genes that contain between one and at least seven fingers. Searches were performed with all the subgroups to ensure identification of all the disparate members.
>A1a At1g03840 (whole protein) tffamily=C2H2 mttedqtisssggyvqsssttdhvdhhhhdqheslnpplvkkkrnlpgnpdpeaevials pktlmatnrflceicgkgfqrdqnlqlhrrghnlpwklkqrtskevrkrvyvcpekscvh hhptralgdltgikkhfcrkhgekkwkcekcakryavqsdwkahsktcgtreyrcdcgti fsrrdsfithrafcdalaeetarlnaashlksfaatagsnlnyhylmgtlipspslpqpp sfpfgppqpqhhhhhqfpittnnfdhqdvmkpastlslwsggninhhqqvtiedrmapqp hspqedynwvfgnannhgelittsdslithdnninivqskenangatslsvpslfssvdq itqdanaasvavanmsatallqkaaqmgatsstsptttittdqsaylqsfasksnqived ggsdrffasfgsnsvelmsnnnnglheignprngvtvvsgmgelqnypwkrrrvdignag gggqtrdflgvgvqtichsssingwi >A1b At1g68130 (whole protein) tffamily=C2H2 mrtdqvmlsnkntntccvvsssssdpflsssengvtttntstqkrkrrpagtpdpdaevv slsprtllesdryiceicnqgfqrdqnlqmhrrrhkvpwkllkrdnnievkkrvyvcpep tclhhnpchalgdlvgikkhfrrkhsnhkqwvcercskgyavqsdykahlktcgtrghsc dcgffssfrvesfiehqdncsarrvhrepprppqtavtvpacssrtastvstpssetnyg gtvavttpqplegrpihqrisssiltnssnnlnlelqllplssnqnpnqenqqqkvkeps hhhnhnhdttnlnlsiapsssyqhynnfdrikeimaseqimkiamkekayaeeakreakr qreiaenefanakkirqkaqaelerakflkeqsmkkisstimqvtcqtckgqfqavavpa atadetslvvsymssantdgelengf >A1c AT1g34370 (whole protein) tffamily=C2H2 mstapgpftgqpgsavfpyvreannvasqsqnnnncgarefdlpkpvlvdereghvveeh emkdeddveegenlppgsyeilqlekeeilaphthfcticgkgfkrdanlrmhmrghgde yktaaalakpnkesvpgsepmlikryscpflgckrnkehkkfqplktilcvknhykrthc dksftcsrchtkkfsviadlkthekhcgknkwlcscgttfsrkdklfghialfqghtpai pleetkpsaststqrgsseggnnnqgmvgfnlgsasnanqettqpgmtdgricfeesfsp mnfdtcnfggfhefprlmfddsessfqmlianacgfsprnvgesvsdtsl >A1d At1g51220 (whole protein) tffamily=C2H2 msnpacsnlfnngcdhnsfnystslsyiynshgsyyysnttnpnyinhthttstspnspp lrealpllslspirhqeqqdqhyfmdthqisssnflddplvtvdlhlglpnygvgesirs niapdattdeqdqdhdrgvevtveshldddddhhgdlhrghhywiptpsqiligptqftc plcfktfnrynnmqmhmwghgsqyrkgpeslrgtqptgmlrlpcfccapgcknnidhpra kplkdfrtlqthykrkhgskpfacrmcgkafavkgdwrthekncgklwycscgsdfkhkr slkdhvkafgnghvpcgidsfggdhedyydaasdieq >A3 At2g23740 (region spanning the 3 fingers) tffamily=C2H2 WSFSGFACAICLDSFVRRKLLEIHVEERHHVQFAEKCMLLQCIPCGSHFGDKEQLLVHVQAVHPSECKSLTVASECNLTNGEFSQKPEAGSSQIVVSQNNENTSGVHKFVCKFCGLKFNLLPDLGRHHQAEHMGPSLVGS >A4 at1g30970 (two fingers) tffamily=C2H2 KVWCYYCDREFDDEKILVQHQKAKHFKCHVCHKKLSTASGMVIHVLQVHKENVTKVPNAK >B1 at1g72050 (whole protein) tffamily=C2H2 maeeakvdvktsakkdirnylcqycgisrsknylitkhiqshhqmeleeerddeacevde esssnhtcqecgaefkkpahlkqhmqshslersftcyvddcaasyrrkdhlnrhllthkg klfkcpkencksefsvqgnvgrhvkkyhsndnrdkdntglgdgdkdntckgdddkeksgs ggcekenegnggsgkdnngngdsqpaecstgqkqvvckeigcgkafkypsqlqkhqdshv kldsveafcsepgcmkyftneeclkshirschqhinceicgskhlkknikrhlrthdeds spgeikcevegcsstfskasnlqkhmkavhddirpfvcgfpgcgmrfaykhvrnkhensg yhvytcgdfvetdedftsrprgglkrkqvtaemlvrkrvmpprfdaeehetc >C1-1i At1g66140 (single finger) tffamily=C2H2 SKRVFSCNYCQRKFYSSQALGGHQNAHKRE >C1-Sa AT2G15740 (single finger) tffamily=C2H2 YGPYTCPKCNGVFNTSQKFAAHMSSHYKNE >C1-Q At2g36475 (single finger) tffamily=C2H2 FDDLPHLCTSCSVRLKQKEELDRHMELHDK >C1-2i AT2G37430 (two fingers) tffamily=C2H2 SHTSNQFECKTCNKRFSSFQALGGHRASHKKPKLTVEQKDVKHLSNDYKGNHFHKCSICSQSFGTGQALGGHMRRHRSSM >C1-3i AT1G02030 (All three fingers) tffamily=C2H2 MEERHKCKLCWKSFANGRALGGHMRSHMLIHPLPSQPESYSSSMADPGFVLQDRESETES SKKPSRKRSRLNRRSISSLRHQQSNEEGKSETARAADIKIGVQELSESCTEQEPMSSVSD AATTEEDVALSLMLLSRDKWEKEEEESDEERWKKKRNKWFECETCEKVFKSYQALGGHRA SHKKKIAETDQLGSDELKKKKKKSTSSHHECPICAKVFTSGQALGGHKRSHASANNEFTR >C1-4i AT1G49900 (two searches with the two double finger regions) tffamily=C2H2 DLFKCSICEKVFTSYQALGGHKASHSIKAAQLENAGADAGEKTRSKMLSPSGKIHKCDICHVLFPTGQALGGHKRRHYEG QCNVCGRELPSYQALGGHKASHRTKPPVENATGEKMRPKKLAPSGKIHKCSICHREFSTGQSLGGHKRLH >C1-5i AT3G29340 (whole protein - six fingers) tffamily=C2H2 MDLDGVELLLDLREMVSQSGFEKSTTCSGVIALRSNLQSKSSHKCKICGKSFECYQALGG HQRIHRPIKEKLSKQEFSEVYPRKSKLQKRPESSSSCYECKVCGKIFGCYRGLGGHTKLH RSTKRELASTQDENSLLDSSEAKKIVSQPSSFKVSQEEKFLHCVELKQDFSEPLSHSGAL PSTLRSKLQTKTQWKSSCHCKICGKSFVCSQGLGNHKRVHREISGKLACKRKYTEDYNPF SDSLKAKKIVKKPSSFEVSQEEKILHCVELKQDFGELLAHSGFDKSISCSKSIKVKKVAR KNEKTEDSTSLFGVFVGEMSQRLHGCKTCGRKFGTLKGVYGHQRMHSGNHNRIEDENGLE RIWGLKKKSRVCSVSAFDRFKGSSFMAEIEKHEVIEAALNLVMLCQGVYDFASISNLPLG DGFMDLELKPCPLRRKLQKKSRSSYKCSICEKSFVCSQALGSHQRLHRWKLVPKPEYIED DSSLLDSSEAKKIVSKPSSFEHAQEEKILQCVEPKLEFHEQLAHSGFDKFDTCSKIRFSA LPSPPEAKKIVSQPPSFEVSVDEKILYRAEPKLNFSEPLAHSCFDNSSSYRSIICGKSFV CSQALGGHQTLHRSIKGQLAGTEDGNSLSVTDSEASKIVAQPSSYKSQGI >C2-B AT1G65110 (single finger) tffamily=C2H2 WKFWMCRTCSQTFFYPKKFKNHLEQVHDAK >C2-sb AT4G26030 (single finger) tffamily=C2H2 PFEKDSSFICLKCNSLFDTSQMLVVHTELIHSKNETKKRL >C2-pairs AT4G12240 (single finger) tffamily=C2H2 VKPPEPYFCGVCDRRFYTNEKLINHFKQIH >C2-unique AT1G04445 (single finger) tffamily=C2H2 MPFSEPQECAVCKRVFLSSHQLISHYNAAH >C2-cons SF AT2G27100 (single finger) tffamily=C2H2 DEKYGWKYGCGAKGCTKLFHAAEFVYKHLKLKHTELVTEL >C3-PL SF AT1G01350 (single finger) tffamily=C2H2 NALPFACFICREPFVDPVVTKCKHYFCEHC
Searched with 32 C3H domains from DATB
>AT1G03790.1 tffamily=C3H YSGEVCPEFRRGGDCSRGDDCEFAHGV >AT2G40140.1 tffamily=C3H YTCVPCPEFRKGSCPKGDSCEYAHGV >AT2G41900.1 tffamily=C3H YSCVPCPDFRKGACRRGDMCEYAHGV >AT5G06420.1 tffamily=C3H YQPDICKDYKETGYCGYGDSCKFLHDR >AT1G32360.1 tffamily=C3H FKGRHCKKFYTEEGCPYGESCTFLHDE >AT1G04990.1 tffamily=C3H PGERDCQFYLRTGLCGYGSSCRYNHPT >AT3G12680.1 tffamily=C3H PGEPDCPYYIKTQRCKYGSKCKFNHPR >AT2G32930.1 tffamily=C3H VGQPDCETGACKYGPTCKYHHPK >AT1G48195.1 tffamily=C3H PGEPECSYYLRTGNCYLKQNCKYHHPK >AT5G18550.1 tffamily=C3H MGQPVCQHFMRTGTCKFGASCKYHHPR >AT1G21570.1 tffamily=C3H NCPVFEATGSCSQGLKCKLHHPK >AT2G47850.1 tffamily=C3H PGVQRCTFYVQNGFCKFGSTCKFDHPM >AT3G08505.1 tffamily=C3H PPNNVCTFYQKRICLYGSRCRYDHVR >AT3G08505.1 tffamily=C3H LRSIDCKHFNFGNGNCPFGASCFYKHAY >AT3G19360.1 tffamily=C3H LRMKLCRKFCFGEECPYGDRCNFIHED >AT3G51950.1 tffamily=C3H FGGVPCSYFARGFCKNGASCRFVHSD >AT2G47680.1 tffamily=C3H EAPVCVYFLNGYCNRGGQCTFTHTL >AT2G02160.1 tffamily=C3H KWNTDCVYFLASPLTCKKGPECEYRHSE >AT3G12130.1 tffamily=C3H SKSKPCTKFFSTSGCPFGENCHFLHYV >AT3G08505.1 tffamily=C3H SDRILCKFFVHGSCLKGENCEFSHDS >AT2G47680.1 tffamily=C3H STRPACKFFASSQGCRNGESCLFSHAM >AT3G47120.1 tffamily=C3H EARGVCRAFQRGECTRGDSCKFSHDE >AT5G51980.1 tffamily=C3H KTEKVCNFWVDGNCTYGDKCRYLHCW >AT5G42820.1 tffamily=C3H FREATCRQYEENSCNRGGYCNFMHVK >AT1G29600.1 tffamily=C3H GKKLDCKAGACKRGSNCPFNHPK >AT5G07060.1 tffamily=C3H NRPKICSFYTIGQCKRGAECSFRHEM >AT2G35430.1 tffamily=C3H WKTRICNKWQTTGYCPFGSHCHFAHGP >AT2G35430.1 tffamily=C3H FKTKLCFKFRAGTCPYSASSCHFAHSA >AT3G12130.1 tffamily=C3H FKTKICERFSKGNCTFGDRCHFAHGE >AT2G19810.1 tffamily=C3H RTQPCKDGGNCRRRVCFFAHSP >AT3G21810.1 tffamily=C3H YKTKLCILFNKTGDCSRPNCTFAHGN >AT2G28450.1 tffamily=C3H WKTSLCSYFRREASCSHGNECKYAHGE
Searched with the Arabidopsis genes
>AT5G64220.1 tffamily=CAMTA LLSEAQHRWLRPAEICEILRNHQKFHIASEPPNRPPSGSLFLFDRKVLRYFRKDGHNWRKKKDGKTVKEAHEKLKVGSIDVLHCYYAHGEDNENFQRRCYWMLEQDLMHIVFVHYLEVK >AT3G16940.1 tffamily=CAMTA MLEEAKSRWLRPNEIHAILYNPKYFTINVKPVNLPNSGRIILFDRKMLRNFRKDGHNWKKKKDGRTVKEAHEHLKVGNEERIHVYYAHGEDNTTFVRRCYWLLDKARENIVLVHYRDTQ >AT4G16150.1 tffamily=CAMTA MLDEAYSRWLRPNEIHALLCNHKFFTINVKPCGTIVLFDRKMLRNFRKDGHNWKKKKDGKTIKEAHEHLKVGNEERIHVYYAHGEDTPTFVRRCYWLLDKSQEHIVLVHYRETH >AT1G67310.1 tffamily=CAMTA LYQEAHSRWLKPPEVLFILQNHESLTLTNTAPQRPTSGSLLLFNKRVLKFFRKDGHQWRRKRDGRAIAEAHERLKVGNAEALNCYYAHGEQDPTFRRRIYWMLDPEYEHIVLVHYRDVS
Searched with the full length of the two Arabidopsis genes
>AT5G23090.2 Arabidopsis CCAAT-Dr1 family transcription factor, protein sequence tffamily=CCAAT-DR1 MDPMDIVGKSKEDASLPKATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNDVCNKEDKRTIAPEHVLKALQ VLGFGEYIEEVYAAYEQHKYETMQDTQRSVKWNPGAQMTEEEAAAEQQRMFAEARARMNGGVSVPQPEHPETDQRSPQS >AT5G08190.1 Arabidopsis CCAAT-Dr1 family transcription factor, protein sequence tffamily=CCAAT-DR1 MDPMDIVGKSKEDASLPKATMTKIIKEMLPADVRVARDAQDLLIECCVEFINLISSESNEVCNKEDKRTIAPEHVLKALQ VLGFGEYVEEVYAAYEQHKYETMQDSQRSVKMNSGAEMTEEEAAAEQQRMFAEARARMNGGVTVPQPEQLEEPQQQQQTS LQS
Searched with the Arabidopsis genes
>AT1G30500.1 tffamily=CCAAT-HAP2 AVEEPVFVNAKQYHGILRRRQSRARLESQNKVIKSRKPYLHESRHLHAIRRPRGCGGRFLNAKKED >AT1G72830.1 tffamily=CCAAT-HAP2 TETDPVFVNAKQYHAIMRRRQQRAKLEAQNKLIRARKPYLHESRHVHALKRPRGSGGRFLNTKKLL >AT1G17590.1 tffamily=CCAAT-HAP2 IENEPVFVNAKQFHAIMRRRQQRAKLEAQNKLIKARKPYLHESRHVHALKRPRGSGGRFLNTKKLQ >AT5G12840.1 tffamily=CCAAT-HAP2 MAQEPVYVNAKQYEGILRRRKARAKAELERKVIRDRKPYLHESRHKHAMRRARASGGRFAKKSEVE >AT5G06510.1 tffamily=CCAAT-HAP2 EEDGTIYVNSKQYHGIIRRRQSRAKAEKLSRCRKPYMHHSRHLHAMRRPRGSGGRFLNTKT >AT3G05690.1 tffamily=CCAAT-HAP2 EDSTIYVNSKQYHGIIRRRQSRAKAAAVLDQKKLSSRCRKPYMHHSRHLHALRRPRGSGGRFLNTKSQN
Searched with the Arabidopsis genes
>AT1G21970.1 tffamily=CCAAT-HAP3 CVAREQDQYMPIANVIRIMRKTLPSHAKISDDAKETIQECVSEYISFVTGEANERCQREQRKTITAEDILWAMSKLGFDNYVDPLTVFINRYREIETDRG >AT2G13570.1 tffamily=CCAAT-HAP3 NNKEQDRFLPIANVGRIMKKVLPGNGKISKDAKETVQECVSEFISFVTGEASDKCQREKRKTINGDDIIWAITTLGFEDYVAPLKVYLCKYRDTEGEKV >AT2G47810.1 tffamily=CCAAT-HAP3 MMVKEQDRLLPIANVGRIMKNILPANAKVSKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICWAMANLGFDDYAAQLKKYLHRYRVLEGEKP >AT2G37060.1 tffamily=CCAAT-HAP3 LHVREQDRFLPIANISRIMKRGLPANGKIAKDAKEIVQECVSEFISFVTSEASDKCQREKRKTINGDDLLWAMATLGFEDYMEPLKVYLMRYREMEGDTK >AT3G53340.1 tffamily=CCAAT-HAP3 LNVREQDRFLPIANISRIMKRGLPLNGKIAKDAKETMQECVSEFISFVTSEASDKCQREKRKTINGDDLLWAMATLGFEDYIDPLKVYLMRYREMEGDTK >AT2G38880.1 tffamily=CCAAT-HAP3 GSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEFISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNK >AT1G09030.1 tffamily=CCAAT-HAP3 MTDEDRLLPIANVGRLMKQILPSNAKISKEAKQTVQECATEFISFVTCEASEKCHRENRKTVNGDDIWWALSTLGLDNYADAVGRHLHKYREAERERT >AT2G27470.1 tffamily=CCAAT-HAP3 ESEKVVVDELPLAIVRRVVKKKLSECSSIHKEALLAFSESARIFIHYLSATANDFCKDARRQTMKADDVFKALEEMDFSEFLEPLKSSLEDFKKKNAGKK
Searched with the Arabidopsis genes
>AT1G08970.1 tffamily=CCAAT-HAP5 AFWENQFKEIEKTTDFKNHSLPLARIKKIMKADEDVRMISAEAPVVFARACEMFILELTLRSWNHTEENKRRTLQKNDIAAAVTRTDIFDFLVDIVPREDL >AT1G54830.1 tffamily=CCAAT-HAP5 SFWETQFKEIEKTTDFKNHSLPLARIKKIMKADEDVRMISAEAPVVFARACEMFILELTLRSWNHTEENKRRTLQKNDIAAAVTRTDIFDFLVDIVPREDL >AT5G50480.1 tffamily=CCAAT-HAP5 NYWIEQMETVSDFKNRQLPLARIKKIMKADPDVHMVSAEAPIIFAKACEMFIVDLTMRSWLKAEENKRHTLQKSDISNAVASSFTYDFLLDVVPK >AT5G27910.1 tffamily=CCAAT-HAP5 SFWSKEMEGNLDFKNHDLPITRIKKIMKYDPDVTMIASEAPILLSKACEMFIMDLTMRSWLHAQESKRVTLQKSNVDAAVAQTVIFDFLLDDDIEVKR >AT5G43250.1 tffamily=CCAAT-HAP5 MEEEEGSIRPEFPIGRVKKIMKLDKDINKINSEALHVITYSTELFLHFLAEKSAVVTAEKKRKTVNLDHLRIAVKRHQPTSDFLLDSLPLP >AT5G38140.1 tffamily=CCAAT-HAP5 VFWNNQREQLGNFAGQTHLPLSRVRKILKSDPEVKKISCDVPALFSKACEYFILEVTLRAWMHTQSCTRETIRRCDIFQAVKNSGTYDFLIDRVPFG >AT3G12480.1 tffamily=CCAAT-HAP5 KVPDYGHSQGQGHGDVTMDDRSISKRRKVNDSDEEYKKSKTQEIGSAKTSGRGGRGRGRGRGRGGRAAKAAEREGLNRENSGQPPPEDNVKMHASESSPQEDE >AT1G07980.1 tffamily=CCAAT-HAP5 TKTSKNREEDDGGAEDAKIKFPMNRIRRIMRSDNSAPQIMQDAVFLVNKATEMFIERFSEEAYDSSVKDKKKFIHYKHLSSVVSNDQRYEFLADSVPEK
Searched with the complete sequences of the Arabidopsis genes CO (X94937), COL6 (AC011915) and COL9 (AC009176). The B1 or B1 and B2 domains (depending on the gene) were manually excised and used for phylogenetic analysis.
>CO tffamily=CONSTANS MLKQESNDIGSGENNRARPCDTCRSNACTVYCHADSAYLCMSCD AQVHSANRVASRHKRVRVCESCERAPAAFLCEADDASLCTACDSEVHSANPLARRHQR VPILPISGNSFSSMTTTHHQSEKTMTDPEKRLVVDQEEGEEGDKDAKEVASWLFPNSD KNNNNQNNGLLFSDEYLNLVDYNSSMDYKFTGEYSQHQQNCSVPQTSYGGDRVVPLKL EESRGHQCHNQQNFQFNIKYGSSGTHYNDNGSINHNAYISSMETGVVPESTACVTTAS HPRTPKGTVEQQPDPASQMITVTQLSPMDREARVLRYREKRKTRKFEKTIRYASRKAY AEIRPRVNGRFAKREIEAEEQGFNTMLMYNTGYGIVPSF >COL6 tffamily=CONSTANS MKSLASAVGGKTARACDSCVKRRARWYCAADDAFLCHACDGSVH SANPLARRHERVRLKSASAGKYRHASPPHQATWHQGFTRKARTPRGGKKSHTMVFHDL VPEMSTEDQAESYEVEEQLIFEVPVMNSMVEEQCFNQSLEKQNEFPMMPLSFKSSDEE DDDNAESCLNGLFPTDMELAQFTADVETLLGGGDREFHSIEELGLGEMLKIEKEEVEE EGVVTREVHDQDEGDETSPFEISFDYEYTHKTTFDEGEEDEKEDVMKNVMEMGVNEMS GGIKEEKKEKALMLRLDYESVISTWGGQGIPWTARVPSEIDLDMVCFPTHTMGESGAE AHHHNHFRGLGLHLGDAGDGGREARVSRYREKRRTRLFSKKIRYEVRKLNAEKRPRMK GRFVKRSSIGVAH >COL9 tffamily=CONSTANS MGYMCDFCGEQRSMVYCRSDAACLCLSCDRSVHSANALSKRHSR TLVCERCNAQPATVRCVEERVSLCQNCDWSGHNNSNNNNSSSSSTSPQQHKRQTISCY SGCPSSSELASIWSFCLDLAGQSICEQELGMMNIDDDGPTDKKTCNEDKKDVLVGSSS IPETSSVPQGKSSSAKDVGMCEDDFYGNLGMDEVDMALENYEELFGTAFNPSEELFGH GGIDSLFHKHQTAPEGGNSVQPAGSNDSFMSSKTEPIICFASKPAHSNISFSGVTGES SAGDFQECGASSSIQLSGEPPWYPPTLQDNNACSHSVTRNNAVMRYKEKKKARKFDKR VRYASRKARADVRRRVKGRFVKAGEAYDYDPLTPTRSY
Searched with the Arabidopsis genes
>AT3G22780.1 tffamily=CPP ESCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIH >AT4G14770.1 tffamily=CPP SPKKKSYCECFAAGVYCIEPCSCIDCFNKPIH >AT3G22760.1 tffamily=CPP SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIH >AT5G25790.1 tffamily=CPP KQQKHCNCKNSKCLKLYCECFASGSYCNGCNCVNCHNKLEN >AT3G16160.1 tffamily=CPP RKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEK >AT2G20110.1 tffamily=CPP RHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGS >AT3G16160.1 tffamily=CPP LLSRGCKCKRTRCLKKYCECFQANLLCSDNCKCINCKNVSEA
Searched with thirteen different dof domains.
>Dof search_1 tffamily=Dof KPDKILPCPRCNSMDTKFCYYNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKSKS >Dof search_2 tffamily=Dof AAAAPLPCPRCRSRDTKFCYFNNYNVNQPRHFCKACHRYWTAGGALRNVPVGAGRRKNRP >Dof search_3 tffamily=Dof AEQAPLRCPRCNSSNTKFCYYNNYNLTQPRHFCKTCRRYWTKGGALRNVPIGGGCRKPRP >Dof search_4 tffamily=Dof TEAEGLACPRCESTNTKFCYYNNYNLAQPRHFCKACRRYWTRGGALRNVPVGGGTRNKVA >Dof search_5 tffamily=Dof LPEPGLKCPRCDSTNTKFCYFNNYSLSQPRHFCRACRRYWTRGGALRNVPVGGGYRRHAK >Dof search_6 tffamily=Dof QPEPGLKCPRCESTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGCRRNKR >Dof search_7 tffamily=Dof PQEQGLRCPRCDSPNTKFCYYNNYSLSQPRHFCKTCRRYWTKGGALRNVPVGGGCRKNKR >Dof search_8 tffamily=Dof QKEKALNCPRCNSTNTKFCYYNNYSLQQPRYFCKTCRRYWTEGGSLRNVPVGGGSRKNKR >Dof search_9 tffamily=Dof GAEAAPNCPRCDSPNTKFCYYNNYSLSQPRYFCKGCRRYWTKGGSLRNVPVGGGCRKNRR >Dof search_10 tffamily=Dof PRPPPRQCPRCGSANTKFCYYNNYSRTQPRYLCKACRRHWTEGGTLRDVPVGGGRKNSKR >Dof search_11 tffamily=Dof KKQQQLECPRCRSTNTKFCYYNNYSTSQPRHFCRACRRYWTHGGTLRDVPVGGASRRGGG >Dof search_12 tffamily=Dof GGGGREQCPRCASRDTKFCYYNNYNTAQPRHFCRACRRYWTLGGSLRNGRLGGGIGVDLL >Dof search_13 tffamily=Dof AAGVGDPCPRCESRDTKFCYYNNYNTSQPRHFCKSCRRYWTKGGSLRNVPVGGGSRKSST
Searched with the Arabidopsis genes
>AT3G01330.1 tffamily=E2F KKEKSLWLLAQNFVKMFLCSDDDLITLDSAAKALLSDSPDSVHMRTKVRRLYDIANVFASMNLIEKTHIPVTRKPAYRWLG >AT3G48160.1 tffamily=E2F RREKSLGLLTQNFIKLFICSEAIRIISLDDAAKLLLGDAHNTSIMRTKVRRLYDIANVLSSMNLIEKTHTLDSRKPAFKWLG >AT5G22220.1 tffamily=E2F RYDSSLGLLTKKFINLIKQAEDGILDLNKAADTLEVQKRRIYDITNVLEGIGLIEKTLKNRIQWKG >AT5G02470.1 tffamily=E2F TSGGGLRQFSVMVCQKLEAKKITTYKEVADEIISDFATIKQNAEKPLNENEYNEKNIRRRVYDALNVFMALDIIARDKKEIRWKG >AT5G03415.1 tffamily=E2F KTGRGLRQFSMKVCEKVESKGRTTYNEVADELVAEFALPNNDGTSPDQQQYDEKNIRRRVYDALNVLMAMDIISKDKKEIQWRG
Searched with the Arabidopsis genes
>AT5G21120.1 tffamily=EIL DSHTALCDDLSSDEEMEIEELEKKIWRDKQRLKRLKEMAKNGLGTRLLLKQQHDDFPEHSSKRTMYKAQDGILKYMSKTMERYKAQGFVYGIVLENGKTVAGSSDNLREWWKDKVRFDRNGPAAIIKHQRDINLSDGSDSGSEVGDSTAQKLLELQDTTLGALLSALFPHCNPPQRRFPLEKGVTPPWWPTGKEDWWDQLSLPVDFRGVPPPYKKPHDLKKLWKIGVLIGVIRHMASDISNIPNLVRRSRSLQEKMTSREGALWLAALYREKAIVDQIA >AT1G73730.1 tffamily=EIL LASDNVAEIDVSDEEIDADDLERRMWKDRVRLKRIKERQKAGSQGAQTKETPKKISDQAQRKKMSRAQDGILKYMLKLMEVCKVRGFVYGIIPEKGKPVSGSSDNIRAWWKEKVKFDKNGPAAIAKYEEECLAFGKSDGNRNSQFVLQDLQDATLGSLLSSLMQHCDPPQRKYPLEKGTPPPWWPTGNEEWWVKLGLPKSQSPPYRKPHDLKKMWKVGVLTAVINHMLPDIAKIKRHVRQSKCLQDKMTAKESAIWLAVLNQEESLIQQPS
Searched with a representative domain from each subfamily.
>ERF I search_1 tffamily=ERF LYRGVRQRHWGKWVAEIRLPRNRTRLWLGTFDTAEEAALAYDKAAYKLRGDFARLNFP >ERF II search_2 tffamily=ERF RYKGIRMRKWGKWVAEIREPNKRSRIWLGSYKTAVAAARAYDTAVFYLRGPSARLNFP >ERF III search_3 tffamily=ERF IYRGVRQRNSGKWVSEVREPNKKTRIWLGTFQTAEMAARAHDVAALALRGRSACLNFA >ERF IV search_4 tffamily=ERF SFRGVRQRIWGKWVAEIREPNRGSRLWLGTFPTAQEAASAYDEAAKAMYGPLARLNFP >ERF V search_5 tffamily=ERF KFRGVRQRHWGSWVAEIRHPLLKRRIWLGTFETAEEAARAYDEAAVLMSGRNAKTNFP >ERF VI search_6 tffamily=ERF KFRGVRQRPWGKWAAEIRDPSRRVRVWLGTFDTAEEAAIVYDNAAIQLRGPNAELNFP >ERF VII search_7 tffamily=ERF VYRGIRKRPWGKWAAEIRDPRKGVRVWLGTFNTAEEAAMAYDVAAKQIRGDKAKLNFP >ERF VIII search_8 tffamily=ERF RFLGVRRRPWGRYAAEIRDPTTKERHWLGTFDTAEEAALAYDRAARSMRGTRARTNFV >ERF IX search_9 tffamily=ERF HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFP >ERF X search_10 tffamily=ERF KYRGVRQRPWGKWAAEIRDPHKATRVWLGTFETAEAAARAYDAAALRFRGSKAKLNFP >ERF VI-L search_11 tffamily=ERF KPVGVRQRKWGKWAAEIRHPITKVRTWLGTYETLEQAADAYATKKLAFDALAAATSAA >ERF XB-L search_12 tffamily=ERF KHKGVRKKPSGKWAAEIWDPSLKVRRWLGTFPTAEMAAKAYNDAAAEFVGRRSARRGT
Searched with the Arabidopsis genes
>AT3G20550.1 tffamily=FHA YLFGRERRIADIPTDHPSCSKQHAVIQYREKPDGKPYIMDLGSTNKTYINESPIEPQRYYELFEKDTIKFG >AT5G47790.1 tffamily=FHA HIFGRQHQTCDFVLDHQSVSRQHAAVVPHKNGSIFVIDLGSAHGTFVANERLTKDTPVELEVGQSLRFA >AT5G19280.1 tffamily=FHA VKLGRVSPSDLALKDSEVSGKHAQITWNSTKFKWELVDMGSLNGTLVNSHSISHWGNPVELASDDIITLG >AT3G54350.1 tffamily=FHA VLVGRSTEDLAVDIDLGREKRGSKISRRQAIIRLGDDGSFHIKNLGKYSISVNEKEVDPGQSLILKSDCLVEIR >AT1G60700.1 tffamily=FHA VIIGRSSGGLNVDIDLGKYNYGSKISRRQALVKLENYGSFSLKNLGKQHILVNGGKLDRGQIVTLTSCSSINIR >AT5G07400.1 tffamily=FHA YTIGRSSSDGFCDFVIDHSSISRKHCQILFDSQSHKLYIFDLGNLNGVYVNRVRVRKSKVQEVSIDDEVLFF >AT5G67030.1 tffamily=FHA CIVGSEPDQDFPGMRIVIPSSQVSKMHARVIYKDGAFFLMDLRSEHGTYVTDNEGRRPNFPARFRSSDIIEFG >AT2G21530.1 tffamily=FHA VTIGRLPEKADVVIPVATVSGVHATINTNEKNLLVTDMNSTNGTFIEDKRLIPGVAAPAFPGTRITFG >AT2G45460.1 tffamily=FHA HCLGRLPCHASYQVESNAISGNHCKVFRKPDGDDVTVFMVDTSTNGTFLNWERLTKNGPEVRVQHGDIISLA
Searched with the Arabidopsis genes
>AT1G49190.1 tffamily=GARP-ARR-B TNVLVVDTNFTTLLNMKQIMKQYAYQVSIETDAEKALAFLTSCKHEINIVIWDFHMPGIDGLQALKSITSKLDLPVVIMSDDNQTESVMKATFYGACDYVVKPVKEEVMANI >AT3G62670.1 tffamily=GARP-ARR-B NRVLLVGADSNSSLKNLMTQYSYQVTKYESGEEAMAFLMKNKHEIDLVIWDFHMPDINGLDALNIIGKQMDLPVVIMSHEYKKETVMESIKYGACDFLVKPVSKEVIAVL >AT3G16857.1 tffamily=GARP-ARR-B LRVLVVDDDPTCLMILERMLRTCLYEVTKCNRAEMALSLLRKNKHGFDIVISDVHMPDMDGFKLLEHVGLEMDLPVIMMSADDSKSVVLKGVTHGAVDYLIKPVRMEALKNI >AT2G01760.1 tffamily=GARP-ARR-B LRILVVDDDTSCLFILEKMLLRLMYQVTICSQADVALTILRERKDSFDLVLSDVHMPGMNGYNLLQQVGLLEMDLPVIMMSVDGRTTTVMTGINHGACDYLIKPIRPEELKNI >AT2G27070.1 tffamily=GARP-ARR-B INVMVVDDNRVFLDIWSRMLEKSKYREITVIAVDYPKKALSTLKNQRDNIDLIITDYYMPGMNGLQLKKQITQEFGNLSVLVMSSDPNKEEESLSCGAMGFIPKPIAPTDLPKI >AT5G07210.1 tffamily=GARP-ARR-B INVMVVDDDHVFLDIMSRMLQHSKYRVIAVDDPKKALSTLKIQRDNIDLIITDYYMPGMNGLQLKKQITQEFGNLPVLVMSSD TNKEEESLSCGAMGFIPKPIHPTDLTKI
Searched with 10 DNA-binding domain sequences n.b these are MYB-like
Searched with the MYB-like domain from these GARP-G2 genes.
The consensus is different from R2R3MYBS and MYB-like proteins.
PR/KL_WTP_LH_RFV_AV__LGG___(intron)ATPK_____M___GLT____SHLQ_YR___
n.b SHLQ not SHAQ (those are MYB-related).
>AT2G02060.1 tffamily=GARP-G2 NKFHGVRPYVRSPVPRLRWTPDLHRCFVHAVEILGGQHRATPKLVLKMMDVKGLTISHVKSHLQMYRGGS >AT4G13640.1 tffamily=GARP-G2 DACLVLTTDPKPRLRWTSELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKFRLGR >AT2G20400.1 tffamily=GARP-G2 TVSSNSNNNSNS(5)AAKGRMRWTPELHEVFVDAVNQLGGSNEATPKGVLKHMKVEGLTIFHVKSHLQKYRTAK >AT2G01060.1 tffamily=GARP-G2 NGGPNSSHASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAK >AT4G37180. tffamily=GARP-G2 HHHHQFNKPSSQSHHIQKKEQRRRWSQELHRKFVDALHRLGGPQVATPKQIRDLMKVDGLTNDEVKSHLQKYRMHI >AT3G46640.1 tffamily=GARP-G2 MAAEEGDSGTEDLSGKTLKRPRLVWTPQLHKRFVDVVAHLGIKNAVPKTIMQLMNVEGLTRENVASHLQKYRLYL >AT2G20570.1 tffamily=GARP-G2 KNNRISNNEGKRKVKVDWTPELHRRFVEAVEQLGVDKAVPSRILELMGVHCLTRHNVASHLQKYRSHR >AT4G18020.3 tffamily=GARP-G2 TKPINKSSGIKNVSGNKTSRKKVDWTPELHKKFVQAVEQLGVDQAIPSRILELMKVGTLTRHNVASHLQKFRQHR >AT5G49240.1 tffamily=GARP-G2 DLAMIQVNNAEGDIFRFLSEIGSEMDLPIIIISEDDSVKSVKKWMINGAADYLIKPIRPEDLRIVFKH >AT2G06020.1 tffamily=GARP-G2 ITPCIFYTSDEKARLRWSSDLHDCFVNAVEKLGGPNKATPKSVKEAMEVEGIALHHVKSHLQKFRLGK
Searched with the Arabidopsis genes
>AT2G25650.1 tffamily=GeBP PLIVRIWNEEDELSILKGLVDYRAKTGFNPKIDWDAFYSFLGSSIVAKFSKEQVLSKIRKLKRRFHVHWEKISEGNDPKFTRSSDSEAFGFSSMIWGQ >AT2G36340.1 tffamily=GeBP SASKMNWSKNDELVILGGIVDYENETKLSYRSDWDALYRYIKDCVEAKFSKIQLINKVKNMKRKFTYNQGRSNHGEQLSFTNTDDDEIFKSLIIWDK >AT5G28040.1 tffamily=GeBP RLFQRLWTDEDEIELLRGFLDYITNHRGNSSHPPDTAPFYEQIKSKLQLEFNKNQLVEKLRRLKKKYRNVMSKFSSGKEVFFKSPHDQATFDISRKIWNQ >AT4G00130.1 tffamily=GeBP MLFQRLFSEADEIALLQGLIDFTSTKGDPYEILMLFAFMLKKKFNNAVKNARKKGQTEDEVEYAKESEKKRFDLSIMIWGS >AT4G01260.1 tffamily=GeBP NLFVRLFTEEDEAILLQGFLDFATKKENPSDHIDDFYESIKNSISFDVTKPQLVTKIGNLKKKFNGRVSKGLKKGKNEEVMVFSKASDQNCFDLSRKIWGS >AT1G11510.1 tffamily=GeBP TYFQRLWTEDDEIVVLQGLIDDKKDTGVSNTNKVYELVKKSISFDVSKNQLMEKLRALKKKYENNLGKAKDGVEPTFVKPHDRKAFELSKLVWGG >AT4G00250.1 tffamily=GeBP ANPQRVWSEEDEISLLQAVIDFKAETGTSPWDHKNAFFDIAKKSISFDVSHVQFFDKIRRLKNKYFVNRKNKSGESNHDKKCLGLAVLIWGS >AT4G00610.1 tffamily=GeBP LLFQRLWTDEDEIVFLQGMIKFAKDTGKNVSEDMNGFFEKLKDSISFEVKTDQFVNKIRSMKRKYIENKKTTTEHDKKCYELAEIIWVS >AT1G66420.1 tffamily=GeBP PYFQRLWSEEDEIVMLQGIIKFEDVTGKSPFEDRHGFIEFVKNSISFEASVQQYIGKISQLKRKYTRKRKNGFSEGHEQKCFKLAMSIWGT >AT5G41765.1 tffamily=GeBP KSMMDFKALTRHNPSDDMTGAYNFLHEYISVDVYSYEFVEKMKSLKKKLIEKMGINAKDLSSSLLKLIWRY
Searched with the all three Arabidopsis genes
>AT1G01160.1 tffamily=GIF ANNITTEQIQKYLDENKKLIMAIMENQNLGKLAECAQYQALLQKNLMYLA >AT4G00850.1 tffamily=GIF TNNITTEQIQKYLDENKKLIMAILENQNLGKLAECAQYQALLQKNLMYLA >AT5G28640.1 tffamily=GIF PSNVTSDHIQQYLDENKSLILKIVESQNSGKLSECAENQARLQRNLMYLA
Searched with SCL3, SCL6, GAI and SCR. Genes were considered as GRAS factors based on the presence of the conserved SAW domain.
>SCL3 tffamily=GRAS mvamfqedngtssvassplqvfstmslnrptllassspfhclkdlkpeerglylihlllt canhvasgslqnanaaleqlshlaspdgdtmqriaayftealanrilkswpglykalnat qtrtnnvseeihvrrlffemfpilkvsylltnraileamegekmvhvidldasepaqwla llqafnsrpegpphlritgvhhqkevleqmahrlieeaekldipfqfnpvvsrldclnve qlrvktgealavssvlqlhtflasdddlmrkncalrfqnnpsgvdlqrvlmmshgsaaea rendmsnnngyspsgdsasslplpssgrtdsflnaiwglspkvmvvteqdsdhngstlme rlleslytyaalfdcletkvprtsqdrikvekmlfgeeikniiscegferrerheklekw sqridlagfgnvplsyyamlqarrllqgcgfdgyrikeesgcavicwqdrplysvsawrc rk >SCL6 tffamily=GRAS aaifyghhhhtpppakrlnpgpvgiteqlvkaaeviesdtclaqgilarlnqqlsspvgk pleraafyfkealnnllhnvsqtlnpyslifkiaayksfseispvlqfanftsnqalles fhgfhrlhiidfdigyggqwaslmqelvlrdnaaplslkitvfaspanhdqlelgftqdn lkhfaseinisldiqvlsldllgsiswpnssekeavavnisaasfshlplvlrfvkhlsp tiivcsdrgcertdlpfsqqlahslhshtalfesldavnanldamqkierfliqpeiekl vldrsrpierpmmtwqamflqmgfspvthsnftesqaeclvqrtpvrgfhvekkhnslll cwqrtelvgvsawrcrss >GAI tffamily=GRAS mkrdhhhhhqdkktmmmneeddgngmdellavlgykvrssemadvaqkleqlevmmsnvq eddlsqlatetvhynpaelytwldsmltdlnppssnaeydlkaipgdailnqfaidsass snqggggdtyttnkrlkcsngvvetttataestrhvvlvdsqengvrlvhallacaeavq kenltvaealvkqigflavsqigamrkvatyfaealarriyrlspsqspidhslsdtlqm hfyetcpylkfahftanqaileafqgkkrvhvidfsmsqglqwpalmqalalrpggppvf rltgigppapdnfdylhevgcklahlaeaihvefeyrgfvantladldasmlelrpseie svavnsvfelhkllgrpgaidkvlgvvnqikpeiftvveqesnhnspifldrfteslhyy stlfdslegvpsgqdkvmsevylgkqicnvvacdgpdrverhetlsqwrnrfgsagfaaa higsnafkqasmllalfnggegyrveesdgclmlgwhtrpliatsawklstn >SCR tffamily=GRAS maesgdfnggqppphsplrttssgssssnnrgpppppppplvmvrkrlasemssnpdynn ssrpprrvshlldsnyntvtpqqppsltaaatvssqpnpplsvcgfsglpvfpsdrggrn vmmsvqpmdqdsssssasptvwvdaiirdlihsstsvsipqliqnvrdiifpcnpnlgal leyrlrslmlldpssssdpspqtfeplyqisnnpsppqqqqqhqqqqqqhkpppppiqqq erensstdappqpetvtatvpavqtntaealrerkeeikrqkqdeeglhlltlllqcaea vsadnleeankllleisqlstpygtsaqrvaayfseamsarllnsclgiyaalpsrwmpq thslkmvsafqvfngisplvkfshftanqaiqeafekedsvhiidldimqglqwpglfhi lasrpggpphvrltglgtsmealqatgkrlsdfadklglpfefcplaekvgnldterlnv rkreavavhwlqhslydvtgsdahtlwllqrlapkvvtvveqdlshagsflgrfveaihy ysalfdslgasygeeseerhvveqqllskeirnvlavggpsrsgevkfeswrekmqqcgf kgislagnaatqatlllgmfpsdgytlvddngtlklgwkdlslltasawtprs
Searched with the Arabidopsis genes
>AT2G22840.1 tffamily=GRF TQWAELEQQALIYKYITANVPVPSSLLLSLKKSFFPYGSLPPNSFGWGSFHLGFSGGNMDPEPGRCRRTDGKKWRCSRDAVPDQKYCERHINRGRHRSRK >AT2G36400.1 tffamily=GRF AQWQELELQALIYRYMLAGAAVPQELLLPIKKSLLHLSPSYFLHHPLQHLPHYQPAWYLGRAAMDPEPGRCRRTDGKKWRCSRDVFAGHKYCERHMHRGRNRSRK >AT2G45480.1 tffamily=GRF AQLMEFRMQALVYRYIEAGLRVPHHLVVPIWNSLALSSSSNYNYHSSSLLSNKGVTHIDTLETEPTRCRRTDGKKWRCSNTVLLFEKYCERHMHRGRKRSRK
Searched with the Arabidopsis genes
>AT1G20693.1 tffamily=HMG PKRPASAFFVFMEDFRETFKKENPKNKSVATVGKAAGDKWKSLSDSEKAPYVAKAEKRKVEYEKNIKAYN >AT3G51880.1 tffamily=HMG PKRAPSAFFVFLEDFRVTFKKENPNVKAVSAVGKAGGQKWKSMSQAEKAPYEEKAAKRKAEYEKQMDAYN >AT2G34450.1 tffamily=HMG PKKPATAFFFFLDDFRKQYQEENPDVKSMREIGKTCGEKWKTMTYEEKVKYYDIATEKREEFHRAMTEYT >AT5G23405.1 tffamily=HMG LTDFAVFMNHFRKSFRTDYNGALVKEGSKIGWEMWKSMTEDEKKDYLDKAADEEDEDEDTVEEQA >AT5G23420.1 tffamily=HMG PKRPLTAFFIFMSDFRKTFKSEHNGSLAKDAAKIGGEKWKSLTEEEKKVYLDKAAELKAEYNKSLESND >AT4G11080.1 tffamily=HMG PKQPISAYLIYANERRAALKGENKSVIEVAKMAGEEWKNLSEEKKAPYDQMAKKNKEIYLQEMEGYK >AT3G28730.1 tffamily=HMG PKRAMSGFMFFSQMERDNIKKEHPGIAFGEVGKVLGDKWRQMSADDKEPYEAKAQVDKQRYKDEISDYK
Homeodomain genes were isolated from the tobacco GSSs by searches with the homeodomain from at least one member of each of the major subgroups of plant homeodomain protein. This approach was crucial to success as, unlike the other transcription factor families, searches with a homeodomain from one family often only resulted in isolation of genes from that family and not the complete homeodomain family.
>knotted1 tffamily=Homeodomain KKRKKGKLPKDARTALLDWWNTHyrwPYPTEEEKNRLSEITGLDPKQINNWFINQRKRHWRPSEDM >KNAT1 tffamily=Homeodomain SKKKKKGKLPKEARQKLLTWWELHYKWPYPSESEKVALAESTGLDQKQINNWFINQRKRH >HAT1 tffamily=Homeodomain GGETCRKKLRLSKDQSAVLEDTFKEHNTLNPKQKLALAKKLGLTARQVEVWFQNRRARTK >GLABRA2 tffamily=Homeodomain KRKRKKYHRHTTDQIRHMEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK >PRHA tffamily=Homeodomain HIFCAECNSREAFPDNDIILCDGTCNRAFHQKCLDPPLETESIPPGDQGWFCKFCDCKI >HAT3.1 tffamily=Homeodomain EIEKSSSSACKQTDPKTQRLYISFQENQYPDKATKESLAKELQMTVKQVNNWFKHRRWSIN >BELL1 tffamily=Homeodomain PWRPQRGLPERAVTTLRAWLFEHFLHPYPSDVDKHILARQTGLSRSQVSNWFINARVRLW >knotted3 tffamily=Homeodomain KKKKKGKLPKDARQKLLSWWELHYKWPYPSESEKVALAETTGLDQKQINNWFINQRKRHW >knotted2 tffamily=Homeodomain KKKKKGKLPKDARQKLLSWWELHYKWPYPSESEKVALAETTGLDQKQINNWFINQRKRHW >H1 tffamily=Homeodomain KKRKKGKLPKEARQQLLDWWTRHYKWPYPSESQKLALAESTGLDQKQINNWFINQRKRHW >Hfi22 tffamily=Homeodomain KKRRLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQ >homeobox22 tffamily=Homeodomain KKKKKGKLPKEARQMLLAWWNDHYRWPYPTEADKNSLAESTGLDPKQINNWFINQRKRHW >homeobox20 tffamily=Homeodomain KKKKKGKLPKDARQKLLSWWELHYKWPYPSESEKVALAETTGLDQKQINNWFINQRKRHW >homeobox9 tffamily=Homeodomain KKNKKGKLPREARQILLNWWTTHYKWPYPTEGEKICLAESTGLDPKQINNWFINQRKRHW >NTH23 tffamily=Homeodomain RKRRAGKLPGDTTSVLKAWWQSHAKWPYPTEEDKAKLVQETGLQLKQINNWFINQRKRDW >NTH15 tffamily=Homeodomain KKRKKGKLPKEARQQLLDWWTRHYKWPYPSESQKLALAESTGLDQKQINNWFINQRKRHW >ATHB-7 tffamily=Homeodomain KKSNHNKNNQRRFSDEQIKSLEMMFESETRLEPRKKVQLARELGLQPRQVAIWFQNKRARWK >KNAT4 tffamily=Homeodomain LRKRRAGKLPGDTTSVLKSWWQSHSKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNW >WUSCHEL tffamily=Homeodomain QTSTRWTPTTEQIKILKELYYNNAIRSPTADQIQKITARLRQFGKIEGKNVFYWFQNHKARER >REVOLUTA tffamily=Homeodomain LDSSGKYVRYTAEQVEALERVYAECPKPSSLRRQQLIRECSILANIEPKQIKVWFQNRRCRDK >BELL1-RELATED tffamily=Homeodomain lskllsildevdrnykqyyhqmqivvssfdviagcgaakpytalalqtisrhfrclrdaisg
Searched with the parts of the two Arabidopsis genes that are conserved.
>AT4G26170.1 tffamily=HRT MIRCRSKPVSR >AT5G56780.1 tffamily=HRT MEPCNKRPVPG >AT4G26170.1 tffamily=HRT RKRCEDHKGMRVNAFFFLLNPTERDKAVNEDKSKPETSTG-MNQEGSGLLCEATTKNGLP >AT5G56780.1 tffamily=HRT RKRCEDHKGMRINAFLFLLNQTDREKTVKDEKPDPESHTESIEEEALTRFCEATTKNGLP >AT4G26170.1 tffamily=HRT CTRSAPEGSKRCWQHKDKTLNHGSSENVQSATASQVICGFKLYNGSVCEKSPVKGRKRCE >AT5G56780.1 tffamily=HRT CTRSSPKGSKRCWQHKEKTSSDTSPVYFQPEAAKNVACGVKLGNGLICERSPVKGRKRCE >AT4G26170.1 tffamily=HRT EHKGMRITS >AT5G56780.1 tffamily=HRT EHKGMRIT
Searched with the Arabidopsis genes
>AT1G46264.1 tffamily=HSF PAPFLTKTYQLVDDPATDHVVSWGDDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFKRGEKHL LCEIHRRKTS >AT5G62020.1 tffamily=HSF TPFLTKTFNLVEDSSIDDVISWNEDGSSFIVWNPTDFAKDLLPKHFKHNNFSSFVRQLNTYGFKKVVPDRWEFSNDFFKRGEKRLLREIQRRKIT >AT5G16820.1 tffamily=HSF PPFLSKTYDMVDDPLTNEVVSWSSGNNSFVVWSAPEFSKVLLPKYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEGFLRGRKQLLKSIVRRKPS >AT4G17750.1 tffamily=HSF PPFLSKTYDMVEDPATDAIVSWSPTNNSFIVWDPPEFSRDLLPKYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEGFLRGQKHLLKKISRRKSV >AT3G63350.1 tffamily=HSF SPFLTKTFEMVGDPNTNHIVSWNRGGISFVVWDPHSFSATILPLYFKHNNFSSFVRQLNTYGFRKIEAERWEFMNEGFLMGQRDLLKSIKRRTSS >AT1G67970.1 tffamily=HSF APFLRKCYDMVDDSTTDSIISWSPSADNSFVILDTTVFSVQLLPKYFKHSNFSSFIRQLNIYGFRKVDADRWEFANDGFVRGQKDL LKNVIRRKN >AT3G24520.1 tffamily=HSF APFIVKTYQMVNDPSTDWLITWGPAHNSFIVVDPLDFSQRILPAYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEHFLRGQKHLLNNIARRKHA >AT1G77570.1 tffamily=HSF FYMRVYEVVDDASTDAIISWSESNNSFIIWNVGEFYRRILPKYVDLGTNLSRFFSNLRSHGFKIVKGRTGVLEFGHEDFIRDKLELMKKMVSDKR >AT4G18870.1 tffamily=HSF PFPTKIYEMVDDPSSDAIISWSQSGKSFIIWNPQEFCKDHLRRLFNTLHIHFFFYKLKIFGFKKINPKKWEFANDNFVRGQRHLVEIIISNDKK
Searched with seven Arabidopsis sequences from the DATF multisequence alignment. This part of the proteins appear to represent the JmjC domains.
>AT2G34880.1 tffamily=JUMONJI WLYVGMCFSTFCWHVEDNHLYSLNYHHFGEPKVWYGVPGSHATGLEKAMRKHLPDLFDEQPDLLHELVTQFSPTILKNEGVPVYRAVQNAGEYVLTFPRAYHSGFNCGFNCAEAVNV >AT3G48430.1 tffamily=JUMONJI MVYVAMMFSWFAWHVEDHDLHSLNYLHMGAGKTWYGVPKDAALAFEEVVRVHGYGEELNPLVTFSTLGEKTTVMSPEVFVKAGIPCCRLVQNPGEFVVTFPGAYHSGFSHGFNFGEASNI >AT2G38950.1 tffamily=JUMONJI RLSVGMCLSSQFWKSEKERLYSLCYLHVGAPRVWYSVAGCHRSKFKAAMKSFILEMSGEQPKKSHNPVMMMSPYQLSVEGIPVTRCVQHPGQYVIIFPGSYYSAFDCGFNCLEKANF >AT1G09060.1 tffamily=JUMONJI MESSCTSSCAGGAQWDVFRRQDVPKLSGYLQRTFQKPDNIQTDFVSRPLYEGLFLNEHHKRQLRDEFGVEPWTFEQHRGEAIFIPAGCPFQITNLQSNIQVALDF >AT1G11950.1 tffamily=JUMONJI VVYDETSGALWDIFKREDVPKLEEYLRKHCIEFRHTYCSRVTKVYHPIHDQSYFLTVEHKRKLKAEFGIEPWTFVQKLGEAVFIPAGCPHQVRNLKSCTKVAVDF >AT3G07610.1 tffamily=JUMONJI ENSRQQVQNVETDDGALWDIFRREDIPKLESYIEKHHKEFRHLYCCPVSQVVHPIHDQNFYLTRYHIMKLKEEYGIEPWTFNQKLGDAVLIPVGCPHQVRNLKSCNKVALDF >AT5G63080.1 tffamily=JUMONJI FVYMGGKGSWTPLHADVFRSYSWSANVCGKKRWLFLPPPQSHLVYDRYMVACPVSIIEWFMNFYDDTKDWEKKPIECICKAGEVMFVPNGWWHLVINLEESIAINHNW
Searched with the Arabidopsis LFY gene.
>AT5G61850.1 Arabidopsis LFY family transcription factor, protein sequence 420AA tffamily=LFY MDPEGFTSGLFRWNPTRALVQAPPPVPPPLQQQPVTPQTAAFGMRLGGLEGLFGPYGIRFYTAAKIAELGFTASTLVGMK DEELEEMMNSLSHIFRWELLVGERYGIKAAVRAERRRLQEEEEEESSRRRHLLLSAAGDSGTHHALDALSQEGLSEEPVQ QQDQTDAAGNNGGGGSGYWDAGQGKMKKQQQQRRRKKPMLTSVETDEDVNEGEDDDGMDNGNGGSGLGTERQREHPFIVT EPGEVARGKKNGLDYLFHLYEQCREFLLQVQTIAKDRGEKCPTKVTNQVFRYAKKSGASYINKPKMRHYVHCYALHCLDE EASNALRRAFKERGENVGSWRQACYKPLVNIACRHGWDIDAVFNAHPRLSIWYVPTKLRQLCHLERNNAVAAAAALVGGI SCTGSSTSGRGGCGGDDLRF
Searched with the Arabidopsis genes
>AT2G45800.1 tffamily=LIM CATCKKTVYPLEKVTMEGESYHKTCFRCTHSGCPLTHSSYASLNGVLYCKVHFNQL >AT2G45800.1 tffamily=LIM CKACDKTVYVMDLLTLEGNTYHKSCFRCTHCKGTLVISNYSSMDGVLYCKPHFEQL >AT1G10200.1 tffamily=LIM CMACDKTVYLVDKLTADNRVYHKACFRCHHCKGTLKLSNYNSFEGVLYCRPHFDQN >AT5G66620.1 tffamily=LIM CGGCNFAVEHGGSVNILGVLWHPGCFCCRACHKPIAIHDIENHVSNSRGKFHKSCYERY >AT5G17890.1 tffamily=LIM CKDCKSAIEDGISINAYGSVWHPQCFCCLRCREPIAMNEISDLRGMYHKPCYKEL >AT1G19270.1 tffamily=LIM CAGCNMEIGHGRFLNCLNSLWHPECFRCYGCSQPISEYEFSTSGNYPFHKACYRER >AT2G39830.1 tffamily=LIM CGGCNSDIGSGNYLGCMGTFFHPECFRCHSCGYAITEHEIPTNDAGLIEYRCHPFWNQK >AT4G36860.1 tffamily=LIM CNACDKPIIDYEFSMSGNRPYHKLCYKEQHHPKCDVCHNFIPTNPAGLIEYRAHPFWMQK
Searched with LUFS region of: AT2G32700.6, AT4G32551.1
This is the N-terminal portion to about amino acid 88. Outside this, LUF genes have Q-rich regions and WD repeats that are not specific to LUG.
>Gene_1 tffamily=LUG MSQTNWEADKMLDVYIHDYLVKRDLKATAQAFQAEGKVSSDPVAIDAPGGFLFEWWSVFWDIFIARTNEKHSEVAASYIETQMIKARE >Gene_2 tffamily=LUG FLMAQSNWEADKMLDVYIYDYLVKKKLHNTAKSFMTEGKVSPDPVAIDAPGGFLFEWWSVFWDIFIARTNEKHSEAAAAYIEAQQGKAKE
Searched with a representative domain from each subfamily
>MIKC tffamily=MADS LGRGKIEIKRIENTTNRQVTFCKRRNGLLKKAYELSVLCDAEVALVIFSTRG >Malpha tffamily=MADS GRRKVEIVKMTKESNLQVTFSKRKAGLFKKASEFCTLCDAKIAMIVFSPAG >Mbeta tffamily=MADS MGRKMVKMTRITNEKTRITTYKKRKACLYKKASEFSTLCGVDTCVIVYGPSRAG >Mdelta tffamily=MADS MGRVKLKIKKLENTNGRQSTFAKRKNGILKKANELSILCDIDIVLLMFSPTG >Mgamma tffamily=MADS TRKKVKLAYISNDSSRKATFKKRKKGLMKKVHELSTLCGITACAIIYSPYDTNPEVWPSNSG
Searched with the three full Arabidopsis genes
>AT2G42680.1 tffamily=MBF MAGIGPITQDWEPVVIRKKPANAAAKRDEKTVNAARRSGADIETVRKFNAGTNKAASSGTSLNTKMLDDDTENLTHERVPTELKKA IMQARTDKKLTQSQLAQIINEKPQVIQEYESGKAIPNQQILSKLERALGAKLRGKK >AT3G58680.1 tffamily=MBF MAGIGPITQDWEPVVIRKRAPNAAAKRDEKTVNAARRSGADIETVRKFNAGSNKAASSGTSLNTKKLDDDTENLSHDRVPTELKKA IMQARGEKKLTQSQLAHLINEKPQVIQEYESGKAIPNQQILSKLERALGAKLRGKK >AT3G24500.1 tffamily=MBF MPSRYPGAVTQDWEPVVLHKSKQKSQDLRDPKAVNAALRNGVAVQTVKKFDAGSNKKGKSTAVPVINTKKLEEETEPAAMDRVKAEVRLMIQKARLEKKMSQADLAKQINERTQVVQEYENGKAVPNQAVLAKMEKVLGVKLRGKIGK
n.b Genes that contain only a single canonical R3 domain have been grouped with the R2R3MYBS. The MYB-related category here only represents divergent single domains.
We consider R2 domains with the sequence GKSCRLRW and R3 domains with the sequence LPGRTDN not to be MYB-related.
These two are in the R2R3MYB category and are not among the MYB-related genes in DATB.
>AT4G01060.1 tffamily=MYB-related VVNMSQEEEDLVSRMHKLVGDRWELIAGRIPGRTAGEIERFWVMKN >AT5G59780.2 tffamily=MYB-related RGKMTPQEERLVLELHAKWGNRWSKIARKLPGRTDNEIKNYWRTHM >AT1G01060.1 tffamily=MYB-related RERWTEDEHERFLEALRLYGRAWQRIEEHIGTKTAVQIRSHAQKFF >AT5G02840.2 tffamily=MYB-related RESWTEGEHDKFLEALQLFDRDWKKIEDFVGSKTVIQIRSHAQKYF >AT1G19000.1 tffamily=MYB-related GVPWTENEHKRFLIGLQKVGKGDWKGISRNFVKSRTPTQVASHAQKYF >AT2G36960.1 tffamily=MYB-related WAAWTHQEEESFFTALRQVGKNFEKITSRVQSKNKDQVRHYYYRLV >AT3G05380.1 tffamily=MYB-related GPQWTRLELERFYDAYRKHGQEWRRVAAAIRNSRSVDMVEALFNMNR >AT5G58340.1 tffamily=MYB-related KKFWKPEEVEALREGVKEYGKSWKDIKNGNPTVFAERTEVDLKDKWRNLV >AT1G09710.1 tffamily=MYB-related RKRWSAEEDEELFAAVKRCGEGNWAHIVKGDFRGERTASQLSQRWALIR >AT2G44430.1 tffamily=MYB-related TQAWGTWEELLLACAVKRHGFGDWDSVATEVRSRSSLSHLLASANDCRHKYRDLK >AT2G47210.1 tffamily=MYB-related DSVWTKEETDQLFEFCQNFDLRFVVIADRFPVSRTVEELKDRYYSVN
Searched with the Arabidopsis NAC2 gene AAF09254
>NAC tffamily=NAC mgrgsvaslapgfrfhptdeelvryylkrkvcnkpfkfdaisvtdiyksepwdlpdkskl ksrdlewyffsmldkkysngsktnratekgywkttgkdreirngsrilgmkktlvyhkgr aprgektnwvmqeyrlsdedlkkagvpqeayvlcrifqksgtvpkngeqygapyleeewe edgmtyvpaqdafseglalnddvyvdiddidekpenlvvydavpilpnychgessnnves gnysdsgnyiqpgnnvvdsggyfeqpietfeedrkpiiregsiqpcslfpeeqigcgvqd envvnlessnnnvfvadtcysdipidhnylpdepfmdpnnnlplndglyletndlscaqq ddfnfedylsffddegltfdesllmgpedflpnpetleqkpapkemekergrrrqrssgg kgkwrkiffqnkytdfkdfdsapkypflkktshmlgaiptpssfasqfqtkdamrlhaaq ssgsvhvtagmmrisnmtlaadsgmgwsydkngnlnvvlsfgvvqqddamtasgsktgit atramlvfmclwvlllsvsfkivtmvsar
Searched with the Arabidopsis genes
>AT1G20640.1 tffamily=Nin TKADKTITLDVLRQYFAGSLKDAAKNIGVCPTTLKRICRQHGIQRWPSRKIKKV >AT2G17150.1 tffamily=Nin TEKTIGLEVLRQYFAGSLKDAAKSIGVCPTTLKRICRQHGIMRWPSRKIKKV >AT3G59580.1 tffamily=Nin TEKNVSLNVLQQYFSGSLKDAAKSLGVCPTTLKRICRQHGIMRWPSRKINKV >AT1G18790.1 tffamily=Nin VSKTLSKETISLYFYMPITQAARELNIGLTLLKKRCRELGIKRWPHRKLMSL >AT5G53040.1 tffamily=Nin RQDKLEMSEIKQFFDRPIMKAAKELNVGLTVLKKRCRELGIYRWPHRKLKSL >AT4G35590.1 tffamily=Nin HVAELSLEELSKYFDLTIVEASRNLKVGLTVLKKKCREFGIPRWPHRKIKSL
Searched with the single NZZ gene from Arabidopsis.
>AT4G27330.1 Arabidopsis NZZ family transcription factor, protein sequence 314AA tffamily=NZZ MATSLFFMSTDQNSVGNPNDLLRNTRLVVNSSGEIRTETLKSRGRKPGSKTGQQKQKKPTLRGMGVAKLERQRIEEEKKQ LAAATVGDTSSVASISNNATRLPVPVDPGVVLQGFPSSLGSNRIYCGGVGSGQVMIDPVISPWGFVETSSTTHELSSISN PQMFNASSNNRCDTCFKKKRLDGDQNNVVRSNGGGFSKYTMIPPPMNGYDQYLLQSDHHQRSQGFLYDHRIARAASVSAS STTINPYFNEATNHTGPMEEFGSYMEGNPRNGSGGVKEYEFFPGKYGERVSVVAKTSSLVGDCSPNTIDLSLKL
Searched with 16 Arabidopsis genes, full length except omiting the N-terminal extensions found in some proteins.
>AT3G03750.1 tffamily=PcG TQKGVSVSLKIVRDEKKGWCLYADQLIKQARRRQNIYDKLRSTQSFASALLVVREHLPSGQACLRINIDATRIGNVARFINHSCDGGNLSTVLLRSSGALLPRLCFFAAKDIIAEEELSFSYGDVSVAGE >AT5G43990.1 tffamily=PcG VQQGIHNKLQVFFTPNGRGWGLRTLEKLPKGAFVCELAGEILTIPELFQRISDRPTSPVILDAYWGSEDISGDDKALSLEGTHYGNISRFINHRCLDANLIEIPVHAETTDSHYYHLAFFTTREIDAMEELTWDYGVPFNQD >AT3G04380.1 tffamily=PcG VQRGIRCQLQVYFTQEGKGWGLRTLQDLPKGTFICEYIGEILTNTELYDRNVRSSSERHTYPVTLDADWGSEKDLKDEEALCLDATICGNVARFINHRCEDANMIDIPIEIETPDRHYYHIAFFTLRDVKAMDELTWDYMIDFNDK >AT2G22740.1 tffamily=PcG TQHGIKLPLEIFKTKSRGWGVRCLKSIPIGSFICEYVGELLEDSEAERRIGNDEYLFDIGNRYDNSLAQGMSELMLGTQAGRSMAEGDESSGFTIDAASKGNVGRFINHSCSPNLYAQNVLYDHEDSRIPHVMFFAQDNIPPLQELCYDYNYALDQVR >AT4G13460.1 tffamily=PcG TQKGLRNRLEVFRSLETGWGVRSLDVLHAGAFICEYAGVALTREQANILTMNGDTLVYPARFSSARWEDWGDLSQVLADFERPSYPDIPPVDFAMDVSKMRNVACYISHSTDPNVIVQFVLHDHNSLMFPRVMLFAAENIPPMTELSLDYGVVDDWN >AT5G13960.1 tffamily=PcG SQKRLRFNLEVFRSAKKGWAVRSWEYIPAGSPVCEYIGVVRRTADVDTISDNEYIFEIDCQQTMGRQRRLRDVAVPMNNGVSQSSEDENAPEFCIDAGSTGNFARFINHSCEPNLFVQCVLSSHQDIRLARVVLFAADNISPMQELTYDYGYALDSVH >AT1G17770.1 tffamily=PcG VQTGLKLHLEVFKTRNCGWGLRSWDPIRAGTFICEFAGLRKTKEEVEEDDDYLFDTSKIYQRFRWNYEPELLLEDSWEQVSEFINLPTQVLISAKEKGNVGRFMNHSCSPNVFWQPIEYENRGDVYLLIGLFAMKHIPPMTELTYDYGVSCVER >AT4G02020.1 tffamily=PcG LLLRQQQRILLGKSDVAGWGAFLKNSVSKNEYLGEYTGELISHHEADKRGKIYDRANSSFLFDLNDQYVLDAQRKGDKLKFANHSAKPNCYAKVMFVAGDHRVGIFANERIEASEELFYDYRYGPDQA >AT1G77300.1 tffamily=PcG FQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGA >AT1G01920.1 tffamily=PcG ERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMRVLQDPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGLYHATELQKKKLLSLYHDKVEVLVTKLLILDGDSESKVSFEHFLWFWSRALNIPLPHSFVFPQSQDDTGECTSTSAQPAPSVGSGDTIWVEGLVPGIDFCNHDLKP >AT5G14260.1 tffamily=PcG LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGLWSEAELDYLTGSPTKAEVLERAEGIKREYNELDTVWFMFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAK >AT2G05900. tffamily=PcG PAFGIWKSIQNWRNGLSIRPGLILEDLSNGAENLKVCLVNEVDKENGPALFRYVTSLIHEVINNIPSMVDRCACGRRSCGSKHVFREKLSVSSSLVISAKKSGNVARFMNHSCSPNVFWQSIAREQNGLWCLYIGFFAMKHIPPLTELRYDYGKSRGGGK >AT5G17240.1 tffamily=PcG ETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDARELKKGELVLKVPRKALMTTESIIAKSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKSFQAWLWASATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPESANNVEEAGLVVETGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLE >AT2G18850.1 tffamily=PcG QDNGVKTKLQIAQIDGYGRGAIASEDLKFGDVALEIPVSSIISEESDMYPILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTGLSFGVDAIMELDGTLLLDEIMQAKELLRERYDELIPLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSH >AT1G24610.1 tffamily=PcG PDLIRWIKREGGFVHHAVKLSQETQFGIGLISTEQISPGTDLISLPPHVPLRFESDDSSSLLSALARRVPEELWAMKLGLRLLQERANADSFWWPYISNLPETYTVPIFFPGEDIKNLQYAPLLHQVNKRCRFLLEFEQEIRRTLEDVKASDHPFSGQDVNASALGWTMSAVSTRAFRLHGNKKLQGGSSDDVPMMLPLIDMCNHSFKPNGADSNTLVKVVAETEVKENDPLLLNYGCLSNDF
Searched with PHD proteins from Arabidopsis
>AT1G32800.1 tffamily=PHD DCVCGVNDDDGTEMVKCDDCGVWVHTRCSRFVEGQELFTCHKCKSK >AT1G33420.1 tffamily=PHD DCKCGTKDDDGERMLACDGCGVWHHTRCIGINNADALPSKFLCFRCIEL >AT5G60410.1 tffamily=PHD RCVCGNSLETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEICRLT >AT3G24010.1 tffamily=PHD YCICNQVSFGEMVACDNNACKIEWFHFGCVGLKEQPKGKWYCPECATV >AT3G08020.1 tffamily=PHD YCPVCLKVYRDSESTPMVCCDICQRWVHCHCDGISDDKYMQFQVDGKLQYKCATCRGE >AT5G36670.1 tffamily=PHD TCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCK >AT1G77250.1 tffamily=PHD LCRTCGTKVDSGGKYITCDHPFCPHKYYHIRCLTSRQIKLHGVRWYCSSCLCR >AT5G63900.1 tffamily=PHD VCCVCHWGGDLLLCDGCPSAFHHACLGLSSLPEEDLWFCPCCCCD >AT3G51120.1 tffamily=PHD VCFICFDGGDLVLCDRRNCPKAYHPACIKRDEAFFRTTAKWNCGWHICG >AT5G35210.1 tffamily=PHD ECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPECTIN >AT3G01460.1 tffamily=PHD VCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIPDGNWYCPSCVIA >AT3G52100.1 tffamily=PHD SCRICEGCGTLGDPKKFMFCKRCDDAYHCDCQHPRHKNVSSGPYLCPKHTKC >AT5G09790.1 tffamily=PHD TCEKCGSGEGDDELLLCDKCDRGFHMKCLRPIVVRVPIGTWLCVDCSDQ >AT3G05670.1 tffamily=PHD ICTECHQGDDDGLMLLCDLCDSSAHTYCVGLGREVPEGNWYCEGCRPV >AT1G43770.1 tffamily=PHD VCQTCGDIGFEEALVFCDSCMFESIHRYCLGITPIPFTEYITWICEDCDNS >AT2G25170.1 tffamily=PHD ACQACGESTNLVSCNTCTYAFHAKCLVPPLKDASVENWRCPECVSP >AT1G05830.1 tffamily=PHD KCNVCHMDEEYENNLFLQCDKCRMMVHTRCYGQLEPHNGILWLCNLCRPV >AT1G77800.1 tffamily=PHD FCCTGHHQLIVCTSCKATVHKKCYGLLEDSGKPWLCSWCELE >AT5G53430.1 tffamily=PHD RCAVCRWVEDWDYNKIIICNRCQIAVHQECYGTRNVRDFTSWVCKACETP >AT1G77800.1 tffamily=PHD TCDICRRSETIWNLIVVCSSCKVAVHIDCYKCAKESTGPWYCELCAES >AT3G20280.1 tffamily=PHD ACQICEVTINEMDTLLICDACEKAYHLKCLQGNNMKGVPKSEWHCSRCVQA >AT2G37520.1 tffamily=PHD GCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNC >AT3G08020.1 tffamily=PHD MCRMCFLGEGEGSDRARRMLSCKDCGKKYHKNCLKSWAQHRDLFHWSSWSCPSCRVC >AT3G01460.1 tffamily=PHD SCGACGRPESIELVVVCDACERGFHMSCVNDGVEAAPSADWMCSDCRTG >AT4G39100.1 tffamily=PHD FCKCEMPYNPDDLMVQCEECSEWFHPSCIGTTEEAKKPDNFYCEECSPQ >AT4G23860.1 tffamily=PHD YCTCDRPYPDPNVEEQVEMIQCCLCEDWFHEEHLGLTPSDDEESEPIYEDFICQNCSPA >AT5G35210.1 tffamily=PHD VCGICLLPYNPGLTYIHCTKCEKWFHTEAVKLKDSQIPEVVGFKCCKCRRI >AT5G63900.1 tffamily=PHD ICGSMESPANSKLMACEQCQRRFHLTCLKEDSCIVSSRGWFCSSQCNR >AT4G10940.1 tffamily=PHD LCKIRNTFSYIEGDSNLDTSIACDSCDMWYHAICVGFDVENASEDTWVCPSKDTL
Searched with the Arabidopsis genes
>AT1G21000.1 tffamily=PLATZ YTSPPWLMPMLRGSYFVPCSIHVDSNKNECNLFCLDCAGNAFCSYCLVKHKDHRVVQIRRSSYHNVVRVNEIQKFIDIACVQTYIINSAKIVFLNERPQPRIGKGVTNTCEICCRSLLDSFRFCSLGC >AT1G43000.1 tffamily=PLATZ VMTPPWLTPMLRADYFVTCSIHSQSSKSECNLFCLDCSGNAFCSSCLAHHRTHRVIQIRRSSYHNVVRVSEIQKHIDISCIQTYVINSAKIFFLNARPQCRTGKSLNKTCQICSRNLLDSFLFCSLAC >AT2G27930.1 tffamily=PLATZ MEEPKWLEGLLRTNFFSICPRHRETPRNECNMFCLSCQNAAFCFYCRSSFHIDHPVLQIRRSSYHDVVRVSEIENALDIRGVQTYVINSARVLFLNERPQPKNSSHEPFLIPFASVPWVARFVFLHLFS >AT1G32700.2 tffamily=PLATZ MYCLDCTNGPLCSLCLSFHKDHHAIQIRRSSYHDVIRVSEIQKFLDITGVQTYVINSAKVVFLNERPQPRPGKGVINTCEVCYRSLVDSFRFCSLGC >AT2G12646.1 tffamily=PLATZ IQKPAWLDALYAEKFFVGCPYHETAKKNERNVCCLDCCTSLCPHCVPSHRFHRLLQVRRYVYHDVVRLEDLQKLIDCSNVQAYTINSAKVVFIKKRPQNRQFKGAGNYCTSCDRSLQEPYIHCSLGC >AT1G31040.1 tffamily=PLATZ MQVDYLSYQGDDLSSILYRIDESDFTFEGLRMDGHDQLGEISTMEDGEDILVISDESEQGNNSHKKEKKKSKKKKPESNYLPGMVLSSLGN
Searched with R2R3 first position consensus and second position consensus (Stracke et al., 2001)
>R2R3MYB search_1 (R2) tffamily=R2R3MYB LKKGPWTPEEDEKLINYILKHGEGNWRSLPKKAGLKRCGKSCRLRWTNYLRPD >R2R3MYB search_2 (R3) tffamily=R2R3MYB IKRGNFTEEEEELIIELHALLGNRWSKIAKRLPGRTDNEIKNYWNTHLKKK >R2R3MYB search_3 (R2) tffamily=R2R3MYB VRRGAWSAEEDQILVSFVQLYGHRCWNAVARLSGLGRSGKSCRLRWINQLKPN >R2R3MYB search_4 (R3) tffamily=R2R3MYB LRKGAISPQEDQTLLRAHSKWGNKWAAIARHLPGRTDNDVKNHWRSRIRRR
Searched with all three Arabidopsis genes
>AT2G37120.1 Arabidopsis S1Fa-like family transcription factor, protein sequence 76AA tffamily=S1Fa MSSDGSAGKAVVEAKGLNPGLIVLLVIGGLLVTFLIANYVMYMYAQKNLPPRKKKPLSKKKLKREKLKQGVPVPGE >AT3G09735.1 Arabidopsis S1Fa-like family transcription factor, protein sequence 73AA tffamily=S1Fa MAAEFDGKIESKGLNPGLIVLLVIGGLLLTFLVGNFILYTYAQKNLPPRKKKPVSKKKMKKEKMKQGVQVPGE >AT3G53370.1 Arabidopsis S1Fa-like family transcription factor, protein sequence 76AA tffamily=S1Fa MDGEDFAGKAAAEAKGLNPGLIVLLVVGGPLLVFLIANYVLYVYAQKNLPPRKKKPVSKKKLKREKLKQGVPVPGE
Searched with the single Arabidopsis gene.
>SAP tffamily=SAP MSTSSSSSDNGAGGSGGVFEAPSPSRPRRGANDVWPEPFLESLAVQVAVNASTSAGLLAAAPALANVFRVCTTWHAVSRSDHLWQLLSRQVWARTHLMHDTWRDEFIYRHRTARNFRTRTHTYFTLQFDPSDVDEPDSLSCRCLTLSDLYLAAGFADGTVRLFLLNNRLHVRTLRPPLRDRFGRFSRAVSGIVISDSRLTFATMDGDIHVAEIDGVGHTRTAYAGDIVNDGALVDFTGCGRWWVGLFAGVPGRAFHIWDCNSEETTFVGGTLTDPEAVMGWHTLTELTTSLGRLRISGNETAVACTRWRIMVIDLRNQGVIIGEDEEQRRGLIVTGFDANDEAYVRLDSRGNASVRRVNTQQTVCEFRVSGAAQRRVMGCVNRLHALMCAGGIMRVWEVERGEYLYSIRERVGEVDAIVADDRHVAVASASSTAQSIIHLWDFGAL
Searched with the Arabidopsis genes
>AT1G53160.1 tffamily=SBP LCQVDRCTADMKEAKLYHRRHKVCEVHAKASSVFLSGLNQRFCQQCSRFHDLQEFDEAKRSCRRRLAGHNERRRKSSGE >AT1G02065.2 tffamily=SBP RCQAEGCNADLSHAKHYHRRHKVCEFHSKASTVVAAGLSQRFCQQCSRFVPPKVATFDLF >AT2G42200.1 tffamily=SBP RCQVEGCGMDLTNAKGYYSRHRVCGVHSKTPKVTVAGIEQRFCQQCSRFHQLPEFDLEKRSCRRRLAGHNERRRKPQPA >AT1G27370.1 tffamily=SBP RCQIDGCELDLSSSKDYHRKHRVCETHSKCPKVVVSGLERRFCQQCSRFHAVSEFDEKKRSCRKRLSHHNARRRKPQGV >AT1G20980.1 tffamily=SBP MCQVDNCTEDLSHAKDYHRRHKVCEVHSKATKALVGKQMQRFCQQCSRFHLLSEFDEGKRSCRRRLAGHNRRRRKTTQP >AT5G18830.1 tffamily=SBP RCQVPDCEADISELKGYHKRHRVCLRCATASFVVLDGENKRYCQQCGKFHLLPDFDEGKRSCRRKLERHNNRRKRKPVD
Searched with the Arabidopsis genes
>AT1G19790.1 tffamily=SRS TVTRQGNMNCQDCGNQAKKDCPHMRCRTCCKSRGFDCQTHVKSTWVSAAKRRERQAQLAV >AT5G66350.1 tffamily=SRS SGSGSGGPSCQDCGNQSKKDCSHMRCRTCCKSRGLDCPTHVKSTWVPAAKRRERQQQLST >AT3G54430.1 tffamily=SRS DNNTVGEKVCRDCGNRAKKECLFERCRTCCKSRGYNCVTHVKSTWIPSSATR >AT2G21400.1 tffamily=SRS MMMIMGRKCEDCGNQAKKDCVYMRCRTCCKSKAFHCQTHIKSTWVPAYRRSHHKHQSQP
Searched with the Arabidopsis genes
>AT5G67480.2 tffamily=TAZ NDQRIYSQLYEAMEALVHICRDGCKTIGPHDKDFKPNHATCNYEACKGLESLIRHFAGCKLRVPGGCVHCKRMWQLLELHSRVCAGSDQCRVPLC >AT5G63160.1 tffamily=TAZ NLDNKSTCQAKPGPCSAFSTCYGLQLLIRHFAVCKKRVDGKGCVRCKRMIQLLRLHSSICDQSESCRVPLC >AT3G12980.1 tffamily=TAZ NTQTNQIQNAQLREVLLHVMTCCTAQCQYPRCRVIKGLIRHGLVCKTRGCIACKKMWSLFRLHSRNCRDPQCKVPKC >AT3G12980.1 tffamily=TAZ DCGLSYKNQRRWLLFLLHVRKCNAAEDNCESKYCFTAKTLLKHINCCKAPACAYQYCHQTRQLIHHYKHCGDEACPVCVF
Searched with TCP proteins from Arabidopsis
>AT3G27010.1 tffamily=TCP NQLGPKRSSNKDRHTKVEGRGRRIRMPALCAARIFQLTRELGHKSDGETIQWLLQQAEPSIIAATGSGTIPASALA >AT5G41030.1 tffamily=TCP YEKEKKKPNKDRHLKVEGRGRRVRLPPLCAARIYQLTKELGHKSDGETLEWLLQHAEPSILSATVNGIKPTESVV >AT1G35560.1 tffamily=TCP AKTPAKRPSKDRHIKVDGRGRRIRMPAICAARVFQLTRELQHKSDGETIEWLLQQAEPAIIAATGTGTIPANIST >AT1G69690.1 tffamily=TCP KKPPPKRTSTKDRHTKVEGRGRRIRMPAMCAARVFQLTRELGHKSDGETIEWLLQQAEPAVIAATGTGTIPANFTS >AT5G51910.1 tffamily=TCP TKPAPKRPTSKDRHTKVEGRGRRIRMPAGCAARVFQLTRELGHKSDGETIRWLLERAEPAIIEATGTGTVPAIAVS >AT1G30210.1 tffamily=TCP IIRVSRASGGKDRHSKVLTSKGLRDRRIRLSVATAIQFYDLQDRLGFDQPSKAVEWLINAASDSITDLPLLNTNFDHLD >AT3G02150.1| tffamily=TCP IVRVSRAFGGKDRHSKVCTLRGLRDRRVRLSVPTAIQLYDLQERLGVDQPSKAVDWLLDAAKEEIDELPPLPISPENFSI >AT5G60970.1 tffamily=TCP IVRVSRTFGGKDRHSKVCTVRGLRDRRIRLSVPTAIQLYDLQDRLGLSQPSKVIDWLLEAAKDDVDKLPPLQFPHGFNQ >AT3G45150.1 tffamily=TCP NNSQKARRTPKDRHLKIGGRDRRIRIPPSVAPQLFRLTKELGFKTDGETVSWLLQNAEPAIFAATGHGVTTTSNED >AT1G68800.1 tffamily=TCP EVQWRRTVKKRDRHSKICTAQGPRDRRMRLSLQIARKFFDLQDMLGFDKASKTIEWLFSKSKTSIKQLKERVAASEGGGK
Searched with the Arabidopsis genes
>AT1G76890.1 tffamily=Trihelix WPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRSAKRCKEKWENINKYFKKVK >AT5G28300.1 tffamily=Trihelix WPKDEVLALINIRRSISNMNDDDHKDENSLSTSSKAVPLWERISKKMLEIGYKRSAKRCKEKWENINKYFRKTK >AT5G47660.1 tffamily=Trihelix WPQEEVQALISSRSDVEEKTGINKGAIWDEISARMKERGYERSAKKCKEKWENMNKYYRRVT >AT1G76880.1 tffamily=Trihelix WPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAEHGYIRNAKKCKEKFENVYKYHKRTK >AT3G10000.1 tffamily=Trihelix WPRQETLMLLEVRSRLDHKFKEANQKGPLWDEVSRIMSEEHGYTRSGKKCREKFENLYKYYKKTK >AT5G28300.1 tffamily=Trihelix WCSDEVLALLRFRSTVENWFPEFTWEHTSRKLAEVGFKRSPQECKEKFEEEERRYFNSN >AT5G03680.1 tffamily=Trihelix WGEQEILKLMEIRTSMDSTFQEILGGCSDEFLWEEIAAKLIQLGFDQRSALLCKEKWEWISNGMRKEK >AT3G25990.1 tffamily=Trihelix WAQDETRTLISLRREMDNLFNTSKSNKHLWEQISKKMREKGFDRSPSMCTDKWRNILKEFKKAK >AT1G76870.1 tffamily=Trihelix WMDKMVKLMITALSYIGEDSGSDKKFAVLQKKGKWRSVSKVMDERGYHVSPQQCEDKFNDLNKRYKKLN >AT3G10040.1 tffamily=Trihelix WTDTMVRLLIMAVFYIGDEAGLNDPVDAMLQKKGKWKSVSRAMVEKGFSVSPQQCEDKFNDLNKRYKRVN >AT3G11100.1 tffamily=Trihelix WSEDATATLIEAWGDRYVNLNRGNLRQNDWKEVADAVNSSHGNGRPKTDVQCKNRIDTLKKKYKTEK >AT1G54060.1 tffamily=Trihelix WSEEATKVLIEAWGDRFSEPGKGTLKQQHWKEVAEIVNKSRQCKYPKTDIQCKNRIDTVKKKYKQEK >AT3G24490.1 tffamily=Trihelix WREQEAFVLLEVWGDRFLQLGRRSLRNEDWNEVAEKVSEELRMEKSETQCRRMIDDLKRKYRKEK >AT2G33550.1 tffamily=Trihelix WTRQEILVLIQGKRVAENRVRRGRAAGMALGSGQMEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIK
Searched with eight TUBBY proteins from Arabidopsis (110-395)
>AT2G47900.1 tffamily=TLP PGPRGSLVQCYIMRNRSNQTYYLYLGLNQAASNDDGKFLLAAKRFRRPTCTDYIISLNCDDVSRGSNTYIGKLRSNFLGTKFTVYDAQPTNPGTQVTRTRSSRLLSLKQVSPRIPSGNYPVAHISYELNVLGSRGPRRMQCVMDAIPASAVEPGGTAPTQTELVHSNLDSFPSFSFFRSKSIRAESLPSGPSSAAQKEGLLVLKNKAPRWHEQLQCWCLNFNGRVTVASVKNFQLVAAPENGPAGPEHENVILQFGKVGKDVFTMDYQYPISAFQAFTICLSSFDTKIACE >AT3G06380.1 tffamily=TLP SGPRDSLVQCFIKRNRNTQSYHLYLGLTTSLTDNGKFLLAASKLKRATCTDYIISLRSDDISKRSNAYLGRMRSNFLGTKFTVFDGSQTGAAKMQKSRSSNFIKVSPRVPQGSYPIAHISYELNVLGSRGPRRMRCIMDTIPMSIVESRGVVASTSISSFSSRSSPVFRSHSKPLRSNSASCSDSGNNLGDPPLVLSNKAPRWHEQLRCWCLNFHGRVTVASVKNFQLVAVSDCEAGQTSERIILQFGKVGKDMFTMDYGYPISAFQAFAICLSSFETRIACE >AT1G53320.1 tffamily=TLP PGPRDFSNQCLIKRNKKTSTFYLYLALTPSFTDKGKFLLAARRFRTGAYTEYIISLDADDFSQGSNAYVGKLRSDFLGTNFTVYDSQPPHNGAKPSNGKASRRFASKQISPQVPAGNFEVGHVSYKFNLLKSRGPRRMVSTLRCPSPSPSSSSAGLSSDQKPCDVTKIMKKPNKDGSSLTILKNKAPRWHEHLQCWCLNFHGRVTVASVKNFQLVATVDQSQPSGKGDEETVLLQFGKVGDDTFTMDYRQPLSAFQAFAICLTSFGTKLACE >AT1G25280.2|S1|E267 tffamily=TLP MDADNISRSSNSYLGKLRSNFLGTKFLVYDTQPPPNTSSSALITDRTSRSRFHSRRVSPKVPSGSYNIAQITYELNVLGTRGPRRMHCIMNSIPISSLEPGGSVPNQPEKLVPAPYSLDDSFRSNISFSKSSFDHRSLDFSSSRFSEMGISCDDNEEEASFRPLILKNKQPRWHEQLQCWCLNFRGRVTVASVKNFQLVAARQPQPQGTGAAAAPTSAPAHPEQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLSSFDTKLACE >AT1G76900.1 tffamily=TLP PGPRDATMQCFIKRDKSNLTYHLYLCLSPALLVENGKFLLSAKRIRRTTYTEYVISMHADTISRSSNTYIGKIRSNFLGTKFIIYDTQPAYNSNIARAVQPVGLSRRFYSKRVSPKVPSGSYKIAQVSYELNVLGTRGPRRMHCAMNSIPASSLAEGGTVPGQPDIIVPRSILDESFRSITSSSSRKITYDYSNDFSSARFSDILGPLSEDQEEGKERNSPPLVLKNKPPRWHEQLQCWCLNFRGRVTVASVKNFQLIAANQPQPQPQPQPQPQPLTQPQPSGQDKIILQFGKVGKDMFTMDFRYPLSAFQAFAICLSSFDTKLACE >AT2G18280.1 tffamily=TLP PGPRDSPIQCFIKRNRATATYILYYGLMPSETENDKLLLAARRIRRATCTDFIISLSAKNFSRSSSTYVGKLRSGFLGTKFTIYDNQTASSTAQAQPNRRLHPKQAAPKLPTNSSTVGNITYELNVLRTRGPRRMHCAMDSIPLSSVIAEPSVVQGIEEEVSSSPSPKGETITTDKEIPDNSPSLRDQPLVLKNKSPRWHEQLQCWCLNFKGRVTVASVKNFQLVAEIDASLDAPPEEHERVILQFGKIGKDIFTMDYRYPLSAFQAFAICISSFDTKPACE >AT1G47270.1 tffamily=TLP PGPRDAPIQCFIKRERATGIYRLYLGLSPALSGDKSKLLLSAKRVRRATGAEFVVSLSGNDFSRSSSNYIGKLRSNFLGTKFTVYENQPPPFNRKLPPSMQVSPWVSSSSSSYNIASILYELNVLRTRGPRRMQCIMHSIPISAIQEGGKIQSPTEFTNQGKKKKKPLMDFCSGNLGGESVIKEPLILKNKSPRWHEQLQCWCLNFKGRVTVASVKNFQLVAAAAEAGKNMNIPEEEQDRVILQFGKIGKDIFTMDYRYPISAFQAFAICLSSFDTKPVCE >AT1G16070.1 tffamily=TLP GRCTCLIVKEQSPEGLSHGSVYSLYTHEGRGRKDRKLAVAYHSRRNGKSIFRVAQNVKGLLCSSDESYVGSMTANLLGSKYYIWDKGVRVGSVGKMVKPLLSVVIFTPTITTWTGSYRRMRTLLPKQQPMQKNNNKQVQQASKLPLDWLENKEKIQKLCSRIPHYNKISKQHELDFRDRGRTGLRIQSSVKNFQLTLTETPRQTILQMGRVDKARYVIDFRYPFSGYQAFCICLASIDSKLCCT
>AT2G20825.1 Arabidopsis ULT family transcription factor, protein sequence 228AA tffamily=ULT MERECGSKELFSKEELQEISGVHVGDDYVEVMCGCTSHRYGDAVARLKIFSDGELQITCQCTPACLEDKLTPAAFEKHSE RETSRNWRNNVWVFIEGDKVPLSKTVLLRYYNKALKNSNVSKVIHRDEFVGCSTCGKERRFRLRSRGECRMHHDAIAEPN WKCCDYPYDKITCEEEEERGSRKVFRGCTRSPSCKGCTSCVCFGCKLCRFSDCNCQTCLDFTTNAKPI >AT4G28190.1 Arabidopsis ULT family transcription factor, protein sequence 237AA tffamily=ULT MANNEGEMQCGSMLFKQEELQEMSGVNVGGDYVEVMCGCTSHRYGDAVARLRVFPTGDLEITCECTPGCDEDKLTPAAFE KHSGRETARKWKNNVWVIIGGEKVPLSKTVLLKYYNESSKKCSRSNRSQGAKVCHRDEFVGCNDCGKERRFRLRSRDECR LHHNAMGDPNWKCSDFPYDKITCEEEEERGSRKVYRGCTRSPSCKGCTSCVCFGCELCRFSECTCQTCVDFTSNVKA
Searched with complete sequences of: AT1G28520.1, AT2G42400.1
>AT1G28520.1 Arabidopsis VOZ family transcription factor, protein sequence 486AA tffamily=VOZ MTGKRSKTNCRSASHKLFKDKAKNRVDDLQGMLLDLQFARKESRPTDVTLLEEQVNQMLREWKSELNEPSPASSLQQGGT LGSFSSDICRLLQLCDEEDDATSKLAAPKPEPADQNLEAGKAAVFQRGYNLVQGKSEHGLPLVDNCKDLSLAAGNNFDGT APLEYHQQYDLQQEFEPNFNGGFNNCPSYGVVEGPIHISNFIPTICPPPSAFLGPKCALWDCPRPAQGFDWFQDYCSSFH AALAFNEGPPGMNPVVRPGGIGLKDGLLFAALSAKAGGKDVGIPECEGAATAKSPWNAPELFDLTVLESETLREWLFFDK PRRAFESGNRKQRSLPDYNGRGWHESRKQIMVEFGGLKRSYYMDPQPLHHFEWHLYEYEINKCDACALYRLELKLVDGKK TSKGKVSNDSVADLQKQMGRLTAEFPPENNTTNTTNNNKRCIKGRPKVSTKVATGNVQNTVEQANDYGVGEEFNYLVGNL SDYYIP >AT2G42400.1 Arabidopsis VOZ family transcription factor, protein sequence 450AA tffamily=VOZ MSNHPKITSAHQNVEEKLRELQERFCHLQAARKEGRHGDLALLEAQISQNIREWQAELTAPSPESSLLGEGISQFLEEFA PLLKLDEEDDATSTLKEHAGAKPDPEGFSQSLCPPEWTSENFSQSPFNGNFSCGFEDALNSTETHGQQLHYGYEGFDPSI NSAPDFHDQKLSSNLDITSQYDYIFSEVRQELDNSPSTKLDSSEEIDNFAEFSTPSSVRVPPSAFLGPKCALWDCTRPAQ GSEWYLDYCSNYHGTLALNEDSPGTAPVLRPGGISLKDNLLIDALRAKTQGKNVGIPVCEGAVNTKCPWNAAELFHLELV EGETIREWLFFDKPRRAYDSGNRKQRSLPDYSGRGWHESRKQLMKEQEGQKRSYYMDPQPPGPFEWHLFEYQINESDACA LYRLELKVGNGKKSPKGKISKDPLADLQKKMGQFKVASDKPSPPTKGRKE
Searched with complete sequences of: AT1G14410.1, AT2G02740.1
>AT1G14410.1 Arabidopsis Whirly family transcription factor, protein sequence 263AA tffamily=Whirly MSQLLSTPLMAVNSNPRFLSSSSVLVTGGFAVKRHGFALKPTTKTVKLFSVKSRQTDYFEKQRFGDSSSSPSPAEGLPAR FYVGHSIYKGKAALTVDPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSKKQVFSLSVTEIGTLVSLGPRESCE FFHDPFKGKSDEGKVRKVLKVEPLPDGSGHFFNLSVQNKLVNVDESIYIPITRAEFAVLISAFNFVLPYLIGWHAFANSI KPEETSRVNNASPNYGGDYEWNR >AT2G02740.1 Arabidopsis Whirly family transcription factor, protein sequence 268AA tffamily=Whirly MSQLLSSPPMAVFSKTFINHKFSDARFLSSHSILTSGGFAGKIIPLKPTARLKLTVKSRQSDYFEKQRFGDSSSSQNAEV SSPRFYVGHSIYKGKAALTIEPRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQVFSLSVTEIGNLVSLGPR ESCEFFHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLSVQNKLLNVDESVYIPITKAEFAVLISAFNFVLPHLIGWSA FANSIKPEDSNRLNNASPKYGGDYEWSR
Searched with a representative domain from each sub family plus a number of others from characterized genes.
>WRKY_search_1 I N-terminal tffamily=WRKY DGYNWRKYGQKLVKGNEFVRSYYRCTHPNCKAKKQLERSAGGQVVDTVYFGEHDH >WRKY_search_2 I C-terminal tffamily=WRKY DGYRWRKYGQKSVKGSPYPRSYYRCSSPGCPVKKHVERSSHDTKLLITTYEGKHDH >WRKY_search_3 IIa tffamily=WRKY DGYQWRKYGQKVTRDNPSPRAYFKCACAPSCSVKKKVQRSVEDQSVLVATYEGEHNH >WRKY_search_4 IIb tffamily=WRKY DGCQWRKYGQKMAKGNPCPRAYYRCTMATGCPVRKQVQRCAEDRSILITTYEGNHNH >WRKY_search_5 IIc tffamily=WRKY DDGYRWRKYGQKVVKNTQHPRSYYRCTQDKCRVKKRVERLADDPRMVITTYEGRHLH >WRKY_search_6 IId tffamily=WRKY DEFSWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERALDDAMMLIVTYEGDHNH >WRKY_search_7 IIe tffamily=WRKY DVWAWRKYGQKPIKGSPYPRGYYRCSTSKGCLARKQVERNRSDPKMFIVTYTAEHNH >WRKY_search_8 III tffamily=WRKY DDGFSWRKYGQKDILGAKFPRGYYRCTYRKSQGCEATKQVQRSDENQMLLEISYRGIHSC >WRKY_search_9 WIZZ tffamily=WRKY KDGYQWRKYGQKVTRDNPSPRAYFRCSFAPGCPVKKKVQRSIEDQSVVVATYEGEHNH >WRKY_search_10 AtWRKY1 N-terminal tffamily=WRKY DGYNWRKYGQKQVKGSENPRSYYKCTFPNCPTKKKVERNLDGHITEIVYKGNHNH >WRKY_search_11 AtWRKY1 C-terminal tffamily=WRKY DGYRWRKYGQKVAKGNPNPRSYYKCTFTGCPVRKHVERASHDLRAVITTYEGKHNH >WRKY_search_12 AtWRKY2 N-terminal tffamily=WRKY DGYNWRKYGQKQVKGSENPRSYYKCTFPNCPTKKKVERSLDGQITEIVYKGNHNH >WRKY_search_13 AtWRKY2 C-terminal tffamily=WRKY DGYRWRKYGQKVVKGNPNPRGYYKCTSPGCPVRKHVERASQDIRSVITTYEGKHNH >WRKY_search_14 AtWRKY3 tffamily=WRKY DEYSWRKYGQKPIKGSPYPRGYYKCSSVRGCPARKHVERAMDDPAMLIVTYEGEHRH >WRKY_search_15 SUSIBA2 N-terminal tffamily=WRKY DGYNWRKYGQKHVKGSENPRSYYKCTHPNCEVKKLLERAVDGLITEVVYKGRHNH >WRKY_search_16 SUSIBA2 C-terminal tffamily=WRKY DGYRWRKYGQKVVKGNPNPRSYYKCTSTGCPVRKHVERASHDPKSVITTYEGKHNH >WRKY_search_17 AtWRKY4 N-terminal tffamily=WRKY DGYNWRKYGQKQVKGSEYPRSYYKCTHPNCPVKKKVERSHEGHITEIIYKGAHNH >WRKY_search_18 AtWRKY4 C-terminal tffamily=WRKY DGYRWRKYGQKVVKGNPNPRSYYKCTSAGCNVRKHVERASHDLKSVITTYEGKHNH >WRKY_search_19 ACRE126 tffamily=WRKY DGCQWRKYGQKISRGNPCPRSYYRCSVAPLCPVRKQVQRCVEDMSVLITTYEGTHNH
Searched with the Arabidopsis genes
>AT2G45190.1 tffamily=YABBY PDHFSPSDHLCYVQCNFCQTILAVNVPYTSLFKTVTVRCGCCTNLLSVNMRSYVLPASNQLQLQLGPHSYFNPQDILEELRDAPSNMNMMMMNQHPTMNDIPSFMDLHQQHEIPKAPPVNRPPEKRQRVPSAYNRFIKEEIQRIKAGNPDISHREAFSAAAKNWAHFPHIHFGL >AT4G00180.1 tffamily=YABBY PDHFSSTDQLCYVHCSFCDTVLAVSVPPSSLFKTVTVRCGHCSNLLSVTVSMRALLLPSVSNLGHSFLPPPPPPPPPNLLEEMRSGGQNINMNMMMSHHASAHHPNEHLVMATRNGQEMPRPPPANRPPEKRQRVPSAYNRFIKEEIQRIKAGNPDISHREAFSAAAKNWAHFPHIHFGL >AT2G26580.1 tffamily=YABBY ANSVMATEQLCYIPCNFCNIILAVNVPCSSLFDIVTVRCGHCTNLWSVNMAAALQSLSRPNFQATNYAVPEYGSSSRSHTKIPSRISTRTITEQRIVNRPPEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPHIHFGL >AT1G23420.1 tffamily=YABBY NHLFDLPGQICHVQCGFCTTILLVSVPFTSLSMVVTVRCGHCTSLLSVNLMKASFIPLHLLASLSHLDETGKEEVAATDGVEEEAWKVNQEKENSPTTLVSSSDNEDEDVSRVYQVVNKPPEKRQRAPSAYNCFIKEEIRRLKAQNPSMAHKEAFSLAAKNWAHFPPAHNKR >AT1G69180.1 tffamily=YABBY SRASPQAEHLYYVRCSICNTILAVGIPLKRMLDTVTVKCGHCGNLSFLTTTPPLQGHVSLTLQMQSFGGSDYKKGSSSSSSSSTSSDQPPSPSPPFVVKPPEKKQRLPSAYNRFMRDEIQRIKSANPEIPHREAFSAAAKNWAKY
Searched with Arabidopsis protein domains.
>AT1G14440.1 tffamily=ZF-HD KPMIKYKECLKNHAAAMGGNATDGCGEFMPSGEDGSIEALTCSACNCHRNFHRKEVEG >AT1G75240.1 tffamily=ZF-HD KPTVRYRECLKNHAASVGGSVHDGCGEFMPSGEEGTIEALRCAACDCHRNFHRKEMDG >AT3G50890.1 tffamily=ZF-HD DQGAKYRECQKNHAASTGGHVVDGCCEFMAGGEEGTLGALKCAACNCHRSFHRKEVYG >AT3G28917.1 tffamily=ZF-HD VRTVRYGECQKNHAAAVGGYAVDGCREFMASRGEEGTVAALTCAACGCHRSFHRREIET >AT5G42780.1 tffamily=ZF-HD THKPHYYECRKNHAADIGTTAYDGCGEFVSSTGEEDSLNCAACGCHRNFHREELIP >AT5G60480.1 tffamily=ZF-HD MVVLYNECLKNHAVSLGGHALDGCGEFTPKSTTILTDPPSLRCDACGCHRNFHRRSPSD >AT5G39760.1 tffamily=ZF-HD PLLFTYKECLKNHAAALGGHALDGCGEFMPSPSSISSDPTSLKCAACGCHRNFHRRDPDN >AT1G14687.1 tffamily=ZF-HD QSTCVYRECMRNHAAKLGSYAIDGCREYSQPSTGDLCVACGCHRSYHRRIDVI
Searched with the Arabidopsis genes
>AT1G17380.1 tffamily=ZIM SQPGSSQLTIFFGGKVLVYNEFPVDKAKEIMEVAKQ >AT1G19180.1 tffamily=ZIM PESQTAPLTIFYAGQVIVFNDFSAEKAKEVINLASK >AT5G13220.1 tffamily=ZIM LVSGTVPMTIFYNGSVSVFQVSRNKAGEIMKVANE >AT1G51600.1 tffamily=ZIM GSEQGDQLTLSFQGQVYVFDSVLPEKVQAVLLLLGG >AT3G43440.1 tffamily=ZIM EPDASTQLTIIFGGSCRVFNGVPAQKVQEIIRIAFA >AT3G43440.1 tffamily=ZIM SMILPSQLTIIFGGSFSVFDGIPAEKVQEILHIAAA >AT1G30135.1 tffamily=ZIM PNEESQRITIFYNGKMCFSSDVTHLQARSIISIASR >AT2G34600.1 tffamily=ZIM PKQESQILTIFYNGHMCVSSDLTHLEANAILSLASR
Paul J Rushton
Marta T. Bokowiec
Xianfeng (Jeff) Chen
Thomas (Tom) W Laudeman
Jennifer F. Brannock
Michael P. Timko