1) ABI-VP1

Searched with the B3 domain from VP1 (from Zea mays) and ABI3. B3 domains are also found in RAV (AP2 family) factors but only 6 of 147 Arabidopsis ERF genes are of this type.

>ABI3 B3 tffamily=ABI3-VP1
drrqgwkpeknlrfllqkvlkqsdvgnlgrivlpkkeaethlpeleardg
islamedigtsrvwnmryrfwpnnksrmyllentgdfvktnglqegdfiv
iysdvkcgkylirgvkvrq

>VP1 B3 tffamily=ABI3-VP1
dkrqgakadknlrfllqkvlkqsdvgslgrivlpkkeaevhlpelktrdg
isipmedigtsrvwnmryrfwpnnksrmyllentgefvrsnelqegdfiv
iysdvksgkylirgvkv
2) Alfin

These proteins are related to the PHD finger proteins and a subgroup may not be merited, especially as the seven Arabidopsis proteins are described as PHD finger family proteins.

Searched with the Arabidopsis genes

>AT1G14510.1 Arabidopsis Alfin family transcription factor, protein sequence 252AA tffamily=Alfin
MEGIQHPIPRTVEEVFSDFRGRRAGLIKALSTDVQKFYHQCDPEKENLCLYGLPNETWEVNLPVEEVPPELPEPALGINF
ARDGMQEKDWISLVAVHSDSWLISVAFYFGARFGFGKNERKRLFQMINDLPTIFEVVTGNAKQSKDQSANHNSSRSKSSG
GKPRHSESHTKASKMSPPPRKEDESGDEDEDDEQGAVCGACGDNYGGDEFWICCDACEKWFHGKCVKITPAKAEHIKHYK
CPSCTTSKKMKA

>AT2G02470.1 Arabidopsis Alfin family transcription factor, protein sequence 256AA tffamily=Alfin
MEGITHPIPRTVEEVFSDFRGRRAGLIKALTNDMVKFYQTCDPEKENLCLYGLPNETWEVNLPVEEVPPELPEPALGINF
ARDGMQEKDWVSLVAVHSDSWLLSVAFYFGARFGFGKNERKRLFQMINELPTIFEVVSGNAKQSKDLSVNNNNSKSKPSG
VKSRQSESLSKVAKMSSPPPKEEEEEEDESEDESEDDEQGAVCGACGDNYGTDEFWICCDACEKWFHGKCVKITPAKAEH
IKHYKCPTCSNKRARP

>AT3G11200.1 Arabidopsis Alfin family transcription factor, protein sequence 246AA tffamily=Alfin
MAAAAVSSNPRTVEEIFKDYSARRAALLRALTKDVDDFYSQCDPEKENLCLYGHPNESWEVNLPAEEVPPELPEPALGIN
FARDGMQRKDWLSLVAVHSDCWLLSVSFYFGARLNRNERKRLFSLINDLPTLFDVVTGRKAMKDNKPSSDSGSKSRNGTK
RSIDGQTKSSTPKLMEESYEEEEEEDEHGDTLCGSCGGHYTNEEFWICCDVCERWYHGKCVKITPAKAESIKQYKCPPCC
AKKGRQ

>AT3G11200.2 Arabidopsis Alfin family transcription factor, protein sequence 233AA tffamily=Alfin
MRSGYERFRLLDTLLCVLLRFDFNFWVFVVIEKENLCLYGHPNESWEVNLPAEEVPPELPEPALGINFARDGMQRKDWLS
LVAVHSDCWLLSVSFYFGARLNRNERKRLFSLINDLPTLFDVVTGRKAMKDNKPSSDSGSKSRNGTKRSIDGQTKSSTPK
LMEESYEEEEEEDEHGDTLCGSCGGHYTNEEFWICCDVCERWYHGKCVKITPAKAESIKQYKCPPCCAKKGRQ

>AT3G42790.1 Arabidopsis Alfin family transcription factor, protein sequence 250AA tffamily=Alfin
MEGGAALYNPRTVEEVFKDFKGRRTAIVKALTTDVQEFYQQCDPEKENLCLYGLPNEEWEVNLPAEEVPPELPEPALGIN
FARDGLSEKEWLSLVAIHSDAWLLSVSFYFGSRFSFHKEERKRLFNMINDVPTIFEVVTGMAKAKDKSSAANQNGNKSKS
NSKVRTSEGKSSKTKQPKEEDEEIDEDDEDDHGETLCGACGDSDGADEFWICCDLCEKWFHGKCVKITPARAEHIKQYKC
PSCSNKRARA

>AT5G05610.1 Arabidopsis Alfin family transcription factor, protein sequence 241AA tffamily=Alfin
MAAESSNPRTVEEIFKDFSGRRSGFLRALSVDVDKFYSLCDPEMENLCLYGHPNGTWEVNLPAEEVPPELPEPALGINFA
RDGMQRKDWLSLVAVHSDCWLLSVSSYFGARLNRNERKRLFSLINDLPTLFEVVTGRKPIKDGKPSMDLGSKSRNGVKRS
IEGQTKSTPKLMEESYEDEDDEHGDTLCGSCGGNYTNDEFWICCDVCERWYHGKCVKITPAKAESIKQYKCPSCCTKKGR
Q

>AT5G05610.2 Arabidopsis Alfin family transcription factor, protein sequence 241AA tffamily=Alfin
MAAESSNPRTVEEIFKDFSGRRSGFLRALSVDVDKFYSLCDPEMENLCLYGHPNGTWEVNLPAEEVPPELPEPALGINFA
RDGMQRKDWLSLVAVHSDCWLLSVSSYFGARLNRNERKRLFSLINDLPTLFEVVTGRKPIKDGKPSMDLGSKSRNGVKRS
IEGQTKSTPKLMEESYEDEDDEHGDTLCGSCGGNYTNDEFWICCDVCERWYHGKCVKITPAKAESIKQYKCPSCCTKKGR
Q

>AT5G20510.1 Arabidopsis Alfin family transcription factor, protein sequence 260AA tffamily=Alfin
MEGGTAHYSPRTVEEVFRDFKGRRAGIIQALTTDVEDFFQQCDPEKQNLCLYGFPNEVWEVNLPAEEVPPELPEPALGIN
FARDGMQERNWLSLVAVHSDAWLLSVSFYFGSRFGFDRADRKRLFSMINEVPTVYEVVTGNAEKQTKEMPSSANQNGNRS
KSNSKMRGLESKSSKTIHAKDEEEGLELEEGEEEEDEDEDEHGETLCGACGDNYASDEFWICCDMCEKWFHGECVKITPA
RAEHIKHYKCPTCSNKRARP

>AT5G26210.1 Arabidopsis Alfin family transcription factor, protein sequence 255AA tffamily=Alfin
MEAGGAYNPRTVEEVFRDFKGRRAGMIKALTTDVQEFFRLCDPEKENLCLYGHPNEHWEVNLPAEEVPPELPEPVLGINF
ARDGMAEKDWLSLVAVHSDAWLLAVAFFFGARFGFDKADRKRLFNMVNDLPTIFEVVAGTAKKQGKDKSSVSNNSSNRSK
SSSKRGSESRAKFSKPEPKDDEEEEEEGVEEEDEDEQGETQCGACGESYAADEFWICCDLCEMWFHGKCVKITPARAEHI
KQYKCPSCSNKRARS


3) AP2

AP2 domain
This contains the AP2-like domains but excludes the ERFs. Typically, they have two AP2 domains and lack the conserved WLG and the AEIRD parts of ERFs. Instead they have a conserved YLG. Searches

R1 repeat contains WESHI or YEAH at the 5 prime end and LAALKY or YDRAA at the 3 prime end.
R2 repeat contains WEAR or WQAR at the 5 prime end and YDIAAI or NAVT at the 3 prime end.

>AP2 search_1 tffamily=AP2
SSVHRGVTRHRWTGRYEAHLWDKNSWNETQTKKGRQVYLGAYDEEDAAARAYDLAALKYWGRDTILNFP

>AP2 search_2 tffamily=AP2
TSIYRGVTRHRWTGRYEAHLWDNSCRREGQSRKGRQVYLGGYDKEEKAARAYDLAALKYRGLNAVTNFE

>AP2 search_3 tffamily=AP2
VSKYRGVAKHHHNGRWEARIGRVFGNKYLYLGTYATQEEAAIAYDIAAIEYRGLNAVTNFDISRYL

>AP2 search_4 tffamily=AP2
ASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFGTQEEAAEAYDVAAIKFRGTNAVTNFDITRYD

>AP2 search_5 tffamily=AP2
SSQYRGVTFYRRTGRWESHIWDCGKQVYLGGFDTAHAAARAYDRAAIKFRGVDADINFDIEDYL
4) ARF


Auxin response factors (ARFs) contain an amino-terminal DNA-binding domain, which has some sequence similarity to the B3 domain found in maize VP1. They also contain a carboxyl-terminal domain related to motifs III and IV found in the CTDs of Aux/IAA proteins. However, they also have a conserved central region.
Searched with a conserved region of auxin-responsive transcription factors, pfam06507, found in the middle of the protein.

>AT5G20730.1 tffamily=ARF
AAHANANNSPFTIFYNPRWAAPAEFVVPLAKYTKAMYAQVSLGMRFRMIFETEECGVRRYMGTVTGISDLDPVRWKNSQWRNLQ

>AT1G34390.1 tffamily=ARF
AKHAFDNQCMFIVVYKPRSSQFIVSYDKFLDAVNNKFNVGSRFTMRFEGDDFSERRYFGTIIGVSDFSPHWKCSEWRNLE

>AT1G34170.1 tffamily=ARF
VVNAFKTKCMFNVVYKPSSSQFVISYDKFVDAMNNNYIVGSRFRMQFEGKDFSEKRYDGTIIGVNDMSPHWKDSEWRSLK

>AT1G59750.1 tffamily=ARF
AAHAITTGTIFSVFYKPRTSRSEFIVSVNRYLEAKTQKLSVGMRFKMRFEGEEAPEKRFSGTIVGVQENKSSVWHDSEWRSLK

>AT2G46530.1 tffamily=ARF
ASHAVTTTTIFVVFYKPRISQFIISVNKYMMAMKNGFSLGMRYRMRFEGEESPERIFTGTIIGSGDLSSQWPASKWRSLQ

>AT2G33860.1 tffamily=ARF
VAHAISTHSVFSISYNPKASWSNFIIPAPKFLKVVDYPFCIGMRFKARVESEDASERRSPGIISGISDLDPIRWPGSKWRCLL

>AT1G77850.1 tffamily=ARF
AINRASQGLPFEVVFYPAAGWSEFVVRAEDVESSMSMYWTPGTRVKMAMETEDSSRITWFQGIVSSTYQETGPWRGSPWKQLQ

>AT4G30080.1 tffamily=ARF
AATLAISGRPFEVVYYPRASTSEFCVKALDARAAMRIPWCSGMRFKMAFETEDSSRISWFMGTVSAVNVSDPIRWPNSPWRLLQ

5) ARID

Searched with the Arabidopsis genes

>AT1G76510.1 tffamily=ARID
AGAPQDQEAFIKEVEAFNKENFLEFKAPKFYGQPLNCLKLWRAVIKLGGYDVVTTSKLWRQVGESFHPPKTCTTVSWTFRIFYEKALLEYEKHLRQNGELNLPGSASL

>AT1G20910.1 tffamily=ARID
EAGTPVEQVAFLREVEAFYKESFLEFKPPKFYGQPLNILKLWRAVVNLGGYEVVTTNKLWRQVGESFNPPKTCTTVSYTFRNFYEKALLEYEKCLRNNGELNLPGS

>AT2G17410.1 tffamily=ARID
SGTEEDQSAFMKELDSFFRERNMDFKPPKFYGEGLNCLKLWRAVTRLGGYDKVTGSKLWRQVGESFRPPKTCTTVSWTFRGFYEKALLEYERHKVSEGELQIPLPLE

>AT1G04880.1 tffamily=ARID
EAVVADPRLFMTSLERLHSLLGTKFMVPIIGGRDLDLHKLFVEVTSRGGINKILNERRWKEVTATFVFPPTATNASYVLRKYYFSLLNNYEQIYFFRSNGQIPPDSMQ

>AT1G55650.1 tffamily=ARID
QDIVRNPELFWEMLRDFHESSDKKFKIPIVGGKSLDLHRLFNEVTSRGGLEKVIKDRRCKEVIDAFNFKTTITNSAFVLRKSYLKMLFEFEHLYYFQAPLSTFWEKEK

>AT2G46040.1 tffamily=ARID
ELISLFRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQESGLESYDSASAKLIYVKYLDAFGRWLNRVVAGDTDVSSVE

>AT4G11400.1 tffamily=ARID
ECEERLRRLFDQALLVFLEEEGSIKPLPAVIGDGKNVDLFKLFVLVREREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAVEESRIVNWDNKDSEK

6) AS2

Searched with 15 full length Arabidopsis proteins.

>AT1G36000.1 Arabidopsis AS2 family transcription factor, protein sequence 122AA tffamily=AS2
MEPLGNRRPCSVCITKNRNCPRFCEYAEYFPYELQSQYESANELFGTPNIITMMQHAPEEKKQMLATSIIMEGNAWTEDP
ISGGFGMIQKLMWKIMLHKAYLRELQEKIKEEKEKKPASSLY

>AT2G19510.1 Arabidopsis AS2 family transcription factor, protein sequence 120AA tffamily=AS2
MEPLGDRRPCCVCITKNRNCPRFCEYAEYFPYELRSHYESTNELFGTPKIIKMMRHAPEEKKQMLATSIIMEGNAWTNDP
VSGGFGMVQKIMWKIMLHKAYLHELEEKIKEEKEKIELHL

>AT1G06280.1 Arabidopsis AS2 family transcription factor, protein sequence 206AA tffamily=AS2
MMQRNSNNTSITSNISNNSSSHQACASCKHQRKKCNNECILSPYFPARKTKEFQAVHKVFGVSNVQKMVRTVREEDRTKL
SDSLTWEALWRQKDPVLGSYGEYRRICEELKLYKSLVHNQPLIGWDNNQRVFNNNSNNKNGLAMTNSSGSGGFSVNNNGV
GVNREIVNGGYASRNVQGGWENLKHDQRQQCYAVINNGFKQHYLPL

>AT1G72980.1 Arabidopsis AS2 family transcription factor, protein sequence 214AA tffamily=AS2
MSLSTFSGGSTTACAACKHQRKKCKKNCILARYFPQDGTNKFLNAHKLFGVSNITKMLKRIEESQRDIAMENLIYHANAR
ALDPVGGVYRTICDLKCKIEFVQTELNLTRQQIDMCRSLAQEQHRQRQNLPYRCNSFESLLQQDGDEYVNVDGLDHQNMQ
QQQEMQQQQQNPSNYDMFLEMPEQTSKVKLEEEKISDQRKNNLMRQILMSSAII

>AT1G16530.1 Arabidopsis AS2 family transcription factor, protein sequence 165AA tffamily=AS2
MRQKGHRHGRTVSPCAGCKLLRRKCVKDSCVFAPYFPAKEPYKFAIVHKIFGASNVNKMLQELSENHRSDAVDSMVYEAN
ARIQDPVYGCVGTISSLHRQLETLQTQLAFAQAELIHIRTLHRIHTKPPPYTASTVTFPSNKDFYSDIDMAVAYTDDAGD
FLWSC

>AT1G07900.1 Arabidopsis AS2 family transcription factor, protein sequence 190AA tffamily=AS2
MESKSDASVATTPIISSSSSPPPSLSPRVVLSPCAACKILRRRCAERCVLAPYFPPTDPAKFTIAHRVFGASNIIKFLQE
LPESQRTDAVNSMVYEAEARIRDPVYGCAGAIYHLQRQVSELQAQLAKAQVEMVNMQFQRSNLLELIYNMDQQQKQEQDN
MSFESNDLGFLEDKSNTNSSMLWWDPLWTC

>AT5G63090.1 Arabidopsis AS2 family transcription factor, protein sequence 186AA tffamily=AS2
MASSSNSYNSPCAACKFLRRKCMPGCIFAPYFPPEEPHKFANVHKIFGASNVTKLLNELLPHQREDAVNSLAYEAEARVR
DPVYGCVGAISYLQRQVHRLQKELDAANADLAHYGLSTSAAGAPGNVVDLVFQPQPLPSQQLPPLNPVYRLSGASPVMNQ
MPRGTGGSYGTFLPWNNGHDQQGGNM

>AT3G11090.1 Arabidopsis AS2 family transcription factor, protein sequence 165AA tffamily=AS2
MRGHEPRSSSSCAACKLLKRRCTPTCIFAPYFRSSDLITFAKVHKVFGASNVSKLLGEVPEEQRQETVNSLAYEAEVRLK
DPVYGCIGAIASLQKKMLELQHDLAVARTRLLAHSGVNNSQVSPLDDSPELAAFLDLVPYSDLMLLDGSTNLDAYLYDLG
QPPFV

>AT4G00220.1 Arabidopsis AS2 family transcription factor, protein sequence 228AA tffamily=AS2
MSSSGNPSSSSGGGGGPCGACKFLRRKCVAGCIFAPYFDSEQGAAHFAAVHKVFGASNVSKLLHHVPEHKRPDAVVSICF
EAQARLRDPIYGCVSHIVSLQQQVVSLQTELSYLQAHLATLELPQPQPPQVPVSSSGSLQALSITDLPTISPSVYDLSSI
FEPVMSSTWAMQQQPRPSDHLFGVPSSSNMGGGGELQALAREFIHGGQMPAQPSPGTSGSASSVIKRE

>AT5G06080.1 Arabidopsis AS2 family transcription factor, protein sequence 177AA tffamily=AS2
MASHGSSCGACKFLRRKCNRDCVFSPYFSYEQASSHFAAVHKVFGASNVSKHLLHLPQHQRNIAAITISYEALSRMRDPV
YGCVAHIFALHQQVVTLQEEIEFLGSQMKNFSYSNQNGSQLNNIPEFVNQMTMATTNFVDESVLNNADGRNCYDGFFTNS
EEMLVNHQWLQNMDYYY

>AT4G00210.1 Arabidopsis AS2 family transcription factor, protein sequence 215AA tffamily=AS2
MSGSTTGCGGPCGACKFLRRKCVADCVFAPYFDSVEGTSHFTAVHKVFGASNASKLLMMIPASRRLDAVVTLTYEALARL
RDPVYGCVGHIFALQHQAELAYVQTQLSTLQGLPPPNSQNNSRTEAASSSNVPLISSVDSKDNMSSSSSHIPCMSQQQEQ
EQPKEAIEVPTESVDLSTFFGLENPVDEDGDLNALAREFFTKYLTGGKYRPSSLI

>AT3G50510.1 Arabidopsis AS2 family transcription factor, protein sequence 198AA tffamily=AS2
MMFHQMDKISTPCAACKHLRRKCTEDCVFAPYFPSTKLDNYEAVHKVFGASHVATLINSLHPCQREFAMDTLAWEAQVQA
NDPVNGCLGIIYNLLSQIKDLEEQLAIVKNELASYCIVPTFVPPPSMTNLEMHNNPMMIPEHTPNNGGCLTGQQLYNEAQ
RFASTQSAQMQETQMQHDEESYRDKSSYQKFGPCFNLH

>AT1G67100.1 Arabidopsis AS2 family transcription factor, protein sequence 233AA tffamily=AS2
MRMSCNGCRVLRKGCSENCSIRPCLQWIKSAESQANATVFLAKFYGRAGLMNLLNTGPDHLRPAIFRSLLYEACGRIVNP
IYGSVGLLWSGNWHLCQAAVEAVMRGSPVTPIACDAAVTGQAPPFNNKLCDIRHVSSRDENVKRRSRGACKEERNVRSLS
HESSLSHESPVSSEETTTEEPKTWIGLELTLGLEPLARGNHVVVPMKKRKLERCGTSEDEDTCKIELGLVCSE

>AT3G49940.1 Arabidopsis AS2 family transcription factor, protein sequence 247AA tffamily=AS2
MSCNGCRVLRKGCSENCILRPCIQWIESPEAQGHATVFVAKFFGRAGLMSFISAVPESQCPALFQSLLYEACGRTVNPVN
GAVGLLWTGNWNVCQAAVETVLRGGSLKPIPELLNGGGFAGFPSPTSDEASEICTEMLNLRKADDSGDRNIYHHCRFSSS
RSRSRSTASPPKRKRLSSEQQPSSELDLSLIPIYPIKTLPFKEDTPSMYSEESVTTVSFQNNNAGDRYVRCGGGGGGATT
KLLNLFA

>AT3G27940.1 Arabidopsis AS2 family transcription factor, protein sequence 153AA tffamily=AS2
MNANPCEVCRFQNKQCVNNCMFALLFPSSDLEKFDVVNRIFGLETLTFYLKDLSPMERIDTTRTLYYEAKPCFLNPPKNP
SKFLEALLNYPYQKAEEVSKTKKLLASYSRPCVVLALPAPKYTQSKSKPSVLRKRKRKTKSSDESAIRVVEDS

7) AUX-IAA

Searched with the Arabidopsis genes

>AT1G15050.1 tffamily=AUX-IAA
QTTEFGGVIDLGLSLRTIQHEIYHSSGQRYCSNEGYRRKWGYVKVTMDGLVVGRKVCVLDHGSYSTLAHQLEDMFGMQSVSGLRLFQMESEFCLVYRDEEGLWRNAGDVPWNEFIESVERLRITRRNDAVLP

>AT1G04100.1 tffamily=AUX-IAA
ADSSPAAASNATRQVAVGWPPLRTYRINSLVNQAKSLATEGGLSETTKSVVVAAKNDDACFIKSSRTSMLVKVTMDGVIIGRKVDLNALDSYAALEKTLDLMFFQIPSPVTRSNTQGYKTIKETCTSKLLDGSSEYIITYQDKDGDWMLVGDVPWQMFLGSVTRLRIMKTSIGAGVGK

>AT4G28640.1 tffamily=AUX-IAA
ADSMAATSGQVVGWPPIRTYRMNSMVNQAKASATEDPNLEISQAVNKNRSDSTKMRNSMFVKVTMDGIPIGRKIDLNAHKCYESLSNTLEEMFLKPKLGSRTLETDGHMETPVKILPDGSSGLVLTYEDKEGDWMLVGDVPWGMFIGSVRRLRIMKTSEATGKAQ

>AT1G04550.1 tffamily=AUX-IAA
AESSSHQGASPPRSSQVVGWPPIGLHRMNSLVNNQAMASATEDPNLEISQAVNKNRSDSTKMRNSMFVKVTMDGIPIGRKIDLNAHKSYENLAQTLEEMFFGMTGLYCFQ

>AT2G46990.1 tffamily=AUX-IAA
TDLRLGLSFGTSSGTQYFNGGYGYSVAAPAVEDAEYVAAVEEEEENECNSVGSFYVKVNMEGVPIGRKIDLMSLNGYRDLIRTLD FMFNASILWAEEEDMCNEKSHVLTYADKEGDWMMVGDVPWEMFLSTVRRLKISRA 

>AT3G17600.1 tffamily=AUX-IAA
PSESSVNLSLSLTFPSTSPQREARQDWPPIKSRLRDTLKGRRLLRRGDDTSLFVKVYMEGVPIGRKLDLCVFSGYESLLENLSHMFDTSIICGNRDRKHHVLTYEDKDGDWMMVGDIPWDMFLETVRRLKITRP

>AT3G23030.1 tffamily=AUX-IAA
EETRDEEESTPPTKTQIVGWPPVRSSRKNNNSVSYVKVSMDGAPYLRKIDLKTYKNYPELLKALENMFKVMIGEYCEREGYKGSGFVPTYEDKDGDWMLVGDVPWDMFSSSCKRLRIMKG

>AT3G23050.1 tffamily=AUX-IAA
EKTTLKDPSKPPAKAQVVGWPPVRNYRKNMMTQQKTSSGAEEASSEKAGNFGGGAAGAGLVKVSMDGAPYLRKVDLKMYKSYQDLSDALAKMFSSFTMGNYGAQGMIDFMNESKLMNLLNSSEYVPSYEDKDGDWMLVGDVPWEMFVESCKRLRIMKG 

>AT3G04730.1 tffamily=AUX-IAA
ENMKEKVVKPPAKAQVVGWPPVRSFRKNVMSGQKPTTGDATEGNDKTSGSSGATSSASACATVAYVKVSMDGAPYLRKIDLKLYKTYQDLSNALSKMFSSFTIGNYGPQGMKDFMNESKLIDLLNGSDYVPTYEDKDGDWMLVGDVPWEMFVDSCKRIRIMKG

>AT1G80390.1 tffamily=AUX-IAA
ENNYISSMVTNDQLVGWPPVATARKTVRRKYVKVALDGAAYLRKVDLGMYDCYGQLFTALENMFQGIITICRVTELERKGEFVATY EDKDGDLMLVGDVPWMMFVESCKRMRLMKT 

>AT5G65670.1 tffamily=AUX-IAA
KGQSSTTNNSSSPPAAKAQIVGWPPVRSYRKNTLATTCKNSDEVDGRPGSGALFVKVSMDGAPYLRKVDLRSYTNYGELSSALEKMFTTFTLGQCGSNGAAGKDMLSETKLKDLLNGKDYVLTYEDKDGDWMLVGDVPWEMFIDVCKKLKIMKG

>AT3G15540.1 tffamily=AUX-IAA
GGDAEKVNDSPAAKSQVVGWPPVCSYRKKNSCKEASTTKVGLGYVKVSMDGVPYLRKMDLGSSQGYDDLAFALDKLFGFRGIGVALKDGDNCEYVTIYEDKDGDWMLAGDVPWGMFLESCKRLRIMKR

>AT3G16500.1 tffamily=AUX-IAA
SNKTTSVPHISQKRTAPGPVVGWPPVRSFRKNLASTSSSKLGNESSHGGQINKSDDGEKQVETKKEGMFVKINMDGVPIGRKVDLNAYNSYEQLSFVVDKLFRGLLAAQRDISDGQGEEKPIIGLLDGKGEFTLTYEDNEGDKMLVGDVPWQMFVSSVKRLRVIK

>AT5G57420.1 tffamily=AUX-IAA
ASKNHNNSNSSSGAAGRSFQGFGLNVEDDLVSSVVPPVTVVLEGRSICQRISLDKHGSYQSLASALRQMFVDGADSTDDLDLSNAIPGHLIAYEDMENDLLLAGDLTWKDFVRVAKRIRILPV


8) BBR-BPC

Searched with the 3 prime half of four Arabidopsis genes. The DNA-binding domain is located in the C-terminal half and is the most highly conserved portion.

>AT2G35550.1 tffamily=BBR-BPC
KRSVSNKSKKTPSIPETKREKKNLDINIDISSFDTSGVPPPVCSCTGVSRVCYKWGMGGWQSSCCTISISTYPLPMST TRPGARLAGRKMSNGAYVKLLARLADEGYDLSHPLDLKNHWARHGTNKFVTIK

>AT2G01930.1 tffamily=BBR-BPC
RKPKEERDVTNNNVQQQQQRVKPVKKSVDLVINGVSMDISGLPVPVCTCTGTPQQCYRWGCGGWQSACCTTNISVYPLPMSTKRRGARISGRKMSQGAFKKVLEKLSEGYSFGNAIDLKSHWARHGTNKFVTIR

>AT2G21240.1 tffamily=BBR-BPC
KVKKVGEDLNRRVPAPGKKSRTDWDSQDVGLNLVTFDETTMPVPMCSCTGSTRQCYKWGNGGWQSSCCTTTLSQYPLPQMPNKRHSRMGGRKMSGNVFSRLLSRLSAEGYDLSCPVDLKDYWARHGTNRYITIK

>AT5G42520.1 tffamily=BBR-BPC
NQRKVKKESEDDLNKIMFVKTTHSKSDWKSQEMVGLNQVVYDETTMPPPVCSCTGVLRQCYKWGNGGWQSSCCTTTLSMYPLPALPNKRHARVGGRKMSGSAFNKLLSRLAAEGHHDLSNPVDLKDHWAKHGTNRYITIK

9) BES

Searched with the Arabidopsis genes

>AT1G78700.1 tffamily=BES
TSGTRMPTWRERENNKRRERRRRAIAAKIFTGLRMYGNYELPKHCDNNEVLKALCNEAGWIVEPDGTTYRKGCSRPVER

>AT1G75080.1 tffamily=BES
AAARRKPSWRERENNRRRERRRRAVAAKIYTGLRAQGDYNLPKHCDNNEVLKALCVEAGWVVEEDGTTYRKGCKPLPGE

>AT1G19350.1 tffamily=BES
MATRRKPSWRERENNRRRERRRRAVAAKIYTGLRAQGNYNLPKHCDNNEVLKALCSEAGWVVEEDGTTYRKGHKPLPGD

>AT3G50750.1 tffamily=BES
AATGRMPTWKERENNKKRERRRRAIAAKIFTGLRSQGNYKLPKHCDNNEVLKALCLEAGWIVHEDGTTYRKGSRP

>AT5G45300.1 tffamily=BES
GGKGKREREKEKERTKLRERHRRAITSRMLAGLRQYGNFPLPARADMNDVIAALAREAGWSVEADGTTYRQSQQPNHVV

>AT2G45880.1 tffamily=BES
GGSRRSRPLEEKERTKLRERHRRAITARILGGLRRHGNYNLRVRADINDVIAALAREAGWVVLPDGTTFPSKSQGTKPT

10) bHLH

Searched with 25 bHLH domains from different groups to ensure isolation of all possible genes.

>AT1G01260.1 IIId tffamily=bHLH
EALNHVEAERQRREKLNQRFYALRSVVPNISKMDKASLLGDAVSYINEL

>AT1G32640.1 IIIe tffamily=bHLH
EPLNHVEAERQRREKLNQRFYALRAVVPNVSKMDKASLLGDAIAYINEL

>AT5G54680.1 IVc tffamily=bHLH
ATSSKACREKQRRDRLNDKFMELGAILEPGNPPKTDKAAILVDAVRMVTQL

>AT2G22770.1 IVa tffamily=bHLH
LLKEHVLAERKRRQKLNERLIALSALLPGLKKTDKATVLEDAIKHLKQL

>AT5G56960.1 IVd tffamily=bHLH
TQLQHMISERKRREKLNESFQALRSLLPPGTKKDKASVLSIAREQLSSL

>AT2G31220.1 II tffamily=bHLH
RKSRTSPTERERRVHFNDRFFDLKNLIPNPTKIDRASIVGEAIDYIKEL

>AT2G31210.1 II tffamily=bHLH
RKNKPFTTERERRCHLNERYEALKLLIPSPSKGDRASILQDGIDYINEL

>AT1G10610.1 IIIc tffamily=bHLH
FKSKNLHSERKRRERINQAMYGLRAVVPKITKLNKIGIFSDAVDYINEL

>AT5G65640.1 IIIb tffamily=bHLH
QPSKNLMAERRRRKRLNDRLSMLRSIVPKISKMDRTSILGDAIDYMKEL

>AT4G21330.1FKSPNLEAER IIIa tffamily=bHLH
RRREKLHCRLMALRSHVPIVTNMTKASIVEDAITYIGEL

>AT1G68810.1 Vb tffamily=bHLH
ASKSHSEAERRRRERINNHLAKLRSILPNTTKTDKASLLAEVIQHVKEL

>AT5G08130.1 Va tffamily=bHLH
PRSKHSATEQRRRSKINDRFQMLRQLIPNSDQKRDKASFLLEVIEYIQFL

>AT2G43010.2 VIIa tffamily=bHLH
AAEVHNLSERRRRDRINERMKALQELIPHCSKTDKASILDEAIDYLKSL

>AT5G67110.1 VIIb tffamily=bHLH
DAQFHNLSEKKRRSKINEKMKALQKLIPNSNKTDKASMLDEAIEYLKQL

>AT2G24260.1 XI tffamily=bHLH
ATDPHSIAERLRRERIAERMKALQELVPNGNKTDKASMLDEIIDYVKFL

>AT1G18400.1 XII tffamily=bHLH
ATDSHSLAERVRRGKINERLRCLQDMVPGCYKAMGMATMLDEIINYVQSL

>AT2G42280.1 IX tffamily=bHLH
HPRSIAERVRRTRISERMRKLQELVPNMDKQTNTSDMLDLAVDYIKD

>AT1G27740.1 VIIIc tffamily=bHLH
DPQSLYARKRREKINERLKTLQNLVPNGTKVDISTMLEEAVHYVKFL

>AT3G21330.1 VIIIb tffamily=bHLH
DPQTVAARQRRERISEKIRVLQTLVPGGTKMDTASMLDEAANYLKFL

>AT1G30670.1 VIIIa tffamily=bHLH
ELSAQSIAARKRRRRITEKTQELGKLIPGSQKHNTAEMFNAAAKYVKFL

>AT1G68240.1 VI tffamily=bHLH
YRMMMEKKRRKEIKDKVDILQGLMPNHCTKPDLASKLENIIEYIKSL

>AT4G01460.1 Ia tffamily=bHLH
QRMTHIAVERNRRRQMNEHLNSLRSLMPPSFLQRGDQASIVGGAIDFIKEL

>AT5G46690.1 Ia tffamily=bHLH
QRMTHIAVERNRRRQMNQHLSVLRSLMPQPFAHKGDQASIVGGAIDFIKEL

>AT5G04150.1 Ib tffamily=bHLH
KKLNHNASERDRRRKLNALYSSLRALLPKLSIPMTVARVVKYIPEQ

>AT1G12540.1 Ib tffamily=bHLH
KRAKHKELERQRRQENTSLFKILRYLLPGKRSSADHVLEAVNYIKDL

11) bZIP

Searched with the Arabidopsis genes

>AT1G03970.1 tffamily=bZIP
KAAAQRQKRMIKNRESAARSRERKQAYQVELETLAAKLEEENEQLLKEIEESTKERY

>AT1G49720.1 tffamily=bZIP
KVVERRQKRMIKNRESAARSRARKQAYTLELEAEIESLKLVNQDLQKKQAEIMKTHN

>AT3G54620.1 tffamily=bZIP
PTDVKRARRMLSNRESARRSRRRKQEQMNEFDTQVGQLRAEHSTLINRLSDMNHKYD

>AT3G54620.1 tffamily=bZIP
PTDVKRARRMLSNRESARRSRRRKQEQMNEFDTQVGQLRAEHSTLINRLSDMNHKYD

>AT4G34590.1 tffamily=bZIP
LMEQRKRKRMLSNRESARRSRMKKQKLLDDLTAQVNHLKKENTEIVTSVSITTQHYL

>AT2G18160.1 tffamily=bZIP
TVDERKRKRMLSNRESARRSRMRKQKHVDDLTAQINQLSNDNRQILNSLTVTSQLYM

>AT4G36730.1 tffamily=bZIP
ERELKRQKRKQSNRESARRSRLRKQAECEQLQQRVESLSNENQSLRDELQRLSSECD

>AT4G36730.2 tffamily=bZIP
ERELKRQKRKQSNRESARRSRLRKQAECEQLQQRVESLSNENQSLRDELQRLSSECD

>AT4G01120.1 tffamily=bZIP
EKEVKREKRKQSNRESARRSRLRKQAETEQLSVKVDALVAENMSLRSKLGQLNNESE

>AT1G19490.1 tffamily=bZIP
EREERRIRRILANRESARQTIRRRQAMCEELSKKAADLTYENENLRREKDWALKEFQ

>AT1G22070.1 tffamily=bZIP
RINDKMKRRLAQNREAARKSRLRKKAHVQQLEESRLKLSQLEQELVRARQQGLCVRN

>AT2G16770.1 tffamily=bZIP
TSESSGKKRPLGNREAVRKYREKKKAKAASLEDEVMRLKAVNNQLLKRLQGQAALEA

>AT4G35040.1 tffamily=bZIP
SCGKKGEKRPLGNREAVRKYREKKKAKAASLEDEVARLRAVNQQLVKRLQNQATLEA

>AT1G43700.1 tffamily=bZIP
DPKRAKRILANRQSAARSKERKIRYTGELERKVQTLQNEATTLSAQVTMLQRGTS

>AT2G42380.2 tffamily=bZIP
DPKRVKRILANRQSAQRSRVRKLQYISELERSVTSLQAEVSVLSPRVAFLDHQRL

>AT3G58120.1 tffamily=bZIP
IHDPKRVKRILANRQSAQRSRVRKLQYISELERSVTSLQTEVSVLSPRVAFLDHQRL

>AT2G40950.1 tffamily=bZIP
EEDEKKRARLMRNRESAQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENA

>AT3G10800.1 tffamily=bZIP
DDDKRKLIRQIRNRESAQLSRLRKKQQTEELERKVKSMNATIAELNGKIAYVMAENV

>AT1G42990.1 tffamily=bZIP
DAVAKKRRRRVRNRDAAVRSRERKKEYVQDLEKKSKYLERECLRLGRMLECFVAENQ

>AT5G11260.1 tffamily=bZIP
EKENKRLKRLLRNRVSAQQARERKKAYLSELENRVKDLENKNSELEERLSTLQNENQ

12) C2C2-GATA

Searched with the Arabidopsis genes

>AT2G18380.1 tffamily=bZIP
CASCDTTSTPLWRNGPKGPKSLCNACGIRFKKEERR

>AT3G20750.1 tffamily=bZIP
CTNMNCNALNTPMWRRGPLGPKSLCNACGIKFRKEEER

>AT1G08000.1 tffamily=bZIP
CTHCETITTPQWRQGPSGPKTLCNACGVRFKSGRLV

>AT3G45170.1 tffamily=bZIP
CSHCGTRKTPLWREGPRGAGTLCNACGMRYRTGRLL

>AT4G17570.1 tffamily=bZIP
CYHCGVTNTPLWRNGPPEKPVLCNACGSRWRTKGTL

13) C2H2

The C2H2 zinc finger family are a large superfamily of largely unrelated genes that contain between one and at least seven fingers. Searches were performed with all the subgroups to ensure identification of all the disparate members.


>A1a At1g03840 (whole protein) tffamily=C2H2
mttedqtisssggyvqsssttdhvdhhhhdqheslnpplvkkkrnlpgnpdpeaevials
pktlmatnrflceicgkgfqrdqnlqlhrrghnlpwklkqrtskevrkrvyvcpekscvh
hhptralgdltgikkhfcrkhgekkwkcekcakryavqsdwkahsktcgtreyrcdcgti
fsrrdsfithrafcdalaeetarlnaashlksfaatagsnlnyhylmgtlipspslpqpp
sfpfgppqpqhhhhhqfpittnnfdhqdvmkpastlslwsggninhhqqvtiedrmapqp
hspqedynwvfgnannhgelittsdslithdnninivqskenangatslsvpslfssvdq
itqdanaasvavanmsatallqkaaqmgatsstsptttittdqsaylqsfasksnqived
ggsdrffasfgsnsvelmsnnnnglheignprngvtvvsgmgelqnypwkrrrvdignag
gggqtrdflgvgvqtichsssingwi

>A1b At1g68130 (whole protein) tffamily=C2H2
mrtdqvmlsnkntntccvvsssssdpflsssengvtttntstqkrkrrpagtpdpdaevv
slsprtllesdryiceicnqgfqrdqnlqmhrrrhkvpwkllkrdnnievkkrvyvcpep
tclhhnpchalgdlvgikkhfrrkhsnhkqwvcercskgyavqsdykahlktcgtrghsc
dcgffssfrvesfiehqdncsarrvhrepprppqtavtvpacssrtastvstpssetnyg
gtvavttpqplegrpihqrisssiltnssnnlnlelqllplssnqnpnqenqqqkvkeps
hhhnhnhdttnlnlsiapsssyqhynnfdrikeimaseqimkiamkekayaeeakreakr
qreiaenefanakkirqkaqaelerakflkeqsmkkisstimqvtcqtckgqfqavavpa
atadetslvvsymssantdgelengf

>A1c AT1g34370 (whole protein) tffamily=C2H2
mstapgpftgqpgsavfpyvreannvasqsqnnnncgarefdlpkpvlvdereghvveeh
emkdeddveegenlppgsyeilqlekeeilaphthfcticgkgfkrdanlrmhmrghgde
yktaaalakpnkesvpgsepmlikryscpflgckrnkehkkfqplktilcvknhykrthc
dksftcsrchtkkfsviadlkthekhcgknkwlcscgttfsrkdklfghialfqghtpai
pleetkpsaststqrgsseggnnnqgmvgfnlgsasnanqettqpgmtdgricfeesfsp
mnfdtcnfggfhefprlmfddsessfqmlianacgfsprnvgesvsdtsl

>A1d At1g51220 (whole protein) tffamily=C2H2
msnpacsnlfnngcdhnsfnystslsyiynshgsyyysnttnpnyinhthttstspnspp
lrealpllslspirhqeqqdqhyfmdthqisssnflddplvtvdlhlglpnygvgesirs
niapdattdeqdqdhdrgvevtveshldddddhhgdlhrghhywiptpsqiligptqftc
plcfktfnrynnmqmhmwghgsqyrkgpeslrgtqptgmlrlpcfccapgcknnidhpra
kplkdfrtlqthykrkhgskpfacrmcgkafavkgdwrthekncgklwycscgsdfkhkr
slkdhvkafgnghvpcgidsfggdhedyydaasdieq

>A3 At2g23740 (region spanning the 3 fingers) tffamily=C2H2
WSFSGFACAICLDSFVRRKLLEIHVEERHHVQFAEKCMLLQCIPCGSHFGDKEQLLVHVQAVHPSECKSLTVASECNLTNGEFSQKPEAGSSQIVVSQNNENTSGVHKFVCKFCGLKFNLLPDLGRHHQAEHMGPSLVGS

>A4 at1g30970 (two fingers) tffamily=C2H2
KVWCYYCDREFDDEKILVQHQKAKHFKCHVCHKKLSTASGMVIHVLQVHKENVTKVPNAK

>B1 at1g72050 (whole protein) tffamily=C2H2
maeeakvdvktsakkdirnylcqycgisrsknylitkhiqshhqmeleeerddeacevde
esssnhtcqecgaefkkpahlkqhmqshslersftcyvddcaasyrrkdhlnrhllthkg
klfkcpkencksefsvqgnvgrhvkkyhsndnrdkdntglgdgdkdntckgdddkeksgs
ggcekenegnggsgkdnngngdsqpaecstgqkqvvckeigcgkafkypsqlqkhqdshv
kldsveafcsepgcmkyftneeclkshirschqhinceicgskhlkknikrhlrthdeds
spgeikcevegcsstfskasnlqkhmkavhddirpfvcgfpgcgmrfaykhvrnkhensg
yhvytcgdfvetdedftsrprgglkrkqvtaemlvrkrvmpprfdaeehetc

>C1-1i At1g66140 (single finger) tffamily=C2H2
SKRVFSCNYCQRKFYSSQALGGHQNAHKRE

>C1-Sa AT2G15740 (single finger) tffamily=C2H2
YGPYTCPKCNGVFNTSQKFAAHMSSHYKNE

>C1-Q At2g36475 (single finger) tffamily=C2H2
FDDLPHLCTSCSVRLKQKEELDRHMELHDK

>C1-2i AT2G37430 (two fingers) tffamily=C2H2
SHTSNQFECKTCNKRFSSFQALGGHRASHKKPKLTVEQKDVKHLSNDYKGNHFHKCSICSQSFGTGQALGGHMRRHRSSM

>C1-3i AT1G02030 (All three fingers) tffamily=C2H2
MEERHKCKLCWKSFANGRALGGHMRSHMLIHPLPSQPESYSSSMADPGFVLQDRESETES
SKKPSRKRSRLNRRSISSLRHQQSNEEGKSETARAADIKIGVQELSESCTEQEPMSSVSD
AATTEEDVALSLMLLSRDKWEKEEEESDEERWKKKRNKWFECETCEKVFKSYQALGGHRA
SHKKKIAETDQLGSDELKKKKKKSTSSHHECPICAKVFTSGQALGGHKRSHASANNEFTR

>C1-4i AT1G49900 (two searches with the two double finger regions) tffamily=C2H2
DLFKCSICEKVFTSYQALGGHKASHSIKAAQLENAGADAGEKTRSKMLSPSGKIHKCDICHVLFPTGQALGGHKRRHYEG
QCNVCGRELPSYQALGGHKASHRTKPPVENATGEKMRPKKLAPSGKIHKCSICHREFSTGQSLGGHKRLH

>C1-5i AT3G29340 (whole protein - six fingers) tffamily=C2H2
MDLDGVELLLDLREMVSQSGFEKSTTCSGVIALRSNLQSKSSHKCKICGKSFECYQALGG
HQRIHRPIKEKLSKQEFSEVYPRKSKLQKRPESSSSCYECKVCGKIFGCYRGLGGHTKLH
RSTKRELASTQDENSLLDSSEAKKIVSQPSSFKVSQEEKFLHCVELKQDFSEPLSHSGAL
PSTLRSKLQTKTQWKSSCHCKICGKSFVCSQGLGNHKRVHREISGKLACKRKYTEDYNPF
SDSLKAKKIVKKPSSFEVSQEEKILHCVELKQDFGELLAHSGFDKSISCSKSIKVKKVAR
KNEKTEDSTSLFGVFVGEMSQRLHGCKTCGRKFGTLKGVYGHQRMHSGNHNRIEDENGLE
RIWGLKKKSRVCSVSAFDRFKGSSFMAEIEKHEVIEAALNLVMLCQGVYDFASISNLPLG
DGFMDLELKPCPLRRKLQKKSRSSYKCSICEKSFVCSQALGSHQRLHRWKLVPKPEYIED
DSSLLDSSEAKKIVSKPSSFEHAQEEKILQCVEPKLEFHEQLAHSGFDKFDTCSKIRFSA
LPSPPEAKKIVSQPPSFEVSVDEKILYRAEPKLNFSEPLAHSCFDNSSSYRSIICGKSFV
CSQALGGHQTLHRSIKGQLAGTEDGNSLSVTDSEASKIVAQPSSYKSQGI

>C2-B AT1G65110 (single finger) tffamily=C2H2
WKFWMCRTCSQTFFYPKKFKNHLEQVHDAK

>C2-sb AT4G26030 (single finger) tffamily=C2H2
PFEKDSSFICLKCNSLFDTSQMLVVHTELIHSKNETKKRL

>C2-pairs AT4G12240 (single finger) tffamily=C2H2
VKPPEPYFCGVCDRRFYTNEKLINHFKQIH

>C2-unique AT1G04445 (single finger) tffamily=C2H2
MPFSEPQECAVCKRVFLSSHQLISHYNAAH

>C2-cons SF AT2G27100 (single finger) tffamily=C2H2
DEKYGWKYGCGAKGCTKLFHAAEFVYKHLKLKHTELVTEL

>C3-PL SF AT1G01350 (single finger) tffamily=C2H2
NALPFACFICREPFVDPVVTKCKHYFCEHC


14) C3H

Searched with 32 C3H domains from DATB

>AT1G03790.1 tffamily=C3H
YSGEVCPEFRRGGDCSRGDDCEFAHGV

>AT2G40140.1 tffamily=C3H
YTCVPCPEFRKGSCPKGDSCEYAHGV

>AT2G41900.1 tffamily=C3H
YSCVPCPDFRKGACRRGDMCEYAHGV

>AT5G06420.1 tffamily=C3H
YQPDICKDYKETGYCGYGDSCKFLHDR

>AT1G32360.1 tffamily=C3H
FKGRHCKKFYTEEGCPYGESCTFLHDE

>AT1G04990.1 tffamily=C3H
PGERDCQFYLRTGLCGYGSSCRYNHPT

>AT3G12680.1 tffamily=C3H
PGEPDCPYYIKTQRCKYGSKCKFNHPR

>AT2G32930.1 tffamily=C3H
VGQPDCETGACKYGPTCKYHHPK

>AT1G48195.1 tffamily=C3H
PGEPECSYYLRTGNCYLKQNCKYHHPK

>AT5G18550.1 tffamily=C3H
MGQPVCQHFMRTGTCKFGASCKYHHPR

>AT1G21570.1 tffamily=C3H
NCPVFEATGSCSQGLKCKLHHPK

>AT2G47850.1 tffamily=C3H
PGVQRCTFYVQNGFCKFGSTCKFDHPM

>AT3G08505.1 tffamily=C3H
PPNNVCTFYQKRICLYGSRCRYDHVR

>AT3G08505.1 tffamily=C3H
LRSIDCKHFNFGNGNCPFGASCFYKHAY

>AT3G19360.1 tffamily=C3H
LRMKLCRKFCFGEECPYGDRCNFIHED

>AT3G51950.1 tffamily=C3H
FGGVPCSYFARGFCKNGASCRFVHSD

>AT2G47680.1 tffamily=C3H
EAPVCVYFLNGYCNRGGQCTFTHTL

>AT2G02160.1 tffamily=C3H
KWNTDCVYFLASPLTCKKGPECEYRHSE

>AT3G12130.1 tffamily=C3H
SKSKPCTKFFSTSGCPFGENCHFLHYV

>AT3G08505.1 tffamily=C3H
SDRILCKFFVHGSCLKGENCEFSHDS

>AT2G47680.1 tffamily=C3H
STRPACKFFASSQGCRNGESCLFSHAM

>AT3G47120.1 tffamily=C3H
EARGVCRAFQRGECTRGDSCKFSHDE

>AT5G51980.1 tffamily=C3H
KTEKVCNFWVDGNCTYGDKCRYLHCW

>AT5G42820.1 tffamily=C3H
FREATCRQYEENSCNRGGYCNFMHVK

>AT1G29600.1 tffamily=C3H
GKKLDCKAGACKRGSNCPFNHPK

>AT5G07060.1 tffamily=C3H
NRPKICSFYTIGQCKRGAECSFRHEM

>AT2G35430.1 tffamily=C3H
WKTRICNKWQTTGYCPFGSHCHFAHGP

>AT2G35430.1 tffamily=C3H
FKTKLCFKFRAGTCPYSASSCHFAHSA

>AT3G12130.1 tffamily=C3H
FKTKICERFSKGNCTFGDRCHFAHGE

>AT2G19810.1 tffamily=C3H
RTQPCKDGGNCRRRVCFFAHSP

>AT3G21810.1 tffamily=C3H
YKTKLCILFNKTGDCSRPNCTFAHGN

>AT2G28450.1 tffamily=C3H
WKTSLCSYFRREASCSHGNECKYAHGE


15) CAMTA

Searched with the Arabidopsis genes

>AT5G64220.1 tffamily=CAMTA
LLSEAQHRWLRPAEICEILRNHQKFHIASEPPNRPPSGSLFLFDRKVLRYFRKDGHNWRKKKDGKTVKEAHEKLKVGSIDVLHCYYAHGEDNENFQRRCYWMLEQDLMHIVFVHYLEVK

>AT3G16940.1 tffamily=CAMTA
MLEEAKSRWLRPNEIHAILYNPKYFTINVKPVNLPNSGRIILFDRKMLRNFRKDGHNWKKKKDGRTVKEAHEHLKVGNEERIHVYYAHGEDNTTFVRRCYWLLDKARENIVLVHYRDTQ

>AT4G16150.1 tffamily=CAMTA
MLDEAYSRWLRPNEIHALLCNHKFFTINVKPCGTIVLFDRKMLRNFRKDGHNWKKKKDGKTIKEAHEHLKVGNEERIHVYYAHGEDTPTFVRRCYWLLDKSQEHIVLVHYRETH

>AT1G67310.1 tffamily=CAMTA
LYQEAHSRWLKPPEVLFILQNHESLTLTNTAPQRPTSGSLLLFNKRVLKFFRKDGHQWRRKRDGRAIAEAHERLKVGNAEALNCYYAHGEQDPTFRRRIYWMLDPEYEHIVLVHYRDVS

16) CCAAT-DR1

Searched with the full length of the two Arabidopsis genes

>AT5G23090.2 Arabidopsis CCAAT-Dr1 family transcription factor, protein sequence  tffamily=CCAAT-DR1
MDPMDIVGKSKEDASLPKATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNDVCNKEDKRTIAPEHVLKALQ
VLGFGEYIEEVYAAYEQHKYETMQDTQRSVKWNPGAQMTEEEAAAEQQRMFAEARARMNGGVSVPQPEHPETDQRSPQS

>AT5G08190.1 Arabidopsis CCAAT-Dr1 family transcription factor, protein sequence  tffamily=CCAAT-DR1
MDPMDIVGKSKEDASLPKATMTKIIKEMLPADVRVARDAQDLLIECCVEFINLISSESNEVCNKEDKRTIAPEHVLKALQ
VLGFGEYVEEVYAAYEQHKYETMQDSQRSVKMNSGAEMTEEEAAAEQQRMFAEARARMNGGVTVPQPEQLEEPQQQQQTS
LQS

17) CCAAT-HAP2

Searched with the Arabidopsis genes

>AT1G30500.1 tffamily=CCAAT-HAP2
AVEEPVFVNAKQYHGILRRRQSRARLESQNKVIKSRKPYLHESRHLHAIRRPRGCGGRFLNAKKED

>AT1G72830.1 tffamily=CCAAT-HAP2
TETDPVFVNAKQYHAIMRRRQQRAKLEAQNKLIRARKPYLHESRHVHALKRPRGSGGRFLNTKKLL

>AT1G17590.1 tffamily=CCAAT-HAP2
IENEPVFVNAKQFHAIMRRRQQRAKLEAQNKLIKARKPYLHESRHVHALKRPRGSGGRFLNTKKLQ 

>AT5G12840.1 tffamily=CCAAT-HAP2
MAQEPVYVNAKQYEGILRRRKARAKAELERKVIRDRKPYLHESRHKHAMRRARASGGRFAKKSEVE 

>AT5G06510.1 tffamily=CCAAT-HAP2
EEDGTIYVNSKQYHGIIRRRQSRAKAEKLSRCRKPYMHHSRHLHAMRRPRGSGGRFLNTKT

>AT3G05690.1 tffamily=CCAAT-HAP2
EDSTIYVNSKQYHGIIRRRQSRAKAAAVLDQKKLSSRCRKPYMHHSRHLHALRRPRGSGGRFLNTKSQN 


18) CCAAT-HAP3

Searched with the Arabidopsis genes

>AT1G21970.1 tffamily=CCAAT-HAP3
CVAREQDQYMPIANVIRIMRKTLPSHAKISDDAKETIQECVSEYISFVTGEANERCQREQRKTITAEDILWAMSKLGFDNYVDPLTVFINRYREIETDRG

>AT2G13570.1 tffamily=CCAAT-HAP3
NNKEQDRFLPIANVGRIMKKVLPGNGKISKDAKETVQECVSEFISFVTGEASDKCQREKRKTINGDDIIWAITTLGFEDYVAPLKVYLCKYRDTEGEKV

>AT2G47810.1 tffamily=CCAAT-HAP3
MMVKEQDRLLPIANVGRIMKNILPANAKVSKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICWAMANLGFDDYAAQLKKYLHRYRVLEGEKP

>AT2G37060.1 tffamily=CCAAT-HAP3
LHVREQDRFLPIANISRIMKRGLPANGKIAKDAKEIVQECVSEFISFVTSEASDKCQREKRKTINGDDLLWAMATLGFEDYMEPLKVYLMRYREMEGDTK

>AT3G53340.1 tffamily=CCAAT-HAP3
LNVREQDRFLPIANISRIMKRGLPLNGKIAKDAKETMQECVSEFISFVTSEASDKCQREKRKTINGDDLLWAMATLGFEDYIDPLKVYLMRYREMEGDTK

>AT2G38880.1  tffamily=CCAAT-HAP3
GSVREQDRYLPIANISRIMKKALPPNGKIGKDAKDTVQECVSEFISFITSEASDKCQKEKRKTVNGDDLLWAMATLGFEDYLEPLKIYLARYRELEGDNK

>AT1G09030.1 tffamily=CCAAT-HAP3
MTDEDRLLPIANVGRLMKQILPSNAKISKEAKQTVQECATEFISFVTCEASEKCHRENRKTVNGDDIWWALSTLGLDNYADAVGRHLHKYREAERERT

>AT2G27470.1 tffamily=CCAAT-HAP3
ESEKVVVDELPLAIVRRVVKKKLSECSSIHKEALLAFSESARIFIHYLSATANDFCKDARRQTMKADDVFKALEEMDFSEFLEPLKSSLEDFKKKNAGKK

19) CCAAT-HAP5

Searched with the Arabidopsis genes

>AT1G08970.1 tffamily=CCAAT-HAP5
AFWENQFKEIEKTTDFKNHSLPLARIKKIMKADEDVRMISAEAPVVFARACEMFILELTLRSWNHTEENKRRTLQKNDIAAAVTRTDIFDFLVDIVPREDL

>AT1G54830.1 tffamily=CCAAT-HAP5
SFWETQFKEIEKTTDFKNHSLPLARIKKIMKADEDVRMISAEAPVVFARACEMFILELTLRSWNHTEENKRRTLQKNDIAAAVTRTDIFDFLVDIVPREDL

>AT5G50480.1  tffamily=CCAAT-HAP5
NYWIEQMETVSDFKNRQLPLARIKKIMKADPDVHMVSAEAPIIFAKACEMFIVDLTMRSWLKAEENKRHTLQKSDISNAVASSFTYDFLLDVVPK

>AT5G27910.1 tffamily=CCAAT-HAP5
SFWSKEMEGNLDFKNHDLPITRIKKIMKYDPDVTMIASEAPILLSKACEMFIMDLTMRSWLHAQESKRVTLQKSNVDAAVAQTVIFDFLLDDDIEVKR

>AT5G43250.1 tffamily=CCAAT-HAP5
MEEEEGSIRPEFPIGRVKKIMKLDKDINKINSEALHVITYSTELFLHFLAEKSAVVTAEKKRKTVNLDHLRIAVKRHQPTSDFLLDSLPLP

>AT5G38140.1 tffamily=CCAAT-HAP5
VFWNNQREQLGNFAGQTHLPLSRVRKILKSDPEVKKISCDVPALFSKACEYFILEVTLRAWMHTQSCTRETIRRCDIFQAVKNSGTYDFLIDRVPFG

>AT3G12480.1 tffamily=CCAAT-HAP5
KVPDYGHSQGQGHGDVTMDDRSISKRRKVNDSDEEYKKSKTQEIGSAKTSGRGGRGRGRGRGRGGRAAKAAEREGLNRENSGQPPPEDNVKMHASESSPQEDE

>AT1G07980.1 tffamily=CCAAT-HAP5
TKTSKNREEDDGGAEDAKIKFPMNRIRRIMRSDNSAPQIMQDAVFLVNKATEMFIERFSEEAYDSSVKDKKKFIHYKHLSSVVSNDQRYEFLADSVPEK

20) CONSTANS

Searched with the complete sequences of the Arabidopsis genes CO (X94937), COL6 (AC011915) and COL9 (AC009176). The B1 or B1 and B2 domains (depending on the gene) were manually excised and used for phylogenetic analysis.

>CO tffamily=CONSTANS
MLKQESNDIGSGENNRARPCDTCRSNACTVYCHADSAYLCMSCD
AQVHSANRVASRHKRVRVCESCERAPAAFLCEADDASLCTACDSEVHSANPLARRHQR
VPILPISGNSFSSMTTTHHQSEKTMTDPEKRLVVDQEEGEEGDKDAKEVASWLFPNSD
KNNNNQNNGLLFSDEYLNLVDYNSSMDYKFTGEYSQHQQNCSVPQTSYGGDRVVPLKL
EESRGHQCHNQQNFQFNIKYGSSGTHYNDNGSINHNAYISSMETGVVPESTACVTTAS
HPRTPKGTVEQQPDPASQMITVTQLSPMDREARVLRYREKRKTRKFEKTIRYASRKAY
AEIRPRVNGRFAKREIEAEEQGFNTMLMYNTGYGIVPSF

>COL6 tffamily=CONSTANS
MKSLASAVGGKTARACDSCVKRRARWYCAADDAFLCHACDGSVH
SANPLARRHERVRLKSASAGKYRHASPPHQATWHQGFTRKARTPRGGKKSHTMVFHDL
VPEMSTEDQAESYEVEEQLIFEVPVMNSMVEEQCFNQSLEKQNEFPMMPLSFKSSDEE
DDDNAESCLNGLFPTDMELAQFTADVETLLGGGDREFHSIEELGLGEMLKIEKEEVEE
EGVVTREVHDQDEGDETSPFEISFDYEYTHKTTFDEGEEDEKEDVMKNVMEMGVNEMS
GGIKEEKKEKALMLRLDYESVISTWGGQGIPWTARVPSEIDLDMVCFPTHTMGESGAE
AHHHNHFRGLGLHLGDAGDGGREARVSRYREKRRTRLFSKKIRYEVRKLNAEKRPRMK
GRFVKRSSIGVAH

>COL9 tffamily=CONSTANS
MGYMCDFCGEQRSMVYCRSDAACLCLSCDRSVHSANALSKRHSR
TLVCERCNAQPATVRCVEERVSLCQNCDWSGHNNSNNNNSSSSSTSPQQHKRQTISCY
SGCPSSSELASIWSFCLDLAGQSICEQELGMMNIDDDGPTDKKTCNEDKKDVLVGSSS
IPETSSVPQGKSSSAKDVGMCEDDFYGNLGMDEVDMALENYEELFGTAFNPSEELFGH
GGIDSLFHKHQTAPEGGNSVQPAGSNDSFMSSKTEPIICFASKPAHSNISFSGVTGES
SAGDFQECGASSSIQLSGEPPWYPPTLQDNNACSHSVTRNNAVMRYKEKKKARKFDKR
VRYASRKARADVRRRVKGRFVKAGEAYDYDPLTPTRSY

21) CPP

Searched with the Arabidopsis genes

>AT3G22780.1 tffamily=CPP
ESCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIH

>AT4G14770.1 tffamily=CPP
SPKKKSYCECFAAGVYCIEPCSCIDCFNKPIH

>AT3G22760.1 tffamily=CPP
SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIH

>AT5G25790.1 tffamily=CPP
KQQKHCNCKNSKCLKLYCECFASGSYCNGCNCVNCHNKLEN

>AT3G16160.1 tffamily=CPP
RKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEK

>AT2G20110.1 tffamily=CPP
RHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGS

>AT3G16160.1 tffamily=CPP
LLSRGCKCKRTRCLKKYCECFQANLLCSDNCKCINCKNVSEA

22) Dof

Searched with thirteen different dof domains.

>Dof search_1 tffamily=Dof
KPDKILPCPRCNSMDTKFCYYNNYNVNQPRHFCKNCQRYWTAGGTMRNVPVGAGRRKSKS

>Dof search_2 tffamily=Dof
AAAAPLPCPRCRSRDTKFCYFNNYNVNQPRHFCKACHRYWTAGGALRNVPVGAGRRKNRP

>Dof search_3 tffamily=Dof
AEQAPLRCPRCNSSNTKFCYYNNYNLTQPRHFCKTCRRYWTKGGALRNVPIGGGCRKPRP

>Dof search_4 tffamily=Dof
TEAEGLACPRCESTNTKFCYYNNYNLAQPRHFCKACRRYWTRGGALRNVPVGGGTRNKVA

>Dof search_5 tffamily=Dof
LPEPGLKCPRCDSTNTKFCYFNNYSLSQPRHFCRACRRYWTRGGALRNVPVGGGYRRHAK

>Dof search_6 tffamily=Dof
QPEPGLKCPRCESTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGCRRNKR

>Dof search_7 tffamily=Dof
PQEQGLRCPRCDSPNTKFCYYNNYSLSQPRHFCKTCRRYWTKGGALRNVPVGGGCRKNKR

>Dof search_8 tffamily=Dof
QKEKALNCPRCNSTNTKFCYYNNYSLQQPRYFCKTCRRYWTEGGSLRNVPVGGGSRKNKR

>Dof search_9 tffamily=Dof
GAEAAPNCPRCDSPNTKFCYYNNYSLSQPRYFCKGCRRYWTKGGSLRNVPVGGGCRKNRR

>Dof search_10 tffamily=Dof
PRPPPRQCPRCGSANTKFCYYNNYSRTQPRYLCKACRRHWTEGGTLRDVPVGGGRKNSKR

>Dof search_11 tffamily=Dof
KKQQQLECPRCRSTNTKFCYYNNYSTSQPRHFCRACRRYWTHGGTLRDVPVGGASRRGGG

>Dof search_12 tffamily=Dof
GGGGREQCPRCASRDTKFCYYNNYNTAQPRHFCRACRRYWTLGGSLRNGRLGGGIGVDLL

>Dof search_13 tffamily=Dof
AAGVGDPCPRCESRDTKFCYYNNYNTSQPRHFCKSCRRYWTKGGSLRNVPVGGGSRKSST


23) E2F

Searched with the Arabidopsis genes

>AT3G01330.1 tffamily=E2F
KKEKSLWLLAQNFVKMFLCSDDDLITLDSAAKALLSDSPDSVHMRTKVRRLYDIANVFASMNLIEKTHIPVTRKPAYRWLG

>AT3G48160.1 tffamily=E2F
RREKSLGLLTQNFIKLFICSEAIRIISLDDAAKLLLGDAHNTSIMRTKVRRLYDIANVLSSMNLIEKTHTLDSRKPAFKWLG

>AT5G22220.1 tffamily=E2F
RYDSSLGLLTKKFINLIKQAEDGILDLNKAADTLEVQKRRIYDITNVLEGIGLIEKTLKNRIQWKG

>AT5G02470.1 tffamily=E2F
TSGGGLRQFSVMVCQKLEAKKITTYKEVADEIISDFATIKQNAEKPLNENEYNEKNIRRRVYDALNVFMALDIIARDKKEIRWKG

>AT5G03415.1 tffamily=E2F
KTGRGLRQFSMKVCEKVESKGRTTYNEVADELVAEFALPNNDGTSPDQQQYDEKNIRRRVYDALNVLMAMDIISKDKKEIQWRG

24) EIL

Searched with the Arabidopsis genes

>AT5G21120.1 tffamily=EIL
DSHTALCDDLSSDEEMEIEELEKKIWRDKQRLKRLKEMAKNGLGTRLLLKQQHDDFPEHSSKRTMYKAQDGILKYMSKTMERYKAQGFVYGIVLENGKTVAGSSDNLREWWKDKVRFDRNGPAAIIKHQRDINLSDGSDSGSEVGDSTAQKLLELQDTTLGALLSALFPHCNPPQRRFPLEKGVTPPWWPTGKEDWWDQLSLPVDFRGVPPPYKKPHDLKKLWKIGVLIGVIRHMASDISNIPNLVRRSRSLQEKMTSREGALWLAALYREKAIVDQIA

>AT1G73730.1 tffamily=EIL
LASDNVAEIDVSDEEIDADDLERRMWKDRVRLKRIKERQKAGSQGAQTKETPKKISDQAQRKKMSRAQDGILKYMLKLMEVCKVRGFVYGIIPEKGKPVSGSSDNIRAWWKEKVKFDKNGPAAIAKYEEECLAFGKSDGNRNSQFVLQDLQDATLGSLLSSLMQHCDPPQRKYPLEKGTPPPWWPTGNEEWWVKLGLPKSQSPPYRKPHDLKKMWKVGVLTAVINHMLPDIAKIKRHVRQSKCLQDKMTAKESAIWLAVLNQEESLIQQPS

25) ERF

Searched with a representative domain from each subfamily.

>ERF I search_1 tffamily=ERF
LYRGVRQRHWGKWVAEIRLPRNRTRLWLGTFDTAEEAALAYDKAAYKLRGDFARLNFP

>ERF II search_2 tffamily=ERF
RYKGIRMRKWGKWVAEIREPNKRSRIWLGSYKTAVAAARAYDTAVFYLRGPSARLNFP

>ERF III search_3 tffamily=ERF
IYRGVRQRNSGKWVSEVREPNKKTRIWLGTFQTAEMAARAHDVAALALRGRSACLNFA

>ERF IV search_4 tffamily=ERF
SFRGVRQRIWGKWVAEIREPNRGSRLWLGTFPTAQEAASAYDEAAKAMYGPLARLNFP

>ERF V search_5 tffamily=ERF
KFRGVRQRHWGSWVAEIRHPLLKRRIWLGTFETAEEAARAYDEAAVLMSGRNAKTNFP

>ERF VI search_6 tffamily=ERF
KFRGVRQRPWGKWAAEIRDPSRRVRVWLGTFDTAEEAAIVYDNAAIQLRGPNAELNFP

>ERF VII search_7 tffamily=ERF
VYRGIRKRPWGKWAAEIRDPRKGVRVWLGTFNTAEEAAMAYDVAAKQIRGDKAKLNFP

>ERF VIII search_8 tffamily=ERF
RFLGVRRRPWGRYAAEIRDPTTKERHWLGTFDTAEEAALAYDRAARSMRGTRARTNFV

>ERF IX search_9 tffamily=ERF
HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFP

>ERF X search_10 tffamily=ERF
KYRGVRQRPWGKWAAEIRDPHKATRVWLGTFETAEAAARAYDAAALRFRGSKAKLNFP

>ERF VI-L search_11 tffamily=ERF
KPVGVRQRKWGKWAAEIRHPITKVRTWLGTYETLEQAADAYATKKLAFDALAAATSAA

>ERF XB-L search_12 tffamily=ERF
KHKGVRKKPSGKWAAEIWDPSLKVRRWLGTFPTAEMAAKAYNDAAAEFVGRRSARRGT

26) FHA

Searched with the Arabidopsis genes

>AT3G20550.1 tffamily=FHA
YLFGRERRIADIPTDHPSCSKQHAVIQYREKPDGKPYIMDLGSTNKTYINESPIEPQRYYELFEKDTIKFG

>AT5G47790.1 tffamily=FHA
HIFGRQHQTCDFVLDHQSVSRQHAAVVPHKNGSIFVIDLGSAHGTFVANERLTKDTPVELEVGQSLRFA

>AT5G19280.1 tffamily=FHA
VKLGRVSPSDLALKDSEVSGKHAQITWNSTKFKWELVDMGSLNGTLVNSHSISHWGNPVELASDDIITLG

>AT3G54350.1 tffamily=FHA
VLVGRSTEDLAVDIDLGREKRGSKISRRQAIIRLGDDGSFHIKNLGKYSISVNEKEVDPGQSLILKSDCLVEIR

>AT1G60700.1 tffamily=FHA
VIIGRSSGGLNVDIDLGKYNYGSKISRRQALVKLENYGSFSLKNLGKQHILVNGGKLDRGQIVTLTSCSSINIR

>AT5G07400.1 tffamily=FHA
YTIGRSSSDGFCDFVIDHSSISRKHCQILFDSQSHKLYIFDLGNLNGVYVNRVRVRKSKVQEVSIDDEVLFF

>AT5G67030.1 tffamily=FHA
CIVGSEPDQDFPGMRIVIPSSQVSKMHARVIYKDGAFFLMDLRSEHGTYVTDNEGRRPNFPARFRSSDIIEFG

>AT2G21530.1 tffamily=FHA
VTIGRLPEKADVVIPVATVSGVHATINTNEKNLLVTDMNSTNGTFIEDKRLIPGVAAPAFPGTRITFG

>AT2G45460.1 tffamily=FHA
HCLGRLPCHASYQVESNAISGNHCKVFRKPDGDDVTVFMVDTSTNGTFLNWERLTKNGPEVRVQHGDIISLA

27) GARP-ARR-B

Searched with the Arabidopsis genes

>AT1G49190.1 tffamily=GARP-ARR-B
TNVLVVDTNFTTLLNMKQIMKQYAYQVSIETDAEKALAFLTSCKHEINIVIWDFHMPGIDGLQALKSITSKLDLPVVIMSDDNQTESVMKATFYGACDYVVKPVKEEVMANI

>AT3G62670.1 tffamily=GARP-ARR-B
NRVLLVGADSNSSLKNLMTQYSYQVTKYESGEEAMAFLMKNKHEIDLVIWDFHMPDINGLDALNIIGKQMDLPVVIMSHEYKKETVMESIKYGACDFLVKPVSKEVIAVL

>AT3G16857.1 tffamily=GARP-ARR-B
LRVLVVDDDPTCLMILERMLRTCLYEVTKCNRAEMALSLLRKNKHGFDIVISDVHMPDMDGFKLLEHVGLEMDLPVIMMSADDSKSVVLKGVTHGAVDYLIKPVRMEALKNI

>AT2G01760.1 tffamily=GARP-ARR-B
LRILVVDDDTSCLFILEKMLLRLMYQVTICSQADVALTILRERKDSFDLVLSDVHMPGMNGYNLLQQVGLLEMDLPVIMMSVDGRTTTVMTGINHGACDYLIKPIRPEELKNI

>AT2G27070.1 tffamily=GARP-ARR-B
INVMVVDDNRVFLDIWSRMLEKSKYREITVIAVDYPKKALSTLKNQRDNIDLIITDYYMPGMNGLQLKKQITQEFGNLSVLVMSSDPNKEEESLSCGAMGFIPKPIAPTDLPKI

>AT5G07210.1 tffamily=GARP-ARR-B
INVMVVDDDHVFLDIMSRMLQHSKYRVIAVDDPKKALSTLKIQRDNIDLIITDYYMPGMNGLQLKKQITQEFGNLPVLVMSSD TNKEEESLSCGAMGFIPKPIHPTDLTKI


28) GARP-G2


Searched with 10 DNA-binding domain sequences n.b these are MYB-like

Searched with the MYB-like domain from these GARP-G2 genes.

The consensus is different from R2R3MYBS and MYB-like proteins.

PR/KL_WTP_LH_RFV_AV__LGG___(intron)ATPK_____M___GLT____SHLQ_YR___

n.b SHLQ not SHAQ (those are MYB-related).


>AT2G02060.1 tffamily=GARP-G2
NKFHGVRPYVRSPVPRLRWTPDLHRCFVHAVEILGGQHRATPKLVLKMMDVKGLTISHVKSHLQMYRGGS

>AT4G13640.1 tffamily=GARP-G2
DACLVLTTDPKPRLRWTSELHERFVDAVTQLGGPDKATPKTIMRTMGVKGLTLYHLKSHLQKFRLGR

>AT2G20400.1 tffamily=GARP-G2
TVSSNSNNNSNS(5)AAKGRMRWTPELHEVFVDAVNQLGGSNEATPKGVLKHMKVEGLTIFHVKSHLQKYRTAK

>AT2G01060.1 tffamily=GARP-G2
NGGPNSSHASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAK

>AT4G37180. tffamily=GARP-G2
HHHHQFNKPSSQSHHIQKKEQRRRWSQELHRKFVDALHRLGGPQVATPKQIRDLMKVDGLTNDEVKSHLQKYRMHI

>AT3G46640.1 tffamily=GARP-G2
MAAEEGDSGTEDLSGKTLKRPRLVWTPQLHKRFVDVVAHLGIKNAVPKTIMQLMNVEGLTRENVASHLQKYRLYL

>AT2G20570.1 tffamily=GARP-G2
KNNRISNNEGKRKVKVDWTPELHRRFVEAVEQLGVDKAVPSRILELMGVHCLTRHNVASHLQKYRSHR

>AT4G18020.3 tffamily=GARP-G2
TKPINKSSGIKNVSGNKTSRKKVDWTPELHKKFVQAVEQLGVDQAIPSRILELMKVGTLTRHNVASHLQKFRQHR

>AT5G49240.1 tffamily=GARP-G2
DLAMIQVNNAEGDIFRFLSEIGSEMDLPIIIISEDDSVKSVKKWMINGAADYLIKPIRPEDLRIVFKH

>AT2G06020.1 tffamily=GARP-G2
ITPCIFYTSDEKARLRWSSDLHDCFVNAVEKLGGPNKATPKSVKEAMEVEGIALHHVKSHLQKFRLGK


29) GeBP

Searched with the Arabidopsis genes

>AT2G25650.1 tffamily=GeBP
PLIVRIWNEEDELSILKGLVDYRAKTGFNPKIDWDAFYSFLGSSIVAKFSKEQVLSKIRKLKRRFHVHWEKISEGNDPKFTRSSDSEAFGFSSMIWGQ

>AT2G36340.1 tffamily=GeBP
SASKMNWSKNDELVILGGIVDYENETKLSYRSDWDALYRYIKDCVEAKFSKIQLINKVKNMKRKFTYNQGRSNHGEQLSFTNTDDDEIFKSLIIWDK

>AT5G28040.1 tffamily=GeBP
RLFQRLWTDEDEIELLRGFLDYITNHRGNSSHPPDTAPFYEQIKSKLQLEFNKNQLVEKLRRLKKKYRNVMSKFSSGKEVFFKSPHDQATFDISRKIWNQ

>AT4G00130.1 tffamily=GeBP
MLFQRLFSEADEIALLQGLIDFTSTKGDPYEILMLFAFMLKKKFNNAVKNARKKGQTEDEVEYAKESEKKRFDLSIMIWGS

>AT4G01260.1 tffamily=GeBP
NLFVRLFTEEDEAILLQGFLDFATKKENPSDHIDDFYESIKNSISFDVTKPQLVTKIGNLKKKFNGRVSKGLKKGKNEEVMVFSKASDQNCFDLSRKIWGS

>AT1G11510.1 tffamily=GeBP
TYFQRLWTEDDEIVVLQGLIDDKKDTGVSNTNKVYELVKKSISFDVSKNQLMEKLRALKKKYENNLGKAKDGVEPTFVKPHDRKAFELSKLVWGG

>AT4G00250.1 tffamily=GeBP
ANPQRVWSEEDEISLLQAVIDFKAETGTSPWDHKNAFFDIAKKSISFDVSHVQFFDKIRRLKNKYFVNRKNKSGESNHDKKCLGLAVLIWGS

>AT4G00610.1 tffamily=GeBP
LLFQRLWTDEDEIVFLQGMIKFAKDTGKNVSEDMNGFFEKLKDSISFEVKTDQFVNKIRSMKRKYIENKKTTTEHDKKCYELAEIIWVS

>AT1G66420.1 tffamily=GeBP
PYFQRLWSEEDEIVMLQGIIKFEDVTGKSPFEDRHGFIEFVKNSISFEASVQQYIGKISQLKRKYTRKRKNGFSEGHEQKCFKLAMSIWGT

>AT5G41765.1 tffamily=GeBP
KSMMDFKALTRHNPSDDMTGAYNFLHEYISVDVYSYEFVEKMKSLKKKLIEKMGINAKDLSSSLLKLIWRY


30) GIF

Searched with the all three Arabidopsis genes

>AT1G01160.1 tffamily=GIF
ANNITTEQIQKYLDENKKLIMAIMENQNLGKLAECAQYQALLQKNLMYLA

>AT4G00850.1 tffamily=GIF
TNNITTEQIQKYLDENKKLIMAILENQNLGKLAECAQYQALLQKNLMYLA

>AT5G28640.1 tffamily=GIF
PSNVTSDHIQQYLDENKSLILKIVESQNSGKLSECAENQARLQRNLMYLA

31) GRAS

Searched with SCL3, SCL6, GAI and SCR. Genes were considered as GRAS factors based on the presence of the conserved SAW domain.

>SCL3 tffamily=GRAS
mvamfqedngtssvassplqvfstmslnrptllassspfhclkdlkpeerglylihlllt
canhvasgslqnanaaleqlshlaspdgdtmqriaayftealanrilkswpglykalnat
qtrtnnvseeihvrrlffemfpilkvsylltnraileamegekmvhvidldasepaqwla
llqafnsrpegpphlritgvhhqkevleqmahrlieeaekldipfqfnpvvsrldclnve
qlrvktgealavssvlqlhtflasdddlmrkncalrfqnnpsgvdlqrvlmmshgsaaea
rendmsnnngyspsgdsasslplpssgrtdsflnaiwglspkvmvvteqdsdhngstlme
rlleslytyaalfdcletkvprtsqdrikvekmlfgeeikniiscegferrerheklekw
sqridlagfgnvplsyyamlqarrllqgcgfdgyrikeesgcavicwqdrplysvsawrc
rk

>SCL6 tffamily=GRAS
aaifyghhhhtpppakrlnpgpvgiteqlvkaaeviesdtclaqgilarlnqqlsspvgk
pleraafyfkealnnllhnvsqtlnpyslifkiaayksfseispvlqfanftsnqalles
fhgfhrlhiidfdigyggqwaslmqelvlrdnaaplslkitvfaspanhdqlelgftqdn
lkhfaseinisldiqvlsldllgsiswpnssekeavavnisaasfshlplvlrfvkhlsp
tiivcsdrgcertdlpfsqqlahslhshtalfesldavnanldamqkierfliqpeiekl
vldrsrpierpmmtwqamflqmgfspvthsnftesqaeclvqrtpvrgfhvekkhnslll
cwqrtelvgvsawrcrss

>GAI tffamily=GRAS
mkrdhhhhhqdkktmmmneeddgngmdellavlgykvrssemadvaqkleqlevmmsnvq
eddlsqlatetvhynpaelytwldsmltdlnppssnaeydlkaipgdailnqfaidsass
snqggggdtyttnkrlkcsngvvetttataestrhvvlvdsqengvrlvhallacaeavq
kenltvaealvkqigflavsqigamrkvatyfaealarriyrlspsqspidhslsdtlqm
hfyetcpylkfahftanqaileafqgkkrvhvidfsmsqglqwpalmqalalrpggppvf
rltgigppapdnfdylhevgcklahlaeaihvefeyrgfvantladldasmlelrpseie
svavnsvfelhkllgrpgaidkvlgvvnqikpeiftvveqesnhnspifldrfteslhyy
stlfdslegvpsgqdkvmsevylgkqicnvvacdgpdrverhetlsqwrnrfgsagfaaa
higsnafkqasmllalfnggegyrveesdgclmlgwhtrpliatsawklstn

>SCR tffamily=GRAS
maesgdfnggqppphsplrttssgssssnnrgpppppppplvmvrkrlasemssnpdynn
ssrpprrvshlldsnyntvtpqqppsltaaatvssqpnpplsvcgfsglpvfpsdrggrn
vmmsvqpmdqdsssssasptvwvdaiirdlihsstsvsipqliqnvrdiifpcnpnlgal
leyrlrslmlldpssssdpspqtfeplyqisnnpsppqqqqqhqqqqqqhkpppppiqqq
erensstdappqpetvtatvpavqtntaealrerkeeikrqkqdeeglhlltlllqcaea
vsadnleeankllleisqlstpygtsaqrvaayfseamsarllnsclgiyaalpsrwmpq
thslkmvsafqvfngisplvkfshftanqaiqeafekedsvhiidldimqglqwpglfhi
lasrpggpphvrltglgtsmealqatgkrlsdfadklglpfefcplaekvgnldterlnv
rkreavavhwlqhslydvtgsdahtlwllqrlapkvvtvveqdlshagsflgrfveaihy
ysalfdslgasygeeseerhvveqqllskeirnvlavggpsrsgevkfeswrekmqqcgf
kgislagnaatqatlllgmfpsdgytlvddngtlklgwkdlslltasawtprs

32) GRF

Searched with the Arabidopsis genes

>AT2G22840.1 tffamily=GRF
TQWAELEQQALIYKYITANVPVPSSLLLSLKKSFFPYGSLPPNSFGWGSFHLGFSGGNMDPEPGRCRRTDGKKWRCSRDAVPDQKYCERHINRGRHRSRK

>AT2G36400.1 tffamily=GRF
AQWQELELQALIYRYMLAGAAVPQELLLPIKKSLLHLSPSYFLHHPLQHLPHYQPAWYLGRAAMDPEPGRCRRTDGKKWRCSRDVFAGHKYCERHMHRGRNRSRK

>AT2G45480.1 tffamily=GRF
AQLMEFRMQALVYRYIEAGLRVPHHLVVPIWNSLALSSSSNYNYHSSSLLSNKGVTHIDTLETEPTRCRRTDGKKWRCSNTVLLFEKYCERHMHRGRKRSRK


33) HMG

Searched with the Arabidopsis genes

>AT1G20693.1 tffamily=HMG
PKRPASAFFVFMEDFRETFKKENPKNKSVATVGKAAGDKWKSLSDSEKAPYVAKAEKRKVEYEKNIKAYN

>AT3G51880.1 tffamily=HMG
PKRAPSAFFVFLEDFRVTFKKENPNVKAVSAVGKAGGQKWKSMSQAEKAPYEEKAAKRKAEYEKQMDAYN

>AT2G34450.1 tffamily=HMG
PKKPATAFFFFLDDFRKQYQEENPDVKSMREIGKTCGEKWKTMTYEEKVKYYDIATEKREEFHRAMTEYT

>AT5G23405.1 tffamily=HMG
LTDFAVFMNHFRKSFRTDYNGALVKEGSKIGWEMWKSMTEDEKKDYLDKAADEEDEDEDTVEEQA

>AT5G23420.1 tffamily=HMG
PKRPLTAFFIFMSDFRKTFKSEHNGSLAKDAAKIGGEKWKSLTEEEKKVYLDKAAELKAEYNKSLESND

>AT4G11080.1 tffamily=HMG
PKQPISAYLIYANERRAALKGENKSVIEVAKMAGEEWKNLSEEKKAPYDQMAKKNKEIYLQEMEGYK

>AT3G28730.1 tffamily=HMG
PKRAMSGFMFFSQMERDNIKKEHPGIAFGEVGKVLGDKWRQMSADDKEPYEAKAQVDKQRYKDEISDYK

34) Homeodomain

Homeodomain genes were isolated from the tobacco GSSs by searches with the homeodomain from at least one member of each of the major subgroups of plant homeodomain protein. This approach was crucial to success as, unlike the other transcription factor families, searches with a homeodomain from one family often only resulted in isolation of genes from that family and not the complete homeodomain family.

>knotted1 tffamily=Homeodomain
KKRKKGKLPKDARTALLDWWNTHyrwPYPTEEEKNRLSEITGLDPKQINNWFINQRKRHWRPSEDM

>KNAT1 tffamily=Homeodomain
SKKKKKGKLPKEARQKLLTWWELHYKWPYPSESEKVALAESTGLDQKQINNWFINQRKRH

>HAT1 tffamily=Homeodomain
GGETCRKKLRLSKDQSAVLEDTFKEHNTLNPKQKLALAKKLGLTARQVEVWFQNRRARTK

>GLABRA2 tffamily=Homeodomain
KRKRKKYHRHTTDQIRHMEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK

>PRHA tffamily=Homeodomain
HIFCAECNSREAFPDNDIILCDGTCNRAFHQKCLDPPLETESIPPGDQGWFCKFCDCKI

>HAT3.1 tffamily=Homeodomain
EIEKSSSSACKQTDPKTQRLYISFQENQYPDKATKESLAKELQMTVKQVNNWFKHRRWSIN

>BELL1 tffamily=Homeodomain
PWRPQRGLPERAVTTLRAWLFEHFLHPYPSDVDKHILARQTGLSRSQVSNWFINARVRLW

>knotted3 tffamily=Homeodomain
KKKKKGKLPKDARQKLLSWWELHYKWPYPSESEKVALAETTGLDQKQINNWFINQRKRHW

>knotted2 tffamily=Homeodomain
KKKKKGKLPKDARQKLLSWWELHYKWPYPSESEKVALAETTGLDQKQINNWFINQRKRHW

>H1 tffamily=Homeodomain
KKRKKGKLPKEARQQLLDWWTRHYKWPYPSESQKLALAESTGLDQKQINNWFINQRKRHW

>Hfi22 tffamily=Homeodomain
KKRRLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQ

>homeobox22 tffamily=Homeodomain
KKKKKGKLPKEARQMLLAWWNDHYRWPYPTEADKNSLAESTGLDPKQINNWFINQRKRHW

>homeobox20 tffamily=Homeodomain
KKKKKGKLPKDARQKLLSWWELHYKWPYPSESEKVALAETTGLDQKQINNWFINQRKRHW

>homeobox9 tffamily=Homeodomain
KKNKKGKLPREARQILLNWWTTHYKWPYPTEGEKICLAESTGLDPKQINNWFINQRKRHW

>NTH23 tffamily=Homeodomain
RKRRAGKLPGDTTSVLKAWWQSHAKWPYPTEEDKAKLVQETGLQLKQINNWFINQRKRDW

>NTH15 tffamily=Homeodomain
KKRKKGKLPKEARQQLLDWWTRHYKWPYPSESQKLALAESTGLDQKQINNWFINQRKRHW

>ATHB-7 tffamily=Homeodomain
KKSNHNKNNQRRFSDEQIKSLEMMFESETRLEPRKKVQLARELGLQPRQVAIWFQNKRARWK

>KNAT4 tffamily=Homeodomain
LRKRRAGKLPGDTTSVLKSWWQSHSKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNW

>WUSCHEL tffamily=Homeodomain
QTSTRWTPTTEQIKILKELYYNNAIRSPTADQIQKITARLRQFGKIEGKNVFYWFQNHKARER

>REVOLUTA tffamily=Homeodomain
LDSSGKYVRYTAEQVEALERVYAECPKPSSLRRQQLIRECSILANIEPKQIKVWFQNRRCRDK

>BELL1-RELATED tffamily=Homeodomain
lskllsildevdrnykqyyhqmqivvssfdviagcgaakpytalalqtisrhfrclrdaisg


35) HRT

Searched with the parts of the two Arabidopsis genes that are conserved.

>AT4G26170.1 tffamily=HRT
MIRCRSKPVSR

>AT5G56780.1 tffamily=HRT
MEPCNKRPVPG

>AT4G26170.1 tffamily=HRT
RKRCEDHKGMRVNAFFFLLNPTERDKAVNEDKSKPETSTG-MNQEGSGLLCEATTKNGLP

>AT5G56780.1 tffamily=HRT
RKRCEDHKGMRINAFLFLLNQTDREKTVKDEKPDPESHTESIEEEALTRFCEATTKNGLP

>AT4G26170.1 tffamily=HRT
CTRSAPEGSKRCWQHKDKTLNHGSSENVQSATASQVICGFKLYNGSVCEKSPVKGRKRCE

>AT5G56780.1 tffamily=HRT
CTRSSPKGSKRCWQHKEKTSSDTSPVYFQPEAAKNVACGVKLGNGLICERSPVKGRKRCE

>AT4G26170.1 tffamily=HRT
EHKGMRITS

>AT5G56780.1 tffamily=HRT
EHKGMRIT

36) HSF

Searched with the Arabidopsis genes

>AT1G46264.1 tffamily=HSF
PAPFLTKTYQLVDDPATDHVVSWGDDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFKRGEKHL LCEIHRRKTS

>AT5G62020.1 tffamily=HSF
TPFLTKTFNLVEDSSIDDVISWNEDGSSFIVWNPTDFAKDLLPKHFKHNNFSSFVRQLNTYGFKKVVPDRWEFSNDFFKRGEKRLLREIQRRKIT 

>AT5G16820.1 tffamily=HSF
PPFLSKTYDMVDDPLTNEVVSWSSGNNSFVVWSAPEFSKVLLPKYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEGFLRGRKQLLKSIVRRKPS 

>AT4G17750.1 tffamily=HSF
PPFLSKTYDMVEDPATDAIVSWSPTNNSFIVWDPPEFSRDLLPKYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEGFLRGQKHLLKKISRRKSV 

>AT3G63350.1 tffamily=HSF
SPFLTKTFEMVGDPNTNHIVSWNRGGISFVVWDPHSFSATILPLYFKHNNFSSFVRQLNTYGFRKIEAERWEFMNEGFLMGQRDLLKSIKRRTSS 

>AT1G67970.1 tffamily=HSF
APFLRKCYDMVDDSTTDSIISWSPSADNSFVILDTTVFSVQLLPKYFKHSNFSSFIRQLNIYGFRKVDADRWEFANDGFVRGQKDL LKNVIRRKN 

>AT3G24520.1 tffamily=HSF
APFIVKTYQMVNDPSTDWLITWGPAHNSFIVVDPLDFSQRILPAYFKHNNFSSFVRQLNTYGFRKVDPDRWEFANEHFLRGQKHLLNNIARRKHA 

>AT1G77570.1 tffamily=HSF
FYMRVYEVVDDASTDAIISWSESNNSFIIWNVGEFYRRILPKYVDLGTNLSRFFSNLRSHGFKIVKGRTGVLEFGHEDFIRDKLELMKKMVSDKR 

>AT4G18870.1 tffamily=HSF
PFPTKIYEMVDDPSSDAIISWSQSGKSFIIWNPQEFCKDHLRRLFNTLHIHFFFYKLKIFGFKKINPKKWEFANDNFVRGQRHLVEIIISNDKK

37) JUMONJI

Searched with seven Arabidopsis sequences from the DATF multisequence alignment. This part of the proteins appear to represent the JmjC domains.

>AT2G34880.1 tffamily=JUMONJI
WLYVGMCFSTFCWHVEDNHLYSLNYHHFGEPKVWYGVPGSHATGLEKAMRKHLPDLFDEQPDLLHELVTQFSPTILKNEGVPVYRAVQNAGEYVLTFPRAYHSGFNCGFNCAEAVNV

>AT3G48430.1 tffamily=JUMONJI
MVYVAMMFSWFAWHVEDHDLHSLNYLHMGAGKTWYGVPKDAALAFEEVVRVHGYGEELNPLVTFSTLGEKTTVMSPEVFVKAGIPCCRLVQNPGEFVVTFPGAYHSGFSHGFNFGEASNI

>AT2G38950.1 tffamily=JUMONJI
RLSVGMCLSSQFWKSEKERLYSLCYLHVGAPRVWYSVAGCHRSKFKAAMKSFILEMSGEQPKKSHNPVMMMSPYQLSVEGIPVTRCVQHPGQYVIIFPGSYYSAFDCGFNCLEKANF

>AT1G09060.1 tffamily=JUMONJI
MESSCTSSCAGGAQWDVFRRQDVPKLSGYLQRTFQKPDNIQTDFVSRPLYEGLFLNEHHKRQLRDEFGVEPWTFEQHRGEAIFIPAGCPFQITNLQSNIQVALDF

>AT1G11950.1 tffamily=JUMONJI
VVYDETSGALWDIFKREDVPKLEEYLRKHCIEFRHTYCSRVTKVYHPIHDQSYFLTVEHKRKLKAEFGIEPWTFVQKLGEAVFIPAGCPHQVRNLKSCTKVAVDF

>AT3G07610.1 tffamily=JUMONJI
ENSRQQVQNVETDDGALWDIFRREDIPKLESYIEKHHKEFRHLYCCPVSQVVHPIHDQNFYLTRYHIMKLKEEYGIEPWTFNQKLGDAVLIPVGCPHQVRNLKSCNKVALDF

>AT5G63080.1 tffamily=JUMONJI
FVYMGGKGSWTPLHADVFRSYSWSANVCGKKRWLFLPPPQSHLVYDRYMVACPVSIIEWFMNFYDDTKDWEKKPIECICKAGEVMFVPNGWWHLVINLEESIAINHNW

38) LFY

Searched with the Arabidopsis LFY gene.

>AT5G61850.1 Arabidopsis LFY family transcription factor, protein sequence 420AA tffamily=LFY
MDPEGFTSGLFRWNPTRALVQAPPPVPPPLQQQPVTPQTAAFGMRLGGLEGLFGPYGIRFYTAAKIAELGFTASTLVGMK
DEELEEMMNSLSHIFRWELLVGERYGIKAAVRAERRRLQEEEEEESSRRRHLLLSAAGDSGTHHALDALSQEGLSEEPVQ
QQDQTDAAGNNGGGGSGYWDAGQGKMKKQQQQRRRKKPMLTSVETDEDVNEGEDDDGMDNGNGGSGLGTERQREHPFIVT
EPGEVARGKKNGLDYLFHLYEQCREFLLQVQTIAKDRGEKCPTKVTNQVFRYAKKSGASYINKPKMRHYVHCYALHCLDE
EASNALRRAFKERGENVGSWRQACYKPLVNIACRHGWDIDAVFNAHPRLSIWYVPTKLRQLCHLERNNAVAAAAALVGGI
SCTGSSTSGRGGCGGDDLRF

39) LIM

Searched with the Arabidopsis genes

>AT2G45800.1 tffamily=LIM
CATCKKTVYPLEKVTMEGESYHKTCFRCTHSGCPLTHSSYASLNGVLYCKVHFNQL

>AT2G45800.1 tffamily=LIM
CKACDKTVYVMDLLTLEGNTYHKSCFRCTHCKGTLVISNYSSMDGVLYCKPHFEQL

>AT1G10200.1 tffamily=LIM
CMACDKTVYLVDKLTADNRVYHKACFRCHHCKGTLKLSNYNSFEGVLYCRPHFDQN

>AT5G66620.1 tffamily=LIM
CGGCNFAVEHGGSVNILGVLWHPGCFCCRACHKPIAIHDIENHVSNSRGKFHKSCYERY

>AT5G17890.1 tffamily=LIM
CKDCKSAIEDGISINAYGSVWHPQCFCCLRCREPIAMNEISDLRGMYHKPCYKEL

>AT1G19270.1 tffamily=LIM
CAGCNMEIGHGRFLNCLNSLWHPECFRCYGCSQPISEYEFSTSGNYPFHKACYRER

>AT2G39830.1 tffamily=LIM
CGGCNSDIGSGNYLGCMGTFFHPECFRCHSCGYAITEHEIPTNDAGLIEYRCHPFWNQK

>AT4G36860.1 tffamily=LIM
CNACDKPIIDYEFSMSGNRPYHKLCYKEQHHPKCDVCHNFIPTNPAGLIEYRAHPFWMQK

40) LUG

Searched with LUFS region of: AT2G32700.6, AT4G32551.1

This is the N-terminal portion to about amino acid 88. Outside this, LUF genes have Q-rich regions and WD repeats that are not specific to LUG.


>Gene_1 tffamily=LUG
MSQTNWEADKMLDVYIHDYLVKRDLKATAQAFQAEGKVSSDPVAIDAPGGFLFEWWSVFWDIFIARTNEKHSEVAASYIETQMIKARE

>Gene_2 tffamily=LUG
FLMAQSNWEADKMLDVYIYDYLVKKKLHNTAKSFMTEGKVSPDPVAIDAPGGFLFEWWSVFWDIFIARTNEKHSEAAAAYIEAQQGKAKE

41) MADS

Searched with a representative domain from each subfamily

>MIKC tffamily=MADS
LGRGKIEIKRIENTTNRQVTFCKRRNGLLKKAYELSVLCDAEVALVIFSTRG

>Malpha tffamily=MADS
GRRKVEIVKMTKESNLQVTFSKRKAGLFKKASEFCTLCDAKIAMIVFSPAG

>Mbeta tffamily=MADS
MGRKMVKMTRITNEKTRITTYKKRKACLYKKASEFSTLCGVDTCVIVYGPSRAG

>Mdelta tffamily=MADS
MGRVKLKIKKLENTNGRQSTFAKRKNGILKKANELSILCDIDIVLLMFSPTG

>Mgamma tffamily=MADS
TRKKVKLAYISNDSSRKATFKKRKKGLMKKVHELSTLCGITACAIIYSPYDTNPEVWPSNSG

42) MBF

Searched with the three full Arabidopsis genes

>AT2G42680.1 tffamily=MBF
MAGIGPITQDWEPVVIRKKPANAAAKRDEKTVNAARRSGADIETVRKFNAGTNKAASSGTSLNTKMLDDDTENLTHERVPTELKKA IMQARTDKKLTQSQLAQIINEKPQVIQEYESGKAIPNQQILSKLERALGAKLRGKK

>AT3G58680.1 tffamily=MBF
MAGIGPITQDWEPVVIRKRAPNAAAKRDEKTVNAARRSGADIETVRKFNAGSNKAASSGTSLNTKKLDDDTENLSHDRVPTELKKA IMQARGEKKLTQSQLAHLINEKPQVIQEYESGKAIPNQQILSKLERALGAKLRGKK

>AT3G24500.1 tffamily=MBF
MPSRYPGAVTQDWEPVVLHKSKQKSQDLRDPKAVNAALRNGVAVQTVKKFDAGSNKKGKSTAVPVINTKKLEEETEPAAMDRVKAEVRLMIQKARLEKKMSQADLAKQINERTQVVQEYENGKAVPNQAVLAKMEKVLGVKLRGKIGK

43) MYB-related

n.b Genes that contain only a single canonical R3 domain have been grouped with the R2R3MYBS. The MYB-related category here only represents divergent single domains.

We consider R2 domains with the sequence GKSCRLRW and R3 domains with the sequence LPGRTDN not to be MYB-related.

These two are in the R2R3MYB category and are not among the MYB-related genes in DATB.


>AT4G01060.1 tffamily=MYB-related
VVNMSQEEEDLVSRMHKLVGDRWELIAGRIPGRTAGEIERFWVMKN

>AT5G59780.2 tffamily=MYB-related
RGKMTPQEERLVLELHAKWGNRWSKIARKLPGRTDNEIKNYWRTHM

>AT1G01060.1 tffamily=MYB-related
RERWTEDEHERFLEALRLYGRAWQRIEEHIGTKTAVQIRSHAQKFF

>AT5G02840.2 tffamily=MYB-related
RESWTEGEHDKFLEALQLFDRDWKKIEDFVGSKTVIQIRSHAQKYF

>AT1G19000.1 tffamily=MYB-related
GVPWTENEHKRFLIGLQKVGKGDWKGISRNFVKSRTPTQVASHAQKYF

>AT2G36960.1 tffamily=MYB-related
WAAWTHQEEESFFTALRQVGKNFEKITSRVQSKNKDQVRHYYYRLV

>AT3G05380.1 tffamily=MYB-related
GPQWTRLELERFYDAYRKHGQEWRRVAAAIRNSRSVDMVEALFNMNR

>AT5G58340.1 tffamily=MYB-related
KKFWKPEEVEALREGVKEYGKSWKDIKNGNPTVFAERTEVDLKDKWRNLV

>AT1G09710.1 tffamily=MYB-related
RKRWSAEEDEELFAAVKRCGEGNWAHIVKGDFRGERTASQLSQRWALIR

>AT2G44430.1 tffamily=MYB-related
TQAWGTWEELLLACAVKRHGFGDWDSVATEVRSRSSLSHLLASANDCRHKYRDLK

>AT2G47210.1 tffamily=MYB-related
DSVWTKEETDQLFEFCQNFDLRFVVIADRFPVSRTVEELKDRYYSVN

44) NAC

Searched with the Arabidopsis NAC2 gene AAF09254

>NAC tffamily=NAC
mgrgsvaslapgfrfhptdeelvryylkrkvcnkpfkfdaisvtdiyksepwdlpdkskl
ksrdlewyffsmldkkysngsktnratekgywkttgkdreirngsrilgmkktlvyhkgr
aprgektnwvmqeyrlsdedlkkagvpqeayvlcrifqksgtvpkngeqygapyleeewe
edgmtyvpaqdafseglalnddvyvdiddidekpenlvvydavpilpnychgessnnves
gnysdsgnyiqpgnnvvdsggyfeqpietfeedrkpiiregsiqpcslfpeeqigcgvqd
envvnlessnnnvfvadtcysdipidhnylpdepfmdpnnnlplndglyletndlscaqq
ddfnfedylsffddegltfdesllmgpedflpnpetleqkpapkemekergrrrqrssgg
kgkwrkiffqnkytdfkdfdsapkypflkktshmlgaiptpssfasqfqtkdamrlhaaq
ssgsvhvtagmmrisnmtlaadsgmgwsydkngnlnvvlsfgvvqqddamtasgsktgit
atramlvfmclwvlllsvsfkivtmvsar


45) Nin

Searched with the Arabidopsis genes

>AT1G20640.1 tffamily=Nin
TKADKTITLDVLRQYFAGSLKDAAKNIGVCPTTLKRICRQHGIQRWPSRKIKKV

>AT2G17150.1 tffamily=Nin
TEKTIGLEVLRQYFAGSLKDAAKSIGVCPTTLKRICRQHGIMRWPSRKIKKV

>AT3G59580.1 tffamily=Nin
TEKNVSLNVLQQYFSGSLKDAAKSLGVCPTTLKRICRQHGIMRWPSRKINKV

>AT1G18790.1 tffamily=Nin
VSKTLSKETISLYFYMPITQAARELNIGLTLLKKRCRELGIKRWPHRKLMSL

>AT5G53040.1 tffamily=Nin
RQDKLEMSEIKQFFDRPIMKAAKELNVGLTVLKKRCRELGIYRWPHRKLKSL

>AT4G35590.1 tffamily=Nin
HVAELSLEELSKYFDLTIVEASRNLKVGLTVLKKKCREFGIPRWPHRKIKSL

46) NZZ

Searched with the single NZZ gene from Arabidopsis.

>AT4G27330.1 Arabidopsis NZZ family transcription factor, protein sequence 314AA tffamily=NZZ
MATSLFFMSTDQNSVGNPNDLLRNTRLVVNSSGEIRTETLKSRGRKPGSKTGQQKQKKPTLRGMGVAKLERQRIEEEKKQ
LAAATVGDTSSVASISNNATRLPVPVDPGVVLQGFPSSLGSNRIYCGGVGSGQVMIDPVISPWGFVETSSTTHELSSISN
PQMFNASSNNRCDTCFKKKRLDGDQNNVVRSNGGGFSKYTMIPPPMNGYDQYLLQSDHHQRSQGFLYDHRIARAASVSAS
STTINPYFNEATNHTGPMEEFGSYMEGNPRNGSGGVKEYEFFPGKYGERVSVVAKTSSLVGDCSPNTIDLSLKL

47) PcG

Searched with 16 Arabidopsis genes, full length except omiting the N-terminal extensions found in some proteins.

>AT3G03750.1  tffamily=PcG
TQKGVSVSLKIVRDEKKGWCLYADQLIKQARRRQNIYDKLRSTQSFASALLVVREHLPSGQACLRINIDATRIGNVARFINHSCDGGNLSTVLLRSSGALLPRLCFFAAKDIIAEEELSFSYGDVSVAGE

>AT5G43990.1 tffamily=PcG
VQQGIHNKLQVFFTPNGRGWGLRTLEKLPKGAFVCELAGEILTIPELFQRISDRPTSPVILDAYWGSEDISGDDKALSLEGTHYGNISRFINHRCLDANLIEIPVHAETTDSHYYHLAFFTTREIDAMEELTWDYGVPFNQD

>AT3G04380.1 tffamily=PcG
VQRGIRCQLQVYFTQEGKGWGLRTLQDLPKGTFICEYIGEILTNTELYDRNVRSSSERHTYPVTLDADWGSEKDLKDEEALCLDATICGNVARFINHRCEDANMIDIPIEIETPDRHYYHIAFFTLRDVKAMDELTWDYMIDFNDK

>AT2G22740.1 tffamily=PcG
TQHGIKLPLEIFKTKSRGWGVRCLKSIPIGSFICEYVGELLEDSEAERRIGNDEYLFDIGNRYDNSLAQGMSELMLGTQAGRSMAEGDESSGFTIDAASKGNVGRFINHSCSPNLYAQNVLYDHEDSRIPHVMFFAQDNIPPLQELCYDYNYALDQVR

>AT4G13460.1 tffamily=PcG
TQKGLRNRLEVFRSLETGWGVRSLDVLHAGAFICEYAGVALTREQANILTMNGDTLVYPARFSSARWEDWGDLSQVLADFERPSYPDIPPVDFAMDVSKMRNVACYISHSTDPNVIVQFVLHDHNSLMFPRVMLFAAENIPPMTELSLDYGVVDDWN

>AT5G13960.1 tffamily=PcG
SQKRLRFNLEVFRSAKKGWAVRSWEYIPAGSPVCEYIGVVRRTADVDTISDNEYIFEIDCQQTMGRQRRLRDVAVPMNNGVSQSSEDENAPEFCIDAGSTGNFARFINHSCEPNLFVQCVLSSHQDIRLARVVLFAADNISPMQELTYDYGYALDSVH

>AT1G17770.1  tffamily=PcG
VQTGLKLHLEVFKTRNCGWGLRSWDPIRAGTFICEFAGLRKTKEEVEEDDDYLFDTSKIYQRFRWNYEPELLLEDSWEQVSEFINLPTQVLISAKEKGNVGRFMNHSCSPNVFWQPIEYENRGDVYLLIGLFAMKHIPPMTELTYDYGVSCVER

>AT4G02020.1  tffamily=PcG
LLLRQQQRILLGKSDVAGWGAFLKNSVSKNEYLGEYTGELISHHEADKRGKIYDRANSSFLFDLNDQYVLDAQRKGDKLKFANHSAKPNCYAKVMFVAGDHRVGIFANERIEASEELFYDYRYGPDQA

>AT1G77300.1  tffamily=PcG
FQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGA

>AT1G01920.1  tffamily=PcG
ERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMRVLQDPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGLYHATELQKKKLLSLYHDKVEVLVTKLLILDGDSESKVSFEHFLWFWSRALNIPLPHSFVFPQSQDDTGECTSTSAQPAPSVGSGDTIWVEGLVPGIDFCNHDLKP

>AT5G14260.1  tffamily=PcG
LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGLWSEAELDYLTGSPTKAEVLERAEGIKREYNELDTVWFMFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAK

>AT2G05900.  tffamily=PcG
PAFGIWKSIQNWRNGLSIRPGLILEDLSNGAENLKVCLVNEVDKENGPALFRYVTSLIHEVINNIPSMVDRCACGRRSCGSKHVFREKLSVSSSLVISAKKSGNVARFMNHSCSPNVFWQSIAREQNGLWCLYIGFFAMKHIPPLTELRYDYGKSRGGGK

>AT5G17240.1  tffamily=PcG
ETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDARELKKGELVLKVPRKALMTTESIIAKSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKSFQAWLWASATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPESANNVEEAGLVVETGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLE

>AT2G18850.1  tffamily=PcG
QDNGVKTKLQIAQIDGYGRGAIASEDLKFGDVALEIPVSSIISEESDMYPILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTGLSFGVDAIMELDGTLLLDEIMQAKELLRERYDELIPLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSH

>AT1G24610.1  tffamily=PcG
PDLIRWIKREGGFVHHAVKLSQETQFGIGLISTEQISPGTDLISLPPHVPLRFESDDSSSLLSALARRVPEELWAMKLGLRLLQERANADSFWWPYISNLPETYTVPIFFPGEDIKNLQYAPLLHQVNKRCRFLLEFEQEIRRTLEDVKASDHPFSGQDVNASALGWTMSAVSTRAFRLHGNKKLQGGSSDDVPMMLPLIDMCNHSFKPNGADSNTLVKVVAETEVKENDPLLLNYGCLSNDF


48) PHD

Searched with PHD proteins from Arabidopsis

>AT1G32800.1 tffamily=PHD
DCVCGVNDDDGTEMVKCDDCGVWVHTRCSRFVEGQELFTCHKCKSK

>AT1G33420.1 tffamily=PHD
DCKCGTKDDDGERMLACDGCGVWHHTRCIGINNADALPSKFLCFRCIEL

>AT5G60410.1 tffamily=PHD
RCVCGNSLETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEICRLT 

>AT3G24010.1 tffamily=PHD
YCICNQVSFGEMVACDNNACKIEWFHFGCVGLKEQPKGKWYCPECATV 

>AT3G08020.1 tffamily=PHD
YCPVCLKVYRDSESTPMVCCDICQRWVHCHCDGISDDKYMQFQVDGKLQYKCATCRGE 

>AT5G36670.1 tffamily=PHD
TCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCK 

>AT1G77250.1 tffamily=PHD
LCRTCGTKVDSGGKYITCDHPFCPHKYYHIRCLTSRQIKLHGVRWYCSSCLCR 

>AT5G63900.1 tffamily=PHD
VCCVCHWGGDLLLCDGCPSAFHHACLGLSSLPEEDLWFCPCCCCD 

>AT3G51120.1 tffamily=PHD
VCFICFDGGDLVLCDRRNCPKAYHPACIKRDEAFFRTTAKWNCGWHICG 

>AT5G35210.1 tffamily=PHD
ECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPECTIN 

>AT3G01460.1 tffamily=PHD
VCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIPDGNWYCPSCVIA 

>AT3G52100.1 tffamily=PHD
SCRICEGCGTLGDPKKFMFCKRCDDAYHCDCQHPRHKNVSSGPYLCPKHTKC 

>AT5G09790.1 tffamily=PHD
TCEKCGSGEGDDELLLCDKCDRGFHMKCLRPIVVRVPIGTWLCVDCSDQ 

>AT3G05670.1 tffamily=PHD
ICTECHQGDDDGLMLLCDLCDSSAHTYCVGLGREVPEGNWYCEGCRPV 

>AT1G43770.1 tffamily=PHD
VCQTCGDIGFEEALVFCDSCMFESIHRYCLGITPIPFTEYITWICEDCDNS 

>AT2G25170.1 tffamily=PHD
ACQACGESTNLVSCNTCTYAFHAKCLVPPLKDASVENWRCPECVSP 

>AT1G05830.1 tffamily=PHD
KCNVCHMDEEYENNLFLQCDKCRMMVHTRCYGQLEPHNGILWLCNLCRPV 

>AT1G77800.1 tffamily=PHD
FCCTGHHQLIVCTSCKATVHKKCYGLLEDSGKPWLCSWCELE 

>AT5G53430.1 tffamily=PHD
RCAVCRWVEDWDYNKIIICNRCQIAVHQECYGTRNVRDFTSWVCKACETP

>AT1G77800.1 tffamily=PHD
TCDICRRSETIWNLIVVCSSCKVAVHIDCYKCAKESTGPWYCELCAES

>AT3G20280.1 tffamily=PHD
ACQICEVTINEMDTLLICDACEKAYHLKCLQGNNMKGVPKSEWHCSRCVQA

>AT2G37520.1 tffamily=PHD
GCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNC

>AT3G08020.1 tffamily=PHD
MCRMCFLGEGEGSDRARRMLSCKDCGKKYHKNCLKSWAQHRDLFHWSSWSCPSCRVC

>AT3G01460.1 tffamily=PHD
SCGACGRPESIELVVVCDACERGFHMSCVNDGVEAAPSADWMCSDCRTG 

>AT4G39100.1 tffamily=PHD
FCKCEMPYNPDDLMVQCEECSEWFHPSCIGTTEEAKKPDNFYCEECSPQ 

>AT4G23860.1 tffamily=PHD
YCTCDRPYPDPNVEEQVEMIQCCLCEDWFHEEHLGLTPSDDEESEPIYEDFICQNCSPA 

>AT5G35210.1 tffamily=PHD
VCGICLLPYNPGLTYIHCTKCEKWFHTEAVKLKDSQIPEVVGFKCCKCRRI 

>AT5G63900.1 tffamily=PHD
ICGSMESPANSKLMACEQCQRRFHLTCLKEDSCIVSSRGWFCSSQCNR 

>AT4G10940.1 tffamily=PHD
LCKIRNTFSYIEGDSNLDTSIACDSCDMWYHAICVGFDVENASEDTWVCPSKDTL

49) PLATZ

Searched with the Arabidopsis genes

>AT1G21000.1 tffamily=PLATZ
YTSPPWLMPMLRGSYFVPCSIHVDSNKNECNLFCLDCAGNAFCSYCLVKHKDHRVVQIRRSSYHNVVRVNEIQKFIDIACVQTYIINSAKIVFLNERPQPRIGKGVTNTCEICCRSLLDSFRFCSLGC

>AT1G43000.1 tffamily=PLATZ
VMTPPWLTPMLRADYFVTCSIHSQSSKSECNLFCLDCSGNAFCSSCLAHHRTHRVIQIRRSSYHNVVRVSEIQKHIDISCIQTYVINSAKIFFLNARPQCRTGKSLNKTCQICSRNLLDSFLFCSLAC

>AT2G27930.1 tffamily=PLATZ
MEEPKWLEGLLRTNFFSICPRHRETPRNECNMFCLSCQNAAFCFYCRSSFHIDHPVLQIRRSSYHDVVRVSEIENALDIRGVQTYVINSARVLFLNERPQPKNSSHEPFLIPFASVPWVARFVFLHLFS

>AT1G32700.2 tffamily=PLATZ
MYCLDCTNGPLCSLCLSFHKDHHAIQIRRSSYHDVIRVSEIQKFLDITGVQTYVINSAKVVFLNERPQPRPGKGVINTCEVCYRSLVDSFRFCSLGC

>AT2G12646.1 tffamily=PLATZ
IQKPAWLDALYAEKFFVGCPYHETAKKNERNVCCLDCCTSLCPHCVPSHRFHRLLQVRRYVYHDVVRLEDLQKLIDCSNVQAYTINSAKVVFIKKRPQNRQFKGAGNYCTSCDRSLQEPYIHCSLGC

>AT1G31040.1 tffamily=PLATZ
MQVDYLSYQGDDLSSILYRIDESDFTFEGLRMDGHDQLGEISTMEDGEDILVISDESEQGNNSHKKEKKKSKKKKPESNYLPGMVLSSLGN

50) R2R3MYB

Searched with R2R3 first position consensus and second position consensus (Stracke et al., 2001)

>R2R3MYB search_1  (R2) tffamily=R2R3MYB
LKKGPWTPEEDEKLINYILKHGEGNWRSLPKKAGLKRCGKSCRLRWTNYLRPD

>R2R3MYB search_2  (R3) tffamily=R2R3MYB
IKRGNFTEEEEELIIELHALLGNRWSKIAKRLPGRTDNEIKNYWNTHLKKK

>R2R3MYB search_3  (R2) tffamily=R2R3MYB
VRRGAWSAEEDQILVSFVQLYGHRCWNAVARLSGLGRSGKSCRLRWINQLKPN

>R2R3MYB search_4  (R3) tffamily=R2R3MYB
LRKGAISPQEDQTLLRAHSKWGNKWAAIARHLPGRTDNDVKNHWRSRIRRR

51) S1Fa

Searched with all three Arabidopsis genes

>AT2G37120.1 Arabidopsis S1Fa-like family transcription factor, protein sequence 76AA tffamily=S1Fa
MSSDGSAGKAVVEAKGLNPGLIVLLVIGGLLVTFLIANYVMYMYAQKNLPPRKKKPLSKKKLKREKLKQGVPVPGE

>AT3G09735.1 Arabidopsis S1Fa-like family transcription factor, protein sequence 73AA tffamily=S1Fa
MAAEFDGKIESKGLNPGLIVLLVIGGLLLTFLVGNFILYTYAQKNLPPRKKKPVSKKKMKKEKMKQGVQVPGE

>AT3G53370.1 Arabidopsis S1Fa-like family transcription factor, protein sequence 76AA tffamily=S1Fa
MDGEDFAGKAAAEAKGLNPGLIVLLVVGGPLLVFLIANYVLYVYAQKNLPPRKKKPVSKKKLKREKLKQGVPVPGE

52) SAP

Searched with the single Arabidopsis gene.

>SAP tffamily=SAP
MSTSSSSSDNGAGGSGGVFEAPSPSRPRRGANDVWPEPFLESLAVQVAVNASTSAGLLAAAPALANVFRVCTTWHAVSRSDHLWQLLSRQVWARTHLMHDTWRDEFIYRHRTARNFRTRTHTYFTLQFDPSDVDEPDSLSCRCLTLSDLYLAAGFADGTVRLFLLNNRLHVRTLRPPLRDRFGRFSRAVSGIVISDSRLTFATMDGDIHVAEIDGVGHTRTAYAGDIVNDGALVDFTGCGRWWVGLFAGVPGRAFHIWDCNSEETTFVGGTLTDPEAVMGWHTLTELTTSLGRLRISGNETAVACTRWRIMVIDLRNQGVIIGEDEEQRRGLIVTGFDANDEAYVRLDSRGNASVRRVNTQQTVCEFRVSGAAQRRVMGCVNRLHALMCAGGIMRVWEVERGEYLYSIRERVGEVDAIVADDRHVAVASASSTAQSIIHLWDFGAL

53) SBP

Searched with the Arabidopsis genes

>AT1G53160.1 tffamily=SBP
LCQVDRCTADMKEAKLYHRRHKVCEVHAKASSVFLSGLNQRFCQQCSRFHDLQEFDEAKRSCRRRLAGHNERRRKSSGE

>AT1G02065.2 tffamily=SBP
RCQAEGCNADLSHAKHYHRRHKVCEFHSKASTVVAAGLSQRFCQQCSRFVPPKVATFDLF

>AT2G42200.1 tffamily=SBP
RCQVEGCGMDLTNAKGYYSRHRVCGVHSKTPKVTVAGIEQRFCQQCSRFHQLPEFDLEKRSCRRRLAGHNERRRKPQPA

>AT1G27370.1 tffamily=SBP
RCQIDGCELDLSSSKDYHRKHRVCETHSKCPKVVVSGLERRFCQQCSRFHAVSEFDEKKRSCRKRLSHHNARRRKPQGV

>AT1G20980.1 tffamily=SBP
MCQVDNCTEDLSHAKDYHRRHKVCEVHSKATKALVGKQMQRFCQQCSRFHLLSEFDEGKRSCRRRLAGHNRRRRKTTQP

>AT5G18830.1 tffamily=SBP
RCQVPDCEADISELKGYHKRHRVCLRCATASFVVLDGENKRYCQQCGKFHLLPDFDEGKRSCRRKLERHNNRRKRKPVD


54) SRS

Searched with the Arabidopsis genes

>AT1G19790.1 tffamily=SRS
TVTRQGNMNCQDCGNQAKKDCPHMRCRTCCKSRGFDCQTHVKSTWVSAAKRRERQAQLAV

>AT5G66350.1 tffamily=SRS
SGSGSGGPSCQDCGNQSKKDCSHMRCRTCCKSRGLDCPTHVKSTWVPAAKRRERQQQLST

>AT3G54430.1 tffamily=SRS
DNNTVGEKVCRDCGNRAKKECLFERCRTCCKSRGYNCVTHVKSTWIPSSATR

>AT2G21400.1 tffamily=SRS
MMMIMGRKCEDCGNQAKKDCVYMRCRTCCKSKAFHCQTHIKSTWVPAYRRSHHKHQSQP

55) TAZ

Searched with the Arabidopsis genes

>AT5G67480.2 tffamily=TAZ
NDQRIYSQLYEAMEALVHICRDGCKTIGPHDKDFKPNHATCNYEACKGLESLIRHFAGCKLRVPGGCVHCKRMWQLLELHSRVCAGSDQCRVPLC

>AT5G63160.1 tffamily=TAZ
NLDNKSTCQAKPGPCSAFSTCYGLQLLIRHFAVCKKRVDGKGCVRCKRMIQLLRLHSSICDQSESCRVPLC

>AT3G12980.1 tffamily=TAZ
NTQTNQIQNAQLREVLLHVMTCCTAQCQYPRCRVIKGLIRHGLVCKTRGCIACKKMWSLFRLHSRNCRDPQCKVPKC

>AT3G12980.1 tffamily=TAZ
DCGLSYKNQRRWLLFLLHVRKCNAAEDNCESKYCFTAKTLLKHINCCKAPACAYQYCHQTRQLIHHYKHCGDEACPVCVF


56) TCP

Searched with TCP proteins from Arabidopsis

>AT3G27010.1  tffamily=TCP
NQLGPKRSSNKDRHTKVEGRGRRIRMPALCAARIFQLTRELGHKSDGETIQWLLQQAEPSIIAATGSGTIPASALA 

>AT5G41030.1 tffamily=TCP
YEKEKKKPNKDRHLKVEGRGRRVRLPPLCAARIYQLTKELGHKSDGETLEWLLQHAEPSILSATVNGIKPTESVV

>AT1G35560.1   tffamily=TCP
AKTPAKRPSKDRHIKVDGRGRRIRMPAICAARVFQLTRELQHKSDGETIEWLLQQAEPAIIAATGTGTIPANIST

>AT1G69690.1 tffamily=TCP
KKPPPKRTSTKDRHTKVEGRGRRIRMPAMCAARVFQLTRELGHKSDGETIEWLLQQAEPAVIAATGTGTIPANFTS 

>AT5G51910.1 tffamily=TCP
TKPAPKRPTSKDRHTKVEGRGRRIRMPAGCAARVFQLTRELGHKSDGETIRWLLERAEPAIIEATGTGTVPAIAVS

>AT1G30210.1 tffamily=TCP
IIRVSRASGGKDRHSKVLTSKGLRDRRIRLSVATAIQFYDLQDRLGFDQPSKAVEWLINAASDSITDLPLLNTNFDHLD

>AT3G02150.1| tffamily=TCP
IVRVSRAFGGKDRHSKVCTLRGLRDRRVRLSVPTAIQLYDLQERLGVDQPSKAVDWLLDAAKEEIDELPPLPISPENFSI

>AT5G60970.1 tffamily=TCP
IVRVSRTFGGKDRHSKVCTVRGLRDRRIRLSVPTAIQLYDLQDRLGLSQPSKVIDWLLEAAKDDVDKLPPLQFPHGFNQ

>AT3G45150.1 tffamily=TCP
NNSQKARRTPKDRHLKIGGRDRRIRIPPSVAPQLFRLTKELGFKTDGETVSWLLQNAEPAIFAATGHGVTTTSNED

>AT1G68800.1 tffamily=TCP
EVQWRRTVKKRDRHSKICTAQGPRDRRMRLSLQIARKFFDLQDMLGFDKASKTIEWLFSKSKTSIKQLKERVAASEGGGK

57) Trihelix

Searched with the Arabidopsis genes

>AT1G76890.1 tffamily=Trihelix
WPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRSAKRCKEKWENINKYFKKVK

>AT5G28300.1 tffamily=Trihelix
WPKDEVLALINIRRSISNMNDDDHKDENSLSTSSKAVPLWERISKKMLEIGYKRSAKRCKEKWENINKYFRKTK

>AT5G47660.1 tffamily=Trihelix
WPQEEVQALISSRSDVEEKTGINKGAIWDEISARMKERGYERSAKKCKEKWENMNKYYRRVT

>AT1G76880.1 tffamily=Trihelix
WPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAEHGYIRNAKKCKEKFENVYKYHKRTK

>AT3G10000.1 tffamily=Trihelix
WPRQETLMLLEVRSRLDHKFKEANQKGPLWDEVSRIMSEEHGYTRSGKKCREKFENLYKYYKKTK

>AT5G28300.1 tffamily=Trihelix
WCSDEVLALLRFRSTVENWFPEFTWEHTSRKLAEVGFKRSPQECKEKFEEEERRYFNSN

>AT5G03680.1 tffamily=Trihelix
WGEQEILKLMEIRTSMDSTFQEILGGCSDEFLWEEIAAKLIQLGFDQRSALLCKEKWEWISNGMRKEK

>AT3G25990.1 tffamily=Trihelix
WAQDETRTLISLRREMDNLFNTSKSNKHLWEQISKKMREKGFDRSPSMCTDKWRNILKEFKKAK

>AT1G76870.1 tffamily=Trihelix
WMDKMVKLMITALSYIGEDSGSDKKFAVLQKKGKWRSVSKVMDERGYHVSPQQCEDKFNDLNKRYKKLN

>AT3G10040.1 tffamily=Trihelix
WTDTMVRLLIMAVFYIGDEAGLNDPVDAMLQKKGKWKSVSRAMVEKGFSVSPQQCEDKFNDLNKRYKRVN

>AT3G11100.1 tffamily=Trihelix
WSEDATATLIEAWGDRYVNLNRGNLRQNDWKEVADAVNSSHGNGRPKTDVQCKNRIDTLKKKYKTEK

>AT1G54060.1 tffamily=Trihelix
WSEEATKVLIEAWGDRFSEPGKGTLKQQHWKEVAEIVNKSRQCKYPKTDIQCKNRIDTVKKKYKQEK

>AT3G24490.1 tffamily=Trihelix
WREQEAFVLLEVWGDRFLQLGRRSLRNEDWNEVAEKVSEELRMEKSETQCRRMIDDLKRKYRKEK

>AT2G33550.1 tffamily=Trihelix
WTRQEILVLIQGKRVAENRVRRGRAAGMALGSGQMEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIK


58) TULP (TUBBY,TLP)

Searched with eight TUBBY proteins from Arabidopsis (110-395)

>AT2G47900.1  tffamily=TLP
PGPRGSLVQCYIMRNRSNQTYYLYLGLNQAASNDDGKFLLAAKRFRRPTCTDYIISLNCDDVSRGSNTYIGKLRSNFLGTKFTVYDAQPTNPGTQVTRTRSSRLLSLKQVSPRIPSGNYPVAHISYELNVLGSRGPRRMQCVMDAIPASAVEPGGTAPTQTELVHSNLDSFPSFSFFRSKSIRAESLPSGPSSAAQKEGLLVLKNKAPRWHEQLQCWCLNFNGRVTVASVKNFQLVAAPENGPAGPEHENVILQFGKVGKDVFTMDYQYPISAFQAFTICLSSFDTKIACE

>AT3G06380.1  tffamily=TLP
SGPRDSLVQCFIKRNRNTQSYHLYLGLTTSLTDNGKFLLAASKLKRATCTDYIISLRSDDISKRSNAYLGRMRSNFLGTKFTVFDGSQTGAAKMQKSRSSNFIKVSPRVPQGSYPIAHISYELNVLGSRGPRRMRCIMDTIPMSIVESRGVVASTSISSFSSRSSPVFRSHSKPLRSNSASCSDSGNNLGDPPLVLSNKAPRWHEQLRCWCLNFHGRVTVASVKNFQLVAVSDCEAGQTSERIILQFGKVGKDMFTMDYGYPISAFQAFAICLSSFETRIACE

>AT1G53320.1  tffamily=TLP
PGPRDFSNQCLIKRNKKTSTFYLYLALTPSFTDKGKFLLAARRFRTGAYTEYIISLDADDFSQGSNAYVGKLRSDFLGTNFTVYDSQPPHNGAKPSNGKASRRFASKQISPQVPAGNFEVGHVSYKFNLLKSRGPRRMVSTLRCPSPSPSSSSAGLSSDQKPCDVTKIMKKPNKDGSSLTILKNKAPRWHEHLQCWCLNFHGRVTVASVKNFQLVATVDQSQPSGKGDEETVLLQFGKVGDDTFTMDYRQPLSAFQAFAICLTSFGTKLACE

>AT1G25280.2|S1|E267  tffamily=TLP
MDADNISRSSNSYLGKLRSNFLGTKFLVYDTQPPPNTSSSALITDRTSRSRFHSRRVSPKVPSGSYNIAQITYELNVLGTRGPRRMHCIMNSIPISSLEPGGSVPNQPEKLVPAPYSLDDSFRSNISFSKSSFDHRSLDFSSSRFSEMGISCDDNEEEASFRPLILKNKQPRWHEQLQCWCLNFRGRVTVASVKNFQLVAARQPQPQGTGAAAAPTSAPAHPEQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLSSFDTKLACE

>AT1G76900.1  tffamily=TLP
PGPRDATMQCFIKRDKSNLTYHLYLCLSPALLVENGKFLLSAKRIRRTTYTEYVISMHADTISRSSNTYIGKIRSNFLGTKFIIYDTQPAYNSNIARAVQPVGLSRRFYSKRVSPKVPSGSYKIAQVSYELNVLGTRGPRRMHCAMNSIPASSLAEGGTVPGQPDIIVPRSILDESFRSITSSSSRKITYDYSNDFSSARFSDILGPLSEDQEEGKERNSPPLVLKNKPPRWHEQLQCWCLNFRGRVTVASVKNFQLIAANQPQPQPQPQPQPQPLTQPQPSGQDKIILQFGKVGKDMFTMDFRYPLSAFQAFAICLSSFDTKLACE

>AT2G18280.1  tffamily=TLP
PGPRDSPIQCFIKRNRATATYILYYGLMPSETENDKLLLAARRIRRATCTDFIISLSAKNFSRSSSTYVGKLRSGFLGTKFTIYDNQTASSTAQAQPNRRLHPKQAAPKLPTNSSTVGNITYELNVLRTRGPRRMHCAMDSIPLSSVIAEPSVVQGIEEEVSSSPSPKGETITTDKEIPDNSPSLRDQPLVLKNKSPRWHEQLQCWCLNFKGRVTVASVKNFQLVAEIDASLDAPPEEHERVILQFGKIGKDIFTMDYRYPLSAFQAFAICISSFDTKPACE

>AT1G47270.1  tffamily=TLP
PGPRDAPIQCFIKRERATGIYRLYLGLSPALSGDKSKLLLSAKRVRRATGAEFVVSLSGNDFSRSSSNYIGKLRSNFLGTKFTVYENQPPPFNRKLPPSMQVSPWVSSSSSSYNIASILYELNVLRTRGPRRMQCIMHSIPISAIQEGGKIQSPTEFTNQGKKKKKPLMDFCSGNLGGESVIKEPLILKNKSPRWHEQLQCWCLNFKGRVTVASVKNFQLVAAAAEAGKNMNIPEEEQDRVILQFGKIGKDIFTMDYRYPISAFQAFAICLSSFDTKPVCE

>AT1G16070.1  tffamily=TLP
GRCTCLIVKEQSPEGLSHGSVYSLYTHEGRGRKDRKLAVAYHSRRNGKSIFRVAQNVKGLLCSSDESYVGSMTANLLGSKYYIWDKGVRVGSVGKMVKPLLSVVIFTPTITTWTGSYRRMRTLLPKQQPMQKNNNKQVQQASKLPLDWLENKEKIQKLCSRIPHYNKISKQHELDFRDRGRTGLRIQSSVKNFQLTLTETPRQTILQMGRVDKARYVIDFRYPFSGYQAFCICLASIDSKLCCT



59) ULT

>AT2G20825.1 Arabidopsis ULT family transcription factor, protein sequence 228AA tffamily=ULT
MERECGSKELFSKEELQEISGVHVGDDYVEVMCGCTSHRYGDAVARLKIFSDGELQITCQCTPACLEDKLTPAAFEKHSE
RETSRNWRNNVWVFIEGDKVPLSKTVLLRYYNKALKNSNVSKVIHRDEFVGCSTCGKERRFRLRSRGECRMHHDAIAEPN
WKCCDYPYDKITCEEEEERGSRKVFRGCTRSPSCKGCTSCVCFGCKLCRFSDCNCQTCLDFTTNAKPI

>AT4G28190.1 Arabidopsis ULT family transcription factor, protein sequence 237AA tffamily=ULT
MANNEGEMQCGSMLFKQEELQEMSGVNVGGDYVEVMCGCTSHRYGDAVARLRVFPTGDLEITCECTPGCDEDKLTPAAFE
KHSGRETARKWKNNVWVIIGGEKVPLSKTVLLKYYNESSKKCSRSNRSQGAKVCHRDEFVGCNDCGKERRFRLRSRDECR
LHHNAMGDPNWKCSDFPYDKITCEEEEERGSRKVYRGCTRSPSCKGCTSCVCFGCELCRFSECTCQTCVDFTSNVKA

60) VOZ

Searched with complete sequences of: AT1G28520.1, AT2G42400.1

>AT1G28520.1 Arabidopsis VOZ family transcription factor, protein sequence 486AA tffamily=VOZ
MTGKRSKTNCRSASHKLFKDKAKNRVDDLQGMLLDLQFARKESRPTDVTLLEEQVNQMLREWKSELNEPSPASSLQQGGT
LGSFSSDICRLLQLCDEEDDATSKLAAPKPEPADQNLEAGKAAVFQRGYNLVQGKSEHGLPLVDNCKDLSLAAGNNFDGT
APLEYHQQYDLQQEFEPNFNGGFNNCPSYGVVEGPIHISNFIPTICPPPSAFLGPKCALWDCPRPAQGFDWFQDYCSSFH
AALAFNEGPPGMNPVVRPGGIGLKDGLLFAALSAKAGGKDVGIPECEGAATAKSPWNAPELFDLTVLESETLREWLFFDK
PRRAFESGNRKQRSLPDYNGRGWHESRKQIMVEFGGLKRSYYMDPQPLHHFEWHLYEYEINKCDACALYRLELKLVDGKK
TSKGKVSNDSVADLQKQMGRLTAEFPPENNTTNTTNNNKRCIKGRPKVSTKVATGNVQNTVEQANDYGVGEEFNYLVGNL
SDYYIP

>AT2G42400.1 Arabidopsis VOZ family transcription factor, protein sequence 450AA tffamily=VOZ
MSNHPKITSAHQNVEEKLRELQERFCHLQAARKEGRHGDLALLEAQISQNIREWQAELTAPSPESSLLGEGISQFLEEFA
PLLKLDEEDDATSTLKEHAGAKPDPEGFSQSLCPPEWTSENFSQSPFNGNFSCGFEDALNSTETHGQQLHYGYEGFDPSI
NSAPDFHDQKLSSNLDITSQYDYIFSEVRQELDNSPSTKLDSSEEIDNFAEFSTPSSVRVPPSAFLGPKCALWDCTRPAQ
GSEWYLDYCSNYHGTLALNEDSPGTAPVLRPGGISLKDNLLIDALRAKTQGKNVGIPVCEGAVNTKCPWNAAELFHLELV
EGETIREWLFFDKPRRAYDSGNRKQRSLPDYSGRGWHESRKQLMKEQEGQKRSYYMDPQPPGPFEWHLFEYQINESDACA
LYRLELKVGNGKKSPKGKISKDPLADLQKKMGQFKVASDKPSPPTKGRKE

61) Whirly

Searched with complete sequences of: AT1G14410.1, AT2G02740.1

>AT1G14410.1 Arabidopsis Whirly family transcription factor, protein sequence 263AA tffamily=Whirly
MSQLLSTPLMAVNSNPRFLSSSSVLVTGGFAVKRHGFALKPTTKTVKLFSVKSRQTDYFEKQRFGDSSSSPSPAEGLPAR
FYVGHSIYKGKAALTVDPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSKKQVFSLSVTEIGTLVSLGPRESCE
FFHDPFKGKSDEGKVRKVLKVEPLPDGSGHFFNLSVQNKLVNVDESIYIPITRAEFAVLISAFNFVLPYLIGWHAFANSI
KPEETSRVNNASPNYGGDYEWNR

>AT2G02740.1 Arabidopsis Whirly family transcription factor, protein sequence 268AA tffamily=Whirly
MSQLLSSPPMAVFSKTFINHKFSDARFLSSHSILTSGGFAGKIIPLKPTARLKLTVKSRQSDYFEKQRFGDSSSSQNAEV
SSPRFYVGHSIYKGKAALTIEPRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQVFSLSVTEIGNLVSLGPR
ESCEFFHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLSVQNKLLNVDESVYIPITKAEFAVLISAFNFVLPHLIGWSA
FANSIKPEDSNRLNNASPKYGGDYEWSR

62) WRKY

Searched with a representative domain from each sub family plus a number of others from characterized genes.


>WRKY_search_1 I N-terminal tffamily=WRKY
DGYNWRKYGQKLVKGNEFVRSYYRCTHPNCKAKKQLERSAGGQVVDTVYFGEHDH

>WRKY_search_2 I C-terminal tffamily=WRKY
DGYRWRKYGQKSVKGSPYPRSYYRCSSPGCPVKKHVERSSHDTKLLITTYEGKHDH

>WRKY_search_3 IIa tffamily=WRKY
DGYQWRKYGQKVTRDNPSPRAYFKCACAPSCSVKKKVQRSVEDQSVLVATYEGEHNH

>WRKY_search_4 IIb tffamily=WRKY
DGCQWRKYGQKMAKGNPCPRAYYRCTMATGCPVRKQVQRCAEDRSILITTYEGNHNH

>WRKY_search_5 IIc tffamily=WRKY
DDGYRWRKYGQKVVKNTQHPRSYYRCTQDKCRVKKRVERLADDPRMVITTYEGRHLH

>WRKY_search_6 IId tffamily=WRKY
DEFSWRKYGQKPIKGSPHPRGYYKCSSVRGCPARKHVERALDDAMMLIVTYEGDHNH

>WRKY_search_7 IIe tffamily=WRKY
DVWAWRKYGQKPIKGSPYPRGYYRCSTSKGCLARKQVERNRSDPKMFIVTYTAEHNH

>WRKY_search_8 III tffamily=WRKY
DDGFSWRKYGQKDILGAKFPRGYYRCTYRKSQGCEATKQVQRSDENQMLLEISYRGIHSC

>WRKY_search_9 WIZZ tffamily=WRKY
KDGYQWRKYGQKVTRDNPSPRAYFRCSFAPGCPVKKKVQRSIEDQSVVVATYEGEHNH

>WRKY_search_10 AtWRKY1 N-terminal tffamily=WRKY
DGYNWRKYGQKQVKGSENPRSYYKCTFPNCPTKKKVERNLDGHITEIVYKGNHNH

>WRKY_search_11 AtWRKY1 C-terminal tffamily=WRKY
DGYRWRKYGQKVAKGNPNPRSYYKCTFTGCPVRKHVERASHDLRAVITTYEGKHNH

>WRKY_search_12 AtWRKY2 N-terminal tffamily=WRKY
DGYNWRKYGQKQVKGSENPRSYYKCTFPNCPTKKKVERSLDGQITEIVYKGNHNH

>WRKY_search_13 AtWRKY2 C-terminal tffamily=WRKY
DGYRWRKYGQKVVKGNPNPRGYYKCTSPGCPVRKHVERASQDIRSVITTYEGKHNH

>WRKY_search_14 AtWRKY3 tffamily=WRKY
DEYSWRKYGQKPIKGSPYPRGYYKCSSVRGCPARKHVERAMDDPAMLIVTYEGEHRH

>WRKY_search_15 SUSIBA2 N-terminal tffamily=WRKY
DGYNWRKYGQKHVKGSENPRSYYKCTHPNCEVKKLLERAVDGLITEVVYKGRHNH

>WRKY_search_16 SUSIBA2 C-terminal tffamily=WRKY
DGYRWRKYGQKVVKGNPNPRSYYKCTSTGCPVRKHVERASHDPKSVITTYEGKHNH

>WRKY_search_17 AtWRKY4 N-terminal tffamily=WRKY
DGYNWRKYGQKQVKGSEYPRSYYKCTHPNCPVKKKVERSHEGHITEIIYKGAHNH

>WRKY_search_18 AtWRKY4 C-terminal tffamily=WRKY
DGYRWRKYGQKVVKGNPNPRSYYKCTSAGCNVRKHVERASHDLKSVITTYEGKHNH

>WRKY_search_19 ACRE126 tffamily=WRKY
DGCQWRKYGQKISRGNPCPRSYYRCSVAPLCPVRKQVQRCVEDMSVLITTYEGTHNH


63) YABBY

Searched with the Arabidopsis genes

>AT2G45190.1 tffamily=YABBY
PDHFSPSDHLCYVQCNFCQTILAVNVPYTSLFKTVTVRCGCCTNLLSVNMRSYVLPASNQLQLQLGPHSYFNPQDILEELRDAPSNMNMMMMNQHPTMNDIPSFMDLHQQHEIPKAPPVNRPPEKRQRVPSAYNRFIKEEIQRIKAGNPDISHREAFSAAAKNWAHFPHIHFGL

>AT4G00180.1 tffamily=YABBY
PDHFSSTDQLCYVHCSFCDTVLAVSVPPSSLFKTVTVRCGHCSNLLSVTVSMRALLLPSVSNLGHSFLPPPPPPPPPNLLEEMRSGGQNINMNMMMSHHASAHHPNEHLVMATRNGQEMPRPPPANRPPEKRQRVPSAYNRFIKEEIQRIKAGNPDISHREAFSAAAKNWAHFPHIHFGL

>AT2G26580.1 tffamily=YABBY
ANSVMATEQLCYIPCNFCNIILAVNVPCSSLFDIVTVRCGHCTNLWSVNMAAALQSLSRPNFQATNYAVPEYGSSSRSHTKIPSRISTRTITEQRIVNRPPEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPHIHFGL

>AT1G23420.1 tffamily=YABBY
NHLFDLPGQICHVQCGFCTTILLVSVPFTSLSMVVTVRCGHCTSLLSVNLMKASFIPLHLLASLSHLDETGKEEVAATDGVEEEAWKVNQEKENSPTTLVSSSDNEDEDVSRVYQVVNKPPEKRQRAPSAYNCFIKEEIRRLKAQNPSMAHKEAFSLAAKNWAHFPPAHNKR

>AT1G69180.1 tffamily=YABBY
SRASPQAEHLYYVRCSICNTILAVGIPLKRMLDTVTVKCGHCGNLSFLTTTPPLQGHVSLTLQMQSFGGSDYKKGSSSSSSSSTSSDQPPSPSPPFVVKPPEKKQRLPSAYNRFMRDEIQRIKSANPEIPHREAFSAAAKNWAKY

64) Zinc finger homeodomain, ZF-HD

Searched with Arabidopsis protein domains.

>AT1G14440.1 tffamily=ZF-HD
KPMIKYKECLKNHAAAMGGNATDGCGEFMPSGEDGSIEALTCSACNCHRNFHRKEVEG

>AT1G75240.1 tffamily=ZF-HD
KPTVRYRECLKNHAASVGGSVHDGCGEFMPSGEEGTIEALRCAACDCHRNFHRKEMDG

>AT3G50890.1 tffamily=ZF-HD
DQGAKYRECQKNHAASTGGHVVDGCCEFMAGGEEGTLGALKCAACNCHRSFHRKEVYG

>AT3G28917.1 tffamily=ZF-HD
VRTVRYGECQKNHAAAVGGYAVDGCREFMASRGEEGTVAALTCAACGCHRSFHRREIET

>AT5G42780.1 tffamily=ZF-HD
THKPHYYECRKNHAADIGTTAYDGCGEFVSSTGEEDSLNCAACGCHRNFHREELIP

>AT5G60480.1 tffamily=ZF-HD
MVVLYNECLKNHAVSLGGHALDGCGEFTPKSTTILTDPPSLRCDACGCHRNFHRRSPSD

>AT5G39760.1 tffamily=ZF-HD
PLLFTYKECLKNHAAALGGHALDGCGEFMPSPSSISSDPTSLKCAACGCHRNFHRRDPDN

>AT1G14687.1 tffamily=ZF-HD
QSTCVYRECMRNHAAKLGSYAIDGCREYSQPSTGDLCVACGCHRSYHRRIDVI


65) ZIM

Searched with the Arabidopsis genes

>AT1G17380.1 tffamily=ZIM
SQPGSSQLTIFFGGKVLVYNEFPVDKAKEIMEVAKQ

>AT1G19180.1 tffamily=ZIM
PESQTAPLTIFYAGQVIVFNDFSAEKAKEVINLASK

>AT5G13220.1 tffamily=ZIM
LVSGTVPMTIFYNGSVSVFQVSRNKAGEIMKVANE

>AT1G51600.1 tffamily=ZIM
GSEQGDQLTLSFQGQVYVFDSVLPEKVQAVLLLLGG

>AT3G43440.1 tffamily=ZIM
EPDASTQLTIIFGGSCRVFNGVPAQKVQEIIRIAFA

>AT3G43440.1 tffamily=ZIM
SMILPSQLTIIFGGSFSVFDGIPAEKVQEILHIAAA

>AT1G30135.1 tffamily=ZIM
PNEESQRITIFYNGKMCFSSDVTHLQARSIISIASR

>AT2G34600.1 tffamily=ZIM
PKQESQILTIFYNGHMCVSSDLTHLEANAILSLASR






Authors of this site:

Paul J Rushton
Marta T. Bokowiec
Xianfeng (Jeff) Chen
Thomas (Tom) W Laudeman
Jennifer F. Brannock
Michael P. Timko

Contact:
pr8y@virginia.edu

Acknowledgements:


Tobacco leaf graphic used under Creative Commons license.