PF00624 - Flocculin repeat

PF00624 Protein family information

A7TTI5

A7TTI5 Interpro sequence information Sequence:

>tr|A7TTI5|A7TTI5_VANPO Uncharacterized protein (Fragment) OS=Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294 / BCRC 21397 / CBS 2163 / NBRC 10782 / NRRL Y-8283 / UCD 57-17) OX=436907 GN=Kpol_249p1 PE=4 SV=1
MKHFTRLLTFLNFVLFACSLSNHENNQALSLSELIDHEAILEGNTALVGDNPKSKLHSEK
KLLSIPLNINQNESIYTSVPSTKNQTYFISDHLATNVKNVDKKDITIKSNDISIITIRTQ
NLNILAETTSTELTWVTGHNGIESKLFIYYIEYPVDHFSFTFIRPMTVNNLEKRLVENED
ISSSSIVKPIVTESTKTIVNTITKSDNALVVETTYIVYSRSPYTSTNSKKTYWTGSYTTT
TKTEITTYIGTNGGVTTETIYFIATPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIE
TTETIYIVETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAF
ETTSYTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAFETTSFTYWTGSTANT
LSTVTTTFTGTDGIETTETIYIVETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIE
TTETIYIVETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAF
ETTSYTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAFETTSYTYWTGSTANT
LSTVTTTFTGTDGIETTETIYIVETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIE
TTETIYIVETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAF
ETTSYTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAFETTSFTYWTGSTANT
LSTVTTTFTGTDGIETTETIYIVETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIE
TTETIYIVETPTTAFETTSFTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAF
ETTSYTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAFETTSYTYWTGSTANT
LSTVTTTFTGTDGIETTETIYIVETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIE
TTETIYIVETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAF
ETTSFTYWTGSTANTLSTVTTTFTGTDGIETTETIYIVETPTTAFETTSYTYWTGSTANT
LSTVTTTFTGTDGIETTETIYIVETPTTAFETTSFTYWTGSTANTLSTVTTTFTGTDGIE
TTETIYIV

MRF results:

Region 1: 207-1197, 60 aa length, 47 units

NALVVETTYIVYSRSPYTSTNSKK-TYWTGSYTTTTKTEITTYIGTN
GGVTTETIYFI--ATPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSFTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSFTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSFTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSFTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSYTYWTGSTANTLSTVTTTFTGTD
GIETTETIYIV--ETPTTAFETTSFTYWTGSTANTLSTVTTTFTGTD

Alphafold results - spectrum b

_images/A7TTI5alphafold.png

Alphafold results - units from MRF

_images/A7TTI5alphafoldUnits.png

G8C135

Repeat units annotated: 342-382, 387-427 ,432-472,477-517 ,524-564 ,571-611

G8C135 Interpro sequence information

Sequence:

>tr|G8C135|G8C135_TETPH Candida_ALS_N domain-containing protein OS=Tetrapisispora phaffii (strain ATCC 24235 / CBS 4417 / NBRC 1672 / NRRL Y-8282 / UCD 70-5) OX=1071381 GN=TPHA0N00820 PE=4 SV=1
MTAKYFKFLLSLFLLSVAFAAELTNVTYSGLEFTPETSDHTPNNGWVASFNFDLDSATIV
EPGDYFDLKFPYIYRVKFNGQSTELPITLQDGTEVFTCNVLQQGASQYLESIVRCTSKVA
LSDQETLSGSISFSVSFNAGGSASEVDIEGAQYFKSGTNEVTALGGITAEISFNAVDFNY
DLYYAIRSIPSESFGSYYLSMVCPNGYLLGGTQDINYDFGNDGFTLDCATPEVFLSDNFN
DFWFLKAYESADANVICMGNDLNIEMSQADAGYGLWIDAQQSFPSDATAVTHFVSAKYSC
TDTLASTKYTNKIQTTIVYEIYGGTANAVALLDTIDIPVTVTSSTTTGWTGTYTTTYSTE
TTVTTGTDGLTTTEIIYHVETPSTIITSTTTTGWTGTYTTTYSTETTVTTGTDGLTTTEI
IYHVETPSTIITSTTTTGWTGTYTTTYSTETTVTTGTDGLTTTEIIYHVETPSTIITSTT
TTGWTGTYTTTYSTETTVTTGTDGLTTTEIIYHVETPSTIITSTTTTTTGWTGTYTTTYS
TETTVTTGTDGLITTEIIYHVETPSTIITSTTTTTTGWTGTYTTTYSTETTVTTGTDGLI
TTEIIYHVETPSNTYDKNLPPALASTTTSTLHNLVFLTTTFCPESSSDSLSFGNKFNSGN
IGNSSNKVTNLTTSATSTVSTISTISTASTISTSSTATTTSMTSSSTISNILVTKNTSTL
SIETYVPHITTQTLYTQESSETSLVTSYPSLEVYSESSTTTSYSLSIFEDMALQTKTNLG
SIVIALLSFLLLV

MRF results:

Region 1: 103-287, 60 aa length, 4 units
QGASQYLESIVRCTSKVALSDQETLSGSISF-SVSFNAGG-------------SASEVDI
EGA-QYFKSG--TNEVTALGG---ITAEISFNAVDFNYDLY-YAIRSIP----SESF---
-GS-YYLSMV--CPNGYLLGG----TQDINY---DFGNDGF-TLDCATPEVFLSDNFNDF
WFL-KAYESA--DANVICMGN----DLNIEMSQADAGYGLWIDAQQSFP----SDA----

Region 2: 342-613, 47 aa length, 6 units
--TSSTTTGWTGTYTTTYSTETTVTTGTDGLTTTEIIYHVETPSTII
--TSTTTTGWTGTYTTTYSTETTVTTGTDGLTTTEIIYHVETPSTII
--TSTTTTGWTGTYTTTYSTETTVTTGTDGLTTTEIIYHVETPSTII
--TSTTTTGWTGTYTTTYSTETTVTTGTDGLTTTEIIYHVETPSTII
TSTTTTTTGWTGTYTTTYSTETTVTTGTDGLITTEIIYHVETPSTII
TSTTTTTTGWTGTYTTTYSTETTVTTGTDGLITTEIIYHVETPSN--

Region 3: 678-695, 3 aa length, 6 units
TVS
TIS
TIS
TAS
TIS
TSS

Region 4: 128-137, 2 aa length, 5 units
SG
SI
SF
SV
SF

TAPAS results:

protein_ID,prediction_type,prediction_tool,first_residue_involved,last_residue_involved,accession
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,structured domain,CATH,171,213,2.30.22.10/FF/2204
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,peptide signal,SignalP,1,20
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,IUPred,459,463
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,IUPred,504,508
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,IUPred,616,622
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,IUPred,667,670
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,BISMMpredictor,350,387
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,BISMMpredictor,398,434
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,BISMMpredictor,445,479
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,BISMMpredictor,490,526
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,BISMMpredictor,537,570
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,BISMMpredictor,586,617
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,BISMMpredictor,671,692
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,disordered region,BISMMpredictor,711,732
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,functional domain,PFAM,52,300,PF11766.9
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,functional domain,PFAM,342,382,PF00624.19
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,functional domain,PFAM,387,427,PF00624.19
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,functional domain,PFAM,432,472,PF00624.19
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,functional domain,PFAM,477,517,PF00624.19
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,functional domain,PFAM,524,564,PF00624.19
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,functional domain,PFAM,571,611,PF00624.19
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,consensus ordered region,TAPASS,1,349
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,consensus ordered region,TAPASS,623,666
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,consensus ordered region,TAPASS,733,818
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,consensus disordered region,TAPASS,350,622
tr_G8C135_OX=1071381_GN=TPHA0N00820_PE=4_SV=1,consensus disordered region,TAPASS,667,732

Alphafold results - spectrum b

_images/G8C135alphafold.png

Alphafold results - units from MRF

_images/G8C135alphafoldUnits.png

Alphafold results G8C135

A0A1Q3ALI5

Repeat units annotated: 207-307, 314-353

A0A1Q3ALI5 Interpro sequence information

Sequence:

>tr|A0A1Q3ALI5|A0A1Q3ALI5_ZYGRO PA14 domain-containing protein (Fragment) OS=Zygosaccharomyces rouxii OX=4956 GN=ZYGR_0BQ00100 PE=4 SV=1
MVSHKSIFQWLLWFSVLGITKALAATACLPANGAQSGFKANFFQYNYGDMTTLRQPSFIA
GGYAKRQLLGTQNNVNNILIAYGMECQLSNGEVVTPTEPWNFDYSQCKNKRYFSQRHNGT
IFGFELTATNFTVELTGYLLAPQTGTYTFTFDHVDDSAILNFGEGIAFDCCNQDAAANGN
TQFSINAIKPDYGPTAHMNYSVDLVGNYYYPMRIVYTNRHVFGWLFTTLTLPDGTNIDND
FTGYVYSFVSEPEQPNCTVTSPLPFVTSTSTTPWTGSFTSTYSTQTNVNTDSDGDNAGTV
IIDVETPTTPPVLTTEYTGYSGSETSTYSTESTWVTGTDGKTTPETIYHVETPTIPPV

MRF results:

Region 1: 326-334,3 aa length,3 units, regex_SX3 0.86
STY
STE
STW

Region 2: 144-151,2 aa length,4 units, regex_TX2 0.88
TG
TY
TF
TF

TAPAS results:

tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,structured domain,CATH,106,191,2.60.120.40/FF/1304
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,peptide signal,SignalP,1,22
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,transmembrane region,TMHMM,12,34
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,disordered region,IUPred,259,262
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,disordered region,IUPred,276,358
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,disordered region,BISMMpredictor,249,264
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,disordered region,BISMMpredictor,266,299
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,disordered region,BISMMpredictor,318,334
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,functional domain,PFAM,131,221,PF10528.10
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,functional domain,PFAM,267,307,PF00624.19
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,functional domain,PFAM,314,353,PF00624.19
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,consensus ordered region,TAPASS,1,248
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,consensus disordered region,TAPASS,249,358
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,eukaryotic SLiMs,ELM,258,264,LIG_FHA_1
tr_A0A1Q3ALI5OX=4956_GN=ZYGR_0BQ00100_PE=4_SV=1,eukaryotic SLiMs,ELM,297,303,LIG_FHA_1

Alphafold results - spectrum b

_images/A0A1Q3ALI5alphafold.png

Alphafold results - units from MRF

_images/A0A1Q3ALI5alphafoldUnits.png

Alpha fold results A0A1Q3ALI5