Summary 2

PF04680 - Opioid growth factor receptor repeat

PF04680 Protein family information

Q05BV5

Q05BV5 Interpro sequence information

Sequence:

>Q05BV5 1-660
MRRTRRTRTARTARPPARGTRTQGTRTRSRRSRGRRGPARSRMTGSRNWRATRDMCRYRHNYPDLVERDCNGDTPNLSFY
RNEIRFLPNGCFIEDILQNWTDNYDLLEDNHSYIQWLFPLREPGVNWHAKPLTLREVEVFKSSQEIQERLVRAYELMLGF
YGIRLEDRGTGTVGRAQNYQKRFQNLNWRSHNNLRITRILKSLGELGLEHFQAPLVRFFLEETLVRRELPGVRQSALDYF
MFAVRCRHQRRQLVHFAWEHFRPRCKFVWGPQDKLRRFKPSSLPHPLEGSRKVEEEGSPGDPDHEASTQGRTCGPEHSKG
GGRVDEGPQPRSVEPQDAGPLERSQGDEAGGHGEDRPEPLSPKESKKRKLELSRREQPPTEPGPQSASEVEKIALNLEGC
ALSQGSLRTGTQEVGGQDPGEAVQPCRQPLGARVADKVRKRRKVDEGAGDSAAVASGGAQTLALAGSPAPSGHPKAGHSE
NGVEEDTEGRTGPKEGTPGSPSETPGPSPAGPAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGPRPAGPAGDEPAKT
PSETPGPSPAGPTRDEPAESPSETPGPRPAGPAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGPSPAGPTRDEPAKA
GEAAELQDAEVESSAKSGKP

MRF results:

_images/PF04680_Q05BV5_MRF.png

TAPAS results:

_images/PF04680_Q05BV5.png

PF04886 - PT repeat

PF04886 Protein family information

A0A0F6P044

A0A0F6P044 Interpro sequence information

Sequence:

>A0A0F6P044 1-358
VALLDNAIKCDEEVPIPDPPQSPDENPGGKDDPPGDSDPLPGEGAGAVEPAGGETDSGVEEGPAEQAVDQPLTQSTDQPA
DQPAEQPADQALTQPTDQPANQPVDQPTDQPIDQPTDQPVDQTTDQTTEQPAGEPLTQSTDEPVDQPLTQSTDQPAGEPL
THSTDQPADQPGDQSADQPQTTDQVTEQPTDQPTEQPTDQTTDQPTEQPTDEPLTQPTDEPLPQPIVEASDRAAAAAVKN
PNEIEAKCAQLKDQDGVKITGPCGAKFQVFLIPHVTINVETETNAIHLGKKLDDVVITKKMHKGVGGKSPPLLQFEEDAD
SLLNQCTEGKTFKFVVVVKGEELILKWKVYEKVPSPSD

MRF results:

_images/PF04886_A0A0F6P044_MRF.png

TAPAS results:

_images/PF04886_A0A0F6P044.png

PF05671 - GETHR pentapeptide repeat (5 copies)

PF05671 Protein family information

P91183

P91183 Interpro sequence information

Sequence:

>P91183 1-101
MFQNLLENSKKFWIVLEPSGKFLKMLLRDLRGETHRGETHHGETHRGEAHRGETHRGETHRGETRRGETHRGETRRGETQ
NFGGKFKFSVKNILAGNLNFL

MRF results:

_images/PF05671_P91183_MRF.png

TAPAS results:

_images/PF05671_P91183.png

PF06049 - Coagulation Factor V LSPD Repeat

PF06049 Protein family information

A0A833YAA7

A0A833YAA7 Interpro sequence information

Sequence:

>tr|A0A833YAA7|A0A833YAA7_9CHIR Coagulation factor V OS=Phyllostomus discolor OX=89673 GN=HJG60_004700 PE=3 SV=1
MRPRHGPSPEPGQAVSADLRHPSSPEDSGQMSPSLELEVWQTAISPDLTQTTLSPDLGQD
ALSPDFSQTTVSPDLGQTTLSPDPSQDALSPDLGQDALSPDFSQTTVSPDLDQETLSPDF
SQTTDSPDLTQTTLSPDLSQDALSPDFSQTTVSPDLDQETLSPDFSQTTDSPDLTQTTLS
PDLSQETLSPDLSQDALSPDFSQTTVSPDLGQTTLSPDPSQETVSPDLSQETDSPDLSQE
TLAPDLGQETLSPDLGQETISPDFSQTTDSPDLDQETLSPDLSQETLSPDLGQDALSPDF
SQTTVSLDLDQETLSPDFSQTTDSPGLGQETLSPDLGQTTLPPAVWQTSAPPDIDQTPDT
SEPAQTVPYSDLGQLSSLQPSPPLNDTFLTKEFNLPFVLGFSGDDGDYIEMIPSEREEND
EDDSYEFGHVAYDDPYQTDMRTDMNSSRNPDSIAEWYLRSNNGNRKYYYIAAEEISWDYS
KSAQSEMDSEDEDAVPEGTVYKKVVFRKYLDSTFTKRDPRGEYEEHLGILGPVIRAEVDD
VIQVRFKNLASRPYSLHAHGLSYEKSSEGKTYEDDSPEWFKEDNAIQPNRSYTYVWHATG
RSGPENPGSACRAWAYYSAVNTEKDIHSGLIGPLLICRKGTLHKESKMPVDTREFVLLFM
VFDEKKSWYYDKKPERSWRRTSSEVKHSHEFHAINGMVYNLPGLRMYEQEWVRLHLLNLG
GSRDIHVVHFHGQTLLENGTRQHQLGVWPLLPGSFKTLEMKASKPGWWLLDTQVGENQRA
GMRTPFLIMDRECRMPMGLSTGVVADRQIKASEFVGSWEPKLARLNNGGSYNAWMTEKMS
EFDTQPWIQVDMEREVLFTGIQSQGAKHYLKSYYTTEFRVAYSSDQTNWQIFKGNSTRNV
MYFDGNSDASSIKENRFDPPIVARYIRVYPTRFFNRPALRLELQGCEINGCSTPLGVESR
KIEDKQITASSFKTSWWGGSWEPSLARLNAQGRVNAWQAKANNNRQWLQIDLLKVKKITA
IVTQGCKSLSSEMYVKSYTVHYSDGGVDWTPYRQRSSMVDKIFEGNSNFKGHVKNFFNPP
IISRYIRIIPKTWNQSIALRLELFGCDID

Sequence used for the model:

WQTAISPDLTQTTLSPDLGQDALSPDFSQTTVSPDLGQTTLSPDPSQDALSPDLGQDALS
PDFSQTTVSPDLDQETLSPDFSQTTDSPDLTQTTLSPDLSQDALSPDFSQTTVSPDLDQE
TLSPDFSQTTDSPDLTQTTLSPDLSQETLSPDLSQDALSPDFSQTTVSPDLGQTTLSPDP
SQETVSPDLSQETDSPDLSQETLAPDLGQETLSPDLGQETISPDFSQTTDSPDLDQETLS
PDLSQETLSPDLGQDALSPDFSQTTVSLDLDQETLSPDFSQTTDSPGLGQETLSPDLGQT
TLPPAVWQTSAPPDIDQTPDTSEPAQTVPYSDLGQLSSLQPSPPLNDTFLTKEFNLPFVL

MRF results:

_images/mrfA0A833YAA7.png

TAPAS results:

_images/tapasA0A833YAA7.png

Alphafold results - spectrum b

_images/alphafoldA0A833YAA7.png

Alphafold results - units from MRF

_images/alphafoldUnitsA0A833YAA7.png

Alpha fold results A0A833YAA7

PF06392 - Acid shock protein repeat

PF06392 Protein family information

A0A0A3YPY8

A0A0A3YPY8 Interpro sequence information

Sequence:

>tr|A0A0A3YPY8|A0A0A3YPY8_9GAMM Acid shock protein OS=Erwinia typographi OX=371042 GN=asr PE=3 SV=1
MKKLFALVVAAAMGLSSVAFAADTTAAPATTPAATTAAPAKATTTKHHKKHKKATVQKAQ
AAKKVHHKKVAKKPVAQKAQAAKKVHHKKVAKKPVAQKAQAAKKVHHKKVAKKPVAQIAQ
AAKKVHHKKVAKKPVAQKAQAAKKVHHKKVTKKAAAPKA

Sequence Fragment:

HHKKHKKATVQKAQAAKKVHHKKVAKKPVAQKAQAAKKVHHKKVAKKPVAQKAQAAKKVH
HKKVAKKPVAQIAQAAKKVHHKKVAKKPVAQKAQAAKKVHHKKVTKKAAAPKA

MRF results:

Region 1: 51-150 ,    20 aa length,   5 units
HKKATVQKAQAAKKVHHKKV
AKKPVAQKAQAAKKVHHKKV
AKKPVAQKAQAAKKVHHKKV
AKKPVAQIAQAAKKVHHKKV
AKKPVAQKAQAAKKVHHKKV

Region 2: 24-39,      6        aa length,     3 units

TTAAPA
TTPAA-
TTAAP-

TAPAS results:

_images/tapasA0A0A3YPY8.png

Alphafold results - spectrum b

_images/A0A0A3YPY8alphafold.png

Alphafold results - units from MRF

_images/A0A0A3YPY8alphafoldUnits.png

Alpha fold results A0A0A3YPY8

PF06671 - Repeat of unknown function (DUF1174)

PF06671 Protein family information

This family do not have alpha models

A0A8R1DRW7

A0A8R1DRW7 Interpro sequence information

Sequence:

>A0A8R1DRW7 1-1421
MKPRWLSSSSPTTSQLLLLSSILLLANAKPLNLPQTITCADNIYVYVNETQADRSPYIFVEIKTETIHDCIDACFGNQFC
YSLKFDQSKTDSCSLYYFAAYNCTGHELRPAKSVTYNGGAVTIDCLRCPSNGDFVTAPPFSSFTEQTIQAIGLRGETLSE
KPLVEEITHNIDSKLESTTTTAAPTSGHSTATVDLHVQDTTPTSESPETTTVPVEPVTSTETAVTQAAQEGSKGNYYPAC
YINFQVEDISTQPNFEHYSVKPAKSANACARFCFVGLCTVAVYSPSRRECLLGKERTEQCTEADNKFSYSGTQDVVLQCF
RCSSRKLPPVTKPPVSFQKEEEVTTQATVDESTTTTEATTTTTNLKSETSTSQNESATTEKPEEPTVLTTDAVESATVAE
ANDPEPAIATKVEMAKDKEGVKTTQRKHCVIKFQARPLSQRPENLQAKFELNVPVDSIELCATRCYQDGCSGARFDPTDK
SCTLSYDDPQFCARGNVFIHYEANETTWLHCVNCYTVKPSDIDEARTGTTSAPHLTTNEETTTPAQSSETTTVVTTESSS
AIPSSEEPTTTIATSTVKASEPDSDFQKGCLIKFQARPLSERPKEFSAKFETEIKVESVEVCATRCYQDGCSGARFDPVW
STCSLSYDEKHFCARGDVFLQYMAKEVTWIHCVNCYAIKPSVAADVSKVPNKLNNENQVTTTTTAGPATNAWGEEISTTS
STNQKEEKTATIEPSAEESTTIMTIGQEVEDDSLLKGCIVHFQAQPIEQRSAEFTAPFELNLNVPTTETCAHRCYQDGCT
AARYDPETKKCSLAYEDKPFCGKGKLVNVDRSKSTVWIHCLSCVPLNHAKVAENTDENITEFPSGQEIPTTLSAETEGSG
EETAVPSTTTPAEASKDGEVTEASGEETTTTAVTEASGEETTTAAVTEGSGEDAAVSSTTAPAEASKDGEVTEASGEETT
TTAVTEASGEETTTAAVTEGSGEEIAVPTTTAPAEASKDGEVTEASGEETTNAAVTEGSGEDAAVSSTTAAAEASKDGEV
TEASGEETTTASVTEASGEESTTAAVTEGSGEDAAVSSTTAPAEASKDGQVTEASGEETTTTAVTEASGEETTTAAVTEG
SGEDAAVSSTTAAAEASKDGQVTEASGEETTTTAVTEASGEETTTAAVTEGSGEDGAVSSTTAAAEASKDGEVTEASGEE
TTTTAVTEASGEETTTAAVTEGSGEDAAVSSTTAPAEASKDGQVTEASGEETTTTAVTEASGEETTTAVVTEGSGEDAAV
SSTTAPAEASKDGEVTEASGEETTTTSVTEASGEETTTAAVTEGSGEDAAVSSTTAPVEASKDGQVTEASGEETTTTSVT
EASGEETTTAADTEGSGEDAAVSSTTAPTEASKDGEVTEASGEETTTTSVTEASGEETTTA

MRF results:

_images/PF06671_A0A8R1DRW7_MRF.png

TAPAS results:

_images/PF06671_A0A8R1DRW7.png

PF06740 - RMicrotubule associated protein Futsch

PF06740 Protein family information

This family do not have alpha models for the long sequences that present many of these repeats, and only generates a model for the smaller sequences that have one or two repetitions of this type

B4R2P4

B4R2P4 Interpro sequence information

Sequence:

>B4R2P4 1-886
MSDEGGQKPHHSPHLRRHHHRHYRGALRVVAKVAGKVAPTRGNCASGDAALEAVETIKLDNSNPLDTPCVLESMSVPGSP
GIAYISGSTSDPSAIRERLIQYASENLVTEVLIHPQYNTLIQCMRNLLSSFTRHRHIIHAGYTFSGNGSWILHVDTEASR
PESVVDSVKDEAEKQESPQYIKDDKSTEHSRRESLADKSAVPSEKFVSRPVSVASDHEAAEAIEDDAKSSISPKDKSRPG
SVAETVSSPIEEAPIEFSKIEVVEKSNLALSLQAGSGGKLQTDSSPVDVAEGDFSHVVASVSTVTPTLTKPAELAKIGAA
ITVSSPVDEAPRTPSAPEHISRADSPAEYASEEIASQDKSPQVSKESSRPASVTESKDDAAQLKRSVEDLRSPVASTEIS
RPASVGETASSPIEEAPKDFAEFEQSVKAMLPLTIELKGSLPTLSSPVDVAHGDFPPTSTTSSPTAAAVQPAELSKVDIE
KTASSPIDEAPKSVLGSPAEERPESPAESAKDAAESVEKSKDASRPPSVVESTKADSTKGDISPSPESVLEGPKDDVEKS
RESSRPPSVSASITGDSTKDVSRPASVVESVRDEHDKAESRRDTSSATKDDSLKETVAEFLATEKIVSAKEAFSTEATKS
ADDCLKKATASTVSSTTASQRALFVGTDESRRESLLSQASESRLTHSDPEDEEPADDVDERSSVKESRSKSIATIMMTSI
YKPSEDMEPISKLVEEEHEHVEELTQEVTSTSKTTTLLQSSEQSSSTTTSSTTKTGASRVESITLTQMDQQTSQSQAEPA
DRKTPPTAPVSPGVKAMSSTGSAGSVIGAGAVAAGGKCESSAASIVSSSGPMSPKDISGKSSPGALTSESQSIPTPLGRE
SHTDTP

MRF results:

_images/PF06740_B4R2P4_MRF.png

TAPAS results:

_images/PF06740_B4R2P4.png