FASTA FORMAT

A sequence in FASTA format begins with a single line description, followed by lines of sequence data. The description line (defline) is distinguished from the sequence data by a greater-than (">") symbol at the beginning. Blank lines are not allowed in the middle of FASTA input.

The following example is protein sequences in FASTA format.

>SYC1_MYCTU
MTDRARLRLHDTAAGVVRDFVPLRPGHVSIYLCGATVQGLPHIGHVRSGVAFDILRRWLL
ARGYDVAFIRNVTDIEDKILAKAAAAGRPWWEWAATHERAFTAAYDALDVLPPSAEPRAT
GHITQMIEMIERLIQAGHAYTGGGDVYFDVLSYPEYGQLSGHKIDDVHQGEGVAAGKRDQ
RDFTLWKGEKPGEPSWPTPWGRGRPGWHLECSAMARSYLGPEFDIHCGGMDLVFPHHENE
IAQSRAAGDGFARYWLHNGWVTMGGEKMSKSLGNVLSMPAMLQRVRPAELRYYLGSAHYR
SMLEFSETAMQDAVKAYVGLEDFLHRVRTRVGAVCPGDPTPRFAEALDDDLSVPIALAEI
HHVRAEGNRALDAGDHDGALRSASAIRAMMGILGCDPLDQRWESRDETSAALAAVDVLVQ
AELQNREKAREQRNWALADEIRGRLKRAGIEVTDTADGPQWSLLGGDTK
>ARGR_ECOLI
MRSSAKQEELVKAFKALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRN
AKMEMVYCLPAELGVPTTSSPLKNLVLDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEG
ILGTIAGDDTIFTTPANGFTVKDLYEAILELFDQEL
>FABPL_PIG
MANASGFLGSSVPALRRATQPQHSISSSRGSSSDFVFKRVFCCSAVQGSDRQSLGDSRSP
RLVSRGCKLIGSGSAIPSLQISNDDLAKIVDTNDEWISVRTGIRNRRVLTGKDSLTNLAS
EAARKALEMAQIDADDVDMVLMCTSTPEDLFGSAPQISKALGCKKNPLSYDITAACSGFV
LGLVSAACHIRGGGFNNVLVIGADSLSRYVDWTDRGTCILFGDAAGAVVVQSCDAEEDGL
FAFDLHSDGDGQRHLKAAIKEDEVDKALGSNGSIRDFPPRRSSYSCIQMNGKEVFRFACR
CVPQSIESALGKAGLNGSNIDWLLLHQANQRIIDAVATRLEVPQERIISNLANYGNTSAA
SIPLALDEAVRSGNVKPGHVIATAGFGAGLTWGSAIIRWG