Menu

(Solved) : Bioinformatics Regular Expression Python Files Uniprot Saved Fasta Format Specific Header Q32478806 . . .

***Bioinformatics: regular expression in Python***

Files from UniProt saved in the FASTA format have a specificheader structure given by:

dbld|Entry Protein OS = Organism [GN = Gene] PE = Existence SV = Version

Write a function that using regular expressions parses a stringin this format and returns a dictionary with the different fields(the key should be the field name). Note the part in right bracketsis optional, the parts in italics are the values of the fields,while the parts in upper case are constant placeholders.

## I am really confused with the wording…

I hope info from this website helps:https://www.uniprot.org/help/fasta-headers

dbld|Entry Protein OS = Organism [GN = Gene] PE = Existence SV = Version Show transcribed image text

Expert Answer


Answer to Bioinformatics Regular Expression Python Files Uniprot Saved Fasta Format Specific Header Q32478806 . . .

OR