FASTA-2-CSV

In bioinformatics FASTA format is a text-based file format to store and present nucleotide/peptide sequences. Used as preferred method for communicating sequences between projects, this format also allows for sequence names and comments to be included in header section.

While the format was originated from William Pearson’s FASTA software package, in recent years, it has become one of the key file formats used in Bioinformatics. Due to the nature of projects, in many cases, to store information from FASTA file into a RDBMS, it is handy to convert *.fasta into *.csv format.

For my projects, while searching the net I found a number of solutions suggested by research community around the world. Here I am providing a collection  together with my own solution.

Continue reading “FASTA-2-CSV”