Program gto_amino_acid_to_pseudo_dna

The gto_amino_acid_to_pseudo_dna converts an amino acid (protein) sequence to a pseudo DNA sequence.

For help type:

./gto_amino_acid_to_pseudo_dna -h


In the following subsections, we explain the input and output paramters.

Input parameters

The gto_amino_acid_to_pseudo_dna program needs two streams for the computation, namely the input and output standard. The input stream is an amino acid sequence.

The attribution is given according to:

Usage: ./gto_amino_acid_to_pseudo_dna [options] [[--] args]
or: ./gto_amino_acid_to_pseudo_dna [options]

It converts a protein sequence to a pseudo DNA sequence.

-h, --help show this help message and exit

Basic options
< input.prot Input amino acid sequence file (stdin)
> output.dna Output DNA sequence file (stdout)

Example: ./gto_amino_acid_to_pseudo_dna < input.prot > output.dna
Table:
Prot DNA
A GCA
C TGC
D GAC
E GAG
F TTT
G GGC
H CAT
I ATC
K AAA
L CTG
M ATG
N AAC
P CCG
Q CAG
R CGT
S TCT
T ACG
V GTA
W TGG
Y TAC
* TAG
X GGG


It can be used to generate pseudo-DNA with characteristics passed by amino acid (protein) sequences. An example of such an input file is:

IPFLLKKQFALADKLVLSKLRQLLGGRIKMMPCGGAKLEPAIGLFFHAIGINIKLGYGMTETTATVSCWHDFQFNPNSIG
TLMPKAEVKIGENNEILVRGGMVMKGYYKKPEETAQAFTEDGFLKTGDAGEFDEQGNLFITDRIKELMKTSNGKYIAPQY
IESKIGKDKFIEQIAIIADAKKYVSALIVPCFDSLEEYAKQLNIKYHDRLELLKNSDILKMFE


Output

The output of the gto_amino_acid_to_pseudo_dna program is a DNA sequence.

Using the input above, an output example for this is the following:

ATCCCGTTTCTGCTGAAAAAACAGTTTGCACTGGCAGACAAACTGGTACTGTCTAAACTGCGTCAGCTGCTGGGCGGCCG
TATCAAAATGATGCCGTGCGGCGGCGCAAAACTGGAGCCGGCAATCGGCCTGTTTTTTCATGCAATCGGCATCAACATCA
AACTGGGCTACGGCATGACGGAGACGACGGCAACGGTATCTTGCTGGCATGACTTTCAGTTTAACCCGAACTCTATCGGC
ACGCTGATGCCGAAAGCAGAGGTAAAAATCGGCGAGAACAACGAGATCCTGGTACGTGGCGGCATGGTAATGAAAGGCTA
CTACAAAAAACCGGAGGAGACGGCACAGGCATTTACGGAGGACGGCTTTCTGAAAACGGGCGACGCAGGCGAGTTTGACG
AGCAGGGCAACCTGTTTATCACGGACCGTATCAAAGAGCTGATGAAAACGTCTAACGGCAAATACATCGCACCGCAGTAC
ATCGAGTCTAAAATCGGCAAAGACAAATTTATCGAGCAGATCGCAATCATCGCAGACGCAAAAAAATACGTATCTGCACT
GATCGTACCGTGCTTTGACTCTCTGGAGGAGTACGCAAAACAGCTGAACATCAAATACCATGACCGTCTGGAGCTGCTGA
AAAACTCTGACATCCTGAAAATGTTTGAG