The gto_amino_acid_to_pseudo_dna converts an amino acid (protein) sequence to a pseudo DNA sequence.
For help type:
./gto_amino_acid_to_pseudo_dna -h
In the following subsections, we explain the input and output paramters.
The gto_amino_acid_to_pseudo_dna program needs two streams for the computation, namely the input and output standard. The input stream is an amino acid sequence.
The attribution is given according to:
Usage: ./gto_amino_acid_to_pseudo_dna [options] [[--] args]
or: ./gto_amino_acid_to_pseudo_dna [options]
It converts a protein sequence to a pseudo DNA sequence.
-h, --help show this help message and exit
Basic options
< input.prot Input amino acid sequence file (stdin)
> output.dna Output DNA sequence file (stdout)
Example: ./gto_amino_acid_to_pseudo_dna < input.prot > output.dna
Table:
Prot DNA
A GCA
C TGC
D GAC
E GAG
F TTT
G GGC
H CAT
I ATC
K AAA
L CTG
M ATG
N AAC
P CCG
Q CAG
R CGT
S TCT
T ACG
V GTA
W TGG
Y TAC
* TAG
X GGG
It can be used to generate pseudo-DNA with characteristics passed by amino acid (protein) sequences. An example of such an input file is:
IPFLLKKQFALADKLVLSKLRQLLGGRIKMMPCGGAKLEPAIGLFFHAIGINIKLGYGMTETTATVSCWHDFQFNPNSIG
TLMPKAEVKIGENNEILVRGGMVMKGYYKKPEETAQAFTEDGFLKTGDAGEFDEQGNLFITDRIKELMKTSNGKYIAPQY
IESKIGKDKFIEQIAIIADAKKYVSALIVPCFDSLEEYAKQLNIKYHDRLELLKNSDILKMFE
The output of the gto_amino_acid_to_pseudo_dna program is a DNA sequence.
Using the input above, an output example for this is the following:
ATCCCGTTTCTGCTGAAAAAACAGTTTGCACTGGCAGACAAACTGGTACTGTCTAAACTGCGTCAGCTGCTGGGCGGCCG
TATCAAAATGATGCCGTGCGGCGGCGCAAAACTGGAGCCGGCAATCGGCCTGTTTTTTCATGCAATCGGCATCAACATCA
AACTGGGCTACGGCATGACGGAGACGACGGCAACGGTATCTTGCTGGCATGACTTTCAGTTTAACCCGAACTCTATCGGC
ACGCTGATGCCGAAAGCAGAGGTAAAAATCGGCGAGAACAACGAGATCCTGGTACGTGGCGGCATGGTAATGAAAGGCTA
CTACAAAAAACCGGAGGAGACGGCACAGGCATTTACGGAGGACGGCTTTCTGAAAACGGGCGACGCAGGCGAGTTTGACG
AGCAGGGCAACCTGTTTATCACGGACCGTATCAAAGAGCTGATGAAAACGTCTAACGGCAAATACATCGCACCGCAGTAC
ATCGAGTCTAAAATCGGCAAAGACAAATTTATCGAGCAGATCGCAATCATCGCAGACGCAAAAAAATACGTATCTGCACT
GATCGTACCGTGCTTTGACTCTCTGGAGGAGTACGCAAAACAGCTGAACATCAAATACCATGACCGTCTGGAGCTGCTGA
AAAACTCTGACATCCTGAAAATGTTTGAG