Example 3:

 

How to Plot the Entropy for a Sample

In this example, you will take a sample generate a profile, compute the entropy and plot it.

 

Convert a Multi-FASTA file into a profile:

In this example, we will convert a multi-FASTA into a profile.

 

1.       Go (cd) into the ANDES/example_data directory.

2.       Run Clustalw2 on sequences to generate .aln file

 

clustalw2 -infile=20090416.fasta -quicktree

 

                This will generate the 20081201.dnd and the 20081201.aln file.

 

3.       Convert .aln file to profile. 

 

../ClustalALN_to_PositionProfile.pl -a 20090416.aln

 

This will generate the 20081201.prof. 

 

Compute the Variation for a profile:

 

4.       Compute the Normalized Shannon Entropy for the profile:

 

                                ../Compute_NormalizedShannonEntropy_for_Profile.pl -i 20090416.prof -o 20090416

 

                This will generate the positional entropy file, 20090416.ent

 

5.       Plot the entropy file:

 

../Plot_Entropy.r 20090416.ent

 

This will generate an Adobe Acrobat PDF File named 20090416.ent.pdf

 

6.       Visualize the PDF file with your favorite viewer:

 

acroread 20090416.ent.pdf

 

                The results should look something like this:

 

 

                From this plot you can see that the variation is fairly random along the aligned sequences.  There are a couple of peaks around 600-700 and near 1500, that may be interesting.