Reputation: 23
I need to perform K-means clustering and Hierarchical clustering of DNA sequences(nucleotide) sequences which i have downloaded in FASTA format. So before performing clustering I need to do DECIMAL ENCODING OF bases(a,t,c,g).. so how to do that.. so that i can take this input in the matrix form in MATLAB?.
Upvotes: 1
Views: 413
Reputation: 13984
Use the nt2int function. Documentation on it below:
http://www.mathworks.com/help/bioinfo/ref/nt2int.html
Upvotes: 1