Scoring for each row based on matrix in python

Question

I have a matrix as follows

  0   1   2   3   ...
A 0.1 0.2 0.3 0.1
C 0.5 0.4 0.2 0.1
G 0.6 0.4 0.8 0.3
T 0.1 0.1 0.4 0.2

The data is in a dataframe as shown

Genes   string
Gene1   ATGC
Gene2   GCTA
Gene3   ATCG

I need to write a code to find the score of each sequence. The score for seq ATGC is 0.1+0.1+0.8+0.1 = 1.1 (A is 0.1 because A is in first position and the value for A at that position is 0.1, similar this is calculated along the length of the sequence (450 letters))

The output should be as follows:

Genes  Score
Gene1  1.1
Gene2  1.5
Gene3  0.7

I tried using biopython but could not get it right. Can anyone please help!

Scoring for each row based on matrix in python

Answers (1)

Related Questions