How can I set the value for a specific row for a Pandas DataFrame in a for loop?

Question

for petid in X['PetID']:
    sentiment_file = datapath + '/train_sentiment/' + petid + '.json'
    if os.path.isfile(sentiment_file):
        json_data = json.loads(open(sentiment_file).read())
        X['DescriptionLanguage'] = json_data['language']
        X['DescriptionMagnitude'] = json_data['documentSentiment']['magnitude']
        X['DescriptionScore'] = json_data['documentSentiment']['score']
        # print(petid, sentiment_file,
        #       json_data['documentSentiment']['magnitude'])
    else:
        X['DescriptionLanguage'] = 'Unknown'
        X['DescriptionMagnitude'] = 0
        X['DescriptionScore'] = 0

This is what I have, but this doesn't work. It sets EVERY row to have those values for DescriptionLanguage, DescriptionMagnitude and DescriptionScore.

Heikki Pulkkinen · Accepted Answer

You can use .loc to set a individual value instead of a whole column. Here is a contained example

import pandas as pd
import numpy as np

X = pd.DataFrame(np.arange(5), columns=['PetID'])

for ind, row in X.iterrows():
    petid = row['PetID']
    X.loc[ind, 'DescriptionLanguage'] = 'No description for {}'.format(petid)

How can I set the value for a specific row for a Pandas DataFrame in a for loop?

Answers (2)

Related Questions