Adding a new column or row as pd.Series

Question

I am trying to add one Column and one Row by using a pd.Series object. Here is what I have so far:

import pandas as pd
df = pd.DataFrame([
    {"Title": "Titanic",    "ReleaseYear": 1997, "Director": "James Cameron"},
    {"Title": "Spider-Man", "ReleaseYear": 2002, "Director": "Sam Raimi"}
])

# Add a new row
new_movie_row = pd.Series(['Jurassic Park', 1993, 'Steven Spielberg'])
df.loc[2] = new_row

# Add a new column
new_keyword_column = pd.Series(['Boat', 'Spider', 'Dinosaur'])
df['Keyword'] = new_keyword_column
df

This seems to add the Column fine, however the Row gives me all NaN:

What would be the correct way to do this?

Ch3steR · Accepted Answer

Pandas tries to align based on index/column names this is called Data Alignment, we can use .tolist here.

df.loc[2] = new_movie_row.tolist()
df
           Title  ReleaseYear          Director
0        Titanic         1997     James Cameron
1     Spider-Man         2002         Sam Raimi
2  Jurassic Park         1993  Steven Spielberg

This applies same for adding columns too

new_keyword_column = pd.Series(['Boat', 'Spider', 'Dinosaur'],index=[4,5,6])  # Notice the Index is 4, 5, 6.

df['new'] = new_keyword_column
df
           Title  ReleaseYear          Director  new
0        Titanic         1997     James Cameron  NaN
1     Spider-Man         2002         Sam Raimi  NaN
2  Jurassic Park         1993  Steven Spielberg  NaN

Since indexes don't align you get all NaN, to counter that you can use .tolist()

df['new'] = new_keyword_column.tolist()
df
           Title  ReleaseYear          Director       new
0        Titanic         1997     James Cameron      Boat
1     Spider-Man         2002         Sam Raimi    Spider
2  Jurassic Park         1993  Steven Spielberg  Dinosaur

Adding a new column or row as pd.Series

Answers (2)

Related Questions