Splitting a pandas DataFrame of email 'From' field into sender's name, email address

Question

I've a pandas Dataframe consisting of a single column which is the extraction from the From field of emails e.g.

                                                   From
0          Grey Caulfu 
1                   Deren Torculas 
2            Charlto Youna

I want to take advantage of the str accessor to split the data into two columns, such that the first column is, Name, contains the actual name (first name last name), and the second column, Email, contains the email address).

If I use:

df = pd.DataFrame(df.From.str.split(' ',1).tolist(),
                                   columns = ['Name','Email'])

This is almost what I need, but it puts the surname in the Email column (i.e. it places the last two items from split() into this column). How do I modify this so that split() knows to stop after the first space when populating the first column?

Once we achieve this, we then need to make it a little more robust, so that it can handle names that contain three elements e.g.

Billy R. Valentine 
Yurimov | Globosales

Anand S Kumar · Accepted Answer

You can use rsplit() instead of split() , to split from the reverse. Example -

In [12]: df1 = pd.DataFrame(df.From.str.rsplit(' ',1).tolist(), columns=['Name','Email'])

In [13]: df1
Out[13]:
             Name                        Email
0     Grey Caulfu      
1  Deren Torculas  
2   Charlto Youna

Splitting a pandas DataFrame of email 'From' field into sender's name, email address

Answers (2)

Related Questions

Splitting a pandas DataFrame of email &#39;From&#39; field into sender&#39;s name, email address

Answers (2)

Related Questions

Splitting a pandas DataFrame of email 'From' field into sender's name, email address