Merging/Joining datasets

Question

I have two datasets (csv files), A and B for simplicity. A contains all the products; B contains only unique values of products (no duplicates). A sample of A is:

Date          All_products
01/07/2020          tv
01/07/2020        radio
04/07/2020         pen
04/07/2020         pen
04/07/2020       notebook
04/08/2020         pen
...                ...
04/10/2020        radio

I have approx. 29000 rows

A sample of B is:

Unique_products     Col1    Col2 ...

tv                  300      24
radio               120      12
pen                 1000      7
notebook            60        2
...                   ....

I have only unique value from A and same other columns (Col1, Col2,...Col 5). Number of rows approx 2000.

I would like to merge the datasets in order to have for each product the values that come from dataset B (csv file):

 Date          All_products         Col1        Col2
01/07/2020          tv               300         24
01/07/2020        radio              120         12
04/07/2020         pen              1000          7
04/07/2020         pen              1000          7 
04/07/2020       notebook             60          2
04/08/2020         pen              1000          7
...                ...                ...        ...
04/10/2020        radio              120         12

Could you please tell me how to generate the output above? Should I merge/join tables? If yes, how could I do it?

Thank you for your help

Neo · Accepted Answer

a = pd.read_csv('a.txt', sep=" ")

b = pd.read_csv('b.txt', sep=" ")
b.rename(columns={'Unique_products':'All_products'}, inplace=True)

ab = pd.merge(a, b, on='All_products', how='left')

Merging/Joining datasets

Answers (1)

Related Questions