Michael Henry
Michael Henry

Reputation: 495

Passing a function with multiple arguments to DataFrame.apply

Suppose I have a dataframe like this:

df = pd.DataFrame([['foo', 'x'], ['bar', 'y']], columns=['A', 'B'])


       A    B
0    foo    x
1    bar    y

I know how to use a single argument function with Apply when it comes to dataframes, like this:

def some_func(row):
    return '{0}-{1}'.format(row['A'], row['B'])

df['C'] = df.apply(some_func, axis=1)

df


       A    B        C
0    foo    x    foo-x
1    bar    y    bar-y

How can I use apply on dataframes when they involve multiple input arguments? Here's an example of what I want:

def some_func(row, var1):
    return '{0}-{1}-{2}'.format(row['A'], row['B'], var1)

df['C'] = df.apply(some_func(row, var1='DOG'), axis=1)

df


       A    B            C
0    foo    x    foo-x-DOG
1    bar    y    bar-y-DOG

I'm not looking for work-arounds to solve this one particular example, just how to do something like this in general. Any advice would be well appreciated, thanks.

Upvotes: 33

Views: 41307

Answers (3)

cs95
cs95

Reputation: 402263

It's just the way you think it would be, apply accepts args and kwargs and passes them directly to some_func.

df.apply(some_func, var1='DOG', axis=1)

Or,

df.apply(some_func, args=('DOG', ), axis=1)
0    foo-x-DOG
1    bar-y-DOG
dtype: object

If for any reason that won't work for your use case, then you can always fallback to using a lambda:

df.apply(lambda row: some_func(row, 'DOG'), axis=1)

0    foo-x-DOG
1    bar-y-DOG
dtype: object

Upvotes: 50

jpp
jpp

Reputation: 164613

You should use vectorized logic:

df['C'] = df['A'] + '-' + df['B'] + '-DOG'

If you really want to use df.apply, which is just a thinly veiled loop, you can simply feed your arguments as additional parameters:

def some_func(row, var1):
    return '{0}-{1}-{2}'.format(row['A'], row['B'], var1)

df['C'] = df.apply(some_func, var1='DOG', axis=1)

As per the docs, df.apply accepts both positional and keyword arguments.

Upvotes: 9

BENY
BENY

Reputation: 323226

I think it can be

df.apply('-'.join,1)+'-DOG'
Out[157]: 
0    foo-x-DOG
1    bar-y-DOG
dtype: object

Upvotes: 2

Related Questions