learner
learner

Reputation: 344

In the pySpark what is the diffrence between Order By and sort

this is basic question but

Both are use for sorting data and both have the same functionality like performe asc by default the what isthe difrence

sort
Order by

Upvotes: 1

Views: 1334

Answers (1)

MaxU - stand with Ukraine
MaxU - stand with Ukraine

Reputation: 210812

I think they are synonyms:

look at this

def sort(self, *cols, **kwargs):
    """Returns a new :class:`DataFrame` sorted by the specified column(s).
    :param cols: list of :class:`Column` or column names to sort by.
    :param ascending: boolean or list of boolean (default True).
        Sort ascending vs. descending. Specify list for multiple sort orders.
        If a list is specified, length of the list must equal length of the `cols`.
    >>> df.sort(df.age.desc()).collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    >>> df.sort("age", ascending=False).collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    >>> df.orderBy(df.age.desc()).collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    >>> from pyspark.sql.functions import *
    >>> df.sort(asc("age")).collect()
    [Row(age=2, name=u'Alice'), Row(age=5, name=u'Bob')]
    >>> df.orderBy(desc("age"), "name").collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    >>> df.orderBy(["age", "name"], ascending=[0, 1]).collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    """
    jdf = self._jdf.sort(self._sort_cols(cols, kwargs))
    return DataFrame(jdf, self.sql_ctx)

orderBy = sort

Upvotes: 2

Related Questions