Pandas Dataframe simple Stats

Question

I want to find the average number of points scored by the team per game in each season.

Is there a easy way to consider both the case when the team won Wscore and when the team lost Lscore?

    Season  Daynum  Wteam   Wscore  Lteam   Lscore  Wloc    Numot
0   1985    20     1228     81      1328       64     N      0
1   1985    25     1106     77      1354       70     H      0

season - this is the year of the associated entry in seasons.csv (the year in which the final tournament occurs)

daynum - this integer always ranges from 0 to 132, and tells you what day the game was played on. It represents an offset from the dayzero date in the seasons.csv file. For example, the first game in the file was daynum=20. Combined with the fact from the season.csv file that day zero was 10/29/1984, that means the first game was played 20 days later, or 11/18/1984. There are no teams that ever played more than one game on a given date, so you can use this fact if you need a unique key. In order to accomplish this uniqueness, we had to adjust one game's date. In March 2008, the SEC postseason tournament had to reschedule one game (Georgia-Kentucky) to a subsequent day, so Georgia had to actually play two games on the same day. In order to enforce this uniqueness, we moved the game date for the Georgia-Kentucky game back to its original date.

wteam - this identifies the id number of the team that won the game, as listed in the teams.csv file. No matter whether the game was won by the home team or visiting team, wteam always identifies the winning team.

wscore - this identifies the number of points scored by the winning team.

lteam - this identifies the id number of the team that lost the game.

lscore - this identifies the number of points scored by the losing team.

numot - this indicates the number of overtime periods in the game, an integer 0 or higher.

wloc - this identifies the location of the winning team. If the winning team was the home team, this value will be H. If the winning team was the visiting team, this value will be A. If it was played on a neutral court, then this value will be N. Sometimes it is unclear whether the site should be considered neutral, since it is near one team's home court, or even on their court during a tournament, but for this determination we have simply used the Kenneth Massey data in its current state, where the @ sign is either listed with the winning team, the losing team, or neither team.

Pandas Dataframe simple Stats

Answers (1)

Related Questions