How to use list of values with statistics in python

Question

The code I am using so far is:

import os
import math
import statistics
def main ():
    infile = open('USPopulation.txt', 'r')
    values = infile.read()
    infile.close()
    values = values.split('
')
    index = 0
    for _ in values:
        values[index] = int(values[index])
    while index < len(values):
        index += 41
    print(values)
 main()

This code gives me the following output which appears to be a list of integer values from the text file that I am using.

[151868, '153982', '156393', '158956', '161884', '165069', '168088', '171187', '174149', '177135', '179979', '182992', '185771', '188483', '191141', '193526', '195576', '197457', '199399', '201385', '203984', '206827', '209284', '211357', '213342', '215465', '217563', '219760', '222095', '224567', '227225', '229466', '231664', '233792', '235825', '237924', '240133', '242289', '244499', '246819', '249623']

My tasks is to create a program which shows average change in population during the time period. The year with the greatest increase in population during the time period. The year with the smallest increase in population (from the previous year) during the time period.

I am totally lost on the logic for how to make this happen or where to check for resources, my textbook has not been very helpful on this.

For Example: When I add the following code:

pop = sum(values)
print(statistics.mean(pop))

I get this error:

TypeError: unsupported operand type(s) for +: 'int' and 'str'

Your help is greatly appreciated. Not sure what to do here.

prhmma · Accepted Answer

as @jan mentioned, one of your problem is when you try converting your list of string to list of int. you should do it this way:

values= [int(i) for i in values]

or the one that @jan said will work too. after that mean operation needs two values, or in this case, it gets a list and uses the length of it as the second value which you did not provide in your code. this gives you an average of the population:

print(statistics.mean(values))

but I think you want the mean of population increase, not just population. in this case, you need to have another list of differences, then calculate the mean of that.

diff=[second-first for first, second in zip(values,values[1:])]

the list "diff" will contain difference values for each consequative years. you can do operations like min,max and mean on this list to get what you want.

How to use list of values with statistics in python

Answers (2)

Related Questions