Reputation: 441
I'm trying to sort dataframe by values. got an AttributeError: 'Series' object has no attribute 'to_numeric'. version '0.20.3', so to numeric should work, but not. Please help.
import pandas as pd
tables = pd.read_html("https://www.sec.gov/Archives/edgar/data/949012/000156761919015285/xslForm13F_X01/form13fInfoTable.xml")
len(tables)
ren=tables[3]
ren.drop(ren.index[[0,1,2]], inplace=True)
ren.to_numeric(ren[3], errors='coerce')
#ren[3].convert_objects(convert_numeric=True)
ren.sort_values(by=[3],ascending=False)
Upvotes: 4
Views: 26385
Reputation: 1907
import pandas as pd
tables = pd.read_html("https://www.sec.gov/Archives/edgar/data/949012/000156761919015285/xslForm13F_X01/form13fInfoTable.xml")
len(tables)
ren=tables[3]
ren.drop(ren.index[[0,1,2]], inplace=True)
ren[3] = pd.to_numeric(ren[3], errors='coerce')
ren.sort_values([3],ascending=False, inplace=True)
ren
0 1 2 3 ...
101 JPMorgan COM 46625h100 48532 ...
44 Cisco COM 17275r102 47376 ...
204 Waste Management COM 94106L109 41558 ...
117 Microsoft COM 594918104 37492 ...
99 Johnson & Johnson COM 478160104 31491 ...
Upvotes: 7