Reputation: 87
I am using google sheets to hold data for a shared project. Using Google's Sheets API I access the data, process it in python, and I am trying to update the Sheets file using batchUpdate, in the function writer.
TypeError: Object of type DataFrame is not JSON serializable
.to_json()
, I get this:googleapiclient.errors.HttpError: <HttpError 400 when requesting https://sheets.googleapis.com/v4/spreadsheets/XXX/values:batchUpdate?alt=json returned "Invalid value at 'data[0].values' (type.googleapis.com/google.protobuf.ListValue), "{"0":{"0":1},"1":{"0":2},"2":{"0":3},"3":{"0":4}}"". Details: "[{'@type': 'type.googleapis.com/google.rpc.BadRequest', 'fieldViolations': [{'field': 'data[0].values', 'description': 'Invalid value at 'data[0].values' (type.googleapis.com/google.protobuf.ListValue), "{"0":{"0":1},"1":{"0":2},"2":{"0":3},"3":{"0":4}}"'}]}]">
Any pointers would be much appreciated.
import pickle
import os.path
import pandas as pd
from googleapiclient.discovery import build
from google_auth_oauthlib.flow import InstalledAppFlow
from google.auth.transport.requests import Request
from pprint import pprint
# If modifying these scopes, delete the file token.pickle.
SCOPES = ['https://www.googleapis.com/auth/spreadsheets']
# Spreadsheet ID: https://docs.google.com/spreadsheets/d/XXX/edit#gid=0
SPREADSHEET_ID = 'XXX'
RANGE_NAME = 'contacts'
def writer(df):
service = build('sheets', 'v4', credentials=gsheet_api(SCOPES))
sheet_name = 'contacts'
data = [{'range' : sheet_name, 'values' : df}]
batch_update_values_request_body = {
'value_input_option': 'RAW',
'data': data }
request = service.spreadsheets().values().batchUpdate(spreadsheetId=SPREADSHEET_ID,
body=batch_update_values_request_body)
response = request.execute()
pprint(response)
df = [[1, 2, 3, 4]]
writer(df)
Upvotes: 3
Views: 8896
Reputation: 201408
I believe your goal and situation as follows.
For this, how about this answer?
I'm not sure about the values of the dataframe. So in this answer, I would like to explain about the modification points using the following sample dataframe.
A B C
0 1 2 3
1 4 5 6
2 7 8 9
Unfortunately, the dataframe cannot be directly used for the request body of the method "spreadsheets.values.batchUpdate". So in this case, it is required to convert from the dataframe to the 2 dimensional array. For this, I used tolist()
.
When your script is modified using the sample dataframe, it becomes as follows.
df = [[1, 2, 3, 4]]
writer(df)
To:
sampleValue = [[1, 2, 3], [4, 5, 6], [7, 8, 9]]
df = pd.DataFrame(sampleValue, columns=list('ABC'))
values = [df.columns.values.tolist()]
values.extend(df.values.tolist())
writer(values)
From
values = [df.columns.values.tolist()]
values.extend(df.values.tolist())
To
values = df.values.tolist()
Upvotes: 8