Reputation: 618
I am currently working on the use case, where
JSON
format; to be more specific, what would be file name in S3 when the dataframe is saved.Upvotes: 1
Views: 255
Reputation: 7028
First of all, why do you want to write each partition in a separate bucket?
To your second question: The saved data would depend on the amount of partitions you are saving to S3. You can always repartition the data by calling .repartition()
on your data frame. Since the files are saved in the Hadoop File Format, the name will contain some specific numbers and a -part suffix similar to this: part-block-0-0-r-00000-.json
Upvotes: 1