StackOverflow Questions for Tag: apache-spark-dataset

Sanjit Jha
Sanjit Jha

Reputation: 1

Spark map() operation fails with NotSerializableException

Score: 0

Views: 13

Answers: 0

Read More
tsar2512
tsar2512

Reputation: 2994

Encoder for Row Type Spark Datasets

Score: 38

Views: 31798

Answers: 3

Read More
Georg Heiler
Georg Heiler

Reputation: 17724

Why do columns change to nullable in Apache Spark SQL?

Score: 11

Views: 11981

Answers: 2

Read More
Trango
Trango

Reputation: 21

Error when import VectorAssembler in Jupyter lab - for Pyspark

Score: 0

Views: 39

Answers: 2

Read More
Amir Bashir
Amir Bashir

Reputation: 1

Spark Using Wrong Catalog in Hive Metastore: How to Use a Specific Catalog Instead of Default 'hive'?

Score: 0

Views: 143

Answers: 0

Read More
Trango
Trango

Reputation: 21

PySpark error when "from pyspark.ml.feature import VectorAssembler"

Score: 0

Views: 21

Answers: 0

Read More
Lara
Lara

Reputation: 333

Why is Spark explode function much slower than a flat map function to split array?

Score: 8

Views: 3732

Answers: 0

Read More
Asif Hussain Shahid
Asif Hussain Shahid

Reputation: 1

Question regarding the behaviour of UserDefinedType

Score: 0

Views: 24

Answers: 0

Read More
Mork
Mork

Reputation: 23

Spark read from MongoDB and filter by objectId indexed field

Score: 0

Views: 2027

Answers: 2

Read More
kk jj
kk jj

Reputation: 17

ClassCastException in Spark 3.4.1 during Dataset reduce operation

Score: 0

Views: 81

Answers: 0

Read More
marios
marios

Reputation: 8996

Differences between Spark's Row and InternalRow types

Score: 12

Views: 2544

Answers: 2

Read More
Joy Cheng
Joy Cheng

Reputation: 1

RowEncoder.apply(schema).resolveAndBind() and Row/InternalRow serializer/deserializer equivalent in Spark 3.5

Score: 0

Views: 258

Answers: 1

Read More
Antonio Ye
Antonio Ye

Reputation: 51

Spark Dataframe na.fill for nested columns

Score: 0

Views: 111

Answers: 2

Read More
Antonio Ye
Antonio Ye

Reputation: 51

Question about Dataset/Dataframe mapPartitions iterator

Score: 0

Views: 150

Answers: 1

Read More
Yudovin Artsiom
Yudovin Artsiom

Reputation: 109

Spark DataFrame: find and set the main root for child

Score: 2

Views: 1737

Answers: 1

Read More
MSS
MSS

Reputation: 3633

Joining multiple Spark datasets in Java makes it very slow

Score: 0

Views: 73

Answers: 0

Read More
Capacytron
Capacytron

Reputation: 3739

deltalake scala api for unit-testing

Score: 0

Views: 119

Answers: 1

Read More
optimal substructure
optimal substructure

Reputation: 125

Dataframes and Datasets in Spark

Score: 1

Views: 827

Answers: 1

Read More
Capacytron
Capacytron

Reputation: 3739

Splitting Spark dataset / rdd into X smaller datasets, like randomSplit but w/o random

Score: 0

Views: 18

Answers: 0

Read More
David Regan
David Regan

Reputation: 319

Spark UDF doesn't get decoded dataset class using org.typelevel.frameless encoder injection

Score: 1

Views: 101

Answers: 1

Read More
PreviousPage 1Next