Tronald Dump
Tronald Dump

Reputation: 1350

Reading Multiple SequnceFiles in PySpark?

Is there a way to read Multiple Sequence Files in a single go?

sc.SequnceFile(['filepath_1.seq','filepath_2.seq','filepath_3.seq'])

Upvotes: 0

Views: 70

Answers (1)

Alper t. Turker
Alper t. Turker

Reputation: 35249

Use glob pattern

sc.SequnceFile('filepath_*.seq')

or comma-separated strings:

sc.SequnceFile(",".join(['filepath_1.seq', 'filepath_2.seq', 'filepath_3.seq']))

Upvotes: 1

Related Questions