How does a Hadoop or Spark connector for distributed data stores function?

Question

Spark has connectors for a variety of databases and data stores.

However, what would be required to create a connector for your own custom distributed database. From what I understand, Spark uses Hadoop connectors to fetch data from a distributed data store. I wasn't able to find a good resource to understand how a Hadoop connector works and how one can be made.

I'm looking to understand the semantics of a Hadoop connector so as to be able to create one for my custom database.

How does a Hadoop or Spark connector for distributed data stores function?

Answers (1)

Related Questions