how to structure code directories in hadoop

Question

We are setting up new project level code directories, which will host PySpark, hive, Sqoop and shell wrapper scripts for different subProjects. We need to plan the structure of code directories considering long term goals.

Currently I have structure like -

Conf/
Scirpts/
  - hql
  - shell
  - pyspark
  ...

but above structure get messy as multiple subProject start having codes, too many files and too much to manage and tough to search.

Can someone suggest, whats ideal way or any better way to arrange code directories as per the past experience?

how to structure code directories in hadoop

Answers (1)

Related Questions