Handling Dependent features in machine learning

Question

I have a dataset like

 Project | Area       | Feature 1 | Feature 2 |
---------+------------+-----------+-----------+...
 A       | Production |     X     |     X     |
 A       | Testing    |     Y     |     Y     |
 B       | Testing    |     Z     |     Z     |
 C       | QA         |     W     |     W     |

Here "Area" is dependent on project (i.e. Combination of Area and Project makes the identity of Area) and they have many to many relationship. I'm predicting Area using deep neural network using Keras. How i should preprocess this data?

Project is a very important feature.

Also is there any formula for approximating number of training data required for X number of features?

Handling Dependent features in machine learning

Answers (1)

Related Questions