Ecommerce Database structure for product feautures in MYSQL (innoDB)

Question

I need to know which database model is better for performance.

First Database Model

Three tables. Features, Products, Feature values.

Feature table is

+----+-----------+
| id | name      |
+----+-----------+
|  1 | Brand     |
|  2 | Color     |
|  3 | Dimension |
|  4 | Model     |
+----+-----------+

And Feature values table

+----+---------+------------+
| id | name    | feature_id |
+----+---------+------------+
|  1 | Sony    |          1 |
|  2 | Samsung |          1 |
|  3 | Red     |          2 |
|  4 | Blue    |          2 |
|  5 | 20 "    |          3 |
|  6 | 30 "    |          3 |
|  7 | Model A |          4 |
|  8 | Model B |          4 |
+----+---------+------------+

And products table.

+----+--------------------+----------+
| id | product_name       | features |
+----+--------------------+----------+
|  1 | Sony Television    | 1-3-5-7  |
|  2 | Samsung Television | 2-4-6-8  |
+----+--------------------+----------+

As you see in this structure, if a user wants to search products according to feature, I need to use REGEXP or fulltext search in my query.

Second Database Model

In the second database model, I will delete features from products table and I will add new table called product_features.

+----+--------------------+
| id | product_name       |
+----+--------------------+
|  1 | Sony Television    |
|  2 | Samsung Television |
+----+--------------------+

And new table product_features;

+----+------------+------------+
| id | feature_id | product_id |
+----+------------+------------+
|  1 |          1 |          1 |
|  2 |          3 |          1 |
|  3 |          5 |          1 |
|  4 |          7 |          1 |
|  5 |          2 |          2 |
|  6 |          4 |          2 |
|  7 |          6 |          2 |
|  8 |          8 |          2 |
+----+------------+------------+

Now if a user wants to search products according to feature, I need to search product_features and then join products.

Question

My question is if I use second model and I have over 20 billion rows in products table, and think there are at least 10 features for every product, So product_features table will have over 200 billions rows. And maybe queries according to features will be slow.

If I use first model, when user search according to features, I have to query with fulltext search or REGEXP in 20 billions rows.

I don't know which way is better ? What is your suggestion ?

Abhishek Gupta · Accepted Answer

First Model

It is not even in 1NF form, as it has non atomic values in features attribute. Moreover it would be really difficult to add, update or delete any new features in products table. So it won't be feasible at all.

Second Model

It is normalized till 5NF and looks good, for optimizing search use subquery and use indexing on product_id and feature_id. Try avoid using JOIN in such large data.

Ecommerce Database structure for product feautures in MYSQL (innoDB)

Answers (2)

Related Questions