After taking a closer look at the parquet file of Open DB Canada, there are 3 different fields related to quantity i.e ('product_quantity_unit', 'product_quantity' and 'quantity'), and each of these fields serves a different purpose.
- Each of these fields has inconsistent null values.
Task:
- Perform data exploration on each of these fields and understand the variation in these fields.
- Propose data cleaning techniques on each of these fields to make them consistent and complete.
- Also, think about how this can help us in achieving product consolidation successfully.
After taking a closer look at the parquet file of Open DB Canada, there are 3 different fields related to quantity i.e ('product_quantity_unit', 'product_quantity' and 'quantity'), and each of these fields serves a different purpose.
Task: