This repository contains an analysis of mobile applications using Python and Jupyter Notebook. The analysis explores various aspects of app data extracted from the Google Play Store dataset.
The analysis focuses on understanding the relationship between app ratings, number of reviews, and number of downloads. We explore hypotheses related to app categories, pricing, and user engagement metrics.
We import the necessary libraries and load the Google Play Store dataset to begin our analysis.
We conduct exploratory data analysis to gain insights into the dataset's structure and contents. This includes visualizing distributions, correlations, and summary statistics.
We test several hypotheses related to app categories, pricing strategies, and user engagement metrics. Hypothesis testing involves statistical analysis to validate or refute our assumptions.
We utilize machine learning techniques to predict app ratings based on user engagement metrics. This includes feature engineering, model training, and evaluation.
We perform clustering analysis to identify distinct groups of apps based on their characteristics. This helps in understanding patterns and trends within the dataset.
The Jupyter Notebook file (data_science_app_analysis.ipynb) contains the complete code and analysis steps. You can run the notebook on your local environment to reproduce the results.
- Python 3.x
- Jupyter Notebook
- Pandas
- Matplotlib
- NumPy
- SciPy
- Scikit-learn