- Clone the entire repository
- Download the dataset from https://datascience-public.transvoyant.com/public/data/test_tasks/ocean_ais/parquet/parquet.zip and extrat to \datasets\parquet
- Open the Jupyter Notebook in your local computer
- Run the code "EDA - Exploratory Data Analysis and Answers.ipynb"
- To run this code successfully, you need to use Jupyter running a Kernel with Python 3.7 and Spark 2.4.7
- You need to install the libraries pandas, pandas_profiling, plotly, bioinfokit, scipy, h20, datetime, matplotlib, functools, glob
maxreis86/AgileEngine
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|