Skip to content
View MasegoM94's full-sized avatar

Block or report MasegoM94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MasegoM94/README.md

Masego Modibane

Product Data Scientist focused on building analytics that help teams understand behaviour, measure impact, and make better decisions.

I work across SQL, Python, statistics, forecasting, and BI to turn messy data into clear metrics, reusable analysis, and decision-ready products.

  • Focus areas: Product analytics • Forecasting • Experimentation/statistical thinking • Usage intelligence • Self-serve analytics
  • Tools: SQL • Python • Power BI • R • Applied ML/NLP
  • Based in: Toronto, Canada

Featured work

1) Power BI Usage Intelligence

A portfolio project focused on report usage forecasting, behavioural analytics, and decision support for analytics adoption.

  • Forecast usage patterns and detect engagement decline
  • Model report, user, and page-level behaviour
  • Combine analytics engineering, semantic modelling, and forecasting
  • Explore how GenAI can support insight generation on top of usage data

→ Repo: report-usage-forecasting

2) E-commerce SQL Portfolio

Product-style analysis across traffic, conversion, customer behaviour, and business performance.

  • Metric definition and business logic
  • Funnel and performance analysis
  • SQL for decision-oriented product questions

→ Repo: ecommerce-sql-portfolio

3) Survey Age Group Analysis

A reproducible statistics project showing how to test for differences across segments.

  • Kruskal–Wallis for ordinal outcomes
  • Chi-square for categorical relationships
  • Post-hoc analysis to identify what is driving differences

→ Repo: survey-age-group-analysis

4) SA Political Manifesto Topic Modeling

An end-to-end NLP project using topic modelling and an interactive Shiny app.

  • Text preprocessing and LDA topic modelling
  • Exploratory analysis of manifesto themes
  • Interactive communication of model outputs

→ Repo: sa-political-manifesto-text-analysis

What this portfolio is built to show

I am intentionally building a portfolio that demonstrates how I:

  1. define business and product questions clearly
  2. structure data into useful analytical models
  3. analyse behaviour with SQL and Python
  4. apply statistics or forecasting where it improves decision-making
  5. communicate insights through dashboards, write-ups, and reusable projects

Other projects

Current interests

I am especially interested in roles and projects at the intersection of:

  • Product Data Science
  • Analytics Engineering
  • Forecasting and behavioural analytics
  • Experimentation and causal thinking
  • GenAI-enabled analytics workflows

Writing

I also write about analytics, data science, and practical portfolio-building:

Contact

Pinned Loading

  1. report-usage-forecasting report-usage-forecasting Public

    Time series forecasting project for report usage with Auto-ARIMA, baseline benchmarking, evaluation metrics, and deployment planning

    Jupyter Notebook

  2. ecommerce-sql-portfolio ecommerce-sql-portfolio Public

    Journey through understanding Maven Fuzzy Factory data

  3. survey-age-group-analysis survey-age-group-analysis Public

    Segment (age) differences in survey responses using nonparametric tests in Python.

    Python

  4. sa-political-manifesto-text-analysis sa-political-manifesto-text-analysis Public

    A NLP text analysis done using the South African Political Parties

    R

  5. gmail-mail-analysis gmail-mail-analysis Public

    This is a repository based on a gmail management code that I use to clear out unwanted emails using Python

    Jupyter Notebook 5

  6. interactive-map-python interactive-map-python Public

    Survey geo-visualization in Python: postal-code cleaning → lat/long join → interactive Folium map

    Jupyter Notebook 1