Learn2Slither

A Q-learning agent that learns to play Snake using reinforcement learning.

Overview

Learn2Slither is a reinforcement learning project that implements a Q-learning algorithm to teach an AI agent to play the classic Snake game. The agent learns through trial and error, gradually improving its performance over multiple episodes.

Features

Interactive visual display with Pygame
Q-learning algorithm implementation
Save and load trained models
Adjustable game speed
Real-time performance metrics
Two types of food (green: grow, red: shrink)

Installation

Prerequisites

Python 3.12 (recommended)
pip package manager

Note: Python 3.14+ is not compatible with Pygame. Use Python 3.12 or earlier.

Setup

Clone the repository

git clone <your-repo-url>
cd Learn2Slither

Create a virtual environment (recommended)

# Windows
py -3.12 -m venv venv
venv\Scripts\activate

# Linux/Mac
python3 -m venv venv
source venv/bin/activate

Install dependencies
```
pip install -r requirements.txt
```

Usage

Basic Command

./snake [options]

Available Options

Option	Description	Default
`-visual on/off`	Enable or disable game display	`on`
`-episodes <int>`	Number of training episodes	`1`
`-load <file>`	Load a pre-trained Q-table	`None`
`-save <file>`	Save the Q-table after training	`None`
`-dontlearn`	Disable learning (exploitation only)	`False`
`-step-by-step`	Wait for user input between steps	`False`
`-help`	Display help message	-

Example Commands

Train a new agent:

./snake -visual on -episodes 1000 -save models/my_model.csv

Load and watch a trained agent:

./snake -visual on -load models/q_table_100000.csv -episodes 10 -dontlearn

Step-by-step debugging:

./snake -visual on -load models/q_table_100000.csv -step-by-step -dontlearn

Train without visual display (faster):

./snake -visual off -episodes 10000 -save models/trained_model.csv

Project Structure

Learn2Slither/
├── agent/
│   ├── __init__.py
│   ├── q_learning.py      # Q-learning algorithm implementation
│   ├── q_data.py          # Q-table management
│   └── vision.py          # Snake vision processing
├── config/
│   ├── __init__.py
│   └── config.py          # Game and learning parameters
├── game/
│   ├── __init__.py
│   ├── snake.py           # Snake logic
│   ├── food.py            # Food management
│   └── display.py         # Pygame visualization
├── assets/                # Sprites and images
├── models/                # Saved Q-tables
├── snake                  # Main entry point
├── requirements.txt       # Python dependencies
└── README.md

How It Works

Q-Learning Algorithm

The agent uses Q-learning, a model-free reinforcement learning algorithm:

State: The snake's vision in 4 directions (UP, DOWN, LEFT, RIGHT)
Actions: Move in one of the 4 directions
Rewards:
- Green food: +10
- Red food: -10
- Wall/Self collision: -20
- Regular move: -1

Vision System

The snake perceives its environment through ray-casting in 4 directions, detecting:

Walls (W)
Own body (S)
Green food (G)
Red food (R)
Empty space (0)

Learning Parameters

Configurable in config/config.py:

Learning rate
Discount factor
Exploration rate (decays over time)
Exploration decay

Controls (Visual Mode)

Space: Pause/Resume
Left Arrow: Decrease game speed
Right Arrow: Increase game speed
Escape: Quit game

In step-by-step mode:

Space: Advance to next step

Development

Code Quality

The project follows Python best practices:

Type hints for better code clarity
Comprehensive docstrings
PEP 8 style compliance
Modular architecture

Acknowledgments

Pygame community for the excellent game development framework
Q-learning algorithm based on classical reinforcement learning literature

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Learn2Slither

Overview

Features

Installation

Prerequisites

Setup

Usage

Basic Command

Available Options

Example Commands

Project Structure

How It Works

Q-Learning Algorithm

Vision System

Learning Parameters

Controls (Visual Mode)

Development

Code Quality

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
agent		agent
assets		assets
config		config
game		game
models		models
.gitignore		.gitignore
README.md		README.md
en.subject.pdf		en.subject.pdf
requirements.txt		requirements.txt
snake		snake

Echo24h/Learn2Slither

Folders and files

Latest commit

History

Repository files navigation

Learn2Slither

Overview

Features

Installation

Prerequisites

Setup

Usage

Basic Command

Available Options

Example Commands

Project Structure

How It Works

Q-Learning Algorithm

Vision System

Learning Parameters

Controls (Visual Mode)

Development

Code Quality

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages