Skip to content

Installation

Prerequisites

  • Python 3.9 or later
  • pip (included with Python)
  • Git (for cloning the repository)

Install from Source

git clone https://github.com/jman4162/Baseball-Pitch-Sequence-Prediction.git
cd Baseball-Pitch-Sequence-Prediction

Includes all optional dependencies and development tools:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -e ".[all,dev]"

Or using Make:

make install

Minimal Install

Core dependencies only (no AutoGluon or hmmlearn):

pip install -e .

Optional Extras

Extra Contents
all AutoGluon + hmmlearn
autogluon AutoGluon TabularPredictor
hmm hmmlearn for HMM model
docs MkDocs + Material theme + mkdocstrings
dev pytest + build + docs

Install specific extras:

pip install -e ".[hmm]"         # Just HMM support
pip install -e ".[autogluon]"   # Just AutoGluon
pip install -e ".[docs]"        # Documentation tools

Verify Installation

# Check the package is installed
python -c "import pitch_sequencing; print(pitch_sequencing.__version__)"

# Check CLI commands are available
pitch-generate --help
pitch-train --help
pitch-benchmark --help
pitch-ablation --help

Generate Training Data

After installation, generate the synthetic dataset:

pitch-generate --output-dir ./data

This creates two files in data/:

  • baseball_pitch_data.csv (~384K rows)
  • synthetic_pitch_sequences.csv (2,500 sequences)