Installation¶
Prerequisites¶
- Python 3.9 or later
- pip (included with Python)
- Git (for cloning the repository)
Install from Source¶
git clone https://github.com/jman4162/Baseball-Pitch-Sequence-Prediction.git
cd Baseball-Pitch-Sequence-Prediction
Development Install (Recommended)¶
Includes all optional dependencies and development tools:
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
pip install -e ".[all,dev]"
Or using Make:
Minimal Install¶
Core dependencies only (no AutoGluon or hmmlearn):
Optional Extras¶
| Extra | Contents |
|---|---|
all |
AutoGluon + hmmlearn |
autogluon |
AutoGluon TabularPredictor |
hmm |
hmmlearn for HMM model |
docs |
MkDocs + Material theme + mkdocstrings |
dev |
pytest + build + docs |
Install specific extras:
pip install -e ".[hmm]" # Just HMM support
pip install -e ".[autogluon]" # Just AutoGluon
pip install -e ".[docs]" # Documentation tools
Verify Installation¶
# Check the package is installed
python -c "import pitch_sequencing; print(pitch_sequencing.__version__)"
# Check CLI commands are available
pitch-generate --help
pitch-train --help
pitch-benchmark --help
pitch-ablation --help
Generate Training Data¶
After installation, generate the synthetic dataset:
This creates two files in data/:
baseball_pitch_data.csv(~384K rows)synthetic_pitch_sequences.csv(2,500 sequences)