7.6 KiB
Saw Mill Knot Detection (YOLOX/YOLO)
This repository contains a complete wood defect detection system using YOLOX/YOLO models, trained to detect 10 different types of wood surface defects. The system includes a web-based annotation GUI, automated training pipeline, and is optimized for deployment on OAK-D cameras.
🎯 Project Overview
- Model: YOLOX-nano (Ultralytics YOLO framework)
- Dataset: 20,276 wood surface defect images with 10 defect categories
- Training: 5 epochs, mAP50: 0.612, mAP50-95: 0.357
- Deployment Target: OAK-D 4 Pro camera
- Framework: Ultralytics 8.3.240
📊 Dataset Information
Source: Kaggle Wood Surface Defects Dataset
Classes (10 total):
- Live knot
- Dead knot
- Knot with crack
- Crack
- Resin
- Marrow
- Quartzity
- Knot missing
- Blue stain
- Overgrown
Dataset Split:
- Train: 16,220 images
- Valid: 2,027 images
- Test: 2,029 images
Formats Available:
dataset_coco/→ COCO format for RF-DETRdataset_yolo/→ YOLO format for YOLOX, YOLOv6, YOLOv8
🚀 Quick Start
1. Environment Setup
# Clone the repository
git clone git@143.244.157.110:dillon_stuff/saw_mill_knot_detection.git
cd saw_mill_knot_detection
# Create virtual environment
python -m venv .venv
source .venv/bin/activate
# Install dependencies
pip install -r requirements.txt
2. Run the Annotation GUI
The repository includes an automated script that handles virtual environment activation:
# Run the GUI (automatically detects and activates venv/conda environment)
./run_gui.sh
# Or run manually
source .venv/bin/activate # or conda activate your_env
python annotation_gui.py
Install dependencies
pip install -U pip pip install ultralytics gradio rfdetr
### 2. Setup Datasets
```bash
# Download dataset from Kaggle (requires Kaggle API)
kaggle datasets download -d kirs0816/wood-surface-defects
unzip wood-surface-defects.zip
# Create multi-format datasets
python split_coco_dataset.py # Creates dataset_yolo/
python setup_datasets.py # Creates dataset_coco/ and updates configs
3. Launch Annotation GUI
python annotation_gui.py
Open http://localhost:7860 in your browser to access the web-based annotation interface with:
- Image navigation with index display
- Auto-labeling with trained YOLOX model
- Manual annotation tools
- Real-time result visualization
4. Train Models
Choose from three different frameworks:
RF-DETR (Highest accuracy, slower training)
python train_rfdetr.py \
--dataset-dir dataset_coco \
--output-dir runs/rfdetr_medium \
--model medium \
--epochs 50 \
--batch-size 4 \
--grad-accum-steps 4 \
--lr 1e-4
YOLOX (Balanced performance/speed)
python train_yolox.py \
--dataset-dir dataset_yolo \
--model yolox-nano \
--epochs 50 \
--batch-size 8
YOLOv6 (Fastest, edge-optimized)
python train_yolov6.py \
--dataset-dir dataset_yolo \
--model yolov6n \
--epochs 50 \
--batch-size 8
📁 Project Structure
saw_mill_knot_detection/
├── annotation_gui.py # Gradio web interface for annotation
├── train_rfdetr.py # RF-DETR training script
├── train_yolox.py # YOLOX training script
├── train_yolov6.py # YOLOv6 training script
├── setup_datasets.py # Multi-format dataset setup script
├── split_coco_dataset.py # Dataset splitting utility
├── config.py # Configuration settings
├── dataset_coco/ # RF-DETR dataset (COCO format)
│ ├── train/
│ │ ├── *.jpg # Training images
│ │ └── _annotations.coco.json
│ ├── valid/
│ │ ├── *.jpg # Validation images
│ │ └── _annotations.coco.json
│ └── test/
│ ├── *.jpg # Test images
│ └── _annotations.coco.json
├── dataset_yolo/ # YOLOX/YOLOv6/YOLOv8 dataset (YOLO format)
│ ├── train/
│ │ ├── images/ # Training images
│ │ └── labels/ # YOLO format labels
│ ├── valid/
│ │ ├── images/ # Validation images
│ │ └── labels/ # YOLO format labels
│ ├── test/
│ │ ├── images/ # Test images
│ │ └── labels/ # YOLO format labels
│ └── data.yaml # YOLO dataset configuration
├── runs/ # Training outputs (excluded from git)
├── bbox_coco_dataset.json # Original COCO annotations
├── requirements.txt # Python dependencies
├── .gitignore # Excludes large data files
└── README.md # This file
🤖 Framework Comparison
| Framework | Accuracy | Speed | Memory | Deployment | Best For |
|---|---|---|---|---|---|
| RF-DETR | ⭐⭐⭐⭐⭐ | ⭐⭐ | ⭐⭐⭐ | CPU/GPU | Highest accuracy, research |
| YOLOX | ⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐ | Edge devices | Balanced performance |
| YOLOv6 | ⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐ | Mobile/Edge | Fast inference, production |
🛠️ Usage Guide
Annotation GUI Features
The Gradio-based annotation interface provides:
- Image Navigation: Browse through dataset with current index display
- Auto-Labeling: One-click defect detection using trained YOLOX model
- Manual Annotation: Draw bounding boxes for corrections
- Real-time Visualization: Immediate display of detection results
- Export Options: Save annotations in multiple formats
Training
# Basic training
python train_yolox.py --dataset-dir dataset_split --model yolox-nano --epochs 10
# Advanced training with custom parameters
python train_yolox.py \
--dataset-dir dataset_split \
--model yolox-nano \
--epochs 20 \
--batch-size 8 \
--img-size 640
Inference
from ultralytics import YOLO
# Load trained model
model = YOLO('runs/yolox_training/training/weights/best.pt')
# Predict on image
results = model.predict('path/to/image.jpg', conf=0.4)
# Process results
for result in results:
boxes = result.boxes # Bounding boxes
for box in boxes:
cls = int(box.cls) # Class index
conf = float(box.conf) # Confidence score
xyxy = box.xyxy.tolist()[0] # Box coordinates
🔧 Configuration
Key settings in config.py:
DEFAULT_MODEL_WEIGHTS = "runs/yolox_training/training/weights/best.pt"
DEFAULT_IMAGES_DIR = "IMAGE/"
WOOD_DEFECT_CLASSES = [
'Live knot', 'Dead knot', 'Knot with crack', 'Crack',
'Resin', 'Marrow', 'Quartzity', 'Knot missing',
'Blue stain', 'Overgrown'
]
📈 Model Performance
YOLOX-nano Results (5 epochs):
- mAP50: 0.612
- mAP50-95: 0.357
- Precision: 0.68
- Recall: 0.55
🎯 Deployment on OAK-D
The trained model can be exported for OAK-D deployment:
from ultralytics import YOLO
# Load and export model
model = YOLO('runs/yolox_training/training/weights/best.pt')
model.export(format='onnx') # Export to ONNX for OAK-D
🤝 Contributing
- Fork the repository
- Create a feature branch
- Make your changes
- Test thoroughly
- Submit a pull request
📄 License
This project uses the Kaggle Wood Surface Defects dataset. Please refer to the original dataset license for usage terms.
🙏 Acknowledgments
- Kaggle for providing the wood surface defects dataset
- Ultralytics for the YOLO framework
- Gradio for the web interface framework