2025-12-23 13:00:12 -07:00
2025-12-23 13:00:12 -07:00
2025-12-23 12:53:52 -07:00
2025-12-23 12:53:52 -07:00

Saw Mill Knot Detection (YOLOX/YOLO)

This repository contains a complete wood defect detection system using YOLOX/YOLO models, trained to detect 10 different types of wood surface defects. The system includes a web-based annotation GUI, automated training pipeline, and is optimized for deployment on OAK-D cameras.

🎯 Project Overview

  • Model: YOLOX-nano (Ultralytics YOLO framework)
  • Dataset: 20,276 wood surface defect images with 10 defect categories
  • Training: 5 epochs, mAP50: 0.612, mAP50-95: 0.357
  • Deployment Target: OAK-D 4 Pro camera
  • Framework: Ultralytics 8.3.240

📊 Dataset Information

Source: Kaggle Wood Surface Defects Dataset

Classes (10 total):

  • Live knot
  • Dead knot
  • Knot with crack
  • Crack
  • Resin
  • Marrow
  • Quartzity
  • Knot missing
  • Blue stain
  • Overgrown

Dataset Split:

  • Train: 16,220 images
  • Valid: 2,027 images
  • Test: 2,029 images

Formats Available:

  • dataset_coco/ → COCO format for RF-DETR
  • dataset_yolo/ → YOLO format for YOLOX, YOLOv6, YOLOv8

🚀 Quick Start

1. Environment Setup

# Clone the repository
git clone git@143.244.157.110:dillon_stuff/saw_mill_knot_detection.git
cd saw_mill_knot_detection

# Create virtual environment
python -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -U pip
pip install ultralytics gradio rfdetr

2. Setup Datasets

# Download dataset from Kaggle (requires Kaggle API)
kaggle datasets download -d kirs0816/wood-surface-defects
unzip wood-surface-defects.zip

# Create multi-format datasets
python split_coco_dataset.py  # Creates dataset_yolo/
python setup_datasets.py      # Creates dataset_coco/ and updates configs

3. Launch Annotation GUI

python annotation_gui.py

Open http://localhost:7860 in your browser to access the web-based annotation interface with:

  • Image navigation with index display
  • Auto-labeling with trained YOLOX model
  • Manual annotation tools
  • Real-time result visualization

4. Train Models

Choose from three different frameworks:

RF-DETR (Highest accuracy, slower training)

python train_rfdetr.py \
  --dataset-dir dataset_coco \
  --output-dir runs/rfdetr_medium \
  --model medium \
  --epochs 50 \
  --batch-size 4 \
  --grad-accum-steps 4 \
  --lr 1e-4

YOLOX (Balanced performance/speed)

python train_yolox.py \
  --dataset-dir dataset_yolo \
  --model yolox-nano \
  --epochs 50 \
  --batch-size 8

YOLOv6 (Fastest, edge-optimized)

python train_yolov6.py \
  --dataset-dir dataset_yolo \
  --model yolov6n \
  --epochs 50 \
  --batch-size 8

📁 Project Structure

saw_mill_knot_detection/
├── annotation_gui.py          # Gradio web interface for annotation
├── train_rfdetr.py           # RF-DETR training script
├── train_yolox.py            # YOLOX training script
├── train_yolov6.py           # YOLOv6 training script
├── setup_datasets.py         # Multi-format dataset setup script
├── split_coco_dataset.py     # Dataset splitting utility
├── config.py                 # Configuration settings
├── dataset_coco/             # RF-DETR dataset (COCO format)
│   ├── train/
│   │   ├── *.jpg             # Training images
│   │   └── _annotations.coco.json
│   ├── valid/
│   │   ├── *.jpg             # Validation images
│   │   └── _annotations.coco.json
│   └── test/
│       ├── *.jpg             # Test images
│       └── _annotations.coco.json
├── dataset_yolo/             # YOLOX/YOLOv6/YOLOv8 dataset (YOLO format)
│   ├── train/
│   │   ├── images/           # Training images
│   │   └── labels/           # YOLO format labels
│   ├── valid/
│   │   ├── images/           # Validation images
│   │   └── labels/           # YOLO format labels
│   ├── test/
│   │   ├── images/           # Test images
│   │   └── labels/           # YOLO format labels
│   └── data.yaml             # YOLO dataset configuration
├── runs/                     # Training outputs (excluded from git)
├── bbox_coco_dataset.json     # Original COCO annotations
├── requirements.txt           # Python dependencies
├── .gitignore                # Excludes large data files
└── README.md                 # This file

🤖 Framework Comparison

Framework Accuracy Speed Memory Deployment Best For
RF-DETR CPU/GPU Highest accuracy, research
YOLOX Edge devices Balanced performance
YOLOv6 Mobile/Edge Fast inference, production

🛠️ Usage Guide

Annotation GUI Features

The Gradio-based annotation interface provides:

  • Image Navigation: Browse through dataset with current index display
  • Auto-Labeling: One-click defect detection using trained YOLOX model
  • Manual Annotation: Draw bounding boxes for corrections
  • Real-time Visualization: Immediate display of detection results
  • Export Options: Save annotations in multiple formats

Training

# Basic training
python train_yolox.py --dataset-dir dataset_split --model yolox-nano --epochs 10

# Advanced training with custom parameters
python train_yolox.py \
  --dataset-dir dataset_split \
  --model yolox-nano \
  --epochs 20 \
  --batch-size 8 \
  --img-size 640

Inference

from ultralytics import YOLO

# Load trained model
model = YOLO('runs/yolox_training/training/weights/best.pt')

# Predict on image
results = model.predict('path/to/image.jpg', conf=0.4)

# Process results
for result in results:
    boxes = result.boxes  # Bounding boxes
    for box in boxes:
        cls = int(box.cls)  # Class index
        conf = float(box.conf)  # Confidence score
        xyxy = box.xyxy.tolist()[0]  # Box coordinates

🔧 Configuration

Key settings in config.py:

DEFAULT_MODEL_WEIGHTS = "runs/yolox_training/training/weights/best.pt"
DEFAULT_IMAGES_DIR = "IMAGE/"
WOOD_DEFECT_CLASSES = [
    'Live knot', 'Dead knot', 'Knot with crack', 'Crack',
    'Resin', 'Marrow', 'Quartzity', 'Knot missing',
    'Blue stain', 'Overgrown'
]

📈 Model Performance

YOLOX-nano Results (5 epochs):

  • mAP50: 0.612
  • mAP50-95: 0.357
  • Precision: 0.68
  • Recall: 0.55

🎯 Deployment on OAK-D

The trained model can be exported for OAK-D deployment:

from ultralytics import YOLO

# Load and export model
model = YOLO('runs/yolox_training/training/weights/best.pt')
model.export(format='onnx')  # Export to ONNX for OAK-D

🤝 Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Make your changes
  4. Test thoroughly
  5. Submit a pull request

📄 License

This project uses the Kaggle Wood Surface Defects dataset. Please refer to the original dataset license for usage terms.

🙏 Acknowledgments

  • Kaggle for providing the wood surface defects dataset
  • Ultralytics for the YOLO framework
  • Gradio for the web interface framework
Description
No description provided
Readme 10 MiB
Languages
Python 99.8%
Shell 0.2%