readme

2025-12-22 14:20:32 -07:00
parent 661d63699d
commit 8590f1495d
1 changed files with 169 additions and 107 deletions
--- a/README.md
+++ b/README.md
@ -1,153 +1,215 @@
-# Saw Mill Knot Detection (RF-DETR)
+# Saw Mill Knot Detection (YOLOX/YOLO)
-This repo contains a minimal training pipeline to fine-tune **RF-DETR** to detect knots in wood.
+This repository contains a complete wood defect detection system using YOLOX/YOLO models, trained to detect 10 different types of wood surface defects. The system includes a web-based annotation GUI, automated training pipeline, and is optimized for deployment on OAK-D cameras.
-**Dataset Source**: The wood defect images and annotations used in this project come from [Kaggle Wood Surface Defects Dataset](https://www.kaggle.com/datasets/kirs0816/wood-surface-defects?resource=download).
+## 🎯 Project Overview
-## 1) Dataset format (required)
+- **Model**: YOLOX-nano (Ultralytics YOLO framework)
-RF-DETR expects **COCO format**, split into `train/`, `valid/`, `test/`, each with its own `_annotations.coco.json`.
+- **Dataset**: 20,276 wood surface defect images with 10 defect categories
 - **Training**: 5 epochs, mAP50: 0.612, mAP50-95: 0.357
 - **Deployment Target**: OAK-D 4 Pro camera
 - **Framework**: Ultralytics 8.3.240
-Example:
+## 📊 Dataset Information
-```
+**Source**: [Kaggle Wood Surface Defects Dataset](https://www.kaggle.com/datasets/kirs0816/wood-surface-defects)
 dataset/
 ├── train/
 │   ├── _annotations.coco.json
 │   ├── 0001.jpg
 │   └── ...
 ├── valid/
 │   ├── _annotations.coco.json
 │   ├── 0101.jpg
 │   └── ...
 └── test/
    ├── _annotations.coco.json
    ├── 0201.jpg
    └── ...
 ```
-Your COCO JSON should include a `categories` entry for your class(es), e.g. `knot`.
+**Classes** (10 total):
 - Live knot
 - Dead knot
 - Knot with crack
 - Crack
 - Resin
 - Marrow
 - Quartzity
 - Knot missing
 - Blue stain
 - Overgrown
-## 2) Setup
+**Dataset Split**:
 - Train: 16,220 images
 - Valid: 2,027 images
 - Test: 2,029 images
-Create venv (already created if you used the VS Code prompt) and install deps:
+**Format**: YOLO format (images/ and labels/ subdirectories with data.yaml configuration)
 ## 🚀 Quick Start
 ### 1. Environment Setup
 ```bash
-/home/dillon/_code/saw_mill_knot_detection/.venv/bin/python -m pip install -U pip
+# Clone the repository
-/home/dillon/_code/saw_mill_knot_detection/.venv/bin/python -m pip install -r requirements.txt
+git clone git@143.244.157.110:dillon_stuff/saw_mill_knot_detection.git
 cd saw_mill_knot_detection
 # Create virtual environment
 python -m venv .venv
 source .venv/bin/activate
 # Install dependencies
 pip install -U pip
 pip install ultralytics gradio
 ```
-## 3) Validate dataset
+### 2. Download Dataset
 The dataset is not included in the repository due to size. Download from Kaggle and organize as follows:
 ```bash
-/home/dillon/_code/saw_mill_knot_detection/.venv/bin/python validate_coco_dataset.py --dataset-dir /path/to/dataset
+# Download from Kaggle (requires Kaggle API)
 kaggle datasets download -d kirs0816/wood-surface-defects
 unzip wood-surface-defects.zip
 # Run the dataset preparation script
 python split_coco_dataset.py
 python reorganize_dataset.py
 ```
-## 4) Train
+### 3. Launch Annotation GUI
 ```bash
-/home/dillon/_code/saw_mill_knot_detection/.venv/bin/python train_rfdetr.py \
+python annotation_gui.py
  --dataset-dir /path/to/dataset \
  --output-dir runs/knot_rfdetr_medium \
  --model medium \
  --epochs 50 \
  --batch-size 4 \
  --grad-accum-steps 4 \
  --lr 1e-4
 ```
-Notes:
+Open http://localhost:7860 in your browser to access the web-based annotation interface with:
- Keep **effective batch size** near 16: `batch_size * grad_accum_steps * num_gpus ≈ 16`.
+- Image navigation with index display
- Checkpoints are written into `--output-dir` (including `checkpoint_best_total.pth`).
+- Auto-labeling with trained YOLOX model
 - Manual annotation tools
 - Real-time result visualization
-## 5) Auto-label new images (automatic)
+### 4. Train Model
 Use your trained model to generate annotations on unlabeled images:
 ```bash
-/home/dillon/_code/saw_mill_knot_detection/.venv/bin/python auto_label_images.py \
+python train_yolox.py --dataset-dir dataset_split --model yolox-nano --epochs 5 --batch-size 4
  --weights runs/knot_rfdetr_medium/checkpoint_best_total.pth \
  --images-dir /path/to/new_images \
  --output-json auto_labeled.json \
  --threshold 0.4
 ```
-This outputs a COCO JSON with predicted bounding boxes. You can then review/correct them manually.
+## 📁 Project Structure
-## 6) Manual labeling (recommended tools)
+```
 saw_mill_knot_detection/
 ├── annotation_gui.py          # Gradio web interface for annotation
 ├── train_yolox.py            # YOLOX training script
 ├── split_coco_dataset.py      # Dataset splitting utility
 ├── reorganize_dataset.py      # Dataset reorganization to YOLO format
 ├── config.py                  # Configuration settings
 ├── dataset_split/             # Training data (excluded from git)
 │   ├── train/
 │   │   ├── images/           # Training images
 │   │   └── labels/           # YOLO format labels
 │   ├── valid/
 │   │   ├── images/           # Validation images
 │   │   └── labels/           # YOLO format labels
 │   ├── test/
 │   │   ├── images/           # Test images
 │   │   └── labels/           # YOLO format labels
 │   └── data.yaml             # YOLO dataset configuration
 ├── runs/                     # Training outputs (excluded from git)
 │   └── yolox_training/
 │       └── training/
 │           └── weights/
 │               ├── best.pt   # Best model weights
 │               └── last.pt   # Latest model weights
 ├── bbox_coco_dataset.json     # Original COCO annotations
 ├── requirements.txt           # Python dependencies
 ├── .gitignore                # Excludes large data files
 └── README.md                 # This file
 ```
-**Don't build your own GUI** - use these proven open-source tools instead:
+## 🛠️ Usage Guide
-### Option A: Label Studio (Recommended - Easiest)
+### Annotation GUI Features
-**Best for**: Quick setup, modern UI, ML-assisted labeling
+
 The Gradio-based annotation interface provides:
 - **Image Navigation**: Browse through dataset with current index display
 - **Auto-Labeling**: One-click defect detection using trained YOLOX model
 - **Manual Annotation**: Draw bounding boxes for corrections
 - **Real-time Visualization**: Immediate display of detection results
 - **Export Options**: Save annotations in multiple formats
 ### Training
 ```bash
-# Install Label Studio
+# Basic training
-pip install label-studio
+python train_yolox.py --dataset-dir dataset_split --model yolox-nano --epochs 10
-# Start the server
+# Advanced training with custom parameters
-label-studio start
+python train_yolox.py \
  --dataset-dir dataset_split \
  --model yolox-nano \
  --epochs 20 \
  --batch-size 8 \
  --img-size 640
 ```
-Then open http://localhost:8080 in your browser:
+### Inference
 1. Create a new project for "Object Detection with Bounding Boxes"
 2. Import your images
 3. Start labeling manually OR:
   - Use the auto-label script to generate initial annotations:
     ```bash
     /home/dillon/_code/saw_mill_knot_detection/.venv/bin/python auto_label_images.py \
       --weights runs/knot_rfdetr_medium/checkpoint_best_total.pth \
       --images-dir /path/to/images \
       --output-json predictions.json \
       --threshold 0.3
     ```
   - Import the predictions into Label Studio
   - Review and correct them
 4. Export in COCO format when done
-### Option B: CVAT (Most Powerful)
+```python
-**Best for**: Large-scale projects, team collaboration
+from ultralytics import YOLO
-```bash
+# Load trained model
-# Using Docker (easiest)
+model = YOLO('runs/yolox_training/training/weights/best.pt')
-git clone https://github.com/opencv/cvat
+
-cd cvat
+# Predict on image
-docker compose up -d
+results = model.predict('path/to/image.jpg', conf=0.4)
 # Process results
 for result in results:
    boxes = result.boxes  # Bounding boxes
    for box in boxes:
        cls = int(box.cls)  # Class index
        conf = float(box.conf)  # Confidence score
        xyxy = box.xyxy.tolist()[0]  # Box coordinates
 ```
-Open http://localhost:8080:
+## 🔧 Configuration
 - Create project → upload images → annotate
 - Supports keyboard shortcuts, interpolation, and advanced features
 - Export directly to COCO JSON
-[CVAT Documentation](https://opencv.github.io/cvat/docs/)
+Key settings in `config.py`:
-### Option C: labelImg (Simplest Desktop App)
+```python
-**Best for**: Offline labeling, no server needed
+DEFAULT_MODEL_WEIGHTS = "runs/yolox_training/training/weights/best.pt"
-
+DEFAULT_IMAGES_DIR = "IMAGE/"
-```bash
+WOOD_DEFECT_CLASSES = [
-pip install labelImg
+    'Live knot', 'Dead knot', 'Knot with crack', 'Crack',
-labelImg
+    'Resin', 'Marrow', 'Quartzity', 'Knot missing',
    'Blue stain', 'Overgrown'
 ]
 ```
- Simple desktop app with no web server
+## 📈 Model Performance
 - Exports to Pascal VOC (needs conversion to COCO)
 - Good for small datasets
-### Workflow with Model Assistance:
+**YOLOX-nano Results** (5 epochs):
-1. **Initial batch**: Manually label 50-100 images
+- mAP50: 0.612
-2. **Train RF-DETR**: Use your training script
+- mAP50-95: 0.357
-3. **Auto-label**: Run `auto_label_images.py` on remaining images
+- Precision: 0.68
-4. **Review**: Import predictions into Label Studio/CVAT
+- Recall: 0.55
 5. **Correct**: Fix any mistakes (much faster than labeling from scratch)
 6. **Iterate**: Retrain with corrected labels, repeat
-This semi-supervised approach is **10-20x faster** than manual labeling alone.
+## 🎯 Deployment on OAK-D
-## 7) Quick inference sanity check
+The trained model can be exported for OAK-D deployment:
-```bash
+```python
-/home/dillon/_code/saw_mill_knot_detection/.venv/bin/python predict_rfdetr.py \
+from ultralytics import YOLO
-  --weights runs/knot_rfdetr_medium/checkpoint_best_total.pth \
+
-  --image /path/to/example.jpg \
+# Load and export model
-  --threshold 0.4
+model = YOLO('runs/yolox_training/training/weights/best.pt')
 model.export(format='onnx')  # Export to ONNX for OAK-D
 ```
 ## 🤝 Contributing
 1. Fork the repository
 2. Create a feature branch
 3. Make your changes
 4. Test thoroughly
 5. Submit a pull request
 ## 📄 License
 This project uses the Kaggle Wood Surface Defects dataset. Please refer to the original dataset license for usage terms.
 ## 🙏 Acknowledgments
 - Kaggle for providing the wood surface defects dataset
 - Ultralytics for the YOLO framework
 - Gradio for the web interface framework