IDcardsGenerator/README.md

# ID Card Data Augmentation Pipeline

A comprehensive data augmentation pipeline for ID card images with YOLO-based detection and advanced augmentation techniques.

![Pipeline Overview](docs/images/yolov8_pipeline.png)

## 🚀 Features

### **YOLO-based ID Card Detection**
- Automatic detection and cropping of ID cards from large images
- Configurable confidence and IoU thresholds
- Multiple cropping modes (bbox, square, aspect_ratio)
- Padding and target size customization

### **Advanced Data Augmentation**
- **Geometric Transformations**: Rotation with multiple angles
- **Random Cropping**: Simulates partially visible cards
- **Noise Addition**: Simulates worn-out cards
- **Partial Blockage**: Simulates occluded card details
- **Blurring**: Simulates blurred but readable images
- **Brightness/Contrast**: Mimics different lighting conditions
- **Grayscale Conversion**: Final preprocessing step for all images

### **Flexible Configuration**
- YAML-based configuration system
- Command-line argument overrides
- Environment-specific settings
- Comprehensive logging

## 📋 Requirements

```bash
# Python 3.8+
conda create -n gpu python=3.8
conda activate gpu

# Install dependencies
pip install -r requirements.txt
```

### Dependencies
- `opencv-python>=4.5.0`
- `numpy>=1.21.0`
- `Pillow>=8.3.0`
- `PyYAML>=5.4.0`
- `ultralytics>=8.0.0` (for YOLO models)

## 🛠️ Installation

1. **Clone the repository**
```bash
git clone <repository-url>
cd IDcardsGenerator
```

2. **Install dependencies**
```bash
pip install -r requirements.txt
```

3. **Prepare YOLO model** (optional)
```bash
# Place your trained YOLO model at:
data/weights/id_cards_yolov8n.pt
```

## 📖 Usage

### **Basic Usage**

```bash
# Run with default configuration
python main.py

# Run with ID card detection enabled
python main.py --enable-id-detection

# Run with custom input/output directories
python main.py --input-dir "path/to/input" --output-dir "path/to/output"
```

### **Configuration Options**

#### **ID Card Detection**
```bash
# Enable detection with custom model
python main.py --enable-id-detection --model-path "path/to/model.pt"

# Adjust detection parameters
python main.py --enable-id-detection --confidence 0.3 --crop-mode square

# Set target size for cropped cards
python main.py --enable-id-detection --crop-target-size "640,640"
```

#### **Data Augmentation**
```bash
# Customize augmentation parameters
python main.py --num-augmentations 5 --target-size "512,512"

# Preview augmentation results
python main.py --preview
```

### **Configuration File**

Edit `config/config.yaml` for persistent settings:

```yaml
# ID Card Detection
id_card_detection:
  enabled: false  # Enable/disable YOLO detection
  model_path: "data/weights/id_cards_yolov8n.pt"
  confidence_threshold: 0.25
  iou_threshold: 0.45
  padding: 10
  crop_mode: "bbox"
  target_size: null

# Data Augmentation
augmentation:
  rotation:
    enabled: true
    angles: [30, 60, 120, 150, 180, 210, 240, 300, 330]
  random_cropping:
    enabled: true
    ratio_range: [0.7, 1.0]
  random_noise:
    enabled: true
    mean_range: [0.0, 0.7]
    variance_range: [0.0, 0.1]
  partial_blockage:
    enabled: true
    coverage_range: [0.0, 0.25]
  blurring:
    enabled: true
    kernel_ratio_range: [0.0, 0.0084]
  brightness_contrast:
    enabled: true
    alpha_range: [0.4, 3.0]
    beta_range: [1, 100]
  grayscale:
    enabled: true  # Applied as final step

# Processing
processing:
  target_size: [640, 640]
  num_augmentations: 3
  save_format: "jpg"
  quality: 95
```

## 🔄 Workflow

### **Two-Step Processing Pipeline**

#### **Step 1: ID Card Detection (Optional)**
When `id_card_detection.enabled: true`:
1. **Input**: Large images containing multiple ID cards
2. **YOLO Detection**: Locate and detect ID cards
3. **Cropping**: Extract individual ID cards with padding
4. **Output**: Cropped ID cards saved to `out/processed/`

#### **Step 2: Data Augmentation**
1. **Input**: Original images OR cropped ID cards
2. **Augmentation**: Apply 6 augmentation methods:
   - Rotation (9 different angles)
   - Random cropping (70-100% ratio)
   - Random noise (simulate wear)
   - Partial blockage (simulate occlusion)
   - Blurring (simulate motion blur)
   - Brightness/Contrast adjustment
3. **Grayscale**: Convert all images to grayscale (final step)
4. **Output**: Augmented images in main output directory

### **Direct Augmentation Mode**
When `id_card_detection.enabled: false`:
- Skips YOLO detection
- Applies augmentation directly to input images
- All images are converted to grayscale

## 📊 Output Structure

```
output_directory/
├── processed/                    # Cropped ID cards (if detection enabled)
│   ├── id_card_001.jpg
│   ├── id_card_002.jpg
│   └── processing_summary.json
├── im1__rotation_01.png         # Augmented images
├── im1__cropping_01.png
├── im1__noise_01.png
├── im1__blockage_01.png
├── im1__blurring_01.png
├── im1__brightness_contrast_01.png
└── augmentation_summary.json
```

## 🎯 Use Cases

### **Training Data Generation**
```bash
# Generate diverse training data
python main.py --enable-id-detection --num-augmentations 10
```

### **Quality Control**
```bash
# Preview results before processing
python main.py --preview
```

### **Batch Processing**
```bash
# Process large datasets
python main.py --input-dir "large_dataset/" --output-dir "augmented_dataset/"
```

## ⚙️ Advanced Configuration

### **Custom Augmentation Parameters**

```yaml
augmentation:
  rotation:
    angles: [45, 90, 135, 180, 225, 270, 315]  # Custom angles
  random_cropping:
    ratio_range: [0.8, 0.95]  # Tighter cropping
  random_noise:
    mean_range: [0.1, 0.5]    # More noise
    variance_range: [0.05, 0.15]
```

### **Performance Optimization**

```yaml
performance:
  num_workers: 4
  prefetch_factor: 2
  pin_memory: true
  use_gpu: false
```

## 📝 Logging

The system provides comprehensive logging:
- **File**: `logs/data_augmentation.log`
- **Console**: Real-time progress updates
- **Summary**: JSON files with processing statistics

### **Log Levels**
- `INFO`: General processing information
- `WARNING`: Non-critical issues (e.g., no cards detected)
- `ERROR`: Critical errors

## 🔧 Troubleshooting

### **Common Issues**

1. **No images detected**
   - Check input directory path
   - Verify image formats (jpg, png, bmp, tiff)
   - Ensure images are not corrupted

2. **YOLO model not found**
   - Place model file at `data/weights/id_cards_yolov8n.pt`
   - Or specify custom path with `--model-path`

3. **Memory issues**
   - Reduce `num_augmentations`
   - Use smaller `target_size`
   - Enable GPU if available

### **Performance Tips**

- **GPU Acceleration**: Set `use_gpu: true` in config
- **Batch Processing**: Use multiple workers for large datasets
- **Memory Management**: Process in smaller batches

## 🤝 Contributing

1. Fork the repository
2. Create a feature branch
3. Make your changes
4. Add tests if applicable
5. Submit a pull request

## 📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

## 🙏 Acknowledgments

- **YOLOv8**: Ultralytics for the detection framework
- **OpenCV**: Computer vision operations
- **NumPy**: Numerical computations

---

**For questions and support, please open an issue on GitHub.**
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`# ID Card Data Augmentation Pipeline`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`A comprehensive data augmentation pipeline for ID card images with YOLO-based detection and advanced augmentation techniques.`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`![Pipeline Overview](docs/images/yolov8_pipeline.png)`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## 🚀 Features`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### YOLO-based ID Card Detection`
			`- Automatic detection and cropping of ID cards from large images`
			`- Configurable confidence and IoU thresholds`
			`- Multiple cropping modes (bbox, square, aspect_ratio)`
			`- Padding and target size customization`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Advanced Data Augmentation`
			`- Geometric Transformations: Rotation with multiple angles`
			`- Random Cropping: Simulates partially visible cards`
			`- Noise Addition: Simulates worn-out cards`
			`- Partial Blockage: Simulates occluded card details`
			`- Blurring: Simulates blurred but readable images`
			`- Brightness/Contrast: Mimics different lighting conditions`
			`- Grayscale Conversion: Final preprocessing step for all images`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Flexible Configuration`
			`- YAML-based configuration system`
			`- Command-line argument overrides`
			`- Environment-specific settings`
			`- Comprehensive logging`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## 📋 Requirements`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			```bash
			`# Python 3.8+`
			`conda create -n gpu python=3.8`
			`conda activate gpu`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`# Install dependencies`
			`pip install -r requirements.txt`
			```
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Dependencies`
			- `opencv-python>=4.5.0`
			- `numpy>=1.21.0`
			- `Pillow>=8.3.0`
			- `PyYAML>=5.4.0`
			- `ultralytics>=8.0.0` (for YOLO models)
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## 🛠️ Installation`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`1. Clone the repository`
done 7 transformations 2025-08-05 21:42:23 +07:00			```bash
			`git clone <repository-url>`
			`cd IDcardsGenerator`
			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`2. Install dependencies`
done 7 transformations 2025-08-05 21:42:23 +07:00			```bash
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`pip install -r requirements.txt`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`3. Prepare YOLO model (optional)`
done 7 transformations 2025-08-05 21:42:23 +07:00			```bash
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`# Place your trained YOLO model at:`
			`data/weights/id_cards_yolov8n.pt`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## 📖 Usage`

			`### Basic Usage`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			```bash
			`# Run with default configuration`
			`python main.py`

			`# Run with ID card detection enabled`
			`python main.py --enable-id-detection`

			`# Run with custom input/output directories`
			`python main.py --input-dir "path/to/input" --output-dir "path/to/output"`
done 7 transformations 2025-08-05 21:42:23 +07:00			```
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00
			`### Configuration Options`

			`#### ID Card Detection`
			```bash
			`# Enable detection with custom model`
			`python main.py --enable-id-detection --model-path "path/to/model.pt"`

			`# Adjust detection parameters`
			`python main.py --enable-id-detection --confidence 0.3 --crop-mode square`

			`# Set target size for cropped cards`
			`python main.py --enable-id-detection --crop-target-size "640,640"`
			```

			`#### Data Augmentation`
			```bash
			`# Customize augmentation parameters`
			`python main.py --num-augmentations 5 --target-size "512,512"`

			`# Preview augmentation results`
			`python main.py --preview`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Configuration File`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			Edit `config/config.yaml` for persistent settings:
done 7 transformations 2025-08-05 21:42:23 +07:00
			```yaml
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`# ID Card Detection`
			`id_card_detection:`
			`enabled: false # Enable/disable YOLO detection`
			`model_path: "data/weights/id_cards_yolov8n.pt"`
			`confidence_threshold: 0.25`
			`iou_threshold: 0.45`
			`padding: 10`
			`crop_mode: "bbox"`
			`target_size: null`

			`# Data Augmentation`
done 7 transformations 2025-08-05 21:42:23 +07:00			`augmentation:`
			`rotation:`
			`enabled: true`
			`angles: [30, 60, 120, 150, 180, 210, 240, 300, 330]`
			`random_cropping:`
			`enabled: true`
			`ratio_range: [0.7, 1.0]`
			`random_noise:`
			`enabled: true`
			`mean_range: [0.0, 0.7]`
			`variance_range: [0.0, 0.1]`
			`partial_blockage:`
			`enabled: true`
			`coverage_range: [0.0, 0.25]`
			`blurring:`
			`enabled: true`
			`kernel_ratio_range: [0.0, 0.0084]`
			`brightness_contrast:`
			`enabled: true`
			`alpha_range: [0.4, 3.0]`
			`beta_range: [1, 100]`
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`grayscale:`
			`enabled: true # Applied as final step`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`# Processing`
done 7 transformations 2025-08-05 21:42:23 +07:00			`processing:`
			`target_size: [640, 640]`
			`num_augmentations: 3`
			`save_format: "jpg"`
			`quality: 95`
			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## 🔄 Workflow`

			`### Two-Step Processing Pipeline`

			`#### Step 1: ID Card Detection (Optional)`
			When `id_card_detection.enabled: true`:
			`1. Input: Large images containing multiple ID cards`
			`2. YOLO Detection: Locate and detect ID cards`
			`3. Cropping: Extract individual ID cards with padding`
			4. Output: Cropped ID cards saved to `out/processed/`

			`#### Step 2: Data Augmentation`
			`1. Input: Original images OR cropped ID cards`
			`2. Augmentation: Apply 6 augmentation methods:`
			`- Rotation (9 different angles)`
			`- Random cropping (70-100% ratio)`
			`- Random noise (simulate wear)`
			`- Partial blockage (simulate occlusion)`
			`- Blurring (simulate motion blur)`
			`- Brightness/Contrast adjustment`
			`3. Grayscale: Convert all images to grayscale (final step)`
			`4. Output: Augmented images in main output directory`

			`### Direct Augmentation Mode`
			When `id_card_detection.enabled: false`:
			`- Skips YOLO detection`
			`- Applies augmentation directly to input images`
			`- All images are converted to grayscale`

			`## 📊 Output Structure`
done 7 transformations 2025-08-05 21:42:23 +07:00
			```
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`output_directory/`
			`├── processed/ # Cropped ID cards (if detection enabled)`
			`│ ├── id_card_001.jpg`
			`│ ├── id_card_002.jpg`
			`│ └── processing_summary.json`
			`├── im1__rotation_01.png # Augmented images`
			`├── im1__cropping_01.png`
			`├── im1__noise_01.png`
			`├── im1__blockage_01.png`
			`├── im1__blurring_01.png`
			`├── im1__brightness_contrast_01.png`
			`└── augmentation_summary.json`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## 🎯 Use Cases`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Training Data Generation`
done 7 transformations 2025-08-05 21:42:23 +07:00			```bash
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`# Generate diverse training data`
			`python main.py --enable-id-detection --num-augmentations 10`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Quality Control`
done 7 transformations 2025-08-05 21:42:23 +07:00			```bash
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`# Preview results before processing`
			`python main.py --preview`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Batch Processing`
done 7 transformations 2025-08-05 21:42:23 +07:00			```bash
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`# Process large datasets`
			`python main.py --input-dir "large_dataset/" --output-dir "augmented_dataset/"`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## ⚙️ Advanced Configuration`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Custom Augmentation Parameters`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			```yaml
			`augmentation:`
			`rotation:`
			`angles: [45, 90, 135, 180, 225, 270, 315] # Custom angles`
			`random_cropping:`
			`ratio_range: [0.8, 0.95] # Tighter cropping`
			`random_noise:`
			`mean_range: [0.1, 0.5] # More noise`
			`variance_range: [0.05, 0.15]`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Performance Optimization`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			```yaml
			`performance:`
			`num_workers: 4`
			`prefetch_factor: 2`
			`pin_memory: true`
			`use_gpu: false`
done 7 transformations 2025-08-05 21:42:23 +07:00			```

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## 📝 Logging`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`The system provides comprehensive logging:`
			- File: `logs/data_augmentation.log`
			`- Console: Real-time progress updates`
			`- Summary: JSON files with processing statistics`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Log Levels`
			- `INFO`: General processing information
			- `WARNING`: Non-critical issues (e.g., no cards detected)
			- `ERROR`: Critical errors
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`## 🔧 Troubleshooting`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Common Issues`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`1. No images detected`
			`- Check input directory path`
			`- Verify image formats (jpg, png, bmp, tiff)`
			`- Ensure images are not corrupted`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`2. YOLO model not found`
			- Place model file at `data/weights/id_cards_yolov8n.pt`
			- Or specify custom path with `--model-path`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`3. Memory issues`
			- Reduce `num_augmentations`
			- Use smaller `target_size`
			`- Enable GPU if available`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`### Performance Tips`
done 7 transformations 2025-08-05 21:42:23 +07:00
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			- GPU Acceleration: Set `use_gpu: true` in config
			`- Batch Processing: Use multiple workers for large datasets`
			`- Memory Management: Process in smaller batches`
done 7 transformations 2025-08-05 21:42:23 +07:00
			`## 🤝 Contributing`

			`1. Fork the repository`
			`2. Create a feature branch`
			`3. Make your changes`
update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`4. Add tests if applicable`
done 7 transformations 2025-08-05 21:42:23 +07:00			`5. Submit a pull request`

			`## 📄 License`

			`This project is licensed under the MIT License - see the LICENSE file for details.`

			`## 🙏 Acknowledgments`

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`- YOLOv8: Ultralytics for the detection framework`
			`- OpenCV: Computer vision operations`
			`- NumPy: Numerical computations`
done 7 transformations 2025-08-05 21:42:23 +07:00
			`---`

update augment + YOLO pipeline 2025-08-06 20:52:39 +07:00			`For questions and support, please open an issue on GitHub.`