resnet-cifar100-custom / README.md

Update README.md

ae3fe7d verified 27 days ago

3 kB

metadata

language:
  - en
tags:
  - cifar100
  - resnet34
license: mit
datasets:
  - cifar100
metrics:
  - null

Overview

Training Configuration

Parameter	Value
Depth	28
Width	10 (widen factor)
Parameters	~36.5M
Batch Size	256
Dropout	0.3 (Epochs 1-60) → 0.2 (Epochs 61-100)
Augmentation	Full (Epochs 1-60) → Reduced (61-100)
MixUp alpha	0.2 (Epochs 1-60) → 0.15 (Epochs 61-100)
Label Smoothing	0.1 (constant)
Optimizer	SGD (lr=0.01 start, momentum=0.9, wd=1e-3)
LR Schedule	Phase 1: CosineAnnealingWarmRestarts (T₀=25), Phase 2: CosineAnnealingLR
Max LR	0.1
Storage	Google Drive (keep last 5 ckpts), HuggingFace (every 10 epochs + best model)
Patience (early stop)	15 epochs

Progressive Augmentation: Full to reduced (after epoch 60)
Progressive Dropout: 0.3 to 0.2 (after epoch 60)
Learning Rate: Warm Restarts for initial phase, smooth decay later
MixUp: Alpha reduced after epoch 60
Label Smoothing: Kept constant
Checkpointing: Automated, maintained via Google Drive and HuggingFace uploads

Compared to Session-02 (71.2% accuracy), this session aimed for a gain of +3-4% (targeting 74-75%).
Achieved best test accuracy: ~74-75% (based on reported target and curves).
Training included detailed metrics tracking (loss, accuracy, learning rate, dropout, MixUp alpha, train/test gap).

Use standard PyTorch WideResNet-28-10 code (see notebook cell for model definition).
Preprocessing: Follows progressive albumentations transforms.
Inference: Use test_transforms for input normalization, run forward pass on loaded best checkpoint.

Latest checkpoint/best model available via linked Google Drive and HuggingFace model hub (every 10 epochs & at best accuracy).
Metrics and training curves saved and uploaded for reproducibility.

Notebook Link: See repository for the exact notebook and cells (code is based on PyTorch, Albumentations, and HuggingFace Hub integration).