Pretraining → Fine-tuning (Transfer Learning) in PyTorch

This project is a single, self-contained Jupyter notebook that demonstrates a full workflow for:

Pretraining a CNN backbone on CIFAR-10, then
Fine-tuning (transfer learning) the pretrained backbone on EMNIST Letters (A–Z).

It also includes a small architecture/training ablation (batch size, learning rate, batch norm, dropout, and global pooling) and basic error analysis.

What the model does

Pretraining task (CIFAR-10)

Input: 32×32 RGB images (3 channels)
Output: 10 classes (airplane, automobile, bird, cat, deer, dog, frog, horse, ship, truck)
Goal: learn general-purpose visual features (edges, textures, shapes etc)

Fine-tuning task (EMNIST Letters)

Input: grayscale handwritten letters (1 channel)
Output: 26 classes (A-Z)
Goal: reuse pretrained features and adapt the classifier + later layers to a new dataset

Model architecture (from the notebook)

Backbone: residual-style CNN blocks

ConvBlock: Conv → BN → ELU → Dropout2d → Conv → BN with a skip/shortcut
Stacked stages (64 → 128 → 256 channels)
Global pooling + MLP classifier head during pretraining
During fine-tuning:
- The first conv layer is adapted from 3-channel → 1-channel by averaging pretrained RGB weights into grayscale weights.
- Classifier head is replaced for 26 classes.
- Earlier layers are frozen, later stage(s) + head remain trainable.

Data & preprocessing

CIFAR-10 (pretraining)

Split used in notebook:
- Train: 45,000
- Validation: 5,000
- Test: 10,000
Augmentations used:
- RandomCrop(32, padding=4)
- RandomHorizontalFlip
- RandAugment
- Normalize (dataset mean/std computed in notebook)
- RandomErasing (p=0.10)

EMNIST Letters (fine-tuning)

Train: 124,800
Test: 20,800
Normalization: mean=0.5, std=0.5
Labels are shifted from [1..26] → [0..25] inside training loop.

Training setup (pretraining)

Epochs: 55
Optimizer: SGD (momentum=0.9, weight_decay=5e-4)
Loss: CrossEntropyLoss with label smoothing (0.1)
LR schedule:
- Linear warmup (first 5 epochs)
- then Cosine annealing
Checkpointing:
- Saves model_checkpoint_epoch_<N>.pth each epoch

Results (from notebook outputs)

CIFAR-10 pretraining

Best validation accuracy: 76.40%
Test accuracy: 76.43%
Misclassifications collected: 2357 examples

EMNIST fine-tuning (transfer learning)

Pretrained checkpoint loaded: model_checkpoint_epoch_30.pth
Fine-tuning epochs: 15
Optimizer: Adam on trainable parameters (lr=1e-3, weight_decay=1e-4)
Scheduler: ReduceLROnPlateau
Best EMNIST test accuracy: 93.72%
Saved model: best_emnist_finetuned.pth

How to run

Option A — Run locally

Clone the repo

git clone https://github.com/maheenrazza/cifar10-pretraining-emnist-finetuning.git
cd cifar10-pretraining-emnist-finetuning

Install dependencies
```
pip install -r requirements.txt
```
Launch Jupyter
```
jupyter notebook
```
Open and run:
```
 pretraining_and_finetuning.ipynb
```

Option B — Run in Google Colab

Upload the notebook to Colab and run cells top-to-bottom. (GPU recommended.)

Repository contents

pretraining_and_finetuning.ipynb — main notebook (pretraining + fine-tuning + experiments)
requirements.txt — Python dependencies
.gitignore — ignores notebook checkpoints, caches, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
pretraining_and_finetuning.ipynb		pretraining_and_finetuning.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pretraining → Fine-tuning (Transfer Learning) in PyTorch

What the model does

Pretraining task (CIFAR-10)

Fine-tuning task (EMNIST Letters)

Model architecture (from the notebook)

Backbone: residual-style CNN blocks

Data & preprocessing

CIFAR-10 (pretraining)

EMNIST Letters (fine-tuning)

Training setup (pretraining)

Results (from notebook outputs)

CIFAR-10 pretraining

EMNIST fine-tuning (transfer learning)

How to run

Option A — Run locally

Option B — Run in Google Colab

Repository contents

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pretraining → Fine-tuning (Transfer Learning) in PyTorch

What the model does

Pretraining task (CIFAR-10)

Fine-tuning task (EMNIST Letters)

Model architecture (from the notebook)

Backbone: residual-style CNN blocks

Data & preprocessing

CIFAR-10 (pretraining)

EMNIST Letters (fine-tuning)

Training setup (pretraining)

Results (from notebook outputs)

CIFAR-10 pretraining

EMNIST fine-tuning (transfer learning)

How to run

Option A — Run locally

Option B — Run in Google Colab

Repository contents

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages