Handwritten Alphanumeric model
Model details
Dataset
Accuracy achieved
Model trained and inference on
Average across all datasets
Reasoning - As model is trained on the existing dataset, doesn't perform good on NIST dataset
Model details
Dataset
Accuracy achieved
Existing dataset + NIST misclassifications
Reasoning - After training the model on NIST misclassifications, improvement in accuracy
Model details
Dataset
Accuracy achieved
Existing dataset + NIST misclassifications
Reasoning - After training the model on NIST misclassifications as per previous checkpoint, improvement in accuracy
Model details
Dataset
Accuracy achieved
Existing dataset + manually collected dataset from sheets (~1K samples)
Reasoning - Training the model upon previous checkpoint and adding manually collected data, improvement in accuracy
Handwritten Digits model
Model details
Dataset
Accuracy achieved
Model trained and inference on
Obtained production data (~50 samples)
Average across all datasets
Reasoning - As model is trained on the existing dataset, doesn't perform good on NIST dataset
Model details
Dataset
Accuracy achieved
Existing dataset + NIST misclasifications
Reasoning - After training the model on NIST misclassifications, improvement in accuracy
Model details
Dataset
Accuracy achieved
Existing dataset + NIST misclasifications + production dataset (~50 samples)
Reasoning: After training the model on NIST misclassifications and production dataset, improvement in accuracy
Model details
Dataset
Accuracy achieved
Existing dataset + NIST misclasifications + manually collected dataset from sheets (~8.6k samples)
Reasoning: As averaging upon a large production dataset, the accuracy slightly dips as compared to iteration 3
Sample dataset images
Existing dataset
Handwritten alphanumeric

Handwritten digits

Handwritten alphanumeric

Handwritten digits

Manually collected
Handwritten alphanumeric

Handwritten digits

Some unhandled misclassifications

Reasoning: Generally occurs if the digits are written in corners of the cell