This is a new image-based handwritten historical digit dataset named ARDIS (Arkiv Digital Sweden). The images in ARDIS dataset are extracted from 15.000 Swedish church records which were written by different priests with various handwriting styles in the nineteenth and twentieth centuries. The constructed dataset consists of three single digit datasets and one digit strings dataset. The digit strings dataset includes 10.000 samples in Red-Green-Blue (RGB) color space, whereas, the other datasets contain 7.600 single digit images in different color spaces. Figure 1 illustrates handwritten digit images from different datasets in ARDIS.
I. Description of the Data Sets
II. Use of the Materials
III. Download Links
#### ARDIS DATASET_II:
This dataset contains 7600 corrupted and noisy handwritten digit images. You can use 6600 images for training and 1000 for testing.
ARDIS_DATASET_II download link: Click here
#### ARDIS DATASET_III:
This dataset contains 7600 handwritten digit images with clean background. You can use 6600 images for training and 1000 for testing.
ARDIS_DATASET_III download link: Click here
ARDIS_DATASET_IV download link: Click here
#### In Python
#### reshape to be [samples][pixels][width][height]
x_train = x_train.reshape(x_train.shape, 1, 28, 28).astype('float32')
x_test = x_test.reshape(x_test.shape, 1, 28, 28).astype('float32')
V. Feedback or Comments
Blekinge Institute of Technology