Victorian400

Victorian400 Dataset

Victorian400 Dataset for Colorizing Victorian Illustrations | by Hoyeol Kim


Dataset: Victorian400 (Kaggle) (521mb)

Paper: Victorian400: Colorizing Victorian Illustrations

Here is example code for loading the Victorian400 dataset:

_URL = URL = '~/Victorian400.tar.gz'
path_to_zip = tf.keras.utils.get_file('Victorian400.tar.gz', origin=_URL, extract=True)
path1 = os.path.join(os.path.dirname(path_to_zip), 'Victorian400/')

Dataset

This dataset was originally created to colorize Charles Dickens’ illustrations. Due to a lack of data, however, the scope of this dataset has been expanded to all nineteenth century illustrations. Colorful images from the nineteenth century were selected to be used in the dataset, which were converted to black and white images for training data.

Victorian400

This dataset contains three different folders: original, resized, and gray. Images in the ‘original’ folder were curated for significant features, such as faces and bodies. In the ‘resized’ folder, the original images were resized to fit the 256*256 dimensions needed for the process of deep learning. Based on the resized images, I created the ‘gray’ folder, which contains black and white images converted from the colorful resized images.


The Victorian400 dataset was tested with pix2pix developed by Isola et al.

Test Results

Test Results


Citation

Please use the following reference to cite the dataset:

@article{Victorian400,
    title        = {Victorian400: Colorizing Victorian Illustrations},
    author       = {Kim, Hoyeol},
    journal      = {International Journal of Humanities and Arts Computing},
    publisher    = {Edinburgh University Press},
    volume       = {15},
    number       = {1-2},
    year         = {2021},
    pages        = {186-202},
    doi          = {10.3366/ijhac.2021.0269},
    }

or

Kim, Hoyeol, "Victorian400: Colorizing Victorian Illustrations." International Journal of Humanities and Arts Computing, vol. 15, no. 1-2, Oct. 2021, pp. 186-202. doi: 10.3366/ijhac.2021.0269.

Funding and Grant

In recognition of my work on the Victorian400 dataset, I received a $563 Project Development Grant from the Center of Digital Humanities Research, as well as Graduate Merit funding for $400 by the Department of English, Texas A&M University.


License

License