Automated classification of paintings (2012) – Research Unit Virtual & Augmented Reality

Abstract

This thesis deals with the automated classification of paintings against the painting styles cubism, expressionism, impressionism, pointillism and renaissance painting by the means of a prototypical proof of concept system. A feature set has been developed which covers a wide range of feature extraction approaches such as edge-based, frequency-based or color-based features. Some approaches have been adopted from other research project where they have been applied on related problems. The classification of a painting for a human is a highly emotional and intuitive process. As a consequence it is difficult to operationalize and it contains a certain amount of unpredictability.
And indeed, it has turned out that some of the developed features behave as intended whereas others behave against intuition and expectation. It is, however, just this unpredictability which makes the underlying classification problem interesting. In order to find the best parameter tuple for each feature they have been optimized on a training set of 1000 images. In order to investigate how far the optimized feature set is reducible a principle component analysis has been applied on it.
Succeeding the feature optimization stage the classification algorithms C4.5 - decision tree, random forest, naive bayes, multilayer perceptron and k-nearest neighbor have been applied on the feature set. As it has been done in the feature optimization stage, a wide range of parameter tuples have been tested for each classification algorithm whereas a stratified 10 x 10 cross validation on the training set has been used as the measurement for the classification performance. The classifier which has shown the best results in the cross validation test has been used to classify an independent test set of 200 images. By this means it could be shown that the classification task can be solved in this particular domain. Further, it could be shown that a principle component analysis allows a reduction of the feature space to no more than 24% of the original size while still gaining approximately the same classification performance. Finally, it could be shown that the related retrieval problem is also worth investigating.

Reference

Gerger, R. (2012). Automated classification of paintings [Diploma Thesis, Technische Universität Wien]. reposiTUm. http://hdl.handle.net/20.500.12708/160589