Face Recognition Using Unlabeled Data

Martínez, Carmen; Fuentes, Olac

Services on Demand

Journal

Article

Indicators

Cited by SciELO
Access statistics

Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Comp. y Sist. vol.7 n.2 Ciudad de México Oct./Dec. 2003

Artículo

Face Recognition Using Unlabeled Data

Reconocimiento de Rostros usando Datos No Etiquetados

Carmen Martínez and Olac Fuentes

Instituto Nacional de Astrofísica, Óptica y Electrónica Luis Enrique Erro # 1 Santa Maria Tonanzintla, Puebla, 72840, México. E–mails: carmen@ccc.inaoep.mx ; fuentes@inaoep.mx

Abstract

Face recognition systems can normally attain good accuracy when they are provided with a large set of training examples. However, when a large training set is not available, their performance is commonly poor. In this work we describe a method for face recognition that achieves good results when only a very small training set is available (one image per person). The method is based on augmenting the original training set with previously unlabeled data (that is, face images for which the identity of the person is not known). Initially, we apply the well–known eigenfaces technique to reduce the dimensionality of the image space, then we perform an iterative process, classifying all the unlabeled data with an ensemble of classifiers built from the current training set, and appending to the training set the previously unlabeled examples that are believed to be correctly classified with a high confidence level, according to the ensemble.

We experimented with ensembles based on the k–nearest neighbors, feed forward artificial neural networks and locally weighted linear regression learning algorithms. Our experimental results show that using unlabeled data improves the accuracy in all cases. The best accuracy, 92.07%, was obtained with locally weighted linear regression using 30 eigenfaces and appending 3 examples of every class in each iteration. In contrast, using only labeled data, an accuracy of only 34.81% was obtained.

Resumen

Los sistemas de reconocimiento de rostros normalmente obtienen buenos resultados cuando tienen disponibles conjuntos de entrenamiento grandes. Sin embargo, cuando no hay un conjunto de entrenamiento grande disponible, su desempeño no es satisfactorio. En este trabajo presentamos un método para reconocimiento de rostros que obtiene buenos resultados cuando solo se tiene disponible un conjunto de entrenamiento pequeño (incluso una sola imagen por persona). El método se basa en expandir el conjunto de entrenamiento original usando datos no etiquetados previamente (esto es, imágenes de rostros con identidad desconocida). Inicialmente, aplicamos la técnica de eigenrostros para reducir la dimensionalidad del espacio de atributos, después realizamos un proceso iterativo, clasificando todos los datos no etiquetados con un ensamble de clasificadores construido a partir del conjunto de entrenamiento actual y agregando al conjunto de entrenamiento los ejemplos que han sido clasificados correctamente con un alto nivel de confianza, de acuerdo al ensamble.

Realizamos experimentos usando ensambles basados en el algoritmo de k vecinos más cercanos, redes neuronales artificiales, y regresión lineal localmente ponderada. Los resultados experimentales demuestran que el uso de datos no etiquetados mejora la clasificación en todos los casos. Los mejores resultados, con un porcentaje de clasificación correcta de 92.07, fueron obtenidos con regresión lineal localmente ponderada usando 30 eigenrostros y agregando 3 ejemplos de cada clase en cada iteración. Como comparación, usando únicamente los datos etiquetados, solo se clasificaron correctamente el 34.81% de los ejemplos.

DESCARGAR ARTÍCULO EN FORMATO PDF

References

1. C. M. Bishop. Neural Networks for Pattern Recognition. Oxford Universiy Press, Oxford, England, 1996. [ Links ]

2. A. Blum and T. Mitchell. Learning to classify text from labeled and unlabeled documents. Conference on Computational Learning Theory, 1998. [ Links ]

3. T. G. Dietterich. Machine learning research: Four current directions. The AI Magazine, 1997. [ Links ]

4. L. Fausett. Fundamentals of Neural Networks: Architectures, Algorithms and Aplications. Prentice– Hall, 1994. [ Links ]

5. Y. Freund, H. Seung, E. Shamir, and N. Tishby. Selective sampling using the query by committee algorithm. Machine Learning, 1997. [ Links ]

6. D. Graham and N. Allinson. Character–rising Virtual Eigensignatures for Face Recognition. In Face Recognition: From Theory to Applications. Springer–Verlag, 1998. [ Links ]

7. A. McCallum and K. Nigam. Employing EM in pool–based active learning for text classification. In Proceeding of the 15th International Conference on Machine Learning, 1998. [ Links ]

8. T. M. Mitchell. Machine Learning. McGraw–Hill, 1997. [ Links ]

9. K. Nigam, A. McCallum, S. Thrun, and T. Mitchell. Learning to classify text from labeled and unlabeled documents. Machine Learning, 1999. [ Links ]

10. T. Solorio. Using unlabeled data to improve classifier accuracy. Master's thesis, Computer Science Department, Instituto Nacional de Astrofísica, Óptica y Electrónica, 2002. [ Links ]

11. T. Solorio and O. Fuentes. Improving classifier accuracy using unlabeled data. International Conference on Artificial Intelligence and Applications, 2001. [ Links ]

12. M. A. Turk and A. P. Pentland. Face recognition using eigenfaces. Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, pages 586–591, 1991. [ Links ]