Radiotekhnika
Publishing house Radiotekhnika

"Publishing house Radiotekhnika":
scientific and technical literature.
Books and journals of publishing houses: IPRZHR, RS-PRESS, SCIENCE-PRESS


Тел.: +7 (495) 625-9241

 

The application of meeting participant registration method in intelligent room

Keywords:

Al. L. Ronzhin – Ph.D. (Eng.), Research Scientist, Laboratory of Speech and Multimodal Interfaces, St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS). E-mail: ronzhinal@iias.spb.su


At designing of intelligent rooms for meetings, lectures, scientific and educational events the following methods of audio- and video signals processing are used most often: 1) detection and tracking of users based on video monitoring; 2) determination of the position, recognition and identification of their faces; 3) sound source localization; 4) speech recognition; 5) identification of a speaker; 6) speech synthesis. The paper presents a method for automatic registration of the participants, based on multistage procedures for monitoring zones of the room, which provides detecting of participants position, recording audiovisual data, which are needed for identification of them. Experimental evaluation of the method was carried out on a database with more than 55 thousand photos of 36 participants. During experiments three methods of face recognition LBPH, PCA and LDA were compared. Conditions at the experiments carrying out (different distances from the camera to a participant, lighting, mobility of participants while taking pictures), affecting on quality and quantity of extracted facial features from an image, which is directly effect on the accuracy of recognition and occurrence of false positives, the LBPH method shows best result 79.5% was achieved by using the LBPH method and the lowest false positive rate 1.3% was obtained with the PCA method.
References:

  1. Fillinger A., Hamchi I., Degré S., Diduch L., Rose T., Fiscus J., Stanford V. Middleware and Metrology for the Pervasive Future // IEEE Pervasive Computing Mobile and Ubiquitous Systems. 2009. V. 8. № 3. R. 74-83.
  2. Nakashima H., Aghajan H. K., Augusto J. C. Handbook of Ambient Intelligence and Smart Environments. Springer. 2010. 1294 p.
  3. Yusupov R.M., Ronzhin A.L. Ot umnykh priborov k intellektual'nomu prostranstvu // Vestnik Rossiyskoy Akademii Nauk: nauchnyy i obshchestvenno-politicheskiy zhurnal. 2010. T. 80. Vyp. 1. C. 45-51.
  4. Aldrich F. Smart Homes: Past, Present and Future / Inside the Smart Home / Ed. R. Harper. London: Springer-Verlag. 2003. R. 17-39.
  5. Lampi F. Automatic Lecture Recording. Dissertation. The University of Mannheim, Germany. 2010. 229 p.
  6. Calonder M., Lepetit V., Fua P. BRIEF: Binary Robust Independent Elementary Features // Computer Vision – ECCV’10. 2010. R. 778-792.
  7. Ekenel1 H.K., Fischer M., Jin Q., Stiefelhagen R. Multi-modal Person Identification in a Smart Environment // Proc. of the Computer Vision and Pattern Recognition, CVPR '07. 2007. R. 1(8.
  8. Ronzhin A.L., Karpov A.A. Sravnenie metodov lokalizatsii pol'zovatelya mnogomodal'noy sistemy po ego rechi // Izvestiya vuzov. Priborostroenie. 2008. T. 51. № 11. S. 41-47.
  9. Zhang C., Yin P., Rui Y., Cutler R., Viola P., Sun X., Pinto N., Zhang Z. Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos // IEEE Trans. on Multimedia. 2008. V. 10. № 8. R. 1541-1552.
  10. Ronzhin A.L. Topologicheskie osobennosti morfofonemnogo sposoba predstavleniya slovarya dlya raspoznavaniya russkoy rechi // Vestnik komp'yuternykh i informatsionnykh tekhnologiy. 2008. № 9. S. 12-19.
  11. Kipyatkova I.S., Karpov A.A. Analiticheskiy obzor sistem raspoznavaniya russkoy rechi s bol'shim slovarem // Trudy SPIIRAN. 2010. Vyp. 12. S. 7-20.
  12. Imseng D.; Friedland G. Tuning-Robust Initialization Methods for Speaker Diarization // IEEE Transactions on Audio, Speech, and Language Processing. 2010. V. 18. № 8. R. 2028-2037.
  13. Lobanov B.M., Tsirul'nik L.I., Zhelezny M., Krnoul Z., Ronzhin A., Karpov A. Sistema audiovizual'nogo sinteza russkoy rechi // Informatika. 2008. № 4(20). S. 67-78.
  14. Schneiderman H., Kanade T. Object detection using the statistic of parts // International Journal of Computer Vision. 2004. V. 56(3). R. 151-177.
  15. Abate A. F., Nappi M., Riccio D. and Sabatino G. 2D and 3D face recognition: A survey // Pattern Recognition Letters. 2007. V. 28. № 14. R. 1885–1906.
  16. Gorodnichy D. Video-Based Framework for Face Recognition in Video // Second Workshop on Face Processing in Video (FPiV'05) in Proceedings of Second Canadian Conference on Computer and Robot Vision (CRV'05). 2005.
  17. Castrill´on-Santana M., D´eniz-Su´arez O., Guerra-Artal C., Hern´andez-Tejera M. Real-time Detection of Faces in Video Streams // Second Canadian Conference on Computer and Robot Vision (CRV’05). 2005. R. 298-305.
  18. Dorogiy Ya.Yu. Postroenie oftal'mogeometricheskogo klassifikatora dlya zadachi raspoznavaniya cheloveka po litsu // Informatsionnye tekhnologii. Radioelektronika. Telekommunikatsii. 2012. T. 2. № 2. S. 24-33.
  19. Kozlov P.V., Lipin Yu.N., Yuzhakov A.A. Algoritm raspoznavaniya litsa cheloveka // Voprosy zashchity informatsii. 2011. № 1. S. 52-57.
  20. Yushchenkova D.V., Meshcheryakov B.G. Raspoznavanie otdel'nykh chert litsa kak osnova uznavaniya tselogo litsa // Eksperimental'naya psikhologiya. 2010. № 3. S. 84-92.
  21. Kashapova L.Kh., Latysheva Ye.Yu., Spiridonov I.N. Algoritm raspoznavaniya emotsional'nogo sostoyaniya po izobrazheniyam litsa s ispol'zovaniem diskriminantnogo analiza i fil'trov gabora // Meditsinskaya tekhnika. 2012. № 3. S. 1-4.
  22. Ul'yanov S.V., Petrov S.P. Kvantovoe raspoznavanie lits i kvantovaya vizual'naya kriptografiya: modeli i algoritmy // Sistemnyy analiz v nauke i obrazovanii. 2012. № 1(15). S. 160-176.
  23. Kukharev G.A., Shchegoleva N.L. Algoritmy dvumernogo analiza glavnykh komponent dlya zadach raspoznavaniya izobrazheniy lits // Komp'yuternaya optika. 2010. T. 34. № 4. S. 545-551.
  24. Kukharev G.A., Kamenskaya Ye.I. Dvumernyy kanonicheskiy korrelyatsionnyy analiz v prilozhenii k obrabotke izobrazheniy lits // Izvestiya SPbGETU «LETI». 2010. № 1. S. 23-28.
  25. Kozin N.Ye., Fursov V.A. Postroenie klassifikatorov dlya raspoznavaniya lits na osnove pokazateley sopryazhennosti // Komp'yuternaya optika. 2005. № 28. S. 160-163.
  26. Tropchenko A.A., Tropchenko A.Yu. Neyrosetevye metody identifikatsii cheloveka po izobrazheniyu litsa // Izvestiya vysshikh uchebnykh zavedeniy. Priborostroenie. 2012. T. 55. № 10. S. 31-36.
  27. Zegzhda D.P., Moskvin D.A., Bosov Yu.O. Raspoznavanie obrazov na osnove fraktal'nogo szhatiya // Problemy informatsionnoy bezopasnosti. Komp'yuternye sistemy. 2012. № 2. S. 86-90.
  28. Druki A.A. Sistema poiska, vydeleniya i raspoznavaniya lits na izobrazheniyakh // Izvestiya Tomskogo politekhnicheskogo universiteta. 2011. T. 318. № 5. S. 64-70.
  29. Petruk V., Samorodov A.V., Spiridonov I.N. Primenenie lokal'nykh binarnykh shablonov k resheniyu zadachi raspoznavaniya lits // Vestnik Moskovskogo gosudarstvennogo tekhnicheskogo universiteta im. N.E. Baumana. Seriya: Priboro­stroenie. 2011. № 5. S. 58-63.
  30. Turk M.A., Pentland A.P. Face recognition using eigenfaces // IEEE Conference on Computer Vision and Pattern Recognition (CVPR. 1991. R. 586-591.
  31. Belhumeur P.N., Hespanha J., Kriegman D. Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection // IEEE Transactions on Pattern Analysis and Machine Intelligence 1997. V. 19. № 7. R. 711-720.
  32. Ahonen T., Hadid A., Pietikainen M. Face Recognition with Local Binary Patterns // Computer Vision (ECCV 2004. 2004. R. 469–481.
  33. Yusupov R.M., Ronzhin A.L., Prishchepa M.V., Ronzhin Al.L. Modeli i programmno-apparatnye resheniya avtomatizirovannogo upravleniya intellektual'nym zalom // Avtomatika i telemekhanika. 2011. № 7. S. 39-49.
  34. Ronzhin A.L., Budkov V.Yu., Ronzhin Al.L. Tekhnologii formirovaniya audiovizual'nogo interfeysa sistemy telekonferentsiy // Avtomatizatsiya i sovremennye tekhnologii. 2011. № 5. S. 20-26.
  35. Ronzhin Al.L., Ronzhin An.L. Sistema audiovizual'nogo monitoringa uchastnikov soveshchaniya v intellektual'nom zale // Doklady TUSURa. 2011. № 1 (22). Ch. 1. S. 153-157.
  36. Ronzhin Al.L., Budkov V.Yu., Ronzhin An.L. Formirovanie profilya pol'zovatelya na osnove audiovizual'nogo analiza situatsii v intellektual'nom zale soveshchaniy // Trudy SPIIRAN. 2012. Vyp. 23. S. 482-494.

Sept. 2, 2020
Aug. 27, 2020
June 24, 2020

© Издательство «РАДИОТЕХНИКА», 2004-2017            Тел.: (495) 625-9241                   Designed by [SWAP]Studio