| 研究生: |
江翰霖 Han-Lin Chiang |
|---|---|
| 論文名稱: |
人臉圖片生成與增益之可用性與效率探討分析 |
| 指導教授: | 柯士文 |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
管理學院 - 資訊管理學系在職專班 Executive Master of Information Management |
| 論文出版年: | 2020 |
| 畢業學年度: | 108 |
| 語文別: | 中文 |
| 論文頁數: | 49 |
| 中文關鍵詞: | 人臉辨識 、資料生成 、資料增益 |
| 外文關鍵詞: | Face detection, Data generation, Data augmentation |
| 相關次數: | 點閱:6 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
人臉辨識一直是相當熱門的話題,而在近年機器學習技術的興起,更將人臉辨識推往更上一層樓。經由大量的數據做訓練,能夠有效的提升辨識準確度,因此如何更大量的收集人臉辨識訓練圖片,便是一門重要的課題。
本研究個案公司因應研發計畫,在人臉辨識專案,採用監督式學習方式,對人臉圖片進行訓練,以產生人臉偵測模組,因此常有大量人臉圖片的需求。在長期的需求之下,利用資料生成與資料增益,是相當適合的方式。
本研究針對人臉資料生成與人臉資料增益,探討兩種方式何者在產出人臉圖片較有效率,包含了產生圖片花費的時間以及產生的圖片可用性。
經過實驗後,可得知在兩者相較之下,兩者的可用性相似,並無太大差異,但在時間效率上,使用人臉資料增益的方式,遠比使用人臉生成的方式快上許多。因此,對本研究個案公司而言,使用人臉資料增益的方式產生人臉圖片,是較符合個案需求的方式。
Face detection has always been a very popular topic, and in recent years, the rise of machine learning technology has pushed face detection to a higher level.
Training through a large amount of data can effectively improve the detection accuracy, so how to collect a larger number of face detection training pictures is an important topic.
In response to the research and development plan, the case of A Company in this study adopted a supervised learning method in the face detection project to train the face pictures to generate face detection modules, so there is often a demand for a large number of face pictures. Under long-term demand, the use of data generation and data augmentation is a very suitable way.
This study focuses on face data generation and face data augmentation, and explores which of the two methods is more efficient in producing face images, including the time it takes to generate the images and the availability of the generated images.
Comparing the two, we can find that the two are similar in terms of usability, but in terms of time efficiency, the way of using face data augmentation is much faster than the way of using face generation.
After confirming the efficiency and availability, it will be of great help to the future development of A company in this research case.
1. Antoniou, A., Storkey, A., & Edwards, H. (2017). Data augmentation generative adversarial networks. arXiv preprint arXiv:1711.04340.
2. Buslaev, A., Iglovikov, V. I., Khvedchenya, E., Parinov, A., Druzhinin, M., & Kalinin, A. A. (2020). Albumentations: fast and flexible image augmentations. Information, 11(2), 125.
3. Chatfield, K., Simonyan, K., Vedaldi, A., & Zisserman, A. (2014). Return of the devil in the details: Delving deep into convolutional nets. arXiv preprint arXiv:1405.3531.
4. Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., & Abbeel, P. (2016). Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in neural information processing systems (pp. 2172-2180).
5. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., ... & Bengio, Y. (2014). Generative adversarial nets. In Advances in neural information processing systems (pp. 2672-2680).
6. He, Z., Zuo, W., Kan, M., Shan, S., & Chen, X. (2019). Attgan: Facial attribute editing by only changing what you want. IEEE Transactions on Image Processing, 28(11), 5464-5478.
7. Iglovikov, V. Albumentations. Retrieved from GitHub: https://github.com/albumentations-team/albumentations (2020)
8. Jain, A. FaceGAN-Generating-Random-Faces. Retrieved from GitHub: https://github.com/adityajn105/FaceGAN-Generating-Random-Faces (2020)
9. Karras, T., Laine, S., Aila.S. StyleGAN - Official TensorFlow Implementation. Retrieved from GitHub: https://github.com/NVlabs/stylegan (2020).
10. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097-1105).
11. Lv, J. J., Shao, X. H., Huang, J. S., Zhou, X. D., & Zhou, X. (2017). Data augmentation for face recognition. Neurocomputing, 230, 184-196.
12. Mirza, M., & Osindero, S. (2014). Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784.
13. Radford, A., Metz, L., & Chintala, S. (2015). Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434.
14. Sajid, M., Ali, N., Dar, S. H., Iqbal Ratyal, N., Butt, A. R., Zafar, B., ... & Baig, S. (2018). Data augmentation-assisted makeup-invariant face recognition. Mathematical Problems in Engineering, 2018.
15. Shorten, C., & Khoshgoftaar, T. M. (2019). A survey on image data augmentation for deep learning. Journal of Big Data, 6(1), 60.
16. Surma, G. Celebrities Images. Retrieved from Kaggle: https://www.kaggle.com/greg115/celebrities-100k (2020).
17. Wang, X., Wang, K., & Lian, S. (2019). A survey on face data augmentation. arXiv preprint arXiv:1904.11685.
18. Zhang,R. BicyleGAN. Retrieved from GitHub:
https://github.com/junyanz/BicycleGAN (2020)
19. Zhu, J. Y., Park, T., Isola, P., & Efros, A. A. (2017). Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE international conference on computer vision (pp. 2223-2232).
20. Zhu, J. Y. CycleGAN. Retrieved from GitHub: https://junyanz.github.io/CycleGAN/ (2020)
21. Zoph, B., Cubuk, E. D., Ghiasi, G., Lin, T. Y., Shlens, J., & Le, Q. V. (2019). Learning data augmentation strategies for object detection. arXiv preprint arXiv:1906.11172.