| 研究生: |
黃怡庭 Yi-Ting Huang |
|---|---|
| 論文名稱: |
不可靠偏標籤學習:新的資料集生成方法和新的解決框架 Unreliable Partial Label Learning: Novel Dataset Generation Method and Solution Frameworks |
| 指導教授: | 陳弘軒 |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 資訊工程學系 Department of Computer Science & Information Engineering |
| 論文出版年: | 2024 |
| 畢業學年度: | 112 |
| 語文別: | 中文 |
| 論文頁數: | 53 |
| 中文關鍵詞: | 不可靠偏標籤學習 、噪音偏標籤學習 、對比學習 、弱監督學習 、分類 、真實世界標籤噪音 |
| 相關次數: | 點閱:13 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
擁有大量且高品質的資料集是訓練深度神經網絡(DNNs)的關鍵,然而現實中收集到的資料集往往缺乏正確且良好的標註。為了解決標註不確定性的問題,研究人員開始關注不可靠偏標籤學習(Unreliable Partial Label Learning,UPLL),這比傳統的偏標籤學習(Partial Label Learning,PLL)更符合現實情況。
本論文提出了一種新的生成UPLL資料集方法,名為候選標籤推斷生成(Candidate Label Inference Generation, CLIG),利用完整的資料集訓練模型及自行收集資料集的統計結果,生成符合現實標註傾向的候選標籤集。實驗證明,CLIG比過去的方法更貼近現實且更合理。
此外本論文還提出了兩個UPLL框架:特徵對齊偽目標學習(Feature Alignment Pseudo-Target Learning, FAPT)和特徵對齊暫時擴增標籤集(Feature Alignment Temporarily Expanded Labels Set, FATEL)。這兩個框架利用對比學習優化模型提取特徵,並利用監督學習進行分類。FAPT將候選標籤集轉換成偽目標,並在每個週期(epoch)結束前更新;FATEL則在每個週期結束前暫時增加一個可能為真實標籤的選項至候選標籤集中。實驗結果表明,FAPT及FATEL在多個圖像資料集上的表現優於當前UPLL的最先進方法。
Large and high-quality datasets are crucial for training Deep Neural Networks (DNNs). However, datasets collected in reality could be inaccurate and noisy. To address the issue of label uncertainty, researchers have turned their attention to Unreliable Partial Label Learning (UPLL), which is more realistic than traditional Partial Label Learning (PLL).
There is currently a lack of publicly available UPLL datasets, so previous research usually requires the artificial synthesis of UPLL datasets. This paper proposes a novel method for generating UPLL datasets, called Candidate Label Inference Generation (CLIG), which leverages the training of models on complete datasets and the statistical results from self-collected datasets to generate candidate label sets that align with real-world labeling tendencies. Experimental results demonstrate that CLIG is more realistic and reasonable than previous methods.
Additionally, this paper introduces two UPLL frameworks: Feature Alignment Pseudo-Target Learning (FAPT) and Feature Alignment Temporarily Expanded Labels Set (FATEL). These frameworks utilize contrastive learning to optimize feature extraction and employ supervised learning for classification. FAPT transforms candidate label sets into pseudo-targets and updates them at the end of each epoch, while FATEL temporarily adds a potentially true label option to the candidate label set before each epoch ends. Experimental results show that FAPT and FATEL outperform state-of-the-art methods for UPLL on multiple image datasets.
[1] J. Luo and F. Orabona, “Learning from candidate labeling sets,” Advances in neural information processing systems, vol. 23, 2010.
[2] L. Liu and T. Dietterich, “A conditional multinomial mixture model for superset label learning,” Advances in neural information processing systems, vol. 25, 2012.
[3] C.-H. Chen, V. M. Patel, and R. Chellappa, “Learning from ambiguously labeled face images,” IEEE transactions on pattern analysis and machine intelligence, vol. 40, no. 7, pp. 1653–1667, 2017.
[4] Z. Zeng, S. Xiao, K. Jia, et al., “Learning by associating ambiguously labeled im- ages,” in Proceedings of the IEEE Conference on computer vision and pattern recognition, 2013, pp. 708–715.
[5] H. Wang, R. Xiao, Y. Li, et al., “Pico+: Contrastive label disambiguation for ropartial bust label learning,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.
[6] Y. Shi, N. Xu, H. Yuan, and X. Geng, “Unreliable partial label learning with recursive separation,” arXiv preprint arXiv:2302.09891, 2023.
[7] Z. Lian, M. Xu, L. Chen, L. Sun, B. Liu, and J. Tao, “Irnet: Iterative refinement network for noisy partial label learning,” arXiv preprint arXiv:2211.04774, 2022.
[8] C. Qiao, N. Xu, J. Lv, Y. Ren, and X. Geng, “Fredis: A fusion framework of refinement and disambiguation for unreliable partial label learning,” in International Conference on Machine Learning, PMLR, 2023, pp. 28 321–28 336.
[9] M. Xu, Z. Lian, L. Feng, B. Liu, and J. Tao, “Alim: Adjusting label importance mechanism for noisy partial label learning,” Advances in Neural Information Processing Systems, vol. 36, 2024.
[10] E. Hüllermeier and J. Beringer, “Learning from ambiguously labeled examples,” Intelligent Data Analysis, vol. 10, no. 5, pp. 419–439, 2006.
[11] T. Cour, B. Sapp, and B. Taskar, “Learning from partial labels,” The Journal of Machine Learning Research, vol. 12, pp. 1501–1536, 2011.
[12] M.-L. Zhang and F. Yu, “Solving the partial label learning problem: An instance-based approach.,” in IJCAI, 2015, pp. 4048–4054.
[13] P. Ni, S.-Y. Zhao, Z.-G. Dai, H. Chen, and C.-P. Li, “Partial label learning via conditional-label-aware disambiguation,” Journal of Computer Science and Technology, vol. 36, no. 3, pp. 590–605, 2021.
[14] R. Jin and Z. Ghahramani, “Learning with multiple labels,” Advances in neural information processing systems, vol. 15, 2002.
[15] N. Nguyen and R. Caruana, “Classification with partial labels,” in Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, 2008, pp. 551–559.
[16] Y.-C. Chen, V. M. Patel, J. K. Pillai, R. Chellappa, and P. J. Phillips, “Dictionary learning from ambiguously labeled data,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013, pp. 353–360.
[17] F. Yu and M.-L. Zhang, “Maximum margin partial label learning,” in Asian conference on machine learning, PMLR, 2016, pp. 96–111.
[18] Y. Yao, J. Deng, X. Chen, C. Gong, J. Wu, and J. Yang, “Deep discriminative cnn with temporal ensembling for ambiguously-labeled image classification,” in Proceedings of the aaai conference on artificial intelligence, vol. 34, 2020, pp. 12 669– 12 676.
[19] Y. Yao, C. Gong, J. Deng, and J. Yang, “Network cooperation with progressive disambiguation for partial label learning,” in Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2020, Ghent, Belgium, September 14–18, 2020, Proceedings, Part II, Springer, 2021, pp. 471–488.
[20] J. Lv, M. Xu, L. Feng, G. Niu, X. Geng, and M. Sugiyama, “Progressive identification of true labels for partial-label learning,” in International conference on machine learning, PMLR, 2020, pp. 6500–6510.
[21] H. Wen, J. Cui, H. Hang, J. Liu, Y. Wang, and Z. Lin, “Leveraged weighted loss for partial label learning,” in International conference on machine learning, PMLR, 2021, pp. 11 091–11 100.
[22] D.-D. Wu, D.-B. Wang, and M.-L. Zhang, “Revisiting consistency regularization for deep partial label learning,” in International conference on machine learning, PMLR, 2022, pp. 24 212–24 225.
[23] H. Wang, R. Xiao, Y. Li, et al., “Pico: Contrastive label disambiguation for partial label learning,” in International Conference on Learning Representations, 2021.
[24] S. Xia, J. Lv, N. Xu, and X. Geng, “Ambiguity-induced contrastive learning for instance-dependent partial label learning.,” in IJCAI, 2022, pp. 3615–3621.
[25] Y. Yan and Y. Guo, “Mutual partial label learning with competitive label noise,” in The Eleventh International Conference on Learning Representations, 2022.
[26] S. Xia, J. Lv, N. Xu, G. Niu, and X. Geng, “Towards effective visual representations for partial-label learning,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 15 589–15 598.
[27] S. Tian, H. Wei, Y. Wang, and L. Feng, “Crosel: Cross selection of confident pseudo labels for partial-label learning,” arXiv preprint arXiv:2303.10365, 2023.
[28] J. Lv, B. Liu, L. Feng, et al., “On the robustness of average losses for partial-label learning,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.
[29] M.-L. Zhang, Partial label learning datasets, https://palm.seu.edu.cn/zhangml/, Accessed: 2024-05-05.
[30] J. Wei, Z. Zhu, H. Cheng, T. Liu, G. Niu, and Y. Liu, “Learning with noisy labels revisited: A study using real-world human annotations,” in International Conference on Learning Representations, 2022. [Online]. Available: https://openreview. net/forum?id=TBWA6PLJZQm.
[31] H. Song, M. Kim, and J.-G. Lee, “SELFIE: Refurbishing unclean samples for robust deep learning,” in ICML, 2019.
[32] H. Zhang, M. Cisse, Y. N. Dauphin, and D. Lopez-Paz, “Mixup: Beyond empirical risk minimization,” arXiv preprint arXiv:1710.09412, 2017.