Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data

被引:0
|
作者
Mekala, Dheeraj [1 ]
Gangal, Varun [2 ]
Shang, Jingbo [1 ,3 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92103 USA
[2] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
[3] Univ Calif San Diego, Halicioglu Data Sci Inst, San Diego, CA 92103 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing text classification methods mainly focus on a fixed label set, whereas many real-world applications require extending to new fine-grained classes as the number of samples per label increases. To accommodate such requirements, we introduce a new problem called coarse-to-fine grained classification, which aims to perform fine-grained classification on coarsely annotated data. Instead of asking for new fine-grained human annotations, we opt to leverage label surface names as the only human guidance and weave in rich pre-trained generative language models into the iterative weak supervision strategy. Specifically, we first propose a label-conditioned finetuning formulation to attune these generators for our task. Furthermore, we devise a regularization objective based on the coarse-fine label constraints derived from our problem setting, giving us even further improvements over the prior formulation. Our framework uses the fine-tuned generative models to sample pseudo-training data for training the classifier, and bootstraps on real unlabeled data for model refinement. Extensive experiments and case studies on two real-world datasets demonstrate superior performance over SOTA zero-shot classification baselines.
引用
收藏
页码:583 / 594
页数:12
相关论文
共 50 条
  • [31] Scratch Testing of Coarse-Grained and Ultra Fine-Grained Copper
    Filippov, A. V.
    Tarasov, S. Yu.
    Fortuna, S. V.
    Podgornyh, O. A.
    Shamarin, N. N.
    Filippova, E. O.
    PROCEEDINGS OF THE ADVANCED MATERIALS WITH HIERARCHICAL STRUCTURE FOR NEW TECHNOLOGIES AND RELIABLE STRUCTURES, 2018, 2051
  • [32] Fine-grained and coarse-grained entropy in problems of statistical mechanics
    Kozlov, V. V.
    Treshchev, D. V.
    THEORETICAL AND MATHEMATICAL PHYSICS, 2007, 151 (01) : 539 - 555
  • [33] Hierarchical classification based on coarse- to fine-grained knowledge transfer
    Qiu, Zeyu
    Hu, Minjie
    Zhao, Hong
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 149 : 61 - 69
  • [34] Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification
    Jain, Kanishk
    Karthik, Shyamgopal
    Gandhi, Vineet
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [35] Maximum Entropy Fine-Grained Classification
    Dubey, Abhimanyu
    Gupta, Otkrist
    Raskar, Ramesh
    Naik, Nikhil
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [36] Fine-Grained Scalable Streaming from Coarse-Grained Videos
    Ni, Pengpeng
    Eichhorn, Alexander
    Griwodz, Carsten
    Halvorsen, Pal
    NOSSDAV 09: 18TH INTERNATIONAL WORKSHOP ON NETWORK AND OPERATING SYSTEMS SUPPORT FOR DIGITAL AUDIO AND VIDEO, 2009, : 103 - 108
  • [37] Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Classification
    Al Sayed, Mohamad
    Brasoveanu, Adrian M. P.
    Nixon, Lyndon J. B.
    Scharl, Arno
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT I, 2023, 14134 : 162 - 174
  • [38] Fine-Grained Classification with Noisy Labels
    Wei, Qi
    Feng, Lei
    Sun, Haoliang
    Wang, Ren
    Guo, Chenhui
    Yin, Yilong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11651 - 11660
  • [39] Learning to Navigate for Fine-Grained Classification
    Yang, Ze
    Luo, Tiange
    Wang, Dong
    Hu, Zhiqiang
    Gao, Jun
    Wang, Liwei
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 438 - 454
  • [40] Malware Visualization for Fine-Grained Classification
    Fu, Jianwen
    Xue, Jingfeng
    Wang, Yong
    Liu, Zhenyan
    Shan, Chun
    IEEE ACCESS, 2018, 6 : 14510 - 14523