abstractive summarization;
legal case summarization;
keyword extraction;
D O I:
10.1145/3594536.3595120
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Since state-of-the-art machine learning models like Transformers can not handle long text well, the quality of the summarization of legal documents is still not desirable. In order to improve the ability of machine learning models to understand the context of a long document, we introduce the keywords into the models to guide the summarization to locate and capture key information from long documents such as legal cases. Different from other works leveraging keywords to enhance the model, we further investigate how keyword quality impacts summarization. To improve the performance of the summarization, we also explore different methods for effectively encoding exceptionally lengthy documents and models for keyword extraction. The experiment results demonstrated that the keywords-based augmentation method is effective for summarization and higher-quality keywords can enhance the summarization models.
机构:
Hokkaido Univ, Grad Sch Informat Sci & Technol, Kita Ku, N14 W9, Sapporo, Hokkaido, JapanHokkaido Univ, Grad Sch Informat Sci & Technol, Kita Ku, N14 W9, Sapporo, Hokkaido, Japan
Aoki, Yasuhiro
论文数: 引用数:
h-index:
机构:
Yoshioka, Masaharu
Suzuki, Youta
论文数: 0引用数: 0
h-index: 0
机构:
Hokkaido Univ, Grad Sch Informat Sci & Technol, Kita Ku, N14 W9, Sapporo, Hokkaido, JapanHokkaido Univ, Grad Sch Informat Sci & Technol, Kita Ku, N14 W9, Sapporo, Hokkaido, Japan