Second-Order Text Matching Algorithm for Agricultural Text

被引:0
|
作者
Sun, Xiaoyang [1 ]
Song, Yunsheng [1 ,2 ]
Huang, Jianing [1 ]
机构
[1] Shandong Agr Univ, Sch Informat Sci & Engn, Tai An 271018, Peoples R China
[2] Minist Agr & Rural Affairs, Key Lab Huang Huai Hai Smart Agr Technol, Tai An 271018, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 16期
关键词
natural language processing; deep learning; text matching; agriculture text;
D O I
10.3390/app14167012
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Text matching promotes the research and application of deep understanding of text information, and it provides the basis for information retrieval, recommendation systems and natural language processing by exploring the similar structures in text data. Owning to the outstanding performance and automatically extract text features for the target, the methods based-pre-training models gradually become the mainstream. However, such models usually suffer from the disadvantages of slow retrieval speed and low running efficiency. On the other hand, previous text matching algorithms have mainly focused on horizontal domain research, and there are relatively few vertical domain algorithms for agricultural text, which need to be further investigated. To address this issue, a second-order text matching algorithm has been developed. This paper first obtains a large amount of text about typical agricultural crops and constructs a database by using web crawlers and querying relevant textbooks, etc. Then BM25 algorithm is used to generate a candidate set and BERT model is used to filter the optimal match based on the candidate set. Experiments have shown that the Precision@1 of this second-order algorithm can reach 88.34% on the dataset constructed in this paper, and the average time to match a piece of text is only 2.02 s. Compared with BERT model and BM25 algorithm, there is an increase of 8.81% and 13.73% in Precision@1 respectively. In terms of the average time required for matching a text, it is 55.2 s faster than BERT model and only 2 s slower than BM25 algorithm. It can improve the efficiency and accuracy of agricultural information retrieval, agricultural decision support, agricultural market analysis, etc., and promote the sustainable development of agriculture.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] An Optimal Algorithm for Matching String Patterns in Large Text Databases
    Kumar, K. S. M. V.
    Raju, S. Viswanadha
    Govardha, Ka.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2013, 13 (06): : 31 - 40
  • [32] Normalized table-matching algorithm as approach to text categorization
    Taeho Jo
    Soft Computing, 2015, 19 : 839 - 849
  • [33] Text and text history of the second Book of Esra
    Rösel, M
    ZEITSCHRIFT FUR DIE ALTTESTAMENTLICHE WISSENSCHAFT, 2005, 117 (01): : 144 - 145
  • [35] Second-order matching prior family parametrized by sample size and matching probability
    Tanaka, Toyoto
    Hirose, Yoshihiro
    Komaki, Fumiyasu
    STATISTICAL PAPERS, 2020, 61 (04) : 1701 - 1717
  • [36] Anticipating the correct matching response in a second-order matching-to-sample task
    Ribes-Iñesta, E
    Rodríguez, ME
    Fuentes, MT
    PSYCHOLOGICAL REPORTS, 2003, 93 (03) : 1307 - 1318
  • [37] Second-order matching prior family parametrized by sample size and matching probability
    Toyoto Tanaka
    Yoshihiro Hirose
    Fumiyasu Komaki
    Statistical Papers, 2020, 61 : 1701 - 1717
  • [38] Inclusion of a Second-Order Prior into Semi-Global Matching
    Hermann, Simon
    Klette, Reinhard
    Destefanis, Eduardo
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2009, 5414 : 633 - +
  • [39] Second-Order Semi-Global Stereo Matching Algorithm Based on Slanted Plane Iterative Optimization
    Ni, Jinyan
    Li, Qingwu
    Liu, Yan
    Zhou, Yan
    IEEE ACCESS, 2018, 6 : 61735 - 61747
  • [40] A second-order multibit complex bandpass ΔΣAD modulator with I, Q dynamic matching and DWA algorithm
    San, Hao
    Jingu, Yoshitaka
    Wada, Hiroki
    Hagiwara, Hiroyuki
    Hayakawa, Akira
    Kobayashi, Haruo
    Matsuura, Tatsuji
    Yahagi, Kouichi
    Kudoh, Junya
    Nakane, Hideo
    Hotta, Masao
    Tsukada, Toshiro
    Mashiko, Koichiro
    Wada, Atsushi
    IEICE TRANSACTIONS ON ELECTRONICS, 2007, E90C (06): : 1181 - 1188