Computing Idioms Frequency in Text Corpora

被引:0
|
作者
Busta, Jan [1 ]
机构
[1] Masaryk Univ, Fac Informat, Brno, Czech Republic
来源
RASLAN 2008: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING: SECOND WORKSHOP | 2008年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The idioms are phrases which meaning is not composed from the meanings of each word in the phrase. This is one of the natural examples of violating the principle of compositionality that means that idioms are in area of natural language processing problem of meaning mining. To count the frequency of phrases such idioms in corpora has one big aim: To get to know which phrases we use often and which less. We do it to be able to start with getting the meaning of the whole phrases not just each word. This improves the understanding natural language.
引用
收藏
页码:71 / 74
页数:4
相关论文
共 50 条
  • [1] IDIOMS AND COMPUTER CORPORA
    Pastae, Veronica
    QUALITY AND EFFICIENCY IN E-LEARNING, VOL 3, 2013, : 322 - 327
  • [2] Frequency of Low-Frequency Words in Text Corpora
    Rychly, Pavel
    RASLAN 2010: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2011, : 53 - 58
  • [3] About corpora, intuitions, and idioms
    Temmerman, Tanja
    Van Craenenbroeck, Jeroen
    TIJDSCHRIFT VOOR NEDERLANDSE TAAL-EN LETTERKUNDE, 2019, 135 (04): : 453 - 464
  • [4] Computing Preset Dictionaries from Text Corpora for the Compression of Messages
    Abel, Marc W.
    Chung, Soon M.
    2014 International Conference on Data and Software Engineering (ICODSE), 2014,
  • [5] Idioms in Types of Text
    Mieder, Wolfgang
    NEUPHILOLOGISCHE MITTEILUNGEN, 2013, 114 (03) : 376 - 379
  • [6] Needles and haystacks, idioms and corpora: Gaining insights into idioms, using corpus analysis
    Moon, R
    PERFECT LEARNERS' DICTIONARY (?), 1999, 95 : 265 - 281
  • [7] Modus Questions: Query Models and Frequency in Russian Text Corpora
    Kazakovskaya, Victoria V.
    Khokhlova, Maria V.
    RASLAN 2014: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2014, : 49 - 55
  • [8] On the Assessment of Text Corpora
    Pinto, David
    Rosso, Paolo
    Jimenez-Salazar, Hector
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 5723 : 281 - +
  • [9] A new computing method for extracting contiguous phraseological sequences from academic text corpora
    Wei, Naixing
    Li, Jingjie
    INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2013, 18 (04) : 506 - 535
  • [10] Text introducers of proverbs and other idioms
    Cermak, Frantisek
    JEZIKOSLOVLJE, 2005, 6 (01): : 57 - 77