Self-Switching Classification Framework for Titled Documents

被引:0
|
作者
Hang Guo
Li-Zhu Zhou
Ling Feng
机构
[1] EMC Research China,Department of Computer Science and Technology
[2] Tsinghua University,undefined
关键词
text analysis; machine learning; Web text analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Ambiguous words refer to words that have multiple meanings such as apple, window. In text classification they are usually removed by feature reduction methods like Information Gain. Sometimes there are too many ambiguous words in the corpus, which makes throwing away all of them not a viable option, as in the case when classifying documents from the Web. In this paper we look for a method to classify Titled documents with the help of ambiguous words. Titled documents are a kind of documents that have a simple structure containing a title and an excerpt. News, messages, and paper abstracts with titles are examples of titled documents. Instead of introducing another feature reduction method, we describe a framework to make the best use of ambiguous words in the titled documents. The framework improves the performance of a traditional bag-of-words classifier with the help of a bag-of-word-pairs classifier. The framework is implemented using one of the most popular classifiers, Multinomial NaiveBayes (MNB) as an example. The experiments with three real life datasets show that in our framework the MNB model performs much better than traditional MNB classifier and a naive weighted algorithm, which simply puts more weight on words in the title.
引用
收藏
页码:615 / 625
页数:10
相关论文
共 50 条
  • [1] Self-Switching Classification Framework for Titled Documents
    郭杭
    周立柱
    冯铃
    JournalofComputerScience&Technology, 2009, 24 (04) : 615 - 625
  • [2] Self-Switching Classification Framework for Titled Documents
    Guo, Hang
    Zhou, Li-Zhu
    Feng, Ling
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, 24 (04) : 615 - 625
  • [3] Self-switching of waves in materials
    Rushchitskii, YY
    INTERNATIONAL APPLIED MECHANICS, 2001, 37 (11) : 1492 - 1498
  • [4] New self-switching converters
    不详
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2008, 23 (02) : 802 - 812
  • [5] The self-switching of waves in materials
    Inst. Mekhaniki, NAN Ukrainy, Kiev, Ukraine
    Prikl Mekh, 1600, 11 (123-129):
  • [6] Self-Switching of Waves in Materials
    Ya. Ya. Rushchitskii
    International Applied Mechanics, 2001, 37 : 1492 - 1498
  • [7] Wavelength self-switching in bistable microlasers
    Zhukovsky, Sergei V.
    Chigrin, Dmitry N.
    THIRD INTERNATIONAL WORKSHOP ON THEORETICAL AND COMPUTATIONAL NANOPHOTONICS - TACONA-PHOTONICS 2010, 2010, 1291 : 158 - +
  • [8] On the effect of δ-doping in self-switching diodes
    Westlund, A.
    Iniguez-de-la-Torre, I.
    Nilsson, P. -A.
    Gonzalez, T.
    Mateos, J.
    Sangare, P.
    Ducournau, G.
    Gaquiere, C.
    Desplanque, L.
    Wallart, X.
    Grahn, J.
    APPLIED PHYSICS LETTERS, 2014, 105 (09)
  • [9] Simulation and modeling of self-switching devices
    Aberg, Markku
    Saijets, Jan
    Song, Aimin
    Prunnila, Mika
    PHYSICA SCRIPTA, 2004, T114 : 123 - 126
  • [10] Parametric Optimization of Self-Switching Diode
    Garg, Sahil
    Kaushal, Bipan
    Singh, Arun K.
    Kumar, Sanjeev
    Mahapatra, Santanu
    2018 IEEE 13TH NANOTECHNOLOGY MATERIALS AND DEVICES CONFERENCE (NMDC), 2018, : 323 - 326