Information extraction from multimedia web documents: an open-source platform and testbed

被引:1
|
作者
Dupplaw, David Paul [1 ]
Matthews, Michael [2 ]
Johansson, Richard [3 ]
Boato, Giulia [3 ]
Costanzo, Andrea [4 ]
Fontani, Marco [4 ]
Minack, Enrico [5 ]
Demidova, Elena [5 ]
Blanco, Roi [2 ]
Griffiths, Thomas [6 ]
Lewis, Paul [1 ]
Hare, Jonathon [1 ]
Moschitti, Alessandro [3 ]
机构
[1] Univ Southampton, Southampton, Hants, England
[2] Barcelona Media, Barcelona, Spain
[3] Univ Trento, Trento, Italy
[4] CNIT, Florence, Italy
[5] Leibniz Univ Hannover, Hannover, Germany
[6] SORA, Vienna, Austria
关键词
Multimedia retrieval; Web analysis; Text analysis; Opinion analysis; Image analysis; Open-source software;
D O I
10.1007/s13735-014-0051-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The LivingKnowledge project aimed to enhance the current state of the art in search, retrieval and knowledge management on the web by advancing the use of sentiment and opinion analysis within multimedia applications. To achieve this aim, a diverse set of novel and complementary analysis techniques have been integrated into a single, but extensible software platform on which such applications can be built. The platform combines state-of-the-art techniques for extracting facts, opinions and sentiment from multimedia documents, and unlike earlier platforms, it exploits both visual and textual techniques to supportmultimedia information retrieval. Foreseeing the usefulness of this software in the wider community, the platform has been made generally available as an open-source project. This paper describes the platform design, gives an overviewof the analysis algorithms integrated into the system and describes two applications that utilise the system for multimedia information retrieval.
引用
收藏
页码:97 / 111
页数:15
相关论文
共 50 条
  • [21] Open-source Defect Injection Benchmark Testbed for the Evaluation of Testing
    Bures, Miroslav
    Herout, Pavel
    Ahmed, Bestoun S.
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VALIDATION AND VERIFICATION (ICST 2020), 2020, : 442 - 447
  • [22] Information Extraction from Multimedia Documents for e-Government Applications
    Amato, F.
    Mazzeo, A.
    Moscato, V.
    Picariello, A.
    INFORMATION SYSTEMS: PEOPLE, ORGANIZATIONS, INSTITUTIONS, AND TECHNOLOGIES, 2010, : 101 - 108
  • [23] From Database to Web Multimedia Documents
    Philippe Mulhem
    Hervé Martin
    Multimedia Tools and Applications, 2003, 20 : 263 - 282
  • [24] From database to web multimedia documents
    Mulhem, P
    Martin, H
    MULTIMEDIA TOOLS AND APPLICATIONS, 2003, 20 (03) : 263 - 282
  • [25] OpenBCI Meets the Web: A Scalable, Customizable Platform for Open-Source EEG Data Collection
    Stingl, Lukas
    Knierim, Michael T.
    COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 925 - 929
  • [26] Prototype of an open-source web-GIS platform for rapid disaster impact assessment
    Olyazadeh, Roya
    Aye, Zar Chi
    Jaboyedoff, Michel
    Derron, Marc-Henri
    SPATIAL INFORMATION RESEARCH, 2016, 24 (03) : 203 - 210
  • [27] Information extraction from semi-structured web documents
    Yun, Bo-Hyun
    Seo, Chang-Ho
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2006, 4092 : 586 - 598
  • [28] Collaborative Information Extraction and Mining from Multiple Web Documents
    Wong, Tak-Lam
    Lam, Wai
    Chan, Shing-Kit
    PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 442 - 452
  • [29] Extraction of Information from Public Health Emergency Web Documents
    Wang, Li
    Zhang, Yuanpeng
    Qian, Danmin
    Yao, Min
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 765 - 770
  • [30] Accessible from the open web: a qualitative analysis of the available open-source information involving cyber security and critical infrastructure
    Zhang, Yuxuan
    Frank, Richard
    Warkentin, Noelle
    Zakimi, Naomi
    JOURNAL OF CYBERSECURITY, 2022, 8 (01):