The Document Components Ontology (DoCO)

被引:47
作者
Constantin, Alexandru [1 ]
Peroni, Silvio [2 ,3 ]
Pettifer, Steve [4 ]
Shotton, David [5 ]
Vitali, Fabio [2 ]
机构
[1] Ecole Polytech Fed Lausanne, PFL IC IIF LSIR, BC 159,Batiment BC, CH-1015 Lausanne, Switzerland
[2] Univ Bologna, Dept Comp Sci & Engn, Mura Anteo Zamboni 7, I-40127 Bologna, BO, Italy
[3] CNR, Inst Cognit Sci & Technol, Semant Technol Lab, Via Nomentana 56, I-00161 Rome, RM, Italy
[4] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
[5] Univ Oxford, Oxford E Res Ctr, 7 Keble Rd, Oxford OX1 3QG, England
关键词
DEO; DoCO; PDFX; SPAR ontologies; Utopia Documents; document components; rhetoric; structural patterns; SWAN;
D O I
10.3233/SW-150177
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The availability in machine-readable form of descriptions of the structure of documents, as well as of the document discourse (e.g. the scientific discourse within scholarly articles), is crucial for facilitating semantic publishing and the overall comprehension of documents by both users and machines. In this paper we introduce DoCO, the Document Components Ontology, an OWL 2 DL ontology that provides a general-purpose structured vocabulary of document elements to describe both structural and rhetorical document components in RDF. In addition to giving a formal description of the ontology, this paper showcases its utility in practice in a variety of our own applications and other activities of the Semantic Publishing community that rely on DoCO to annotate and retrieve document components of scholarly articles.
引用
收藏
页码:167 / 181
页数:15
相关论文
共 41 条
[1]  
[Anonymous], 2010, DOCBOOK 5 DEFINITIVE
[2]   Utopia documents: linking scholarly literature with research data [J].
Attwood, T. K. ;
Kell, D. B. ;
McDermott, P. ;
Marsh, J. ;
Pettifer, S. R. ;
Thorne, D. .
BIOINFORMATICS, 2010, 26 (18) :i568-i574
[3]  
Bartalesi V., 2013, P 2013 WORKSH COLL A, DOI [10.1145/2517978.2517983, DOI 10.1145/2517978.2517983]
[4]  
Beck J., 2010, P INT S XML LONG HAU, DOI DOI 10.4242/BALISAGEVOL6.BECK01
[5]  
Ciccarese P., 2011, ONTOLOGY RHETORICAL
[6]   The SWAN biomedical discourse ontology [J].
Ciccarese, Paolo ;
Wu, Elizabeth ;
Wong, Gwen ;
Ocana, Marco ;
Kinoshita, June ;
Ruttenberg, Alan ;
Clark, Tim .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (05) :739-751
[7]   The Collections Ontology: Creating and handling collections in OWL 2 DL frameworks [J].
Ciccarese, Paolo ;
Peroni, Silvio .
SEMANTIC WEB, 2014, 5 (06) :515-529
[8]   CiTO plus SWAN: The web semantics of bibliographic records, citations, evidence and discourse relationships [J].
Ciccarese, Paolo ;
Shotton, David ;
Peroni, Silvio ;
Clark, Tim .
SEMANTIC WEB, 2014, 5 (04) :295-311
[9]  
Constantin A., 2014, THESIS U MANCHESTER
[10]  
Constantin Alexandru., 2013, Proceedings of the 2013 ACM symposium on Document engineering, P177, DOI DOI 10.1145/2494266.2494271