共 50 条
- [1] Intelligent text extraction from PDF documents INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, : 2 - +
- [2] Extracting Body Text from Academic PDF Documents for Text Mining PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1, 2020, : 235 - 242
- [5] Improved Text Extraction from PDF Documents for Large-Scale Natural Language Processing COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PT I, 2014, 8403 : 102 - 112
- [6] The Use of Historical Documents and Sound Recordings for the Study and Safeguarding of Endangered Languages ENDANGERED LANGUAGES AND HISTORY, 2009, : 27 - 32
- [8] Endangered Turkic Languages from China ENDANGERED LANGUAGES OF THE CAUCASUS AND BEYOND, 2017, 15 : 135 - 150
- [9] Automatic Recovery of Corrupted Font Encoding in PDF Documents Using CNN-Based Symbol Recognition with Language Model 2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 121 - 126
- [10] A Benchmark and Evaluation for Text Extraction from PDF 2017 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2017), 2017, : 99 - 108