A method for extracting data from semis-tructured documents

Linguistic method to solve the problem of data extraction from weakly structured documents is developed, approved, and described in detail in the paper. Sample data were taken from thesis catalogue of Vernadsky National Library of Ukraine. The sequence of all stages is described: document collection...

Full description

Saved in:
Bibliographic Details
Date:2020
Main Authors: Kudim, K.A., Proskudina, G.Yu.
Format: Article
Language:rus
Published: Інститут програмних систем НАН України 2020
Subjects:
Online Access:https://pp.isofts.kiev.ua/index.php/ojs1/article/view/388
Tags: Add Tag
No Tags, Be the first to tag this record!
Journal Title:Problems in programming

Institution

Problems in programming