Slavova, A. and Valkov, B. and Tonchev, K. and Daskalova, N. and Nikolova, M. and Bivas, M. and Mateev, P. and Yordanova, R. and Zhelezova, S. (2013) Estimation of errors in text and data processing. [Study Group Report]
|
PDF
187kB |
Abstract
The company Adiss Lab Lts. obtained 1 000 000 medical reports that are either in free form text, or in XML format. One of the main goals of their development is to integrate an algorithm for information extraction (IE) in their platform. The verification of the algorithm’s output for a report is done by a medical doctor (MD) for a certain fee. Validating the correctness of all data would be overwhelming and very expensive. Hence, the problem, as presented by the company, is to provide a method (algorithm) which determines the minimum amount of reports that will validate the correctness of the IE algorithm and a procedure for selecting these reports.
In order to solve the problem we have considered an algorithm-centric approach uses active learning and semi-supervised learning.
Item Type: | Study Group Report |
---|---|
Problem Sectors: | Medical and pharmaceutical Data processing |
Study Groups: | European Study Group with Industry > ESGI 95 (Sofia, Bugaria, Sept 23-27, 2013) |
Company Name: | Adiss Lab Ltd. |
ID Code: | 630 |
Deposited By: | Matthew Hennessy |
Deposited On: | 04 Dec 2013 23:04 |
Last Modified: | 29 May 2015 20:15 |
Repository Staff Only: item control page