relation: http://miis.maths.ox.ac.uk/miis/630/ title: Estimation of errors in text and data processing creator: Slavova, A. creator: Valkov, B. creator: Tonchev, K. creator: Daskalova, N. creator: Nikolova, M. creator: Bivas, M. creator: Mateev, P. creator: Yordanova, R. creator: Zhelezova, S. subject: Medical and pharmaceutical subject: Data processing description: The company Adiss Lab Lts. obtained 1 000 000 medical reports that are either in free form text, or in XML format. One of the main goals of their development is to integrate an algorithm for information extraction (IE) in their platform. The verification of the algorithm’s output for a report is done by a medical doctor (MD) for a certain fee. Validating the correctness of all data would be overwhelming and very expensive. Hence, the problem, as presented by the company, is to provide a method (algorithm) which determines the minimum amount of reports that will validate the correctness of the IE algorithm and a procedure for selecting these reports. In order to solve the problem we have considered an algorithm-centric approach uses active learning and semi-supervised learning. date: 2013 type: Study Group Report type: NonPeerReviewed format: application/pdf language: en identifier: http://miis.maths.ox.ac.uk/miis/630/1/p2_adiss.pdf identifier: Slavova, A. and Valkov, B. and Tonchev, K. and Daskalova, N. and Nikolova, M. and Bivas, M. and Mateev, P. and Yordanova, R. and Zhelezova, S. (2013) Estimation of errors in text and data processing. [Study Group Report]