20–24 May 2024
Baia Mare, Technical University of Cluj-Napoca
Europe/Bucharest timezone

Creating medical datasets using Robot Process Automation

20 May 2024, 15:45
15m
A3 (Baia Mare, Technical University of Cluj-Napoca)

A3

Baia Mare, Technical University of Cluj-Napoca

Medical Informatics Medical Informatics

Speaker

Mr Iulian Roman (West University of Timisoara)

Description

A dataset is a collection of information, usually structured in a format that can be used in research or training various applications such as machine learning. Datasets are fundamental to many analytical processes because they provide raw data that can be used in different applications. While some datasets are readily available, others must be specially curated. Creating a dataset often involves automated data collection methods, with web scraping being a popular technique. Web scraping employs scripts to extract data from online sources, demanding robust programming skills and considerable maintenance effort. It is important to note that the method can break site policies because it may violate terms of service agreements. A proposed solution for gathering the required information is to create a Robot Process Automation (RPA) flow, which collects information from various sources as a normal user would do manually. A use case of RPA flow involves using it to compile a medical dataset that contains information about clinical trials for various drugs. The dataset will be utilized in researching the effects of placebo drugs on people. The information is available on the clinicaltrails.gov site and the results of different trials can be downloaded manually as files. Automating this process can be done by creating a flow in UiPath by linking predefined activity blocks, which simplifies the creation and maintenance of the workflow. Running the flow will follow the steps that a normal user would do for downloading the results of different trials from the site. This approach eases data collection, allowing medical staff to generate datasets, without the need for deep technical skills, that can be used in different investigations. In conclusion, an RPA flow is a straightforward way to gather information for a dataset, circumventing the complexities of traditional programming-based approaches like web scraping.

Keywords: Datasets; Robot Process Automation; UiPath; Data Collection; Data Extraction

Primary authors

Mr Călin Munteanu (“Victor Babeș” University of Medicine and Pharmacy) Mr Iulian Roman (West University of Timisoara) Mr Sebastian-Aurelian Ștefănigă (Romanian Society of Medical Informatics)

Presentation materials

There are no materials yet.

Proceedings

Slides