11 October 2024 - "Code-free Information Retrieval (IR) over a Medical Text Base using RapidMiner Studio" by Dr. Nikos Giatrakos

Talk by Asst. Prof. Nikos Giatrakos with title "Code-free Information Retrieval (IR) over a Medical Text Base using RapidMiner Studio"

When

11 October 2024, 17:00 Athens time, Science Building 145Π58

Abstract

Information Retrieval (IR) is a part of data science that involves the process of obtaining relevant information from large collections of unstructured or semi-structured data, such as documents, web pages, or multimedia, based on a user’s query, typically using algorithms for indexing, searching, ranking, and filtering data to improve search relevance and efficiency. In this seminar, we will make an introduction to the basic concepts of IR and build a basic IR system, without coding, using Altair RapidMiner Studio. For our case study, we will use a medical text base, namely the CFC Cystic Fibrosis text base. CFC Cystic Fibrosis is a specialized collection of scientific and medical abstracts, bibliographic records, and research articles focused on cystic fibrosis, covering topics such as clinical studies, microbiology (e.g., Pseudomonas aeruginosa), treatment outcomes, and related biomedical research. Participants will be engaged in a hands-on experience following up the steps of IR system development on par with the lecturer. Therefore, they are encouraged to have downloaded and installed RapidMiner Studio and its Text Processing Extension.

Short Bio

Nikos Giatrakos is an Assistant Professor at the School of Electrical and Computer Engineering of the Technical University of Crete (Greece). He received his BSc Degree in Computer Science from the University of Piraeus (Greece) in 2006, the MSc degree in Information Systems from the Athens University of Economics and Business (Greece) in 2007, and the PhD degree in Computer Science from the University of Piraeus (Greece) in 2012. His research interests are in the broad area of Big Data Management algorithms, software architectures and systems including Big streaming Data & Real-Time Analytics, Distributed/Decentralized Big Data Processing, Federated Machine Learning, Edge-to-Cloud Big Data Management, Synopses for Massive Data/Approximate Query Processing, Complex Event Processing. He was a recipient of the Best Demo Award in ACM CIKM 2020. He has contributed as one of key investigators in several recent EU projects, he has served as the co-coordinator of the EU H2020 project INFORE and as a Principal Investigator at the EU Horizon project EVENFLOW.