cv
Basics
Name | Panagiotis (Panos) Kaliosis |
Label | PhD Student |
pkaliosis@cs.stonybrook.edu | |
Summary | PhD Student in Computer Science at Stony Brook University, working on multimodal learning under the supervision of Prof. Dimitris Samaras. |
Work
-
2024.08 - Present Research Assistant
Computer Vision Laboratory, Stony Brook University, NY, USA
As a member of SBU's CVLab I have been working on two funded research projects under the supervision of Prof. Dimitris Samaras, Prof. Andrew Schwartz and Prof. Owen Rambow. I am focusing on modeling vision and language in its human and cognitive contexts in an attempt to improve the way machines 'think' aiming to align them with human's Theory of Mind (ToM).
- Computer Vision
- Natural Language Processing
-
2024.02 - 2024.07 Research Engineer
OrbDB / KTH Royal Institute of Technology, Stockholm, Sweden
Doing research and engineering on a Graph Machine Learning and Uncertainty Quantification project. OrbDB is a fresh start in database design, combining the latest advances in AI and Data Systems to build world’s first responsible knowledge graph database.
- Graph Neural Networks
- Graph Databases
-
2023.04 - 2024.06 Research Engineer
National Center for Scientific Research Demokritos, Athens, Greece
As a member of the Multimedia Analysis Group / Computational Intelligence Laboratory, I worked on several multimodal learning projects revolving around speech-to-text applications, as well as signal processing challenges focusing on mental health state detection.
- Speech Recognition
- Computer Vision
-
2022.10 - 2024.03 Research Assistant
Information Processing Laboratory, Athens University of Economics and Business, Athens, Greece
As a member of the NLP Group / Information Processing Laboratory, I wrote my thesis on automatic medical image diagnosis (Diagnostic Captioning) under the supervision of Assistant Prof. John Pavlopoulos and Prof. Ion Androutsopoulos. This work led to an ACL Findings publication, as well as capturing the 1st place in ImageCLEFmedical 2025 Automatic Image Captioning task.
- Speech Recognition
- Computer Vision
Education
-
2024.08 - Present New York, USA
-
2022.10 - 2023.11 Athens, Greece
-
2022.01 - 2022.06 Stockholm, Sweden
-
2018.10 - 2022.10 Athens, Greece
Publications
-
2024.08.15 A Data-Driven Guided Decoding Mechanism for Diagnostic Captioning
Association for Computational Linguistics (ACL)
We proposed a novel training framework for HTR and ASR designed to align the model’s output character frequency distributions with empirical ones, aiming to improve robustness to temporal and contextual intra-dataset shifts.
-
2024.04.18 A Self-Supervised Learning Approach for Detecting Non-Psychotic Relapses Using Wearable-Based Digital Phenotyping
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
it concerned MagCIL’s approach for the 1st track of the '2nd e-Prevention challenge: Psychotic and Non-Psychotic Relapse Detection using Wearable-Based Digital Phenotyping'. First we present our approach for preprocessing and extracting features from the wearable’s raw data. We then propose a Transformer model for learning self-supervised representations from augmented features, trained on data from non-relapse days from each of the 9 patients of the challenge. We adopt two unsupervised methods for detecting relapse days as outliers. A separate unsupervised model is tuned for each patient using the validation data of the challenge.
-
2024.04.02 Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation
Proceedings of the 13th Hellenic Conference on Artificial Intelligence
We proposed an end-to-end system that harnesses the power of Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) techniques to enable users to verbally dictate mathematical expressions and equations in natural language, which are subsequently converted into LaTeX format.
-
2023.09.17 AUEB NLP group at ImageCLEFmedical caption 2023
Proceedings of the 2023 Conference and Labs of the Evaluation Forum (CLEF)
We presented the methods that the AUEB NLP Group experimented with during its participation in the 7th edition of the ImageCLEFmedical Caption sub-tasks, namely Concept Detection and Caption Prediction. We ranked 1st in Concept Detection and 3rd in Caption Prediction.
Languages
Greek | |
Native speaker |
English | |
Fluent |
Spanish | |
Intermediate |
German | |
Beginner |