cv | Panagiotis (Panos) Kaliosis

Basics

Name	Panagiotis (Panos) Kaliosis
Label	PhD Student
Email	pkaliosis@cs.stonybrook.edu
Summary	PhD Student in Computer Science at Stony Brook University, working on multimodal learning under the supervision of Prof. Dimitris Samaras.

Work

2024.08 - Present
Research Assistant

Computer Vision Laboratory, Stony Brook University, NY, USA

As a member of SBU's CVLab I have been working on two funded research projects under the supervision of Prof. Dimitris Samaras, Prof. Andrew Schwartz and Prof. Owen Rambow. I am focusing on modeling vision and language in its human and cognitive contexts in an attempt to improve the way machines 'think' aiming to align them with human's Theory of Mind (ToM).
- Computer Vision
- Natural Language Processing
2024.02 - 2024.07
Research Engineer

OrbDB / KTH Royal Institute of Technology, Stockholm, Sweden

Doing research and engineering on a Graph Machine Learning and Uncertainty Quantification project. OrbDB is a fresh start in database design, combining the latest advances in AI and Data Systems to build world’s first responsible knowledge graph database.
- Graph Neural Networks
- Graph Databases
2023.04 - 2024.06
Research Engineer

National Center for Scientific Research Demokritos, Athens, Greece

As a member of the Multimedia Analysis Group / Computational Intelligence Laboratory, I worked on several multimodal learning projects revolving around speech-to-text applications, as well as signal processing challenges focusing on mental health state detection.
- Speech Recognition
- Computer Vision
2022.10 - 2024.03
Research Assistant

Information Processing Laboratory, Athens University of Economics and Business, Athens, Greece

As a member of the NLP Group / Information Processing Laboratory, I wrote my thesis on automatic medical image diagnosis (Diagnostic Captioning) under the supervision of Assistant Prof. John Pavlopoulos and Prof. Ion Androutsopoulos. This work led to an ACL Findings publication, as well as capturing the 1st place in ImageCLEFmedical 2025 Automatic Image Captioning task.
- Speech Recognition
- Computer Vision

Education

2024.08 - Present

New York, USA
PhD

Stony Brook University

Computer Science
2022.10 - 2023.11

Athens, Greece
MSc

Athens University of Economics and Business

Computer Science
2022.01 - 2022.06

Stockholm, Sweden
BSc/Exchange Semester

Stockholm University

Computer Science
2018.10 - 2022.10

Athens, Greece
BSc

Athens University of Economics and Business

Computer Science

Publications

2024.08.15

A Data-Driven Guided Decoding Mechanism for Diagnostic Captioning

Association for Computational Linguistics (ACL)

We proposed a novel training framework for HTR and ASR designed to align the model’s output character frequency distributions with empirical ones, aiming to improve robustness to temporal and contextual intra-dataset shifts.
2024.04.18

A Self-Supervised Learning Approach for Detecting Non-Psychotic Relapses Using Wearable-Based Digital Phenotyping

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

it concerned MagCIL’s approach for the 1st track of the '2nd e-Prevention challenge: Psychotic and Non-Psychotic Relapse Detection using Wearable-Based Digital Phenotyping'. First we present our approach for preprocessing and extracting features from the wearable’s raw data. We then propose a Transformer model for learning self-supervised representations from augmented features, trained on data from non-relapse days from each of the 9 patients of the challenge. We adopt two unsupervised methods for detecting relapse days as outliers. A separate unsupervised model is tuned for each patient using the validation data of the challenge.
2024.04.02

Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation

Proceedings of the 13th Hellenic Conference on Artificial Intelligence

We proposed an end-to-end system that harnesses the power of Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) techniques to enable users to verbally dictate mathematical expressions and equations in natural language, which are subsequently converted into LaTeX format.
2023.09.17

AUEB NLP group at ImageCLEFmedical caption 2023

Proceedings of the 2023 Conference and Labs of the Evaluation Forum (CLEF)

We presented the methods that the AUEB NLP Group experimented with during its participation in the 7th edition of the ImageCLEFmedical Caption sub-tasks, namely Concept Detection and Caption Prediction. We ranked 1st in Concept Detection and 3rd in Caption Prediction.

Languages

	Greek
	Native speaker

	English
	Fluent

	Spanish
	Intermediate

	German
	Beginner

Basics

Work

Computer Vision Laboratory, Stony Brook University, NY, USA

OrbDB / KTH Royal Institute of Technology, Stockholm, Sweden

Doing research and engineering on a Graph Machine Learning and Uncertainty Quantification project. OrbDB is a fresh start in database design, combining the latest advances in AI and Data Systems to build world’s first responsible knowledge graph database.

National Center for Scientific Research Demokritos, Athens, Greece

As a member of the Multimedia Analysis Group / Computational Intelligence Laboratory, I worked on several multimodal learning projects revolving around speech-to-text applications, as well as signal processing challenges focusing on mental health state detection.

Information Processing Laboratory, Athens University of Economics and Business, Athens, Greece

Education

Stony Brook University

Computer Science

Athens University of Economics and Business

Computer Science

Stockholm University

Computer Science

Athens University of Economics and Business

Computer Science

Publications

Association for Computational Linguistics (ACL)

We proposed a novel training framework for HTR and ASR designed to align the model’s output character frequency distributions with empirical ones, aiming to improve robustness to temporal and contextual intra-dataset shifts.

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Proceedings of the 13th Hellenic Conference on Artificial Intelligence

We proposed an end-to-end system that harnesses the power of Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) techniques to enable users to verbally dictate mathematical expressions and equations in natural language, which are subsequently converted into LaTeX format.

Proceedings of the 2023 Conference and Labs of the Evaluation Forum (CLEF)

We presented the methods that the AUEB NLP Group experimented with during its participation in the 7th edition of the ImageCLEFmedical Caption sub-tasks, namely Concept Detection and Caption Prediction. We ranked 1st in Concept Detection and 3rd in Caption Prediction.

Languages