Date of Award
5-2024
Document Type
Dissertation
Degree Name
Doctor of Philosophy (PhD)
Department
Mathematics and Systems Engineering
First Advisor
Ryan T. White
Second Advisor
Xianqi Li
Third Advisor
Munevver Mine Subasi
Fourth Advisor
Venkat Keshav Chivukula
Abstract
This dissertation explores applications of representation learning and generative models to challenges in healthcare, astronautics, and aviation.
The first part investigates the use of Generative Adversarial Networks (GANs) to synthesize realistic electronic health record (EHR) data. An initial attempt at training a GAN on the MIMIC-IV dataset encountered stability and convergence issues, motivating a deeper study of 1-Lipschitz regularization techniques for Auxiliary Classifier GANs (AC-GANs). An extensive ablation study on the CIFAR-10 dataset found that Spectral Normalization is key for AC-GAN stability and performance, while Weight Clipping fails to converge without Spectral Normalization. Analysis of the training dynamics provided further insights into the impact of regularization on AC-GAN convergence behavior and stability.
The second part explores approaches for adapting Large Language Models (LLMs) to specialized domains. One study focused on fine-tuning BERT variants to identify suicidal ideation in social media posts. The fine-tuned models were interpreted using Layer Integrated Gradients to understand the linguistic cues used to detect suicidal ideation. Token attributions were leveraged as features for an efficient lightweight classifier. Another study adapted BERT to the aerospace domain through continual pre-training on an aerospace corpus (aeroBERTv2). Aerospace-specific pre-training yielded significant gains over general-purpose checkpoints, especially for smaller models, highlighting the value of tailoring representations to the target domain.
The third part applies 3D Gaussian Splatting to reconstruct satellites from 2D images under the computational constraints of spacecraft hardware. The approach achieves nearly 100x speedup over prior neural rendering methods, enabling real-time performance on-board spacecraft. The learned 3D representations are leveraged to generate synthetic satellite images for data augmentation, significantly improving the performance of object detection models for recognizing key spacecraft components.
Together, these studies advance representation learning and generative modeling techniques to derive insights and solve practical problems across healthcare, astronautics, and aviation. The work provides novel technical contributions while yielding broader insights into the behavior of generative models, the adaptation of LLMs to specialized domains, and the application of 3D reconstruction to space robotics.
Recommended Citation
Nguyen, Van Minh, "Representation Learning for Generative Models with Applications to Healthcare, Astronautics, and Aviation" (2024). Theses and Dissertations. 1460.
https://repository.fit.edu/etd/1460
Included in
Applied Statistics Commons, Artificial Intelligence and Robotics Commons, Databases and Information Systems Commons, Data Science Commons, Health Information Technology Commons, Longitudinal Data Analysis and Time Series Commons, Management and Operations Commons, Numerical Analysis and Computation Commons, Other Mathematics Commons, Statistical Methodology Commons, Statistical Models Commons