Publications | Sahil Vora

2024

Towards Unsupervised Denoising of Magnetic Resonance Imaging

Sahil Vora

2024

Master Thesis for completion of Master of Science in Computer Science

Abs HTML

Image denoising, a fundamental task in computer vision, poses significant challenges due to its inherently inverse and ill-posed nature. Despite advancements in traditional methods and supervised learning approaches, particularly in medical imaging such as Medical Resonance Imaging (MRI) scans, the reliance on paired datasets and known noise distributions remains a practical hurdle. Recent progress in noise statistical independence theory and diffusion models has revitalized research interest, offering promising avenues for unsupervised denoising. However, existing methods often yield overly smoothed results or introduce hallucinated structures, limiting their clinical applicability. This thesis tackles the core challenge of progressing towards unsupervised denoising of MRI scans. It aims to retain intricate details without smoothing or introducing artificial structures, thus ensuring the production of high-quality MRI images. The thesis makes a three-fold contribution: Firstly, it presents a detailed analysis of traditional techniques, early machine learning algorithms for denoising, and new statistical-based models, with an extensive evaluation study on self-supervised denoising methods highlighting their limitations. Secondly, it conducts an evaluation study on an emerging class of diffusion-based denoising methods, accompanied by additional empirical findings and discussions on their effectiveness and limitations, proposing solutions to enhance their utility. Lastly, it introduces a novel approach, Unsupervised Multi-stage Ensemble Deep Learning with diffusion models for denoising MRI scans (MEDL). Leveraging diffusion models, this approach operates independently of signal or noise priors and incorporates weighted rescaling of multi-stage reconstructions to balance over-smoothing and hallucination tendencies. Evaluation using benchmark datasets demonstrates an average gain of 1dB and 2% in PSNR and SSIM metrics, respectively, over existing approaches.
Transferable Variational Feedback Network for Vendor Generalization in Accelerated MRI

Riti Paul, Sahil Vora, Kevin Pak Lun Ding, and 4 more authors

In International Conference on AI in Healthcare, 2024

Abs Bib HTML

Magnetic Resonance Imaging (MRI) is a widely used diagnostic tool in medicine. The long acquisition time of MRI remains to be a practical concern, leading to suboptimal patient experiences. Existing deep learning models for fast MRI acquisition struggle to handle the problem of data heterogeneity due to scanners from different vendors. This study explores the transfer learning capabilities of variational deep learning architectures to address this problem. Using standard ACR protocols, we acquired 135 ACR phantom samples from GE and Siemens 3.0T MR scanners and conducted comprehensive experiments to compare the reconstruction quality of the images produced by different models. Our experiments identified vendor differences as a major challenge in the generalization of accelerated MRI. We propose a feature refinement-based transfer learning method that outperforms the baseline networks by 2.0 dB (PSNR), 1.8% (SSIM) for GE, and 3.0 dB (PSNR), 0.8% (SSIM) for SIEMENS. Furthermore, we used experience replay to address the problem of catastrophic forgetting. We established it as a robust baseline through experiments with strong results (PSNR and SSIM performance drop reduced by 25.55% and 9.5%, respectively).
@inproceedings{paul2024transferable, title = {Transferable Variational Feedback Network for Vendor Generalization in Accelerated MRI}, author = {Paul, Riti and Vora, Sahil and Ding, Kevin Pak Lun and Patel, Ameet and Hu, Leland and Li, Baoxin and Zhou, Yuxiang}, booktitle = {International Conference on AI in Healthcare}, pages = {48--63}, year = {2024}, organization = {Springer}, }

2023

Instance Adaptive Prototypical Contrastive Embedding for Generalized Zero Shot Learning

Riti Paul, Sahil Vora, and Baoxin Li

2023

Accepted in IJCAI 2023 Workshop on Generalizing from Limited Resources in the Open World

Abs arXiv Bib HTML

Generalized zero-shot learning(GZSL) aims to classify samples from seen and unseen labels, assuming unseen labels are not accessible during training. Recent advancements in GZSL have been expedited by incorporating contrastive-learning-based (instance-based) embedding in generative networks and leveraging the semantic relationship between data points. However, existing embedding architectures suffer from two limitations: (1) limited discriminability of synthetic features’ embedding without considering fine-grained cluster structures; (2) inflexible optimization due to restricted scaling mechanisms on existing contrastive embedding networks, leading to overlapped representations in the embedding space. To enhance the quality of representations in the embedding space, as mentioned in (1), we propose a margin-based prototypical contrastive learning embedding network that reaps the benefits of prototype-data (cluster quality enhancement) and implicit data-data (fine-grained representations) interaction while providing substantial cluster supervision to the embedding network and the generator. To tackle (2), we propose an instance adaptive contrastive loss that leads to generalized representations for unseen labels with increased inter-class margin. Through comprehensive experimental evaluation, we show that our method can outperform the current state-of-the-art on three benchmark datasets. Our approach also consistently achieves the best unseen performance in the GZSL setting.
@misc{paul2023instance, title = {Instance Adaptive Prototypical Contrastive Embedding for Generalized Zero Shot Learning}, author = {Paul, Riti and Vora, Sahil and Li, Baoxin}, year = {2023}, eprint = {2309.06987}, archiveprefix = {arXiv}, primaryclass = {cs.CV}, note = {Accepted in IJCAI 2023 Workshop on Generalizing from Limited Resources in the Open World}, }

2021

Phase Recovery for Holography using Deep Learning

S. Vora

International Research Journal of Engineering and Technology, Mar 2021

Abs Bib HTML

Computer-generated holography (CGH) is the strategy for carefully creating holographic interference designs. A hologram is a true account of an interference design that utilizes diffraction to repeat a 3D light field, bringing about a picture that has the depth, parallax, and different properties of the original scene. A holographic picture can be produced for example by carefully registering a holographic interference example and printing it onto a cover or film for ensuing brightening by a reasonable intelligible light source. However, CGH is an iterative technique to register this interference which is time and asset demanding. This paper proposes a technique utilizing deep learning networks that uses a non-iterative calculation which is proficient when contrasted with CGH and galvanizes the plan to utilize this technique to consolidate computer vision and the field of optics
@article{irjet, title = {Phase Recovery for Holography using Deep Learning}, author = {Vora, S.}, journal = {International Research Journal of Engineering and Technology}, volume = {08}, issue = {03}, pages = {577-580}, numpages = {4}, year = {2021}, month = mar, publisher = {Fast Track Publications}, dimensions = {true}, }