🏥 Production ML
Lung Cancer Detection with Vision Transformers
A research project applying Vision Transformer architectures to lung cancer detection from medical imaging, exploring attention-based models as an alternative to convolutional approaches.
Overview
A deep-learning research project investigating Vision Transformers (ViT) for lung cancer detection from medical images. The goal was to evaluate whether attention-based architectures — which model global relationships across an image rather than purely local features — could be effective for medical image classification where subtle, spatially-distributed cues matter.
Approach
- Vision Transformer backbone — images are split into patches, embedded, and processed through self-attention layers, letting the model weigh relationships across the whole image instead of only local neighbourhoods the way a CNN does.
- Transfer learning — started from pretrained ViT weights and fine-tuned on the medical imaging dataset, a standard strategy for getting transformers to converge on the limited data typical of medical domains.
- Data pipeline — preprocessing and augmentation to expand effective dataset size and improve generalization, with a held-out split for honest evaluation.
- Implementation — built in a Jupyter / PyTorch workflow, keeping the training and evaluation loop transparent and reproducible.
What It Demonstrates
- Comfort with transformer architectures applied to vision, not just NLP.
- Practical handling of the realities of medical ML — small datasets, transfer learning, and careful evaluation.
- An experimental mindset: testing a then-emerging architecture against an established problem to understand its trade-offs.
An academic / research project focused on exploring the architecture and methodology rather than a deployed clinical system.